DESCRIPTION
Looks after the character map. For ease of use, the actual cmap is held in a hash against codepoint. Thus for a given table:
$gid = $font->{'cmap'}{'Tables'}[0]{'val'}{$code};
Note that $code should be a true value (0x1234) rather than a string representation.
INSTANCE VARIABLES
The instance variables listed here are not preceded by a space due to their emulating structural information in the font.- Num
- Number of subtables in this table
- Tables
-
An array of subtables ([0..Num-1])
Each subtable also has its own instance variables which are, again, not preceded by a space.
-
- Platform
- The platform number for this subtable
- Encoding
- The encoding number for this subtable
- Format
- Gives the stored format of this subtable
- Ver
- Gives the version (or language) information for this subtable
- val
- A hash keyed by the codepoint value (not a string) storing the glyph id
-
The following cmap options are controlled by instance variables that start with a space:
- allowholes
- By default, when generating format 4 cmap subtables character codes that point to glyph zero (normally called .notdef) are not included in the subtable. In some cases including some of these character codes can result in a smaller format 4 subtable. To enable this behavior, set allowholes to non-zero.
METHODS
$t->read
Reads the cmap into memory. Format 4 subtables read the whole subtable and fill in the segmented array accordingly.$t->ms_lookup($uni)
Finds a Unicode table, giving preference to the MS one, and looks up the given Unicode codepoint in it to find the glyph id.$t->find_ms
Finds the a Unicode table, giving preference to the Microsoft one, and sets the "mstable" instance variable to it if found. Returns the table it finds.$t->ms_enc
Returns the encoding of the microsoft table (0 => symbol, etc.). Returns undef if there is no Microsoft cmap.$t->out($fh)
Writes out a cmap table to a filehandle. If it has not been read, then just copies from input file to output$t->XML_element($context, $depth, $name, $val)
Outputs the elements of the cmap in XML. We only need to process val here$t->minsize()
Returns the minimum size this table can be in bytes. If it is smaller than this, then the table must be bad and should be deleted or whatever.$t->update
Tidies the cmap table.Removes MS Fmt12 cmap if it is no longer needed.
Removes from all cmaps any codepoint that map to GID=0. Note that such entries will be re-introduced as necessary depending on the cmap format.
@map = $t->reverse(%opt)
Returns a reverse map of the Unicode cmap. I.e. given a glyph gives the Unicode value for it. Options are:- tnum
- Table number to use rather than the default Unicode table
- array
- Returns each element of reverse as an array since a glyph may be mapped by more than one Unicode value. The arrays are unsorted. Otherwise store any one unicode value for a glyph.
is_unicode($index)
Returns whether the table of a given index is known to be a unicode table (as specified in the specifications)BUGS
- Format 14 (Unicode Variation Sequences) cmaps are not supported.
AUTHOR
Martin Hosken <http://scripts.sil.org/FontUtils>.LICENSING
Copyright (c) 1998-2014, SIL International (http://www.sil.org)This module is released under the terms of the Artistic License 2.0. For details, see the full text of the license in the file LICENSE.