PDF::API2(3) Facilitates the creation and modification of PDF files

SYNOPSIS


use PDF::API2;
# Create a blank PDF file
$pdf = PDF::API2->new();
# Open an existing PDF file
$pdf = PDF::API2->open('some.pdf');
# Add a blank page
$page = $pdf->page();
# Retrieve an existing page
$page = $pdf->openpage($page_number);
# Set the page size
$page->mediabox('Letter');
# Add a built-in font to the PDF
$font = $pdf->corefont('Helvetica-Bold');
# Add an external TTF font to the PDF
$font = $pdf->ttfont('/path/to/font.ttf');
# Add some text to the page
$text = $page->text();
$text->font($font, 20);
$text->translate(200, 700);
$text->text('Hello World!');
# Save the PDF
$pdf->saveas('/path/to/new.pdf');

GENERIC METHODS

$pdf = PDF::API2->new(%options)
Creates a new PDF object. If you will be saving it as a file and already know the filename, you can give the '-file' option to minimize possible memory requirements later on.

Example:

    $pdf = PDF::API2->new();
    ...
    print $pdf->stringify();
    $pdf = PDF::API2->new();
    ...
    $pdf->saveas('our/new.pdf');
    $pdf = PDF::API2->new(-file => 'our/new.pdf');
    ...
    $pdf->save();
$pdf = PDF::API2->open($pdf_file)
Opens an existing PDF file.

Example:

    $pdf = PDF::API2->open('our/old.pdf');
    ...
    $pdf->saveas('our/new.pdf');
    $pdf = PDF::API2->open('our/to/be/updated.pdf');
    ...
    $pdf->update();
$pdf = PDF::API2->open_scalar($pdf_string)
Opens a PDF contained in a string.

Example:

    # Read a PDF into a string, for the purpose of demonstration
    open $fh, 'our/old.pdf' or die $@;
    undef $/;  # Read the whole file at once
    $pdf_string = <$fh>;
    $pdf = PDF::API2->open_scalar($pdf_string);
    ...
    $pdf->saveas('our/new.pdf');
$pdf->preferences(%options)
Controls viewing preferences for the PDF.

Page Mode Options:

-fullscreen
Full-screen mode, with no menu bar, window controls, or any other window visible.
-thumbs
Thumbnail images visible.
-outlines
Document outline visible.

Page Layout Options:

-singlepage
Display one page at a time.
-onecolumn
Display the pages in one column.
-twocolumnleft
Display the pages in two columns, with oddnumbered pages on the left.
-twocolumnright
Display the pages in two columns, with oddnumbered pages on the right.

Viewer Options:

-hidetoolbar
Specifying whether to hide tool bars.
-hidemenubar
Specifying whether to hide menu bars.
-hidewindowui
Specifying whether to hide user interface elements.
-fitwindow
Specifying whether to resize the document's window to the size of the displayed page.
-centerwindow
Specifying whether to position the document's window in the center of the screen.
-displaytitle
Specifying whether the window's title bar should display the document title taken from the Title entry of the document information dictionary.
-afterfullscreenthumbs
Thumbnail images visible after Full-screen mode.
-afterfullscreenoutlines
Document outline visible after Full-screen mode.
-printscalingnone
Set the default print setting for page scaling to none.

Initial Page Options:

-firstpage => [ $page, %options ]
Specifying the page (either a page number or a page object) to be displayed, plus one of the following options:
-fit => 1
Display the page designated by page, with its contents magnified just enough to fit the entire page within the window both horizontally and vertically. If the required horizontal and vertical magnification factors are different, use the smaller of the two, centering the page within the window in the other dimension.
-fith => $top
Display the page designated by page, with the vertical coordinate top positioned at the top edge of the window and the contents of the page magnified just enough to fit the entire width of the page within the window.
-fitv => $left
Display the page designated by page, with the horizontal coordinate left positioned at the left edge of the window and the contents of the page magnified just enough to fit the entire height of the page within the window.
-fitr => [ $left, $bottom, $right, $top ]
Display the page designated by page, with its contents magnified just enough to fit the rectangle specified by the coordinates left, bottom, right, and top entirely within the window both horizontally and vertically. If the required horizontal and vertical magnification factors are different, use the smaller of the two, centering the rectangle within the window in the other dimension.
-fitb => 1
Display the page designated by page, with its contents magnified just enough to fit its bounding box entirely within the window both horizontally and vertically. If the required horizontal and vertical magnification factors are different, use the smaller of the two, centering the bounding box within the window in the other dimension.
-fitbh => $top
Display the page designated by page, with the vertical coordinate top positioned at the top edge of the window and the contents of the page magnified just enough to fit the entire width of its bounding box within the window.
-fitbv => $left
Display the page designated by page, with the horizontal coordinate left positioned at the left edge of the window and the contents of the page magnified just enough to fit the entire height of its bounding box within the window.
-xyz => [ $left, $top, $zoom ]
Display the page designated by page, with the coordinates (left, top) positioned at the top-left corner of the window and the contents of the page magnified by the factor zoom. A zero (0) value for any of the parameters left, top, or zoom specifies that the current value of that parameter is to be retained unchanged.

Example:

    $pdf->preferences(
        -fullscreen => 1,
        -onecolumn => 1,
        -afterfullscreenoutlines => 1,
        -firstpage => [$page, -fit => 1],
    );
$val = $pdf->default($parameter)
$pdf->default($parameter, $value)
Gets/sets the default value for a behaviour of PDF::API2.

Supported Parameters:

nounrotate
prohibits API2 from rotating imported/opened page to re-create a default pdf-context.
pageencaps
enables than API2 will add save/restore commands upon imported/opened pages to preserve graphics-state for modification.
copyannots
enables importing of annotations (*EXPERIMENTAL*).
$version = $pdf->version([$new_version])
Get/set the PDF version (e.g. 1.4)
$bool = $pdf->isEncrypted()
Checks if the previously opened PDF is encrypted.
%infohash = $pdf->info(%infohash)
Gets/sets the info structure of the document.

Example:

    %h = $pdf->info(
        'Author'       => "Alfred Reibenschuh",
        'CreationDate' => "D:20020911000000+01'00'",
        'ModDate'      => "D:YYYYMMDDhhmmssOHH'mm'",
        'Creator'      => "fredos-script.pl",
        'Producer'     => "PDF::API2",
        'Title'        => "some Publication",
        'Subject'      => "perl ?",
        'Keywords'     => "all good things are pdf"
    );
    print "Author: $h{Author}\n";
@metadata_attributes = $pdf->infoMetaAttributes(@metadata_attributes)
Gets/sets the supported info-structure tags.

Example:

    @attributes = $pdf->infoMetaAttributes;
    print "Supported Attributes: @attr\n";
    @attributes = $pdf->infoMetaAttributes('CustomField1');
    print "Supported Attributes: @attributes\n";
$xml = $pdf->xmpMetadata($xml)
Gets/sets the XMP XML data stream.

Example:

    $xml = $pdf->xmpMetadata();
    print "PDFs Metadata reads: $xml\n";
    $xml=<<EOT;
    <?xpacket begin='' id='W5M0MpCehiHzreSzNTczkc9d'?>
    <?adobe-xap-filters esc="CRLF"?>
    <x:xmpmeta
      xmlns:x='adobe:ns:meta/'
      x:xmptk='XMP toolkit 2.9.1-14, framework 1.6'>
        <rdf:RDF
          xmlns:rdf='http://www.w3.org/1999/02/22-rdf-syntax-ns#'
          xmlns:iX='http://ns.adobe.com/iX/1.0/'>
            <rdf:Description
              rdf:about='uuid:b8659d3a-369e-11d9-b951-000393c97fd8'
              xmlns:pdf='http://ns.adobe.com/pdf/1.3/'
              pdf:Producer='Acrobat Distiller 6.0.1 for Macintosh'></rdf:Description>
            <rdf:Description
              rdf:about='uuid:b8659d3a-369e-11d9-b951-000393c97fd8'
              xmlns:xap='http://ns.adobe.com/xap/1.0/'
              xap:CreateDate='2004-11-14T08:41:16Z'
              xap:ModifyDate='2004-11-14T16:38:50-08:00'
              xap:CreatorTool='FrameMaker 7.0'
              xap:MetadataDate='2004-11-14T16:38:50-08:00'></rdf:Description>
            <rdf:Description
              rdf:about='uuid:b8659d3a-369e-11d9-b951-000393c97fd8'
              xmlns:xapMM='http://ns.adobe.com/xap/1.0/mm/'
              xapMM:DocumentID='uuid:919b9378-369c-11d9-a2b5-000393c97fd8'/></rdf:Description>
            <rdf:Description
              rdf:about='uuid:b8659d3a-369e-11d9-b951-000393c97fd8'
              xmlns:dc='http://purl.org/dc/elements/1.1/'
              dc:format='application/pdf'>
                <dc:description>
                  <rdf:Alt>
                    <rdf:li xml:lang='x-default'>Adobe Portable Document Format (PDF)</rdf:li>
                  </rdf:Alt>
                </dc:description>
                <dc:creator>
                  <rdf:Seq>
                    <rdf:li>Adobe Systems Incorporated</rdf:li>
                  </rdf:Seq>
                </dc:creator>
                <dc:title>
                  <rdf:Alt>
                    <rdf:li xml:lang='x-default'>PDF Reference, version 1.6</rdf:li>
                  </rdf:Alt>
                </dc:title>
            </rdf:Description>
        </rdf:RDF>
    </x:xmpmeta>
    <?xpacket end='w'?>
    EOT
    $xml = $pdf->xmpMetadata($xml);
    print "PDF metadata now reads: $xml\n";
$pdf->pageLabel($index, $options)
Sets page label options.

Supported Options:

-style
Roman, roman, decimal, Alpha or alpha.
-start
Restart numbering at given number.
-prefix
Text prefix for numbering.

Example:

    # Start with Roman Numerals
    $pdf->pageLabel(0, {
        -style => 'roman',
    });
    # Switch to Arabic
    $pdf->pageLabel(4, {
        -style => 'decimal',
    });
    # Numbering for Appendix A
    $pdf->pageLabel(32, {
        -start => 1,
        -prefix => 'A-'
    });
    
    # Numbering for Appendix B
    $pdf->pageLabel( 36, {
        -start => 1,
        -prefix => 'B-'
    });
    
    # Numbering for the Index
    $pdf->pageLabel(40, {
        -style => 'Roman'
        -start => 1,
        -prefix => 'Index '
    });
$pdf->finishobjects(@objects)
Force objects to be written to file if possible.

Example:

    $pdf = PDF::API2->new(-file => 'our/new.pdf');
    ...
    $pdf->finishobjects($page, $gfx, $txt);
    ...
    $pdf->save();
$pdf->update()
Saves a previously opened document.

Example:

    $pdf = PDF::API2->open('our/to/be/updated.pdf');
    ...
    $pdf->update();
$pdf->saveas($file)
Save the document to $file and remove the object structure from memory.

Example:

    $pdf = PDF::API2->new();
    ...
    $pdf->saveas('our/new.pdf');
$string = $pdf->stringify()
Return the document as a string and remove the object structure from memory.

Example:

    $pdf = PDF::API2->new();
    ...
    print $pdf->stringify();
$pdf->end()
Remove the object structure from memory. PDF::API2 contains circular references, so this call is necessary in long-running processes to keep from running out of memory.

This will be called automatically when you save or stringify a PDF. You should only need to call it explicitly if you are reading PDF files and not writing them.

PAGE METHODS

$page = $pdf->page()
$page = $pdf->page($page_number)
Returns a new page object. By default, the page is added to the end of the document. If you include an existing page number, the new page will be inserted in that position, pushing existing pages back.

$page_number can also have one of the following values:

-1 inserts the new page as the second-last page
0 inserts the page as the last page

Example:

    $pdf = PDF::API2->new();
    # Add a page.  This becomes page 1.
    $page = $pdf->page();
    # Add a new first page.  $page becomes page 2.
    $another_page = $pdf->page(1);
$page = $pdf->openpage($page_number)
Returns the PDF::API2::Page object of page $page_number.

If $page_number is 0 or -1, it will return the last page in the document.

Example:

    $pdf = PDF::API2->open('our/99page.pdf');
    $page = $pdf->openpage(1);   # returns the first page
    $page = $pdf->openpage(99);  # returns the last page
    $page = $pdf->openpage(-1);  # returns the last page
    $page = $pdf->openpage(999); # returns undef
$xoform = $pdf->importPageIntoForm($source_pdf, $source_page_number)
Returns a Form XObject created by extracting the specified page from $source_pdf.

This is useful if you want to transpose the imported page somewhat differently onto a page (e.g. two-up, four-up, etc.).

If $source_page_number is 0 or -1, it will return the last page in the document.

Example:

    $pdf = PDF::API2->new();
    $old = PDF::API2->open('our/old.pdf');
    $page = $pdf->page();
    $gfx = $page->gfx();
    # Import Page 2 from the old PDF
    $xo = $pdf->importPageIntoForm($old, 2);
    # Add it to the new PDF's first page at 1/2 scale
    $gfx->formimage($xo, 0, 0, 0.5); 
    $pdf->saveas('our/new.pdf');

Note: You can only import a page from an existing PDF file.

$page = $pdf->import_page($source_pdf, $source_page_number, $target_page_number)
Imports a page from $source_pdf and adds it to the specified position in $pdf.

If $source_page_number or $target_page_number is 0 or -1, the last page in the document is used.

Note: If you pass a page object instead of a page number for $target_page_number, the contents of the page will be merged into the existing page.

Example:

    $pdf = PDF::API2->new();
    $old = PDF::API2->open('our/old.pdf');
    # Add page 2 from the old PDF as page 1 of the new PDF
    $page = $pdf->import_page($old, 2);
    $pdf->saveas('our/new.pdf');

Note: You can only import a page from an existing PDF file.

$count = $pdf->pages()
Returns the number of pages in the document.
$pdf->mediabox($name)
$pdf->mediabox($w, $h)
$pdf->mediabox($llx, $lly, $urx, $ury)
Sets the global mediabox.

Example:

    $pdf = PDF::API2->new();
    $pdf->mediabox('A4');
    ...
    $pdf->saveas('our/new.pdf');
    $pdf = PDF::API2->new();
    $pdf->mediabox(595, 842);
    ...
    $pdf->saveas('our/new.pdf');
    $pdf = PDF::API2->new;
    $pdf->mediabox(0, 0, 595, 842);
    ...
    $pdf->saveas('our/new.pdf');
$pdf->cropbox($name)
$pdf->cropbox($w, $h)
$pdf->cropbox($llx, $lly, $urx, $ury)
Sets the global cropbox.
$pdf->bleedbox($name)
$pdf->bleedbox($w, $h)
$pdf->bleedbox($llx, $lly, $urx, $ury)
Sets the global bleedbox.
$pdf->trimbox($name)
$pdf->trimbox($w, $h)
$pdf->trimbox($llx, $lly, $urx, $ury)
Sets the global trimbox.
$pdf->artbox($name)
$pdf->artbox($w, $h)
$pdf->artbox($llx, $lly, $urx, $ury)
Sets the global artbox.

FONT METHODS

@directories = PDF::API2::addFontDirs($dir1, $dir2, ...)
Adds one or more directories to the search path for finding font files.

Returns the list of searched directories.

$font = $pdf->corefont($fontname, [%options])
Returns a new Adobe core font object.

Examples:

    $font = $pdf->corefont('Times-Roman');
    $font = $pdf->corefont('Times-Bold');
    $font = $pdf->corefont('Helvetica');
    $font = $pdf->corefont('ZapfDingbats');

Valid %options are:

-encode
Changes the encoding of the font from its default.
-dokern
Enables kerning if data is available.

See Also: PDF::API2::Resource::Font::CoreFont.

$font = $pdf->psfont($ps_file, [%options])
Returns a new Adobe Type1 font object.

Examples:

    $font = $pdf->psfont('Times-Book.pfa', -afmfile => 'Times-Book.afm');
    $font = $pdf->psfont('/fonts/Synest-FB.pfb', -pfmfile => '/fonts/Synest-FB.pfm');

Valid %options are:

-encode
Changes the encoding of the font from its default.
-afmfile
Specifies the location of the font metrics file.
-pfmfile
Specifies the location of the printer font metrics file. This option overrides the -encode option.
-dokern
Enables kerning if data is available.
$font = $pdf->ttfont($ttf_file, [%options])
Returns a new TrueType or OpenType font object.

Examples:

    $font = $pdf->ttfont('Times.ttf');
    $font = $pdf->ttfont('Georgia.otf');

Valid %options are:

-encode
Changes the encoding of the font from its default.
-isocmap
Use the ISO Unicode Map instead of the default MS Unicode Map.
-dokern
Enables kerning if data is available.
-noembed
Disables embedding of the font file.
$font = $pdf->cjkfont($cjkname, [%options])
Returns a new CJK font object.

Examples:

    $font = $pdf->cjkfont('korean');
    $font = $pdf->cjkfont('traditional');

Valid %options are:

-encode
Changes the encoding of the font from its default.

See Also: PDF::API2::Resource::CIDFont::CJKFont

$font = $pdf->synfont($basefont, [%options])
Returns a new synthetic font object.

Examples:

    $cf  = $pdf->corefont('Times-Roman', -encode => 'latin1');
    $sf  = $pdf->synfont($cf, -slant => 0.85);  # compressed 85%
    $sfb = $pdf->synfont($cf, -bold => 1);      # embolden by 10em
    $sfi = $pdf->synfont($cf, -oblique => -12); # italic at -12 degrees

Valid %options are:

-slant
Slant/expansion factor (0.1-0.9 = slant, 1.1+ = expansion).
-oblique
Italic angle (+/-)
-bold
Emboldening factor (0.1+, bold = 1, heavy = 2, ...)
-space
Additional character spacing in ems (0-1000)

See Also: PDF::API2::Resource::Font::SynFont

$font = $pdf->bdfont($bdf_file)
Returns a new BDF font object, based on the specified Adobe BDF file.

See Also: PDF::API2::Resource::Font::BdFont

$font = $pdf->unifont(@fontspecs, %options)
Returns a new uni-font object, based on the specified fonts and options.

BEWARE: This is not a true pdf-object, but a virtual/abstract font definition!

See Also: PDF::API2::Resource::UniFont.

Valid %options are:

-encode
Changes the encoding of the font from its default.

IMAGE METHODS

$jpeg = $pdf->image_jpeg($file)
Imports and returns a new JPEG image object.
$tiff = $pdf->image_tiff($file)
Imports and returns a new TIFF image object.
$pnm = $pdf->image_pnm($file)
Imports and returns a new PNM image object.
$png = $pdf->image_png($file)
Imports and returns a new PNG image object.
$gif = $pdf->image_gif($file)
Imports and returns a new GIF image object.
$gdf = $pdf->image_gd($gd_object, %options)
Imports and returns a new image object from GD::Image.

Options: The only option currently supported is "-lossless => 1".

COLORSPACE METHODS

$cs = $pdf->colorspace_act($file)
Returns a new colorspace object based on an Adobe Color Table file.

See PDF::API2::Resource::ColorSpace::Indexed::ACTFile for a reference to the file format's specification.

$cs = $pdf->colorspace_web()
Returns a new colorspace-object based on the web color palette.
$cs = $pdf->colorspace_hue()
Returns a new colorspace-object based on the hue color palette.

See PDF::API2::Resource::ColorSpace::Indexed::Hue for an explanation.

$cs = $pdf->colorspace_separation($tint, $color)
Returns a new separation colorspace object based on the parameters.

$tint can be any valid ink identifier, including but not limited to: 'Cyan', 'Magenta', 'Yellow', 'Black', 'Red', 'Green', 'Blue' or 'Orange'.

$color must be a valid color specification limited to: '#rrggbb', '!hhssvv', '%ccmmyykk' or a ``named color'' (rgb).

The colorspace model will automatically be chosen based on the specified color.

$cs = $pdf->colorspace_devicen(\@tintCSx, [$samples])
Returns a new DeviceN colorspace object based on the parameters.

Example:

    $cy = $pdf->colorspace_separation('Cyan',    '%f000');
    $ma = $pdf->colorspace_separation('Magenta', '%0f00');
    $ye = $pdf->colorspace_separation('Yellow',  '%00f0');
    $bk = $pdf->colorspace_separation('Black',   '%000f');
    $pms023 = $pdf->colorspace_separation('PANTONE 032CV', '%0ff0');
    $dncs = $pdf->colorspace_devicen( [ $cy,$ma,$ye,$bk,$pms023 ] );

The colorspace model will automatically be chosen based on the first colorspace specified.

BARCODE METHODS

$bc = $pdf->xo_codabar(%options)
$bc = $pdf->xo_code128(%options)
$bc = $pdf->xo_2of5int(%options)
$bc = $pdf->xo_3of9(%options)
$bc = $pdf->xo_ean13(%options)
Creates the specified barcode object as a form XObject.

OTHER METHODS

$xo = $pdf->xo_form()
Returns a new form XObject.
$egs = $pdf->egstate()
Returns a new extended graphics state object.
$obj = $pdf->pattern()
Returns a new pattern object.
$obj = $pdf->shading()
Returns a new shading object.
$otls = $pdf->outlines()
Returns a new or existing outlines object.

RESOURCE METHODS

$pdf->resource($type, $key, $obj, $force)
Adds a resource to the global PDF tree.

Example:

    $pdf->resource('Font', $fontkey, $fontobj);
    $pdf->resource('XObject', $imagekey, $imageobj);
    $pdf->resource('Shading', $shadekey, $shadeobj);
    $pdf->resource('ColorSpace', $spacekey, $speceobj);

Note: You only have to add the required resources if they are NOT handled by the font, image, shade or space methods.

KNOWN ISSUES

This module does not work with perl's -l command-line switch.

PDFs using cross-reference streams instead of cross-reference tables are not yet supported. Cross-reference streams were added as an option in version 1.5 of the PDF spec, but were only used infrequently until Adobe Acrobat 9 started using them by default. A patch would be welcome --- see the PDF 1.7 specification, sections 7.5.4 and 7.5.8 for implementation details.

AUTHOR

PDF::API2 was originally written by Alfred Reibenschuh.

It is currently being maintained by Steve Simms.