Poppler (software)

Poppler is a free and open-source software library for rendering Portable Document Format (PDF) documents. Its development is supported by freedesktop.org. Commonly used on Linux systems,[4] it powers the PDF viewers of the GNOME and KDE desktop environments.

Poppler
Developer(s)freedesktop.org
Initial release4 March 2005; 19 years ago (2005-03-04)[nb 1]
Stable release
24.07.0[2] Edit this on Wikidata / 1 July 2024; 8 days ago (1 July 2024)[1]
Repository
Written inC++
Operating systemLinux, Unix, BSD, Windows
TypeLibrary
LicenseGPLv2 or GPLv3[3]
Websitepoppler.freedesktop.org Edit this at Wikidata

History

The project was started by Kristian Høgsberg with two goals:[5] to provide PDF rendering functionality as a shared library, to centralize maintenance effort and to go beyond the goals of Xpdf, and to integrate with functionality provided by modern operating systems.

By the version 0.18 release in 2011, the poppler library represented a complete implementation of ISO 32000-1,[4] the PDF format standard, and was the first major free PDF library to support its forms (only Acroforms but not full XFA forms)[6][7] and annotations features.[4]

Poppler is a fork of Xpdf-3.0, a PDF file viewer developed by Derek Noonburg of Glyph and Cog, LLC.[5][8]

The name Poppler comes from the animated series Futurama episode "The Problem with Popplers."[8]

Applications

Notable free software applications using Poppler to render PDF documents include:[9]

ApplicationGUI widgets
EvinceGTK
InkscapeGTK
LibreOffice 4.xGTK[10]
OkularQt
pdftotext, pdftohtml, etc.none
TeXstudioQt
TeXworksQt
xpoppleMotif
ZathuraGTK

Features

Poppler can use two back-ends for drawing PDF documents, Cairo and Splash. Its features may depend on which back-end it employs. A third back-end based on Qt4's painting framework "Arthur", is available, but is incomplete and no longer under active development.[11] Bindings exist for Glib and Qt5, that provide interfaces to the Poppler backends, although the Qt5 bindings support only the Splash and Arthur backends. There is a patchset available to add support for the Cairo backend to the Qt5 bindings,[12] but the Poppler project does not currently wish to integrate the feature into the library proper.[13]

Some characteristics of the back-ends include:

Poppler comes with a text-rendering back-end as well, which can be invoked from the command line utility pdftotext. It is useful for searching for strings in PDFs from the command line, using the utility grep, for instance.[14]

Example:

pdftotext file.pdf - | grep string

Poppler partially supports annotations and Acroforms. It does not support JavaScript[15] nor the rendering of full XFA forms.[6]

poppler-utils

poppler-utils is a collection of command-line utilities built on Poppler's library API, to manage PDF and extract contents:

  • pdfattach – add a new embedded file (attachment) to an existing PDF
  • pdfdetach – extract embedded documents from a PDF
  • pdffonts – lists the fonts used in a PDF
  • pdfimages – extract all embedded images at native resolution from a PDF
  • pdfinfo – list all information of a PDF
  • pdfseparate – extract single pages from a PDF
  • pdftocairo – convert single pages from a PDF to vector or bitmap formats using cairo
  • pdftohtml – convert PDF to HTML format retaining formatting
  • pdftoppm – convert a PDF page to a bitmap
  • pdftops – convert PDF to printable PS format
  • pdftotext – extract all text from PDF
  • pdfunite – merges several PDFs

See also

Notes

References

  • Albert Astals Cid (29 August 2005) The Poppler Library, presentation at the 2005 KDE conference