Xenos d2e Vision PDF Parser
The innovative Xenos d2e Vision PDF Parser component processes Acrobat Portable Document Format (PDF) 1.0 to 1.6 (Acrobat 7.0 or earlier) input files. Written 100% in Java by Xenos, the d2e Vision PDF Parser supports full indexing from anywhere in the document, full font correlation, rich functionality and faster performance than rasterized solutions.
The Xenos d2e Vision PDF Parser features full Double Byte Character Support including outline fonts and mixed Latin and double byte characters for Chinese, Japanese or Korean language documents. Indexes can be extracted from DBCS data and used, provided that the retrieval application has a way to read them that uses XML or binary with UTF 8 encoding, or an escape character sequence.
Contact us to find out how you can benefit from the following advanced PDF parsing capabilities.
PDF Parser Applications
Print
- Reduce costs and save time, by eliminating the inefficient use of many slower desktop printers to print files in PDF
- Repurpose large volumes of existing PDF files by transforming them to AFP format
- Leverage your investment in existing high-volume AFP printers
- Fidelity similar to the original PDF, assuming the corresponding AFP font resources are available and/or color capability exists on the AFP printer
Archive and Electronic Content Management
- Extract index information from anywhere in a PDF input file
- Separate the file into individual PDF documents
- Load PDF files to archive or ECM systems and for viewing in PDF format. These documents can also be separated into individual smaller PDF files.
- For PDF to PDF applications, value added functions such as bookmarks for easier document navigation, URL links for 1-to-1 marketing and standard Acrobat 128-bit encryption for document security and fraud protection
- Transform PDF files to TIFF for loading archives requiring this image format
Supported Formats
- Adobe PDF 1.0 to 1.6 files (Acrobat 7.0 or earlier), including those generated by the d2e Platform and d2e Vision PDF Generators.
PDF Parser Features
The PDF Parser includes the following major features.
- Vector graphics support (pie charts, histograms etc.)
- Images (image objects, logos etc.)
- True color (24-bit) RGB images supported in PDF output and in AFP output with Function Set 11 RGB color support
- Decoding Flate, LZW and Ascii85 encoded compressed objects
- Processing encrypted PDF files
- Standard Adobe Base 14 support (Type 1, Type 3 and TrueType, but no rasterization)
- CJK (Chinese, Japanese, Korean) font support for double-byte and multi-byte character fonts and support for outline fonts and mixed Latin and double byte characters
- Text orientations: Supported, 4 standard orientations: 0° - Portrait, 90° - Inverse Landscape, 180° - Inverse Portrait, 270° - Landscape
- Bullets: Supported, with font correlation using glyph mapping, if there is corresponding output font; otherwise uses the default font
- Word shadows: Supported with font correlation
- Tables: Supported for graphical tables such as row/column data inside line
Note:
Because of the large number of features in PDF, many of which are not relevant or usable for content management, archiving or high volume printing, Xenos technical consultants will work closely with you to validate that your data is suitable for this application.