Fresh from our labs: Adobe PDF import improvements

ckrause

Well-known member
Joined
Oct 9, 2009
Messages
87
Programming Experience
5-10
I was just able to get hold of the latest builds fresh out of our development labs. Our engineering is currently working on innovative new features for version 18.0 of TX Text Control. In this article, I will give a deep look into the drastically improved Adobe PDF import functionality.

Since version 15.0, the .NET versions of TX Text Control can import PDF files. While the first version was able to import text only, later versions already recognized paragraphs, fonts and positioned text using anchored text frames.

While these essential features describe the basement of such an innovative technology, version 18.0 is the next big milestone. Thanks to powerful new features in the TX Text Control core, the PDF import filter converts even complex pages into editable documents that can be easily modified and converted.

A very commonly used feature in PDF documents are layers. A PDF can consist of unlimited layers with a different z-order, so they can overlap each other. This z-order of objects was required in TX Text Control for images and text frames to enable the import of more complex PDF files. The following screenshots give an overview of what will be possible with the next version of TX Text Control.

Floating Text

This sample document shows how text is converted into floating text. On the right-hand screenshot, you can see that text can be selected like any other text in a TX Text Control document. Fonts, font weights, colors and paragraph margins are recognized and applied to the document. If a document contains more sections with different page sizes, they are converted to the proper landscape or portrait settings as well.

Original PDF in Adobe Acrobat Reader:

tx_pdf_import_1_pdf.png


Opened in TX Text Control 18.0 (Beta):


tx_pdf_import_1_tx.png


Modify Text in Complex Forms

Form documents with complex tables and perfectly positioned elements can be imported 1:1. You can easily change each string using the editor interface or you can find and replace text programmatically using the powerful TX Text Control API.

Original PDF in Adobe Acrobat Reader:

tx_pdf_import_2_pdf.png


Opened in TX Text Control 18.0 (Beta):


tx_pdf_import_2_tx.png


Import Postscript Vector Graphics

When importing business reports from PDF, they often contain charts or diagrams to visualize data. Using TX Text Control 18.0, these vector graphics can not only be imported, the included text can be changed as well.

Original PDF in Adobe Acrobat Reader:

tx_pdf_import_3_pdf.png


Opened in TX Text Control 18.0 (Beta):

tx_pdf_import_3_tx.png


These are just a few samples that gives an impression of the power of our new PDF import functionality that will come with TX Text Control 18.0. In the next couple of days, we will publish a live preview demo that enables you to test this functionality on your own.

Stay tuned and subscribe to the newsletter:

Subscribe to the TX Text Control Newsletter


About TX Text Control:

TX Text Control was originally released in 1991, since then more than 50,000 copies have been sold. Starting off as a single, small DLL, TX Text Control has made its way through 16-bit DLL and VBX versions to today‘s Enterprise edition with its .NET and ActiveX components. The recent addition to the family, TX Text Control .NET Server, offers all of TX Text Control advanced word processing functionality in an easy-to-use server-side .NET component. Customers benefit from these years of experience, large user base, and at the same time, appreciate developing with a mature, reliable product.

Contact Informations:

support@textcontrol.com

North & South America:
Phone: +1 704-370-0110
Phone: +1 877-462-4772 (toll free)

Europe:
Phone: +49 (0)421 42 70 67 10

Asia Pacific:
Phone: +886 2-2797-8508
 
Back
Top