The service accesses the content of a PDF document and generates structured XML. The site offers an interactive application for converting PDF to either XML or ePub formats. The web service enables programmatic conversion specifically to XML.
API methods support detection of header and footer, segmentation and ordering of text found in the PDF file. Methods also detect and process embedded table of contents, captions, and footnotes.
The following is a list of how-to and tutorial content that matched your search term. ProgrammableWeb's how-to content comes from two sources; full-blown tutorials that we publish ourselves and other highly relevant tutorials that we find elsewhere on the Web. This list represents on combination of both tutorial types and if you go to ProgrammableWeb's API University, you'll not only be able to find more, they are organized based on your role (API providers or developers who consumes APIs). If you know of a tutorial that would be of interest to the ProgrammableWeb community, we'd like to know about it. Be sure to check our guidelines for making contributions to ProgrammableWeb.