The service provides analysis and data extraction from a number of popular application file types, including PDF and Microsoft Office applications Word, Excel, and PowerPoint. It also extracts text or other data from scanned documents.
API methods support uploading a source document file in one of the recognized formats along with specifications of the data to be exported. In addition to PDF and MS Office formats, methods can process OpenOffice and WordPerfect documents, text files, .rtf files, and a number of common image file formats. Requests also specify the source document language and MIME type, desired output template, and other technical parameters. Methods generate output in the desired output, such as html for web display.
The following is a list of libraries from ProgrammableWeb's Library Directory that matched your search term. Although there many different interpretations of the word "library" among software developers, ProgrammableWeb adheres to a specific definition so as to clearly distinguish libraries from SDKs and frameworks in a way that will facilitate clean search results. In ProgrammableWeb's parlance, the term "library" is strictly used to describe a platform-specific software tool that, when installed, results in the provisioning a specific API. Conversely, SDKs are exclusively for consuming APIs and Frameworks are agnostic to specific APIs. If you think a library, SDK, API, or other asset is missing from our directory, be sure to check our guidelines for making contributions to ProgrammableWeb.