The service provides analysis and data extraction from a number of popular application file types, including PDF and Microsoft Office applications Word, Excel, and PowerPoint. It also extracts text or other data from scanned documents.
API methods support uploading a source document file in one of the recognized formats along with specifications of the data to be exported. In addition to PDF and MS Office formats, methods can process OpenOffice and WordPerfect documents, text files, .rtf files, and a number of common image file formats. Requests also specify the source document language and MIME type, desired output template, and other technical parameters. Methods generate output in the desired output, such as html for web display.