The Captricity Python Sample Code by Captricity demonstrates how to interact with the API, providing list jobs and upload instances as available examples. Developers can integrate data extraction into applications with resources available at https://shreddr.captricity.com/developer/overview/
Named-Entity Recognition involves identifying an entity in a text and assigning it a class label. These classifications can include People, Locations, and Organisations, among others depending on the tool. This comparison looks at the performance of 10 natural language processing APIs.
Diffbot is one of the coolest new ideas on the internet. The service brings monitoring to the web in a new and interesting way. The company is just about to release a whole slew of projects built on its collection of Diffbot APIs for following changes to web pages and RSS, as well as extracting clean text from websites.
Diffbot has come out of beta announcing the APIs ability to extract content from sites that fit into two page-types: article and front page. The Diffbot engine can determine, just by rendering and looking at a page, what type of page it is. Is it an article or a front page news site? Maybe it's a profile page from a social network. Diffbot’s artificial brain has been literally trained to know the difference. Developers can make 50,000 calls to the Diffbot API per month for free with additional calls available for fractions of a cent. This pricing should encourage wide adoption and experimentation.