PW is the web's leading resource for everything about open APIs and mashups. Learn more: take a tour »
The Diffbot Article API is used to to extract clean article text from article web pages. It's powerful when combined with the permalinks that are extracted by the above APIs. The Diffbot Article API takes in as input any news story page. Statistical machine learning algorithms are run over all of the visual elements on the page to extract out the article text and associated metadata, such as its images, videos, and tags. If the article spans multiple pages, Diffbot will follow the next pages to get the whole article. There is also experimental support for extracting reader comments.

Click the "Track this API" button on any profile page and never miss an API update, new app, or breaking news for that API again.
There are no mashups, developers or code associated with this API yet.





©ProgrammableWeb.com 2013. All rights reserved.
Terms of Service | Privacy Policy
Comments
No comments on this API yet...Used it? How was it for you?
You must be logged in to comment. Not a PW member yet? Why not register.