PW is the web's leading resource for everything about open APIs and mashups. Learn more: take a tour »
Topicalizer is a text analysis and topic extraction tool. Based on methods of computational linguistics it provides various analyses for a given URL or plain text. These comprise, amongst others, language recognition, lexical density, keywords, collocations, word and phrase frequencies, readability and a short abstract. Topicalizer can, for instance, be used for automatically tagging a website or blog entry with semantic information or for retrieving summaries for web pages.
Click the "Track this API" button on any profile page and never miss an API update, new app, or breaking news for that API again.
There are no source code libraries or how-to links for Topicalizer. If you know one why not add it?
The Topicalizer API now sports yet another method named getSemWeb, which build a web of related terms for a given term by making use of the Google API und Wikipedia.
The Topicalizer API now has a another new method called getCoOccurrences, which makes use of large corpora of different text categories in order to guess, which words co-occur most frequently with a given text.