ACHE by New York University provides web crawler capabilities capable of collecting web pages that satisfy domains or user-specified patterns. This service uses page classifiers to distinguish between relevant and irrelevant pages in a given domain. The REST API can be utilized to retrieve crawler metrics, and effectuate crawler actions.
Twelve APIs have been added to the ProgrammableWeb directory in categories including Artificial Intelligence, Data Mining, and Auto. Some highlights are an API that returns data about Superheros from multiple sources, and an API that uses machine learning for crawling the web.