An upcoming PDF Liberation Hackathon is aimed at raising developers' skills in unlocking data from PDF sources. The hackathon will be held onsite in Washington D.C. and in San Francisco on Jan. 17 - 19, 2014, while international developers can also compete remotely. ProgrammableWeb spoke with organizer Marc Joffe about how API-focused developers can participate.
Getting structured data out of web pages — often referred to as "web scraping" — is a real need, particularly for people whose job it is to prepare and analyze the information that's available in web pages. Meeting this need is right up the alley of a data extraction tool, such as Import.io.
One of the more interesting views given by the directory is a look at what sectors are seeing the most growth in APIs. The directory data model allows for one primary category as well as multiple secondary categories and in this article we take a look at which categories are most represented.