In a well-received move, major British newspaper The Guardian has opened up access to its content and data sets to third parties. The Guardian’s Open Platform comprises both an API (our new Guardian API Profile) and a Data Store that provide access to multimedia content and data sets respectively. The Content API includes approximately 1,000,000 articles that go as far back as 1999 and in some cases much further back.
As stated in The Guardian’s Open Platform Blog:
We’ve opened up our platform to give access to our journalism, our brand, and the technologies that power guardian.co.uk. The first two services we are releasing as part of the platform are the Content API and the Data Store.
The Content API is a service that allows partners to select and collect content from the Guardian.
The Data Store is a directory of useful data curated by Guardian journalists.
And in contrast to some other media company offerings the Guardian is providing the full text of their content, both here in the API and in their RSS feeds. And from a business model perspective, it’s an interesting play. PaidContent:UK’s good coverage has a few relevant points about this:
Open Platform is currently in beta and is being released on a limited basis. Although the service is free, developers interested in using the API and Data Store will need to register for an API Key. The RESTFUL API provides several methods for retrieving various types of content, including text and multimedia, and results are provided in XML, JSON or ATOM.
The Guardian has set up several resources for developers including documentation, a Google Group, and an FAQ page. Developers should take note that the API is limited to 5,000 daily requests and ads may be included in pages that display full content.
This is a nice addition to our API directory, and certainly this move by The Guardian highlights an emerging trend by traditional media providers, such as The New York Times and NPR, towards opening up content for integration by third parties. Be sure to check out Simon Willison, ReadWriteWeb, PC World, TechCrunch, and BuzzMachine.