17. Url Combiners
A Logical AND
Almost any combination from before
technology/internet+profile/bobbiejohnson
theguardian/technologyguardian+technology/internet
Except things that aren't actual tags:
Dated newspaper sections
theguardian/2009/jun/11... + anything = 404
the all page
.../all + anything = 404
18. Problems with this approach
Fragile
No Discovery
Not well documented
Copyright Issues
20. Open Platform
Opening up how we work with people both internally
and externally
A suite of services enabling partners to build
applications with the Guardian
Content API
Data Store
21. Data Store
A directory of useful data curated by Guardian editors
22. Data Store
A directory of useful data curated by Guardian editors
32. So why?
guardian.co.uk has an amazing amount of
quality content
Incredible amounts of meta-data, curated by
editors
Aim to allow the guardian to become a rich
source of facts and journalism for the web
33. How?
Backed of our search platform
Provides access to 10 years of article content and
metadata
Supports multiple output formats: XML, JSON, ATOM
Supported free text search across content
Search for keywords
Guardian supported api projects in Java, PHP, Python
and Ruby
Community supported api projects in Perl, ActionScript
and Coldfusion (and probably more?)
36. How free is free?
You can publish full articles from the guardian on your
website
5k queries per day limit
24 hour maximum cache lifetime
Online support
Partner with us on advertising
37. Beta trial
Limited number of keys
Collecting feedback
Will open more widely at end of beta program
38. How do I use it?
Home page
http://www.guardian.co.uk/open-platform
Sign up for an API key at
http://guardian.mashery.com
Use the API Explorer at
http://api.guardianapis.com/docs/
Use the python library at
http://code.google.com/p/openplatform-python/
39. What can I do with it?
Search our tag hierarchy
Find content by tag
Find content by search terms
Display on your website!
46. Using the Guardian API
Getting all articles about an MP
results = client.search(q='"%S"' % (name))
returns a paginating iterator
for x in results:
print x['headline']
Shows only first 10 by default
56. Public API Key?
jbynv3fwdp8ju5625mt2axw3
Only 5k queries a day
Will be closed on Friday 14th August
Will be closed if abused
Strongly encourage you to sign up for your own key