16. Url Combiners
A Logical AND
Almost any combination from before
technology/internet+profile/bobbiejohnson
theguardian/technologyguardian+technology/internet
Except things that aren't actual tags:
Dated newspaper sections
theguardian/2009/jun/11... + anything = 404
the all page
.../all + anything = 404
17. Problems with this approach
Fragile
No Discovery
Not well documented
Copyright Issues
19. Open Platform
Opening up how we work with people both internally
and externally
A suite of services enabling partners to build
applications with the Guardian
Content API
Data Store
20. Data Store
A directory of useful data curated by Guardian editors
21. Data Store
A directory of useful data curated by Guardian editors
31. So why?
guardian.co.uk has an amazing amount of
quality content
Incredible amounts of meta-data, curated by
editors
Aim to allow the guardian to become a rich
source of facts and journalism for the web
32. How?
Backed of our search platform
Provides access to 10 years of article content
and metadata
Supports multiple output formats: XML, JSON,
ATOM
Supported free text search across content
Search for keywords
Guardian supported api projects in Java, PHP,
Python and Ruby
Community supported api projects in Perl,
ActionScript and Coldfusion (and probably
more?)
35. How free is free?
You can publish full articles from the guardian on your
website
5k queries per day limit
24 hour maximum cache lifetime
Online support
Partner with us on advertising
36. Beta trial
Limited number of keys
Collecting feedback
Will open more widely at end of beta program
37. How do I use it?
Home page
http://www.guardian.co.uk/open-platform
Sign up for an API key at
http://guardian.mashery.com
Use the API Explorer at
http://api.guardianapis.com/docs/
Use the python library at
http://code.google.com/p/openplatform-python/
38. What can I do with it?
Search our tag hierarchy
Find content by tag
Find content by search terms
Display on your website!
45. Using the Guardian API
Getting all articles about an MP
results = client.search(q='"%S"' % (name))
returns a paginating iterator
for x in results:
print x['headline']
Shows only first 10 by default
55. Public API Key?
jbynv3fwdp8ju5625mt2axw3
Only 5k queries a day
Will be closed on Monday 6th July
Will be closed if abused
Strongly encourage you to sign up for your own key