Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Combining data through standards and metrics by Mike Thacker
1.
2. Mike Thacker – Porism Limited
Open Data
COMBINING DATA THROUGH STANDARDS
AND METRICS
Combining data through standards and
metrics
2
3. WHAT WE MEAN BY STANDARDS
Combining data through standards and
metrics 3
4. Making local data useful
• Municipalities need to be compare with one another
• Innovators need to build apps cost-effectively
Hence we offer
standards for:
• what we call things
• format of open data
5. Defining datasets and linking via URIs
Schemas
Define
structure of
Datasets
Contain
Data
items
Local
authorities
Official
geographies
Neighbour-hoods
Services
grouped
by function
Other eg:
• Planning
categories
• Entertainment
types
7. URIs - Uniform Resource Identifiers
Codes that give precise definitions of things, eg:
• http://id.esd.org.uk/service/860 – premises licence service
• http://opendatacommunities.org/id/unitary-authority/york
York City Council
• http://statistics.data.gov.uk/id/statistical-geography/
E06000014 York area - from ONS
These normally resolve to descriptions with properties
that are human and machine readable
11. STANDARDS FOR DATA FORMATS - SCHEMAS
Combining data through standards and
metrics 11
12. The inventory schema
Inventory
Dataset Documents
ODF, PDF, HTML
Data
CSV, XML, ...
Dataset Documents
ODF, PDF , HTML
Data
CSV, XML, ...
inventory.esd.org.uk
• Indexes datasets & their schemas against functions & services
• Automatically harvested by data.gov.uk
• Automatically output by DataShare DKAN and CKAN following
• Can be uploaded to and validated by esd-toolkit
13. Dataset schemas
• Define the structure of data for a service or function (group of
services)
• Shared to allow (not mandate) consistency
• Some validated schemas encouraged
• Formats:
– Tabular: DataShare definitions and CSV validation files as
used by the ODI’s csvlint.io
– XML
– Linked data profiles
21. Metrics structure
Metric
type
Metric
value
Date/
Time
Organis
ation
has
Dimensions Area
Examples
• Miles of roads
• Road accidents
• Spending on roads
Example
• Norfolk County
Council
EoE £573.2M
England £208.8M
2012 6,142.5
£75.3M
Example
• Norfolk administrative area
• East of England
• England
At a point
or
Over a period
Examples
• Age
• Gender
• Severity
183,003.4
2010-12
353
24,027.8 2,473
21,534
2011/12
has
has has has
governs
25. Further information
• UK local government open data: http://opendata.esd.org.uk/
• Standards: http://standards.esd.org.uk/
• Metric types http://id.esd.org.uk/list/metricTypes
• Dimensions (Circumstances) http://id.esd.org.uk/list/circumstances
• Municipalities http://opendatacommunities.org/data/local-authorities
• Administrative areas http://statistics.data.gov.uk/doc/statistical-geography/
• Natural neighbourhoods: http://neighbourhoods.esd.org.uk/
• Tools http://about.esd.org.uk/
• API: http://api.esd.org.uk/
Mike.Thacker@Porism.com
@MikeThacker
Notas del editor
Local government is different from central government when it comes to providing open data in that hundreds of councils provide the same type of data. It helps councils to be able to compare with one another. We want to encourage innovators (eg SMEs, interest groups) to develop applications that make use of the data, but it is not cost-effective for them to do so differently for every council. Hence standard means or offered for referring to elements of data and standard ways of sharing them.
Little is mandated but you make it easier for others to consume your data of you use the same references and same formats as other councils.
Standard classifications, in the form of sets of URIs, are important so everyone using the data understands the terms used and these terms have precise meaning. URI sets of particular importance to local authorities are from:
DCLG’s Open data Communities
ONS, published via statistics.data.gov.uk
our esd-standards and natural neighbourhoods (developed by Cheshire East) sites
URIs are used to:
Classify schemas
define the scope of datasets
- define teh scope of induividual data items within datasets
Our Open data URIs page lets you do a text search across important URI sets to find a specific URI.
We also provide a facility for councils to add their own local “alternative labels” against URIs and an API that converts a local label back to the official URI. The facility is used in the incentive scheme if, for example, a council has its own name for a type of planning application which it needs to cross-reference with the official definition.
As always, Services are central to our work in local government. The current model means we have as open data these categories of information for some or all services in the Local Government Services List.
The Services List is under continual review. A few changes were made for more precise definition of planning services referenced in the incentive scheme.
Record retention guidance data is changed as we and Kent County Council get feedback from users and as legislation changes. The last update was in June and a further one is scheduled for March.
data.gov.uk have in beta a “harvester” that automatically reads council inventories and updates their entries in the data.gov.uk catalogue.
We have implemented a similar harvester in our page where you can register your inventory for automatic harvesting.
The datasets page lets you see what local authority datasets exist for specified functions and services. You can filter by dataset format and can also see links to local documentation on the datasets.
A well as reading details of open datasets from inventories, we also read them from submissions under the local open data incentive scheme.
Note that these submissions are for live data feeds, but many are for static files, so a different address applies to each week’s, month’s or year’s data. Hence we really need an automatically maintained inventory to be sure of picking up the latest datasets.