SlideShare una empresa de Scribd logo
1 de 55
CKAN Tutorial 
@ odw2013 
Presenter: 李承錱 Cheng-Jen Lee (Sol) 
Email: cjlee AT iis.sinica.edu.tw 
This work is licensed under a 
Creative Commons Attribution-ShareAlike 3.0 Taiwan License.
2013/11/9 
Agenda 
● About CKAN 
● Feature Tour 
– Publish & Find Datasets 
– Store & Manage Data 
– Engage with Users & Others 
– Customise & Extend 
● CKAN and 5 ★ Open Data 
● Showcase 
● Installation 
● Harvesters 
● Issues
2013/11/9 
Agenda 
● About CKAN 
● Feature Tour 
– Publish & Find Datasets 
– Store & Manage Data 
– Engage with Users & Others 
– Customise & Extend 
● CKAN and 5 ★ Open Data 
● Showcase 
● Installation 
● Harvesters 
● Issues
2013/11/9 
About CKAN 
● The Comprehensive Knowledge Archive 
Network 
● A powerful data management system 
● Publishing 
● Sharing 
● Finding 
● Using Data
2013/11/9 
About CKAN 
61 instances
2013/11/9 
CKAN 2
2013/11/9 
Feature Tour
2013/11/9 
Demo Site 
demo.ckan.org
2013/11/9 
Feature Tour (1) 
Publish & Find Datasets 
Add Dataset Basic 
Information
2013/11/9 
Feature Tour (1) 
Publish & Find Datasets 
Add Data 
Under the Dataset
2013/11/9 
Feature Tour (1) 
Publish & Find Datasets 
Add Metadata 
About the Dataset
2013/11/9 
Feature Tour (1) 
Publish & Find Datasets 
Filter By 
Keywords
2013/11/9 
Feature Tour (1) 
Publish & Find Datasets 
Filter By 
Geographical Features
2013/11/9 
Feature Tour (2) 
Store & Manage Data 
Data Explorer: 
recline_preview (csv, xls) 
json_preview 
pdf_preview 
ckanext-spatial
2013/11/9 
Feature Tour (2) 
Store & Manage Data 
Graphing data
2013/11/9 
Feature Tour (3) 
Engage with Users & Others 
Share
2013/11/9 
Feature Tour (3) 
Engage with Users & Others 
Organization
2013/11/9 
Feature Tour (3) 
Engage with Users & Others 
Manage Users of 
an Organization
2013/11/9 
Feature Tour (3) 
Engage with Users & Others 
Manage Role of Members 
Admin: edit datasets & members 
Editor: edit datasets 
Viewer: view (private) datasets 
Note: Public datasets are visible to 
everyone
2013/11/9 
Feature Tour (3) 
Engage with Users & Others 
Harvest and Federation
2013/11/9 
Feature Tour (3) 
Engage with Users & Others 
History
2013/11/9 
Feature Tour (4) 
Customise & Extend 
● RESTful JSON APIs 
– The Action API 
– The DataStore API 
– The FileStore API... 
● Extensions (over 60) 
– ckanext-harvest 
– ckanext-spatial 
Open source is good! 
● Themable 
● Integrates with other CMS (ex. Drupal)
CKAN and 5 ★ Open Data 
2013/11/9
CKAN and 5 ★ Open Data 
● ★ Make your stuff available on the Web (whatever 
format) under an open license 
2013/11/9
CKAN and 5 ★ Open Data 
● ★★ Make it available as structured data (e.g., 
Excel instead of image scan of a table) 
★★★ Use non-proprietary formats (e.g., CSV 
instead of Excel) 
– Accept any data format 
– Beautiful data demonstration 
2013/11/9
CKAN and 5 ★ Open Data 
● ★★★★ Use URIs to denote things, so that people 
can point at your stuff 
– Permanent link for each dataset 
– Get Dataset URI through API 
● ★★★★★ Link your data to other data to provide 
context 
– Linked data and RDF for metadata 
2013/11/9
CKAN and 5 ★ Open Data 
● RDF for metadata 
– DCAT and Dublin Core 
– curl -L -H "Accept:application/rdf+xml" 
http://thedatahub.org/dataset/gold-prices 
2013/11/9
2013/11/9 
Agenda 
● About CKAN 
● Feature Tour 
– Publish & Find Datasets 
– Store & Manage Data 
– Engage with Users & Others 
– Customise & Extend 
● CKAN and 5 ★ Open Data 
● Showcase 
● Installation 
● Harvesters 
● Issues
2013/11/9 
United Kingdom 
DATA.GOV.UK
2013/11/9 
United States 
DATA.GOV
2013/11/9 
Brazil 
DADOS.GOV.BR
2013/11/9 
European Union 
PUBLICDATA.EU
2013/11/9 
Geospatial Data Explorer: 
Lat/Long field
2013/11/9 
Geospatial Data Explorer: 
GeoJSON
2013/11/9 
Geospatial Data Explorer: 
WMS
2013/11/9 
Agenda 
● About CKAN 
● Feature Tour 
– Publish & Find Datasets 
– Store & Manage Data 
– Engage with Users & Others 
– Customise & Extend 
● CKAN and 5 ★ Open Data 
● Showcase 
● Installation 
● Harvesters 
● Issues
2013/11/9 
System Architecture
What You Should Know 
● Python, Pylons 
● ckan plugins toolkit 
● SQLAlchemy, SQL 
● HTML, JavaScript 
● Babel 
● Web Server (Nginx, uWSGI...) 
2013/11/9
2013/11/9 
Install from Source 
● Virtual environment 
● Checkout the source (via GIT) 
– https://github.com/okfn/ckan 
● Create a CKAN config file 
● Setup Jetty & Solr 
● Initialize Database (user, db) 
● Link to who.ini 
● Create a Sysadmin User 
● Deployment (nginx + uWSGI) 
● Install other extensions...
2013/11/9 
Installation Notes 
● https://ckan-docs-tw.readthedocs.org/
2013/11/9 
Agenda 
● About CKAN 
● Feature Tour 
– Publish & Find Datasets 
– Store & Manage Data 
– Engage with Users & Others 
– Customise & Extend 
● CKAN and 5 ★ Open Data 
● Showcase 
● Installation 
● Harvesters 
● Issues
2013/11/9 
Harvesters 
● ckanext-harvest 
– Remote harvesting extension 
– https://github.com/okfn/ckanext-harvest 
● Source Type 
– CKAN (built-in) 
– CSW 
– WAF 
– Custom (csv/xls/website… etc)
2013/11/9 
Harvested from TGOS 
CSW service
2013/11/9 
Harvesters 
http://Mydomain.com/harvest
2013/11/9 
Harvesters 
Add a new harvest source
2013/11/9 
Harvesters 
Create a harvest job
2013/11/9 
Harvesters 
Overview of harvested datasets
2013/11/9 
Harvesters 
Background Process 
● Manually 
– (pyenv) $ paster --plugin=ckanext-harvest 
harvester gather_consumer -c 
/etc/ckan/default/production.ini 
– (pyenv) $ paster --plugin=ckanext-harvest 
harvester fetch_consumer -c 
/etc/ckan/default/production.ini 
– (pyenv) $ paster --plugin=ckanext-harvest 
harvester run -c 
/etc/ckan/default/production.ini
2013/11/9 
Harvesters 
Background Process 
● Automatically 
– Supervisor (for gather & fetch consumer) 
– Cron (for run)
2013/11/9 
Harvesters 
Custom harvester 
● Implement the harvester interface 
to perform harvesting operations 
● Three stages 
● gather: get the identification 
● fetch: fetch the contents 
● import: create ckan package(dataset)
2013/11/9 
Harvesters 
The harvesting interface 
from base import HarvesterBase 
class SRDAHarvester(HarvesterBase): 
def _set_config(self,config_str): 
def info(self): 
... 
def gather_stage(self, harvest_job): 
def fetch_stage(self, harvest_object): 
def import_stage(self, harvest_object): 
See the extension site for details 
An example (SRDA): http://goo.gl/ZMnND7
2013/11/9 
Agenda 
● About CKAN 
● Feature Tour 
– Publish & Find Datasets 
– Store & Manage Data 
– Engage with Users & Others 
– Customise & Extend 
● CKAN and 5 ★ Open Data 
● Showcase 
● Installation 
● Harvesters 
● Issues
2013/11/9 
Issues 
● CJK Support 
– CJK Search 
– Some broken translations 
– Frequent updates 
● Extensions compatibilities 
● Tons of tweaks needed 
● Performance Issue 
● Complicated Architecture
● Official Documents: 
– http://docs.ckan.org/en/latest/ 
● Installation Notes (in Chinese): 
– https://ckan-docs-tw.readthedocs.org/ 
● CKAN Development Discussions: 
– http://lists.okfn.org/mailman/listinfo/ckan-dev 
● CKAN Taiwan Interest Group: 
– https://groups.google.com/forum/#!forum/ckan-taiwan-interest-group 
2013/11/9 
Resources
Thanks for your attention! 
2013/11/9 
Any Q? 
Email: u10313335 AT citi.sinica.edu.tw 
http://about.me/sollee

Más contenido relacionado

La actualidad más candente

DataFusion-and-Arrow_Supercharge-Your-Data-Analytical-Tool-with-a-Rusty-Query...
DataFusion-and-Arrow_Supercharge-Your-Data-Analytical-Tool-with-a-Rusty-Query...DataFusion-and-Arrow_Supercharge-Your-Data-Analytical-Tool-with-a-Rusty-Query...
DataFusion-and-Arrow_Supercharge-Your-Data-Analytical-Tool-with-a-Rusty-Query...
aiuy
 

La actualidad más candente (20)

Powering a Graph Data System with Scylla + JanusGraph
Powering a Graph Data System with Scylla + JanusGraphPowering a Graph Data System with Scylla + JanusGraph
Powering a Graph Data System with Scylla + JanusGraph
 
Graph Analytics with ArangoDB
Graph Analytics with ArangoDBGraph Analytics with ArangoDB
Graph Analytics with ArangoDB
 
Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...
Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...
Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...
 
Building a Data Pipeline using Apache Airflow (on AWS / GCP)
Building a Data Pipeline using Apache Airflow (on AWS / GCP)Building a Data Pipeline using Apache Airflow (on AWS / GCP)
Building a Data Pipeline using Apache Airflow (on AWS / GCP)
 
Creating data apps using Streamlit in Python
Creating data apps using Streamlit in PythonCreating data apps using Streamlit in Python
Creating data apps using Streamlit in Python
 
GDG Cloud Southlake #16: Priyanka Vergadia: Scalable Data Analytics in Google...
GDG Cloud Southlake #16: Priyanka Vergadia: Scalable Data Analytics in Google...GDG Cloud Southlake #16: Priyanka Vergadia: Scalable Data Analytics in Google...
GDG Cloud Southlake #16: Priyanka Vergadia: Scalable Data Analytics in Google...
 
JupyterHub: Learning at Scale
JupyterHub: Learning at ScaleJupyterHub: Learning at Scale
JupyterHub: Learning at Scale
 
InfluxDB IOx Tech Talks: Query Engine Design and the Rust-Based DataFusion in...
InfluxDB IOx Tech Talks: Query Engine Design and the Rust-Based DataFusion in...InfluxDB IOx Tech Talks: Query Engine Design and the Rust-Based DataFusion in...
InfluxDB IOx Tech Talks: Query Engine Design and the Rust-Based DataFusion in...
 
Data catalog
Data catalogData catalog
Data catalog
 
Accelerated Training of Transformer Models
Accelerated Training of Transformer ModelsAccelerated Training of Transformer Models
Accelerated Training of Transformer Models
 
DataFusion-and-Arrow_Supercharge-Your-Data-Analytical-Tool-with-a-Rusty-Query...
DataFusion-and-Arrow_Supercharge-Your-Data-Analytical-Tool-with-a-Rusty-Query...DataFusion-and-Arrow_Supercharge-Your-Data-Analytical-Tool-with-a-Rusty-Query...
DataFusion-and-Arrow_Supercharge-Your-Data-Analytical-Tool-with-a-Rusty-Query...
 
Orchestrating workflows Apache Airflow on GCP & AWS
Orchestrating workflows Apache Airflow on GCP & AWSOrchestrating workflows Apache Airflow on GCP & AWS
Orchestrating workflows Apache Airflow on GCP & AWS
 
Full Stack Graph in the Cloud
Full Stack Graph in the CloudFull Stack Graph in the Cloud
Full Stack Graph in the Cloud
 
RubiX
RubiXRubiX
RubiX
 
Koalas: Making an Easy Transition from Pandas to Apache Spark
Koalas: Making an Easy Transition from Pandas to Apache SparkKoalas: Making an Easy Transition from Pandas to Apache Spark
Koalas: Making an Easy Transition from Pandas to Apache Spark
 
KFServing and Feast
KFServing and FeastKFServing and Feast
KFServing and Feast
 
Importing Data into Neo4j quickly and easily - StackOverflow
Importing Data into Neo4j quickly and easily - StackOverflowImporting Data into Neo4j quickly and easily - StackOverflow
Importing Data into Neo4j quickly and easily - StackOverflow
 
Machine Learning + Graph Databases for Better Recommendations V1 08/06/2022
Machine Learning + Graph Databases for Better Recommendations V1 08/06/2022Machine Learning + Graph Databases for Better Recommendations V1 08/06/2022
Machine Learning + Graph Databases for Better Recommendations V1 08/06/2022
 
Evolution from EDA to Data Mesh: Data in Motion
Evolution from EDA to Data Mesh: Data in MotionEvolution from EDA to Data Mesh: Data in Motion
Evolution from EDA to Data Mesh: Data in Motion
 
Making Acquisitions Open to All: Alma interested users as a service for patro...
Making Acquisitions Open to All: Alma interested users as a service for patro...Making Acquisitions Open to All: Alma interested users as a service for patro...
Making Acquisitions Open to All: Alma interested users as a service for patro...
 

Destacado

ckan 2.0: Harvesting from other sources
ckan 2.0: Harvesting from other sourcesckan 2.0: Harvesting from other sources
ckan 2.0: Harvesting from other sources
Chengjen Lee
 

Destacado (12)

Drupal, CKAN and Public Data. DrupalGov 08 february 2016
Drupal, CKAN and Public Data. DrupalGov 08 february 2016Drupal, CKAN and Public Data. DrupalGov 08 february 2016
Drupal, CKAN and Public Data. DrupalGov 08 february 2016
 
ODN - Technical introduction of the platform
ODN - Technical introduction of the platformODN - Technical introduction of the platform
ODN - Technical introduction of the platform
 
Data Management Systems for Government Agencies - with CKAN
Data Management Systems for Government Agencies - with CKANData Management Systems for Government Agencies - with CKAN
Data Management Systems for Government Agencies - with CKAN
 
Slides Steffen Becker, "Code for Europe Workshop"
Slides Steffen Becker, "Code for Europe Workshop"Slides Steffen Becker, "Code for Europe Workshop"
Slides Steffen Becker, "Code for Europe Workshop"
 
Getting to Know CKAN, 24 June 2015, Singapore
Getting to Know CKAN, 24 June 2015, SingaporeGetting to Know CKAN, 24 June 2015, Singapore
Getting to Know CKAN, 24 June 2015, Singapore
 
Ckan foo - CKAN Association overview at CKANcon 2015, Ottawa
Ckan foo - CKAN Association overview at CKANcon 2015, OttawaCkan foo - CKAN Association overview at CKANcon 2015, Ottawa
Ckan foo - CKAN Association overview at CKANcon 2015, Ottawa
 
DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drup...
DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drup...DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drup...
DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drup...
 
Cloud Asia presentation in Singapore, 29 October 2015
Cloud Asia presentation in Singapore, 29 October 2015Cloud Asia presentation in Singapore, 29 October 2015
Cloud Asia presentation in Singapore, 29 October 2015
 
ckan 2.0: Harvesting from other sources
ckan 2.0: Harvesting from other sourcesckan 2.0: Harvesting from other sources
ckan 2.0: Harvesting from other sources
 
Global - Local Open Data Index
Global - Local Open Data Index Global - Local Open Data Index
Global - Local Open Data Index
 
CKAN by Friedrich Lindenberg
CKAN by Friedrich LindenbergCKAN by Friedrich Lindenberg
CKAN by Friedrich Lindenberg
 
Minicurso de CKAN
Minicurso de CKANMinicurso de CKAN
Minicurso de CKAN
 

Similar a Ckan tutorial odw2013 131109

ckan 2.0 Introduction (20140618 updated)
ckan 2.0 Introduction (20140618 updated)ckan 2.0 Introduction (20140618 updated)
ckan 2.0 Introduction (20140618 updated)
Chengjen Lee
 
ckan 2.0 Introduction (20140522 updated)
ckan 2.0 Introduction  (20140522 updated)ckan 2.0 Introduction  (20140522 updated)
ckan 2.0 Introduction (20140522 updated)
Chengjen Lee
 
ckan 2.0 Introduction
ckan 2.0 Introductionckan 2.0 Introduction
ckan 2.0 Introduction
Chengjen Lee
 
Dataiku Flow and dctc - Berlin Buzzwords
Dataiku Flow and dctc - Berlin BuzzwordsDataiku Flow and dctc - Berlin Buzzwords
Dataiku Flow and dctc - Berlin Buzzwords
Dataiku
 
Slides semantic web and Drupal 7 NYCCamp 2012
Slides semantic web and Drupal 7 NYCCamp 2012Slides semantic web and Drupal 7 NYCCamp 2012
Slides semantic web and Drupal 7 NYCCamp 2012
scorlosquet
 
Drupal and the semantic web - SemTechBiz 2012
Drupal and the semantic web - SemTechBiz 2012Drupal and the semantic web - SemTechBiz 2012
Drupal and the semantic web - SemTechBiz 2012
scorlosquet
 
Proud to be polyglot!
Proud to be polyglot!Proud to be polyglot!
Proud to be polyglot!
NLJUG
 

Similar a Ckan tutorial odw2013 131109 (20)

ckan 2.0 Introduction (20140618 updated)
ckan 2.0 Introduction (20140618 updated)ckan 2.0 Introduction (20140618 updated)
ckan 2.0 Introduction (20140618 updated)
 
ckan 2.0 Introduction (20140522 updated)
ckan 2.0 Introduction  (20140522 updated)ckan 2.0 Introduction  (20140522 updated)
ckan 2.0 Introduction (20140522 updated)
 
ckan 2.0 Introduction
ckan 2.0 Introductionckan 2.0 Introduction
ckan 2.0 Introduction
 
Sparkler - Spark Crawler
Sparkler - Spark Crawler Sparkler - Spark Crawler
Sparkler - Spark Crawler
 
Dataiku Flow and dctc - Berlin Buzzwords
Dataiku Flow and dctc - Berlin BuzzwordsDataiku Flow and dctc - Berlin Buzzwords
Dataiku Flow and dctc - Berlin Buzzwords
 
Slides semantic web and Drupal 7 NYCCamp 2012
Slides semantic web and Drupal 7 NYCCamp 2012Slides semantic web and Drupal 7 NYCCamp 2012
Slides semantic web and Drupal 7 NYCCamp 2012
 
Introduction to Globus: Research Data Management Software at the ALCF
Introduction to Globus: Research Data Management Software at the ALCFIntroduction to Globus: Research Data Management Software at the ALCF
Introduction to Globus: Research Data Management Software at the ALCF
 
Using oracle12c pluggable databases to archive
Using oracle12c pluggable databases to archiveUsing oracle12c pluggable databases to archive
Using oracle12c pluggable databases to archive
 
OpenSearch.pdf
OpenSearch.pdfOpenSearch.pdf
OpenSearch.pdf
 
Drupal 7 and RDF
Drupal 7 and RDFDrupal 7 and RDF
Drupal 7 and RDF
 
Drupal and the semantic web - SemTechBiz 2012
Drupal and the semantic web - SemTechBiz 2012Drupal and the semantic web - SemTechBiz 2012
Drupal and the semantic web - SemTechBiz 2012
 
Gluster overview & future directions vault 2015
Gluster overview & future directions vault 2015Gluster overview & future directions vault 2015
Gluster overview & future directions vault 2015
 
From Data Warehouse to Lakehouse
From Data Warehouse to LakehouseFrom Data Warehouse to Lakehouse
From Data Warehouse to Lakehouse
 
Lodstats: The Data Web Census Dataset. Kobe, Japan, 2016
Lodstats: The Data Web Census Dataset. Kobe, Japan, 2016Lodstats: The Data Web Census Dataset. Kobe, Japan, 2016
Lodstats: The Data Web Census Dataset. Kobe, Japan, 2016
 
Gobblin @ NerdWallet (Nov 2015)
Gobblin @ NerdWallet (Nov 2015)Gobblin @ NerdWallet (Nov 2015)
Gobblin @ NerdWallet (Nov 2015)
 
Gluster.community.day.2013
Gluster.community.day.2013Gluster.community.day.2013
Gluster.community.day.2013
 
Hong Kong Drupal User Group - Sep 13th
Hong Kong Drupal User Group - Sep 13thHong Kong Drupal User Group - Sep 13th
Hong Kong Drupal User Group - Sep 13th
 
Proud to be polyglot!
Proud to be polyglot!Proud to be polyglot!
Proud to be polyglot!
 
Drupal and the Semantic Web - ESIP Webinar
Drupal and the Semantic Web - ESIP WebinarDrupal and the Semantic Web - ESIP Webinar
Drupal and the Semantic Web - ESIP Webinar
 
OSDC 2013 | Distributed Storage with GlusterFS by Dr. Udo Seidel
OSDC 2013 | Distributed Storage with GlusterFS by Dr. Udo SeidelOSDC 2013 | Distributed Storage with GlusterFS by Dr. Udo Seidel
OSDC 2013 | Distributed Storage with GlusterFS by Dr. Udo Seidel
 

Más de Chengjen Lee (12)

Preserving Collaborative Documents in Contemporary Events
Preserving Collaborative Documents in Contemporary EventsPreserving Collaborative Documents in Contemporary Events
Preserving Collaborative Documents in Contemporary Events
 
Retooling a Research Data Repository: data.depositar.io
Retooling a Research Data Repository: data.depositar.ioRetooling a Research Data Repository: data.depositar.io
Retooling a Research Data Repository: data.depositar.io
 
跨領域區域研究資料集 (data.depositar.io): CKAN 應用介紹
跨領域區域研究資料集 (data.depositar.io): CKAN 應用介紹跨領域區域研究資料集 (data.depositar.io): CKAN 應用介紹
跨領域區域研究資料集 (data.depositar.io): CKAN 應用介紹
 
CKANCon 2016 & IODC16
CKANCon 2016 & IODC16CKANCon 2016 & IODC16
CKANCon 2016 & IODC16
 
CKAN 技術介紹 (開發篇)
CKAN 技術介紹 (開發篇)CKAN 技術介紹 (開發篇)
CKAN 技術介紹 (開發篇)
 
CKAN 技術介紹 (基礎篇)
CKAN 技術介紹 (基礎篇)CKAN 技術介紹 (基礎篇)
CKAN 技術介紹 (基礎篇)
 
將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享
將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享
將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享
 
CKAN 應用介紹 - 以台江計畫為例
CKAN 應用介紹 - 以台江計畫為例CKAN 應用介紹 - 以台江計畫為例
CKAN 應用介紹 - 以台江計畫為例
 
Report 140227
Report 140227Report 140227
Report 140227
 
Report 140213
Report 140213Report 140213
Report 140213
 
Introduction to Pelican
Introduction to PelicanIntroduction to Pelican
Introduction to Pelican
 
ckan 2.0: a deeper look
ckan 2.0: a deeper lookckan 2.0: a deeper look
ckan 2.0: a deeper look
 

Último

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Último (20)

Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 

Ckan tutorial odw2013 131109

  • 1. CKAN Tutorial @ odw2013 Presenter: 李承錱 Cheng-Jen Lee (Sol) Email: cjlee AT iis.sinica.edu.tw This work is licensed under a Creative Commons Attribution-ShareAlike 3.0 Taiwan License.
  • 2. 2013/11/9 Agenda ● About CKAN ● Feature Tour – Publish & Find Datasets – Store & Manage Data – Engage with Users & Others – Customise & Extend ● CKAN and 5 ★ Open Data ● Showcase ● Installation ● Harvesters ● Issues
  • 3. 2013/11/9 Agenda ● About CKAN ● Feature Tour – Publish & Find Datasets – Store & Manage Data – Engage with Users & Others – Customise & Extend ● CKAN and 5 ★ Open Data ● Showcase ● Installation ● Harvesters ● Issues
  • 4. 2013/11/9 About CKAN ● The Comprehensive Knowledge Archive Network ● A powerful data management system ● Publishing ● Sharing ● Finding ● Using Data
  • 5. 2013/11/9 About CKAN 61 instances
  • 8. 2013/11/9 Demo Site demo.ckan.org
  • 9. 2013/11/9 Feature Tour (1) Publish & Find Datasets Add Dataset Basic Information
  • 10. 2013/11/9 Feature Tour (1) Publish & Find Datasets Add Data Under the Dataset
  • 11. 2013/11/9 Feature Tour (1) Publish & Find Datasets Add Metadata About the Dataset
  • 12. 2013/11/9 Feature Tour (1) Publish & Find Datasets Filter By Keywords
  • 13. 2013/11/9 Feature Tour (1) Publish & Find Datasets Filter By Geographical Features
  • 14. 2013/11/9 Feature Tour (2) Store & Manage Data Data Explorer: recline_preview (csv, xls) json_preview pdf_preview ckanext-spatial
  • 15. 2013/11/9 Feature Tour (2) Store & Manage Data Graphing data
  • 16. 2013/11/9 Feature Tour (3) Engage with Users & Others Share
  • 17. 2013/11/9 Feature Tour (3) Engage with Users & Others Organization
  • 18. 2013/11/9 Feature Tour (3) Engage with Users & Others Manage Users of an Organization
  • 19. 2013/11/9 Feature Tour (3) Engage with Users & Others Manage Role of Members Admin: edit datasets & members Editor: edit datasets Viewer: view (private) datasets Note: Public datasets are visible to everyone
  • 20. 2013/11/9 Feature Tour (3) Engage with Users & Others Harvest and Federation
  • 21. 2013/11/9 Feature Tour (3) Engage with Users & Others History
  • 22. 2013/11/9 Feature Tour (4) Customise & Extend ● RESTful JSON APIs – The Action API – The DataStore API – The FileStore API... ● Extensions (over 60) – ckanext-harvest – ckanext-spatial Open source is good! ● Themable ● Integrates with other CMS (ex. Drupal)
  • 23. CKAN and 5 ★ Open Data 2013/11/9
  • 24. CKAN and 5 ★ Open Data ● ★ Make your stuff available on the Web (whatever format) under an open license 2013/11/9
  • 25. CKAN and 5 ★ Open Data ● ★★ Make it available as structured data (e.g., Excel instead of image scan of a table) ★★★ Use non-proprietary formats (e.g., CSV instead of Excel) – Accept any data format – Beautiful data demonstration 2013/11/9
  • 26. CKAN and 5 ★ Open Data ● ★★★★ Use URIs to denote things, so that people can point at your stuff – Permanent link for each dataset – Get Dataset URI through API ● ★★★★★ Link your data to other data to provide context – Linked data and RDF for metadata 2013/11/9
  • 27. CKAN and 5 ★ Open Data ● RDF for metadata – DCAT and Dublin Core – curl -L -H "Accept:application/rdf+xml" http://thedatahub.org/dataset/gold-prices 2013/11/9
  • 28. 2013/11/9 Agenda ● About CKAN ● Feature Tour – Publish & Find Datasets – Store & Manage Data – Engage with Users & Others – Customise & Extend ● CKAN and 5 ★ Open Data ● Showcase ● Installation ● Harvesters ● Issues
  • 32. 2013/11/9 European Union PUBLICDATA.EU
  • 33. 2013/11/9 Geospatial Data Explorer: Lat/Long field
  • 34. 2013/11/9 Geospatial Data Explorer: GeoJSON
  • 35. 2013/11/9 Geospatial Data Explorer: WMS
  • 36. 2013/11/9 Agenda ● About CKAN ● Feature Tour – Publish & Find Datasets – Store & Manage Data – Engage with Users & Others – Customise & Extend ● CKAN and 5 ★ Open Data ● Showcase ● Installation ● Harvesters ● Issues
  • 38. What You Should Know ● Python, Pylons ● ckan plugins toolkit ● SQLAlchemy, SQL ● HTML, JavaScript ● Babel ● Web Server (Nginx, uWSGI...) 2013/11/9
  • 39. 2013/11/9 Install from Source ● Virtual environment ● Checkout the source (via GIT) – https://github.com/okfn/ckan ● Create a CKAN config file ● Setup Jetty & Solr ● Initialize Database (user, db) ● Link to who.ini ● Create a Sysadmin User ● Deployment (nginx + uWSGI) ● Install other extensions...
  • 40. 2013/11/9 Installation Notes ● https://ckan-docs-tw.readthedocs.org/
  • 41. 2013/11/9 Agenda ● About CKAN ● Feature Tour – Publish & Find Datasets – Store & Manage Data – Engage with Users & Others – Customise & Extend ● CKAN and 5 ★ Open Data ● Showcase ● Installation ● Harvesters ● Issues
  • 42. 2013/11/9 Harvesters ● ckanext-harvest – Remote harvesting extension – https://github.com/okfn/ckanext-harvest ● Source Type – CKAN (built-in) – CSW – WAF – Custom (csv/xls/website… etc)
  • 43. 2013/11/9 Harvested from TGOS CSW service
  • 45. 2013/11/9 Harvesters Add a new harvest source
  • 47. 2013/11/9 Harvesters Overview of harvested datasets
  • 48. 2013/11/9 Harvesters Background Process ● Manually – (pyenv) $ paster --plugin=ckanext-harvest harvester gather_consumer -c /etc/ckan/default/production.ini – (pyenv) $ paster --plugin=ckanext-harvest harvester fetch_consumer -c /etc/ckan/default/production.ini – (pyenv) $ paster --plugin=ckanext-harvest harvester run -c /etc/ckan/default/production.ini
  • 49. 2013/11/9 Harvesters Background Process ● Automatically – Supervisor (for gather & fetch consumer) – Cron (for run)
  • 50. 2013/11/9 Harvesters Custom harvester ● Implement the harvester interface to perform harvesting operations ● Three stages ● gather: get the identification ● fetch: fetch the contents ● import: create ckan package(dataset)
  • 51. 2013/11/9 Harvesters The harvesting interface from base import HarvesterBase class SRDAHarvester(HarvesterBase): def _set_config(self,config_str): def info(self): ... def gather_stage(self, harvest_job): def fetch_stage(self, harvest_object): def import_stage(self, harvest_object): See the extension site for details An example (SRDA): http://goo.gl/ZMnND7
  • 52. 2013/11/9 Agenda ● About CKAN ● Feature Tour – Publish & Find Datasets – Store & Manage Data – Engage with Users & Others – Customise & Extend ● CKAN and 5 ★ Open Data ● Showcase ● Installation ● Harvesters ● Issues
  • 53. 2013/11/9 Issues ● CJK Support – CJK Search – Some broken translations – Frequent updates ● Extensions compatibilities ● Tons of tweaks needed ● Performance Issue ● Complicated Architecture
  • 54. ● Official Documents: – http://docs.ckan.org/en/latest/ ● Installation Notes (in Chinese): – https://ckan-docs-tw.readthedocs.org/ ● CKAN Development Discussions: – http://lists.okfn.org/mailman/listinfo/ckan-dev ● CKAN Taiwan Interest Group: – https://groups.google.com/forum/#!forum/ckan-taiwan-interest-group 2013/11/9 Resources
  • 55. Thanks for your attention! 2013/11/9 Any Q? Email: u10313335 AT citi.sinica.edu.tw http://about.me/sollee

Notas del editor

  1. 由 Open Knowledge Foundation 支持
  2. 由 Open Knowledge Foundation 支持
  3. 公布網址
  4. Data.gov.uk 即是 drupal+CKAN
  5. Store the raw data and metadata. Visualise structured data with interactive tables, graphs and maps.
  6. TGOS: 國土資訊圖資服務平台