SlideShare una empresa de Scribd logo
1 de 55
CKAN 2 Introduction 
Presenter: 李承錱 Cheng-Jen Lee (Sol) 
Email: cjlee AT iis.sinica.edu.tw 
This work is licensed under a 
Creative Commons Attribution-ShareAlike 3.0 Taiwan License.
Agenda 
● About CKAN 
● Feature Tour 
– Publish & Find Datasets 
– Store & Manage Data 
– Engage with Users & Others 
– Customise & Extend 
● CKAN and 5 ★ Open Data 
● Showcase 
● Issues 
● Extended Topic 1: Installation 
● Extended Topic 2: Harvesters 
2014/5/22 2
Agenda 
● About CKAN 
● Feature Tour 
– Publish & Find Datasets 
– Store & Manage Data 
– Engage with Users & Others 
– Customise & Extend 
● CKAN and 5 ★ Open Data 
● Showcase 
● Issues 
● Extended Topic 1: Installation 
● Extended Topic 2: Harvesters 
2014/5/22 3
About CKAN 
● The Comprehensive Knowledge Archive 
Network 
● A powerful data management system 
● Publishing 
● Sharing 
● Finding 
● Using Data 
2014/5/22 4
About CKAN 
83 instances 
around the world 
in May 2014 
2014/5/22 5
CKAN 2 
2014/5/22 6
Feature Tour 
2014/5/22 7
Demo Site 
demo.ckan.org 
2014/5/22 8
Feature Tour (1) 
Publish & Find Datasets 
Add Dataset Basic 
Information 
2014/5/22 9
Feature Tour (1) 
Publish & Find Datasets 
Add Data 
Under the Dataset 
2014/5/22 10
Feature Tour (1) 
Publish & Find Datasets 
Add Metadata 
About the Dataset 
2014/5/22 11
Feature Tour (1) 
Publish & Find Datasets 
Filter By 
Keywords 
2014/5/22 12
Feature Tour (1) 
Publish & Find Datasets 
Filter By 
Geographical Features 
2014/5/22 13
Feature Tour (2) 
Store & Manage Data 
Data Explorer: 
recline_preview (csv, xls) 
json_preview 
pdf_preview 
ckanext-spatial 
2014/5/22 14
Feature Tour (2) 
Store & Manage Data 
Graphing data 
2014/5/22 15
Feature Tour (3) 
Engage with Users & Others 
Share 
2014/5/22 16
Feature Tour (3) 
Engage with Users & Others 
Organization 
2014/5/22 17
Feature Tour (3) 
Engage with Users & Others 
Manage Users of 
an Organization 
2014/5/22 18
Feature Tour (3) 
Engage with Users & Others 
Manage Role of Members 
Admin: edit datasets & members 
Editor: edit datasets 
Viewer: view (private) datasets 
Note: Public datasets are visible to 
everyone 
2014/5/22 19
Feature Tour (3) 
Engage with Users & Others 
Harvest and Federation 
2014/5/22 20
Feature Tour (3) 
Engage with Users & Others 
History 
2014/5/22 21
Feature Tour (4) 
Customise & Extend 
● RESTful JSON APIs 
– The Action API 
– The DataStore API 
– The FileStore API... 
● Extensions (over 60) 
– ckanext-harvest 
– ckanext-spatial 
Open source is good! 
● Themable 
● Integrates with other CMS (ex. Drupal) 
2014/5/22 22
CKAN and 5 ★ Open Data 
2014/5/22 23
CKAN and 5 ★ Open Data 
● ★ Make your stuff available on the Web (whatever 
format) under an open license 
2014/5/22 24
CKAN and 5 ★ Open Data 
● ★★ Make it available as structured data (e.g., 
Excel instead of image scan of a table) 
★★★ Use non-proprietary formats (e.g., CSV 
instead of Excel) 
– Accept any data format 
– Beautiful data demonstration 
– Datastore : Indexing for structure data 
2014/5/22 25
CKAN and 5 ★ Open Data 
● ★★★★ Use URIs to denote things, so that people 
can point at your stuff 
– Permanent link for each dataset 
– Get Dataset URI through API 
● ★★★★★ Link your data to other data to provide 
context 
– Linked data and RDF for metadata 
2014/5/22 26
CKAN and 5 ★ Open Data 
● RDF for metadata 
– DCAT and Dublin Core 
– curl -L -H "Accept:application/rdf+xml" 
http://thedatahub.org/dataset/gold-prices 
2014/5/22 27
Agenda 
● About CKAN 
● Feature Tour 
– Publish & Find Datasets 
– Store & Manage Data 
– Engage with Users & Others 
– Customise & Extend 
● CKAN and 5 ★ Open Data 
● Showcase 
● Issues 
● Extended Topic 1: Installation 
● Extended Topic 2: Harvesters 
2014/5/22 28
United Kingdom 
DATA.GOV.UK 
2014/5/22 29
United States 
DATA.GOV 
2014/5/22 30
Brazil 
DADOS.GOV.BR 
2014/5/22 31
European Union 
PUBLICDATA.EU 
2014/5/22 32
Geospatial Data Explorer: 
Lat/Long field 
2014/5/22 33
Geospatial Data Explorer: 
GeoJSON 
2014/5/22 34
Geospatial Data Explorer: 
WMS 
2014/5/22 35
Agenda 
● About CKAN 
● Feature Tour 
– Publish & Find Datasets 
– Store & Manage Data 
– Engage with Users & Others 
– Customise & Extend 
● CKAN and 5 ★ Open Data 
● Showcase 
● Issues 
● Extended Topic 1: Installation 
● Extended Topic 2: Harvesters 
2014/5/22 36
Issues 
● CJK Support 
– CJK Search 
– Some broken translations 
– Frequent updates 
● Extensions compatibilities 
● Tons of tweaks needed 
● Performance Issue 
● Complicated Architecture 
2014/5/22 37
System Architecture 
2014/5/22 38
Issues (Cont'd) 
● What You Should Know 
– Python & Pylons 
– ckan plugins toolkit 
– SQLAlchemy & SQL 
– HTML, JavaScript 
– Babel 
– Web Server (Nginx, uWSGI...) 
2014/5/22 39
Resources 
● Official Documents: 
– http://docs.ckan.org/en/latest/ 
● Installation Notes (in Chinese): 
– https://ckan-docs-tw.readthedocs.org/ 
● CKAN Development Discussions: 
– http://lists.okfn.org/mailman/listinfo/ckan-dev 
● CKAN Taiwan Interest Group: 
– https://groups.google.com/forum/#!forum/ckan-taiwan-interest-group 
2014/5/22 40
Thanks for your attention! 
Any Q? 
Email: u10313335 AT citi.sinica.edu.tw 
http://about.me/sollee 
2014/5/22 41
Agenda 
● About CKAN 
● Feature Tour 
– Publish & Find Datasets 
– Store & Manage Data 
– Engage with Users & Others 
– Customise & Extend 
● CKAN and 5 ★ Open Data 
● Showcase 
● Issues 
● Extended Topic 1: Installation 
● Extended Topic 2: Harvesters 
2014/5/22 42
Install from Source 
● Virtual environment 
● Checkout the source (via GIT) 
– https://github.com/okfn/ckan 
● Create a CKAN config file 
● Setup Jetty & Solr 
● Initialize Database (user, db) 
● Link to who.ini 
● Create a Sysadmin User 
● Deployment (nginx + uWSGI) 
● Install other extensions... 
2014/5/22 43
Installation Notes 
● https://ckan-docs-tw.readthedocs.org/ 
2014/5/22 44
Agenda 
● About CKAN 
● Feature Tour 
– Publish & Find Datasets 
– Store & Manage Data 
– Engage with Users & Others 
– Customise & Extend 
● CKAN and 5 ★ Open Data 
● Showcase 
● Issues 
● Extended Topic 1: Installation 
● Extended Topic 2: Harvesters 
2014/5/22 45
Harvesters 
● ckanext-harvest 
– Remote harvesting extension 
– https://github.com/okfn/ckanext-harvest 
● Source Type 
– CKAN (built-in) 
– CSW 
– WAF 
– Custom (csv/xls/website… etc) 
2014/5/22 46
Harvested from TGOS 
CSW service 
2014/5/22 47
Harvesters 
http://Mydomain.com/harvest 
2014/5/22 48
Harvesters 
Add a new harvest source 
2014/5/22 49
Harvesters 
Create a harvest job 
2014/5/22 50
Harvesters 
Overview of harvested datasets 
2014/5/22 51
Harvesters 
Background Process 
● Manually 
– (pyenv) $ paster --plugin=ckanext-harvest 
harvester gather_consumer -c 
/etc/ckan/default/production.ini 
– (pyenv) $ paster --plugin=ckanext-harvest 
harvester fetch_consumer -c 
/etc/ckan/default/production.ini 
– (pyenv) $ paster --plugin=ckanext-harvest 
harvester run -c 
/etc/ckan/default/production.ini 
2014/5/22 52
Harvesters 
Background Process 
● Automatically 
– Supervisor (for gather & fetch consumer) 
– Cron (for run) 
2014/5/22 53
Harvesters 
Custom harvester 
● Implement the harvester interface 
to perform harvesting operations 
● Three stages 
● gather: get the identification 
● fetch: fetch the contents 
● import: create ckan package(dataset) 
2014/5/22 54
Harvesters 
The harvesting interface 
from base import HarvesterBase 
class SRDAHarvester(HarvesterBase): 
def _set_config(self,config_str): 
def info(self): 
... 
def gather_stage(self, harvest_job): 
def fetch_stage(self, harvest_object): 
def import_stage(self, harvest_object): 
See the extension site for details 
An example (SRDA): http://goo.gl/ZMnND7 
2014/5/22 55

Más contenido relacionado

La actualidad más candente

ckan 2.0: Harvesting from other sources
ckan 2.0: Harvesting from other sourcesckan 2.0: Harvesting from other sources
ckan 2.0: Harvesting from other sourcesChengjen Lee
 
CKAN - the open source data portal platform
CKAN - the open source data portal platformCKAN - the open source data portal platform
CKAN - the open source data portal platformMaurizio Napolitano
 
Geospatial Querying in Apache Marmotta - Apache Big Data North America 2016
Geospatial Querying in Apache Marmotta -  Apache Big Data North America 2016Geospatial Querying in Apache Marmotta -  Apache Big Data North America 2016
Geospatial Querying in Apache Marmotta - Apache Big Data North America 2016Sergio Fernández
 
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015Sergio Fernández
 
Adventures in Linked Data Land (presentation by Richard Light)
Adventures in Linked Data Land (presentation by Richard Light)Adventures in Linked Data Land (presentation by Richard Light)
Adventures in Linked Data Land (presentation by Richard Light)jottevanger
 
Slides semantic web and Drupal 7 NYCCamp 2012
Slides semantic web and Drupal 7 NYCCamp 2012Slides semantic web and Drupal 7 NYCCamp 2012
Slides semantic web and Drupal 7 NYCCamp 2012scorlosquet
 
The Semantic Web and Drupal 7 - Loja 2013
The Semantic Web and Drupal 7 - Loja 2013The Semantic Web and Drupal 7 - Loja 2013
The Semantic Web and Drupal 7 - Loja 2013scorlosquet
 
LIFE GreenYourMove Project - GTFS data
LIFE GreenYourMove Project - GTFS data LIFE GreenYourMove Project - GTFS data
LIFE GreenYourMove Project - GTFS data LIFE GreenYourMove
 
An Introduction to the Open Archives Initiative Object Reuse and Exchange (OA...
An Introduction to the Open Archives Initiative Object Reuse and Exchange (OA...An Introduction to the Open Archives Initiative Object Reuse and Exchange (OA...
An Introduction to the Open Archives Initiative Object Reuse and Exchange (OA...Jenn Riley
 
Presentation data collection and gtfs
Presentation data collection and gtfsPresentation data collection and gtfs
Presentation data collection and gtfsLIFE GreenYourMove
 
Culture Geeks Feb talk: Adventures in Linked Data Land
Culture Geeks Feb talk: Adventures in Linked Data LandCulture Geeks Feb talk: Adventures in Linked Data Land
Culture Geeks Feb talk: Adventures in Linked Data Landval.cartei
 
Graph basedrdf storeforapachecassandra
Graph basedrdf storeforapachecassandraGraph basedrdf storeforapachecassandra
Graph basedrdf storeforapachecassandraRavindra Ranwala
 
Updates from Hungary (Jozsef Kovacs)
Updates from Hungary (Jozsef Kovacs)Updates from Hungary (Jozsef Kovacs)
Updates from Hungary (Jozsef Kovacs)EOSC-hub project
 
Pinot: Realtime Distributed OLAP datastore
Pinot: Realtime Distributed OLAP datastorePinot: Realtime Distributed OLAP datastore
Pinot: Realtime Distributed OLAP datastoreKishore Gopalakrishna
 
Iceberg: a fast table format for S3
Iceberg: a fast table format for S3Iceberg: a fast table format for S3
Iceberg: a fast table format for S3DataWorks Summit
 
The evolution of Netflix's S3 data warehouse (Strata NY 2018)
The evolution of Netflix's S3 data warehouse (Strata NY 2018)The evolution of Netflix's S3 data warehouse (Strata NY 2018)
The evolution of Netflix's S3 data warehouse (Strata NY 2018)Ryan Blue
 

La actualidad más candente (20)

ckan 2.0: Harvesting from other sources
ckan 2.0: Harvesting from other sourcesckan 2.0: Harvesting from other sources
ckan 2.0: Harvesting from other sources
 
CKAN - the open source data portal platform
CKAN - the open source data portal platformCKAN - the open source data portal platform
CKAN - the open source data portal platform
 
Geospatial Querying in Apache Marmotta - Apache Big Data North America 2016
Geospatial Querying in Apache Marmotta -  Apache Big Data North America 2016Geospatial Querying in Apache Marmotta -  Apache Big Data North America 2016
Geospatial Querying in Apache Marmotta - Apache Big Data North America 2016
 
Apache Marmotta - Introduction
Apache Marmotta - IntroductionApache Marmotta - Introduction
Apache Marmotta - Introduction
 
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015
 
Adventures in Linked Data Land (presentation by Richard Light)
Adventures in Linked Data Land (presentation by Richard Light)Adventures in Linked Data Land (presentation by Richard Light)
Adventures in Linked Data Land (presentation by Richard Light)
 
Slides semantic web and Drupal 7 NYCCamp 2012
Slides semantic web and Drupal 7 NYCCamp 2012Slides semantic web and Drupal 7 NYCCamp 2012
Slides semantic web and Drupal 7 NYCCamp 2012
 
The Semantic Web and Drupal 7 - Loja 2013
The Semantic Web and Drupal 7 - Loja 2013The Semantic Web and Drupal 7 - Loja 2013
The Semantic Web and Drupal 7 - Loja 2013
 
LIFE GreenYourMove Project - GTFS data
LIFE GreenYourMove Project - GTFS data LIFE GreenYourMove Project - GTFS data
LIFE GreenYourMove Project - GTFS data
 
An Introduction to the Open Archives Initiative Object Reuse and Exchange (OA...
An Introduction to the Open Archives Initiative Object Reuse and Exchange (OA...An Introduction to the Open Archives Initiative Object Reuse and Exchange (OA...
An Introduction to the Open Archives Initiative Object Reuse and Exchange (OA...
 
Towards Data Operations
Towards Data OperationsTowards Data Operations
Towards Data Operations
 
Presentation data collection and gtfs
Presentation data collection and gtfsPresentation data collection and gtfs
Presentation data collection and gtfs
 
Culture Geeks Feb talk: Adventures in Linked Data Land
Culture Geeks Feb talk: Adventures in Linked Data LandCulture Geeks Feb talk: Adventures in Linked Data Land
Culture Geeks Feb talk: Adventures in Linked Data Land
 
Graph basedrdf storeforapachecassandra
Graph basedrdf storeforapachecassandraGraph basedrdf storeforapachecassandra
Graph basedrdf storeforapachecassandra
 
Drupal and RDF
Drupal and RDFDrupal and RDF
Drupal and RDF
 
Updates from Hungary (Jozsef Kovacs)
Updates from Hungary (Jozsef Kovacs)Updates from Hungary (Jozsef Kovacs)
Updates from Hungary (Jozsef Kovacs)
 
PhD Defense
PhD DefensePhD Defense
PhD Defense
 
Pinot: Realtime Distributed OLAP datastore
Pinot: Realtime Distributed OLAP datastorePinot: Realtime Distributed OLAP datastore
Pinot: Realtime Distributed OLAP datastore
 
Iceberg: a fast table format for S3
Iceberg: a fast table format for S3Iceberg: a fast table format for S3
Iceberg: a fast table format for S3
 
The evolution of Netflix's S3 data warehouse (Strata NY 2018)
The evolution of Netflix's S3 data warehouse (Strata NY 2018)The evolution of Netflix's S3 data warehouse (Strata NY 2018)
The evolution of Netflix's S3 data warehouse (Strata NY 2018)
 

Destacado

EUDAT data architecture and interoperability aspects – Daan Broeder
EUDAT data architecture and interoperability aspects – Daan BroederEUDAT data architecture and interoperability aspects – Daan Broeder
EUDAT data architecture and interoperability aspects – Daan BroederOpenAIRE
 
Open Data and CKAN Data Catalogues
Open Data and CKAN Data CataloguesOpen Data and CKAN Data Catalogues
Open Data and CKAN Data Cataloguesdavid-read
 
Drupal, CKAN and Public Data. DrupalGov 08 february 2016
Drupal, CKAN and Public Data. DrupalGov 08 february 2016Drupal, CKAN and Public Data. DrupalGov 08 february 2016
Drupal, CKAN and Public Data. DrupalGov 08 february 2016Steven De Costa
 
Getting to Know CKAN, 24 June 2015, Singapore
Getting to Know CKAN, 24 June 2015, SingaporeGetting to Know CKAN, 24 June 2015, Singapore
Getting to Know CKAN, 24 June 2015, SingaporeSteven De Costa
 
Introduction to CKAN
Introduction to CKANIntroduction to CKAN
Introduction to CKANOKCon2013
 
AWS re:Invent 2016: Case Study: How Atlassian Uses Amazon EFS with JIRA to Cu...
AWS re:Invent 2016: Case Study: How Atlassian Uses Amazon EFS with JIRA to Cu...AWS re:Invent 2016: Case Study: How Atlassian Uses Amazon EFS with JIRA to Cu...
AWS re:Invent 2016: Case Study: How Atlassian Uses Amazon EFS with JIRA to Cu...Amazon Web Services
 
20160602 典藏目錄的語意與連結
20160602 典藏目錄的語意與連結20160602 典藏目錄的語意與連結
20160602 典藏目錄的語意與連結andrea huang
 
20161004 “Open Data Web” – A Linked Open Data Repository Built with CKAN
20161004 “Open Data Web” – A Linked Open Data Repository Built with CKAN20161004 “Open Data Web” – A Linked Open Data Repository Built with CKAN
20161004 “Open Data Web” – A Linked Open Data Repository Built with CKANandrea huang
 
AWS re:Invent 2016: Case Study: How Monsanto Uses Amazon EFS with Their Large...
AWS re:Invent 2016: Case Study: How Monsanto Uses Amazon EFS with Their Large...AWS re:Invent 2016: Case Study: How Monsanto Uses Amazon EFS with Their Large...
AWS re:Invent 2016: Case Study: How Monsanto Uses Amazon EFS with Their Large...Amazon Web Services
 

Destacado (10)

EUDAT data architecture and interoperability aspects – Daan Broeder
EUDAT data architecture and interoperability aspects – Daan BroederEUDAT data architecture and interoperability aspects – Daan Broeder
EUDAT data architecture and interoperability aspects – Daan Broeder
 
Open Data and CKAN Data Catalogues
Open Data and CKAN Data CataloguesOpen Data and CKAN Data Catalogues
Open Data and CKAN Data Catalogues
 
Drupal, CKAN and Public Data. DrupalGov 08 february 2016
Drupal, CKAN and Public Data. DrupalGov 08 february 2016Drupal, CKAN and Public Data. DrupalGov 08 february 2016
Drupal, CKAN and Public Data. DrupalGov 08 february 2016
 
Getting to Know CKAN, 24 June 2015, Singapore
Getting to Know CKAN, 24 June 2015, SingaporeGetting to Know CKAN, 24 June 2015, Singapore
Getting to Know CKAN, 24 June 2015, Singapore
 
Introduction to CKAN
Introduction to CKANIntroduction to CKAN
Introduction to CKAN
 
AWS re:Invent 2016: Case Study: How Atlassian Uses Amazon EFS with JIRA to Cu...
AWS re:Invent 2016: Case Study: How Atlassian Uses Amazon EFS with JIRA to Cu...AWS re:Invent 2016: Case Study: How Atlassian Uses Amazon EFS with JIRA to Cu...
AWS re:Invent 2016: Case Study: How Atlassian Uses Amazon EFS with JIRA to Cu...
 
20160602 典藏目錄的語意與連結
20160602 典藏目錄的語意與連結20160602 典藏目錄的語意與連結
20160602 典藏目錄的語意與連結
 
20161004 “Open Data Web” – A Linked Open Data Repository Built with CKAN
20161004 “Open Data Web” – A Linked Open Data Repository Built with CKAN20161004 “Open Data Web” – A Linked Open Data Repository Built with CKAN
20161004 “Open Data Web” – A Linked Open Data Repository Built with CKAN
 
AWS re:Invent 2016: Case Study: How Monsanto Uses Amazon EFS with Their Large...
AWS re:Invent 2016: Case Study: How Monsanto Uses Amazon EFS with Their Large...AWS re:Invent 2016: Case Study: How Monsanto Uses Amazon EFS with Their Large...
AWS re:Invent 2016: Case Study: How Monsanto Uses Amazon EFS with Their Large...
 
Minicurso de CKAN
Minicurso de CKANMinicurso de CKAN
Minicurso de CKAN
 

Similar a ckan 2.0 Introduction (20140522 updated)

DM2E Project meeting Bergen: WP2 presentation, Kai Eckert (University of Mann...
DM2E Project meeting Bergen: WP2 presentation, Kai Eckert (University of Mann...DM2E Project meeting Bergen: WP2 presentation, Kai Eckert (University of Mann...
DM2E Project meeting Bergen: WP2 presentation, Kai Eckert (University of Mann...Digitised Manuscripts to Europeana
 
Open Data Node - Platform and Methodology - 2015-May
Open Data Node - Platform and Methodology - 2015-MayOpen Data Node - Platform and Methodology - 2015-May
Open Data Node - Platform and Methodology - 2015-MayComsode - FP7 project
 
Spark China Summit 2015 Guancheng Chen
Spark China Summit 2015 Guancheng ChenSpark China Summit 2015 Guancheng Chen
Spark China Summit 2015 Guancheng ChenGuancheng (G.C.) Chen
 
Drupal 7 and RDF
Drupal 7 and RDFDrupal 7 and RDF
Drupal 7 and RDFscorlosquet
 
Sparkler - Spark Crawler
Sparkler - Spark Crawler Sparkler - Spark Crawler
Sparkler - Spark Crawler Thamme Gowda
 
ODN - Technical introduction of the platform
ODN - Technical introduction of the platformODN - Technical introduction of the platform
ODN - Technical introduction of the platformComsode - FP7 project
 
Drupal and the semantic web - SemTechBiz 2012
Drupal and the semantic web - SemTechBiz 2012Drupal and the semantic web - SemTechBiz 2012
Drupal and the semantic web - SemTechBiz 2012scorlosquet
 
PostgreSQL ecosystem
PostgreSQL ecosystemPostgreSQL ecosystem
PostgreSQL ecosystemJosé Lin
 
Introduction to Globus: Research Data Management Software at the ALCF
Introduction to Globus: Research Data Management Software at the ALCFIntroduction to Globus: Research Data Management Software at the ALCF
Introduction to Globus: Research Data Management Software at the ALCFGlobus
 
OpenMetadata Community Meeting - 14 Dec. 2023
OpenMetadata Community Meeting - 14 Dec. 2023OpenMetadata Community Meeting - 14 Dec. 2023
OpenMetadata Community Meeting - 14 Dec. 2023OpenMetadata
 
[scala.by] Launching new application fast
[scala.by] Launching new application fast[scala.by] Launching new application fast
[scala.by] Launching new application fastDenis Karpenko
 
scale12x
scale12xscale12x
scale12xkaiyzen
 
Database Migrations with Gradle and Liquibase
Database Migrations with Gradle and LiquibaseDatabase Migrations with Gradle and Liquibase
Database Migrations with Gradle and LiquibaseDan Stine
 
Improving ad hoc and production workflows at Stitch Fix
Improving ad hoc and production workflows at Stitch FixImproving ad hoc and production workflows at Stitch Fix
Improving ad hoc and production workflows at Stitch FixStitch Fix Algorithms
 
Integrating Flink with Hive - Flink Forward SF 2019
Integrating Flink with Hive - Flink Forward SF 2019Integrating Flink with Hive - Flink Forward SF 2019
Integrating Flink with Hive - Flink Forward SF 2019Bowen Li
 
Efficient & effective data management for research projects : ILRI's Data Ma...
Efficient & effective  data management for research projects : ILRI's Data Ma...Efficient & effective  data management for research projects : ILRI's Data Ma...
Efficient & effective data management for research projects : ILRI's Data Ma...CIARD Movement
 
Deploy Eclipse hawBit in Production
Deploy Eclipse hawBit in ProductionDeploy Eclipse hawBit in Production
Deploy Eclipse hawBit in ProductionKynetics
 
Infrastructure for Deep Learning in Apache Spark
Infrastructure for Deep Learning in Apache SparkInfrastructure for Deep Learning in Apache Spark
Infrastructure for Deep Learning in Apache SparkDatabricks
 

Similar a ckan 2.0 Introduction (20140522 updated) (20)

DM2E Project meeting Bergen: WP2 presentation, Kai Eckert (University of Mann...
DM2E Project meeting Bergen: WP2 presentation, Kai Eckert (University of Mann...DM2E Project meeting Bergen: WP2 presentation, Kai Eckert (University of Mann...
DM2E Project meeting Bergen: WP2 presentation, Kai Eckert (University of Mann...
 
Open Data Node - Platform and Methodology - 2015-May
Open Data Node - Platform and Methodology - 2015-MayOpen Data Node - Platform and Methodology - 2015-May
Open Data Node - Platform and Methodology - 2015-May
 
Spark China Summit 2015 Guancheng Chen
Spark China Summit 2015 Guancheng ChenSpark China Summit 2015 Guancheng Chen
Spark China Summit 2015 Guancheng Chen
 
Drupal 7 and RDF
Drupal 7 and RDFDrupal 7 and RDF
Drupal 7 and RDF
 
Sparkler - Spark Crawler
Sparkler - Spark Crawler Sparkler - Spark Crawler
Sparkler - Spark Crawler
 
ODN - Technical introduction of the platform
ODN - Technical introduction of the platformODN - Technical introduction of the platform
ODN - Technical introduction of the platform
 
Drupal and the semantic web - SemTechBiz 2012
Drupal and the semantic web - SemTechBiz 2012Drupal and the semantic web - SemTechBiz 2012
Drupal and the semantic web - SemTechBiz 2012
 
Sprint planning
Sprint planningSprint planning
Sprint planning
 
PostgreSQL ecosystem
PostgreSQL ecosystemPostgreSQL ecosystem
PostgreSQL ecosystem
 
PTG recap
PTG recapPTG recap
PTG recap
 
Introduction to Globus: Research Data Management Software at the ALCF
Introduction to Globus: Research Data Management Software at the ALCFIntroduction to Globus: Research Data Management Software at the ALCF
Introduction to Globus: Research Data Management Software at the ALCF
 
OpenMetadata Community Meeting - 14 Dec. 2023
OpenMetadata Community Meeting - 14 Dec. 2023OpenMetadata Community Meeting - 14 Dec. 2023
OpenMetadata Community Meeting - 14 Dec. 2023
 
[scala.by] Launching new application fast
[scala.by] Launching new application fast[scala.by] Launching new application fast
[scala.by] Launching new application fast
 
scale12x
scale12xscale12x
scale12x
 
Database Migrations with Gradle and Liquibase
Database Migrations with Gradle and LiquibaseDatabase Migrations with Gradle and Liquibase
Database Migrations with Gradle and Liquibase
 
Improving ad hoc and production workflows at Stitch Fix
Improving ad hoc and production workflows at Stitch FixImproving ad hoc and production workflows at Stitch Fix
Improving ad hoc and production workflows at Stitch Fix
 
Integrating Flink with Hive - Flink Forward SF 2019
Integrating Flink with Hive - Flink Forward SF 2019Integrating Flink with Hive - Flink Forward SF 2019
Integrating Flink with Hive - Flink Forward SF 2019
 
Efficient & effective data management for research projects : ILRI's Data Ma...
Efficient & effective  data management for research projects : ILRI's Data Ma...Efficient & effective  data management for research projects : ILRI's Data Ma...
Efficient & effective data management for research projects : ILRI's Data Ma...
 
Deploy Eclipse hawBit in Production
Deploy Eclipse hawBit in ProductionDeploy Eclipse hawBit in Production
Deploy Eclipse hawBit in Production
 
Infrastructure for Deep Learning in Apache Spark
Infrastructure for Deep Learning in Apache SparkInfrastructure for Deep Learning in Apache Spark
Infrastructure for Deep Learning in Apache Spark
 

Más de Chengjen Lee

Preserving Collaborative Documents in Contemporary Events
Preserving Collaborative Documents in Contemporary EventsPreserving Collaborative Documents in Contemporary Events
Preserving Collaborative Documents in Contemporary EventsChengjen Lee
 
Retooling a Research Data Repository: data.depositar.io
Retooling a Research Data Repository: data.depositar.ioRetooling a Research Data Repository: data.depositar.io
Retooling a Research Data Repository: data.depositar.ioChengjen Lee
 
跨領域區域研究資料集 (data.depositar.io): CKAN 應用介紹
跨領域區域研究資料集 (data.depositar.io): CKAN 應用介紹跨領域區域研究資料集 (data.depositar.io): CKAN 應用介紹
跨領域區域研究資料集 (data.depositar.io): CKAN 應用介紹Chengjen Lee
 
“Open Data Web” – A Linked Open Data Repository Built with CKAN
“Open Data Web” – A Linked Open Data Repository Built with CKAN“Open Data Web” – A Linked Open Data Repository Built with CKAN
“Open Data Web” – A Linked Open Data Repository Built with CKANChengjen Lee
 
CKAN 技術介紹 (開發篇)
CKAN 技術介紹 (開發篇)CKAN 技術介紹 (開發篇)
CKAN 技術介紹 (開發篇)Chengjen Lee
 
CKAN 技術介紹 (基礎篇)
CKAN 技術介紹 (基礎篇)CKAN 技術介紹 (基礎篇)
CKAN 技術介紹 (基礎篇)Chengjen Lee
 
CKAN 應用介紹 - 以台江計畫為例
CKAN 應用介紹 - 以台江計畫為例CKAN 應用介紹 - 以台江計畫為例
CKAN 應用介紹 - 以台江計畫為例Chengjen Lee
 
Introduction to Pelican
Introduction to PelicanIntroduction to Pelican
Introduction to PelicanChengjen Lee
 
ckan 2.0: a deeper look
ckan 2.0: a deeper lookckan 2.0: a deeper look
ckan 2.0: a deeper lookChengjen Lee
 

Más de Chengjen Lee (11)

Preserving Collaborative Documents in Contemporary Events
Preserving Collaborative Documents in Contemporary EventsPreserving Collaborative Documents in Contemporary Events
Preserving Collaborative Documents in Contemporary Events
 
Retooling a Research Data Repository: data.depositar.io
Retooling a Research Data Repository: data.depositar.ioRetooling a Research Data Repository: data.depositar.io
Retooling a Research Data Repository: data.depositar.io
 
跨領域區域研究資料集 (data.depositar.io): CKAN 應用介紹
跨領域區域研究資料集 (data.depositar.io): CKAN 應用介紹跨領域區域研究資料集 (data.depositar.io): CKAN 應用介紹
跨領域區域研究資料集 (data.depositar.io): CKAN 應用介紹
 
“Open Data Web” – A Linked Open Data Repository Built with CKAN
“Open Data Web” – A Linked Open Data Repository Built with CKAN“Open Data Web” – A Linked Open Data Repository Built with CKAN
“Open Data Web” – A Linked Open Data Repository Built with CKAN
 
CKAN 技術介紹 (開發篇)
CKAN 技術介紹 (開發篇)CKAN 技術介紹 (開發篇)
CKAN 技術介紹 (開發篇)
 
CKAN 技術介紹 (基礎篇)
CKAN 技術介紹 (基礎篇)CKAN 技術介紹 (基礎篇)
CKAN 技術介紹 (基礎篇)
 
CKAN 應用介紹 - 以台江計畫為例
CKAN 應用介紹 - 以台江計畫為例CKAN 應用介紹 - 以台江計畫為例
CKAN 應用介紹 - 以台江計畫為例
 
Report 140227
Report 140227Report 140227
Report 140227
 
Report 140213
Report 140213Report 140213
Report 140213
 
Introduction to Pelican
Introduction to PelicanIntroduction to Pelican
Introduction to Pelican
 
ckan 2.0: a deeper look
ckan 2.0: a deeper lookckan 2.0: a deeper look
ckan 2.0: a deeper look
 

Último

Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 

Último (20)

Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 

ckan 2.0 Introduction (20140522 updated)

  • 1. CKAN 2 Introduction Presenter: 李承錱 Cheng-Jen Lee (Sol) Email: cjlee AT iis.sinica.edu.tw This work is licensed under a Creative Commons Attribution-ShareAlike 3.0 Taiwan License.
  • 2. Agenda ● About CKAN ● Feature Tour – Publish & Find Datasets – Store & Manage Data – Engage with Users & Others – Customise & Extend ● CKAN and 5 ★ Open Data ● Showcase ● Issues ● Extended Topic 1: Installation ● Extended Topic 2: Harvesters 2014/5/22 2
  • 3. Agenda ● About CKAN ● Feature Tour – Publish & Find Datasets – Store & Manage Data – Engage with Users & Others – Customise & Extend ● CKAN and 5 ★ Open Data ● Showcase ● Issues ● Extended Topic 1: Installation ● Extended Topic 2: Harvesters 2014/5/22 3
  • 4. About CKAN ● The Comprehensive Knowledge Archive Network ● A powerful data management system ● Publishing ● Sharing ● Finding ● Using Data 2014/5/22 4
  • 5. About CKAN 83 instances around the world in May 2014 2014/5/22 5
  • 9. Feature Tour (1) Publish & Find Datasets Add Dataset Basic Information 2014/5/22 9
  • 10. Feature Tour (1) Publish & Find Datasets Add Data Under the Dataset 2014/5/22 10
  • 11. Feature Tour (1) Publish & Find Datasets Add Metadata About the Dataset 2014/5/22 11
  • 12. Feature Tour (1) Publish & Find Datasets Filter By Keywords 2014/5/22 12
  • 13. Feature Tour (1) Publish & Find Datasets Filter By Geographical Features 2014/5/22 13
  • 14. Feature Tour (2) Store & Manage Data Data Explorer: recline_preview (csv, xls) json_preview pdf_preview ckanext-spatial 2014/5/22 14
  • 15. Feature Tour (2) Store & Manage Data Graphing data 2014/5/22 15
  • 16. Feature Tour (3) Engage with Users & Others Share 2014/5/22 16
  • 17. Feature Tour (3) Engage with Users & Others Organization 2014/5/22 17
  • 18. Feature Tour (3) Engage with Users & Others Manage Users of an Organization 2014/5/22 18
  • 19. Feature Tour (3) Engage with Users & Others Manage Role of Members Admin: edit datasets & members Editor: edit datasets Viewer: view (private) datasets Note: Public datasets are visible to everyone 2014/5/22 19
  • 20. Feature Tour (3) Engage with Users & Others Harvest and Federation 2014/5/22 20
  • 21. Feature Tour (3) Engage with Users & Others History 2014/5/22 21
  • 22. Feature Tour (4) Customise & Extend ● RESTful JSON APIs – The Action API – The DataStore API – The FileStore API... ● Extensions (over 60) – ckanext-harvest – ckanext-spatial Open source is good! ● Themable ● Integrates with other CMS (ex. Drupal) 2014/5/22 22
  • 23. CKAN and 5 ★ Open Data 2014/5/22 23
  • 24. CKAN and 5 ★ Open Data ● ★ Make your stuff available on the Web (whatever format) under an open license 2014/5/22 24
  • 25. CKAN and 5 ★ Open Data ● ★★ Make it available as structured data (e.g., Excel instead of image scan of a table) ★★★ Use non-proprietary formats (e.g., CSV instead of Excel) – Accept any data format – Beautiful data demonstration – Datastore : Indexing for structure data 2014/5/22 25
  • 26. CKAN and 5 ★ Open Data ● ★★★★ Use URIs to denote things, so that people can point at your stuff – Permanent link for each dataset – Get Dataset URI through API ● ★★★★★ Link your data to other data to provide context – Linked data and RDF for metadata 2014/5/22 26
  • 27. CKAN and 5 ★ Open Data ● RDF for metadata – DCAT and Dublin Core – curl -L -H "Accept:application/rdf+xml" http://thedatahub.org/dataset/gold-prices 2014/5/22 27
  • 28. Agenda ● About CKAN ● Feature Tour – Publish & Find Datasets – Store & Manage Data – Engage with Users & Others – Customise & Extend ● CKAN and 5 ★ Open Data ● Showcase ● Issues ● Extended Topic 1: Installation ● Extended Topic 2: Harvesters 2014/5/22 28
  • 30. United States DATA.GOV 2014/5/22 30
  • 33. Geospatial Data Explorer: Lat/Long field 2014/5/22 33
  • 34. Geospatial Data Explorer: GeoJSON 2014/5/22 34
  • 35. Geospatial Data Explorer: WMS 2014/5/22 35
  • 36. Agenda ● About CKAN ● Feature Tour – Publish & Find Datasets – Store & Manage Data – Engage with Users & Others – Customise & Extend ● CKAN and 5 ★ Open Data ● Showcase ● Issues ● Extended Topic 1: Installation ● Extended Topic 2: Harvesters 2014/5/22 36
  • 37. Issues ● CJK Support – CJK Search – Some broken translations – Frequent updates ● Extensions compatibilities ● Tons of tweaks needed ● Performance Issue ● Complicated Architecture 2014/5/22 37
  • 39. Issues (Cont'd) ● What You Should Know – Python & Pylons – ckan plugins toolkit – SQLAlchemy & SQL – HTML, JavaScript – Babel – Web Server (Nginx, uWSGI...) 2014/5/22 39
  • 40. Resources ● Official Documents: – http://docs.ckan.org/en/latest/ ● Installation Notes (in Chinese): – https://ckan-docs-tw.readthedocs.org/ ● CKAN Development Discussions: – http://lists.okfn.org/mailman/listinfo/ckan-dev ● CKAN Taiwan Interest Group: – https://groups.google.com/forum/#!forum/ckan-taiwan-interest-group 2014/5/22 40
  • 41. Thanks for your attention! Any Q? Email: u10313335 AT citi.sinica.edu.tw http://about.me/sollee 2014/5/22 41
  • 42. Agenda ● About CKAN ● Feature Tour – Publish & Find Datasets – Store & Manage Data – Engage with Users & Others – Customise & Extend ● CKAN and 5 ★ Open Data ● Showcase ● Issues ● Extended Topic 1: Installation ● Extended Topic 2: Harvesters 2014/5/22 42
  • 43. Install from Source ● Virtual environment ● Checkout the source (via GIT) – https://github.com/okfn/ckan ● Create a CKAN config file ● Setup Jetty & Solr ● Initialize Database (user, db) ● Link to who.ini ● Create a Sysadmin User ● Deployment (nginx + uWSGI) ● Install other extensions... 2014/5/22 43
  • 44. Installation Notes ● https://ckan-docs-tw.readthedocs.org/ 2014/5/22 44
  • 45. Agenda ● About CKAN ● Feature Tour – Publish & Find Datasets – Store & Manage Data – Engage with Users & Others – Customise & Extend ● CKAN and 5 ★ Open Data ● Showcase ● Issues ● Extended Topic 1: Installation ● Extended Topic 2: Harvesters 2014/5/22 45
  • 46. Harvesters ● ckanext-harvest – Remote harvesting extension – https://github.com/okfn/ckanext-harvest ● Source Type – CKAN (built-in) – CSW – WAF – Custom (csv/xls/website… etc) 2014/5/22 46
  • 47. Harvested from TGOS CSW service 2014/5/22 47
  • 49. Harvesters Add a new harvest source 2014/5/22 49
  • 50. Harvesters Create a harvest job 2014/5/22 50
  • 51. Harvesters Overview of harvested datasets 2014/5/22 51
  • 52. Harvesters Background Process ● Manually – (pyenv) $ paster --plugin=ckanext-harvest harvester gather_consumer -c /etc/ckan/default/production.ini – (pyenv) $ paster --plugin=ckanext-harvest harvester fetch_consumer -c /etc/ckan/default/production.ini – (pyenv) $ paster --plugin=ckanext-harvest harvester run -c /etc/ckan/default/production.ini 2014/5/22 52
  • 53. Harvesters Background Process ● Automatically – Supervisor (for gather & fetch consumer) – Cron (for run) 2014/5/22 53
  • 54. Harvesters Custom harvester ● Implement the harvester interface to perform harvesting operations ● Three stages ● gather: get the identification ● fetch: fetch the contents ● import: create ckan package(dataset) 2014/5/22 54
  • 55. Harvesters The harvesting interface from base import HarvesterBase class SRDAHarvester(HarvesterBase): def _set_config(self,config_str): def info(self): ... def gather_stage(self, harvest_job): def fetch_stage(self, harvest_object): def import_stage(self, harvest_object): See the extension site for details An example (SRDA): http://goo.gl/ZMnND7 2014/5/22 55

Notas del editor

  1. 由 Open Knowledge Foundation 支持
  2. 由 Open Knowledge Foundation 支持
  3. 公布網址
  4. Data.gov.uk 即是 drupal+CKAN
  5. Store the raw data and metadata. Visualise structured data with interactive tables, graphs and maps.
  6. TGOS: 國土資訊圖資服務平台