SlideShare a Scribd company logo
GREENSTONE
OAI IMPLEMENTATION
Wednesday, May 22, 2013
Kathmandu - Nepal
Diego J. Spano
dspano@prodigioconsultores.com
diegospano
@diegospano
THE OPEN ARCHIVES
APPROACH
The Open Archives Approach
"The Open Archives Initiative develops and promotes
interoperability standards that aim to facilitate the efficient
dissemination of content."
Facilitates access to heterogeneous web-accessible material
A low-barrier interoperability solution
Based on repositories supporting
Metadata sharing
Publishing
Archiving
Arose out of the e-print community
Main features
Open Archives Initiative
OAI Protocol for Metadata Harvesting (OAI-PMH)
To facilitate metadata harvesting there needs to be agreement on:
Transport protocol - HTTP
Metadata format - Dublin Core
Metadata Quality Assurance - mandatory element set, naming and
subject conventions
Intellectual Property and Usage Rights
The Open Archives Approach
OAI-PMH
A mechanism for harvesting
Data providers make metadata available for harvesting
Service Providers harvest metadata
Metadata can be centrally collected or “aggregated”
That’s all it is: a way to bring metadata together in one place!
GaDP
(GREENSTONE AS DATA
PROVIDER)
Greenstone comes with a built-in OAI data provider. This runs as a CGI program called
"oaiserver.cgi“. It can be accessed via the same URL as the Greenstone library (replacing
"library.cgi with "oaiserver.cgi").
GaDP – GREENSTONE AS DATA PROVIDER
http://biblioteca.clacso.edu.ar/gsdl/cgi-bin/library.cgi http://biblioteca.clacso.edu.ar/gsdl/cgi-bin/oaiserver.cgi
Configuration of the server is done via the oai.cfg file in the
Greenstone etc directory.
This file specifies general information about the repository, and lists
collections to be made accessible to OAI clients. By default, collections are not
accessible.
Greenstone's OAI server currently supports Dublin Core, Qualified Dublin
Core, RFC1807 metadata. For collections that use other metadata sets,
including extracted metadata, metadata mapping rules should be provided to
map the existing metadata to Dublin Core.
Next … Main parameters to configure
GaDP – GREENSTONE AS DATA PROVIDER
baseServerURL: the URL of the web server (the domain and port number if necessary)
baseServerURL “http://www.mylibrary.org”
oaiserverPath: the relative path of the oaiserver.cgi (will be added to baseServerURL)
oaiserverPath “/greenstone/cgi-bin/oaiserver.cgi”
libraryPath: the relative path of the Greenstone library.cgi
libraryPath “/greenstone/cgi-bin/library.cgi”
repositoryName: The OAI repository name - human readable
repositoryName “University of Waikato - Institutional Repository”
repositoryId: The OAI repository domain name or unique identifier - used for
generating the document identifiers, which will be like: oai:reposId:coll:oid
repositoryId “waikato”
maintainer: The central administrative email address for the repository
maintainer “george@mylib.edu"
resumeafter: How many records to output before needing a resumption token
resumeafter 250
GaDP – GREENSTONE AS DATA PROVIDER
oaiinfo: Free text items describing the repository.
oaiinfo “Purpose” “This Repository has all the papers …”
oaiinfo “Size” “More than 5000 articles”
oaicollection: A list of collections that should be made available to OAI clients. By
default, collections are not available. There can be more than one line
oaicollection thesis images videos
oaisetname / oaisetdescription: Optional set name and description for each collection.
oaisetname thesis “Thesis of Chemystry subjects”
oaisetdescription thesis “Ph students´papers”
oaimetadata: the list of metadata formats this server supports
oaimetadata oai_dc gsdl_qdc
Next … how to map your own metadata fields
GaDP – GREENSTONE AS DATA PROVIDER
Mapping rules allow you to match your own metadata schema to the right Dublin Core
metadata. The format is:
oaimapping <greenstone fieldname> <metadataPrefix.fieldname>
oaimapping dls.Subject oai_dc.coverage
oaimapping nz.Region oai_dc.coverage
oaimapping nz.Region gsdl_qdc.spatial
Next … Validate your site
GaDP – GREENSTONE AS DATA PROVIDER
Go to http://www.openarchives.org/data/registerasprovider.html and validate your
site to be sure it can be a data provider.
GaDP – GREENSTONE AS DATA PROVIDER
GaDP – GREENSTONE AS DATA PROVIDER
Once your OAI server is operational, to provide maximal international
visibility for your open access collections you should register them in
at least one (and ideally all) of the following:
-the ROAR directory (http://roar.eprints.org/)
- the OAI directory (http://www.openarchives.org/Register/BrowseSites)
- the OpenDOAR directory (http://www.opendoar.org/)
Next … A real case.
repositoryName
baseServerURL + oaiserverPath
repositoryId
maintainer
oaiinfo
http://biblioteca.clacso.edu.ar/gsdl/cgi-bin/oaiserver.cgi
GaDP – GREENSTONE AS DATA PROVIDER
oaisetname
oaisetdescription
oaicollection
GaDP – GREENSTONE AS DATA PROVIDER
oaimapping
GaDP – GREENSTONE AS DATA PROVIDER
GaSP
(GREENSTONE AS
SERVICE PROVIDER)
GLI can serve as an OAI client application: it can connect to a remote OAI server and
retrieve metadata and even download documents.
You have three methods for doing this:
- Using GLI (Greenstone Librarian Interface)
- Using command line from within Greenstone environment
- Using a third party application that allows you to download OAI records.
We´ll focus on the first two methods.
Next … Using GLI to download OAI records
GaSP – GREENSTONE AS SERVICE PROVIDER
GLI offers a “Download” tab for downloading documents and/or records. Assuming
that you know the URL for the external repository, you can know the rest of the
parameters by clicking “Server Information”, i.e.. the name of the sets.
GaSP – GREENSTONE AS SERVICE PROVIDER
Once you filled the required fields just press the Download button.
GaSP – GREENSTONE AS SERVICE PROVIDER
The downloaded records will appear in the “Gather” tab under “Downloaded Files”
section. Drag them to the right panel to get the records ready for building the
collection.
GaSP – GREENSTONE AS SERVICE PROVIDER
OAI records imported into a Greenstone collection are saved as Greenstone archive
format, similar to XML format but with some extra data
GaSP – GREENSTONE AS SERVICE PROVIDER
Dublin Core metadata extracted as “ex”
Schema
“prettymd” metadata: its value is a
preformated table (with style)
Containing the full OAI record
format DocumentText "[prettymd]“ is all you need to display the OAI record…
GaSP – GREENSTONE AS SERVICE PROVIDER
Using command line to download OAI records
Just a few step to do this task:
1. Open a terminal or console window
2. Go to Greenstone root folder
3. Run setup.bat/setup.bash
4. Move to the import folder of the target collection for the records
5. Run perl -S downloadfrom.pl (with the following options)
- download_mode OAI
- url http://biblioteca.clacso.edu.ar/gsdl/cgi-bin/oaiserver.cgi
- set ar/ar-001
- max_record 10
6. Build the collection as usual
The command “downloadfrom” will have the same results as the previous task
executed through GLI
GaSP – GREENSTONE AS SERVICE PROVIDER
Harvester: client application issuing OAI-PMH requests
Repository: network accessible server, able to process OAI-PMH requests
correctly
Resource: object the metadata is “about”, nature of resources is not
defined in the OAI-PMH
Item: component of a repository from which metadata about a resource
can be disseminated. Has an unique identifier
Record: metadata in a specific metadata format
Identifier: unique key for an item in a repository
Set: optional construct for grouping items in a repository
GLOSARY
For GS2, there are 3 tutorials:
http://wiki.greenstone.org/wiki/index.php/Tutorial_exercises#Setting_up_your_Greenstone_OAI_Server
http://wiki.greenstone.org/wiki/index.php/Tutorial_exercises#Connecting_to_an_OAI_server_from_GLI
http://wiki.greenstone.org/wiki/index.php/Tutorial_exercises#Connecting_to_the_Greenstone_OAI_server_fro
m_the_outside_world
http://wiki.greenstone.org/wiki/index.php/2.85_Release_Notes#OAI_downloading
For GS3, the same approximate information is merged into the following 2 tutorials:
http://wiki.greenstone.org/wiki/index.php/GS3_Tutorial_exercises#Setting_up_your_Greenstone_OAI_Server
http://wiki.greenstone.org/wiki/gsdoc/tutorial/gs3-current/en/OAI_downloading.htm
+ INFO...
QUESTIONS
THANKS !!!
Diego Spano
dspano@prodigioconsultores.com
www.prodigioconsultores.com

More Related Content

What's hot

Tracking the Performance of the Web Over Time with the HTTP Archive
Tracking the Performance of the Web Over Time with the HTTP ArchiveTracking the Performance of the Web Over Time with the HTTP Archive
Tracking the Performance of the Web Over Time with the HTTP Archive
Akamai Developers & Admins
 
Tracking the Performance of the Web with HTTP Archive
Tracking the Performance of the Web with HTTP ArchiveTracking the Performance of the Web with HTTP Archive
Tracking the Performance of the Web with HTTP Archive
Rick Viscomi
 
Akamai Edge: Tracking the Performance of the Web with HTTP Archive
Akamai Edge: Tracking the Performance of the Web with HTTP ArchiveAkamai Edge: Tracking the Performance of the Web with HTTP Archive
Akamai Edge: Tracking the Performance of the Web with HTTP Archive
Rick Viscomi
 
HBaseCon 2013: Honeycomb - MySQL Backed by Apache HBase
HBaseCon 2013: Honeycomb - MySQL Backed by Apache HBase HBaseCon 2013: Honeycomb - MySQL Backed by Apache HBase
HBaseCon 2013: Honeycomb - MySQL Backed by Apache HBase
Cloudera, Inc.
 
Leveraging Hadoop in Polyglot Architectures
Leveraging Hadoop in Polyglot ArchitecturesLeveraging Hadoop in Polyglot Architectures
Leveraging Hadoop in Polyglot Architectures
Thanigai Vellore
 
Introduction to Apache Hive | Big Data Hadoop Spark Tutorial | CloudxLab
Introduction to Apache Hive | Big Data Hadoop Spark Tutorial | CloudxLabIntroduction to Apache Hive | Big Data Hadoop Spark Tutorial | CloudxLab
Introduction to Apache Hive | Big Data Hadoop Spark Tutorial | CloudxLab
CloudxLab
 
What and Why and How: Apache Drill ! - Tugdual Grall
What and Why and How: Apache Drill ! - Tugdual GrallWhat and Why and How: Apache Drill ! - Tugdual Grall
What and Why and How: Apache Drill ! - Tugdual Grall
distributed matters
 
GlobusWorld 2021 Tutorial: Globus for System Administrators
GlobusWorld 2021 Tutorial: Globus for System AdministratorsGlobusWorld 2021 Tutorial: Globus for System Administrators
GlobusWorld 2021 Tutorial: Globus for System Administrators
Globus
 
NoSQL meets Microservices - Michael Hackstein
NoSQL meets Microservices -  Michael HacksteinNoSQL meets Microservices -  Michael Hackstein
NoSQL meets Microservices - Michael Hackstein
distributed matters
 
From Overnight to Always On @ Jfokus 2019
From Overnight to Always On @ Jfokus 2019From Overnight to Always On @ Jfokus 2019
From Overnight to Always On @ Jfokus 2019
Enno Runne
 
NoSQL Databases
NoSQL DatabasesNoSQL Databases
NoSQL Databases
Amit Kumar Gupta
 
Enabling Secure Data Discoverability (SC21 Tutorial)
Enabling Secure Data Discoverability (SC21 Tutorial)Enabling Secure Data Discoverability (SC21 Tutorial)
Enabling Secure Data Discoverability (SC21 Tutorial)
Globus
 
Browser
BrowserBrowser
Browser
Shweta Oza
 
REST meets Semantic Web
REST meets Semantic WebREST meets Semantic Web
REST meets Semantic Web
Steve Speicher
 
W3C Linked Data Platform Overview
W3C Linked Data Platform OverviewW3C Linked Data Platform Overview
W3C Linked Data Platform Overview
Steve Speicher
 
Data Processing and Aggregation with MongoDB
Data Processing and Aggregation with MongoDB Data Processing and Aggregation with MongoDB
Data Processing and Aggregation with MongoDB MongoDB
 
20090701 Climate Data Staging
20090701 Climate Data Staging20090701 Climate Data Staging
20090701 Climate Data Staging
Henning Bergmeyer
 
GlobusWorld 2021 Tutorial: The Globus CLI, Platform and SDK
GlobusWorld 2021 Tutorial: The Globus CLI, Platform and SDKGlobusWorld 2021 Tutorial: The Globus CLI, Platform and SDK
GlobusWorld 2021 Tutorial: The Globus CLI, Platform and SDK
Globus
 
Introduction to Hbase
Introduction to Hbase Introduction to Hbase
Introduction to Hbase
Rupak Roy
 
What's New in Globus - Internet2 TechEXtra
What's New in Globus - Internet2 TechEXtraWhat's New in Globus - Internet2 TechEXtra
What's New in Globus - Internet2 TechEXtra
Globus
 

What's hot (20)

Tracking the Performance of the Web Over Time with the HTTP Archive
Tracking the Performance of the Web Over Time with the HTTP ArchiveTracking the Performance of the Web Over Time with the HTTP Archive
Tracking the Performance of the Web Over Time with the HTTP Archive
 
Tracking the Performance of the Web with HTTP Archive
Tracking the Performance of the Web with HTTP ArchiveTracking the Performance of the Web with HTTP Archive
Tracking the Performance of the Web with HTTP Archive
 
Akamai Edge: Tracking the Performance of the Web with HTTP Archive
Akamai Edge: Tracking the Performance of the Web with HTTP ArchiveAkamai Edge: Tracking the Performance of the Web with HTTP Archive
Akamai Edge: Tracking the Performance of the Web with HTTP Archive
 
HBaseCon 2013: Honeycomb - MySQL Backed by Apache HBase
HBaseCon 2013: Honeycomb - MySQL Backed by Apache HBase HBaseCon 2013: Honeycomb - MySQL Backed by Apache HBase
HBaseCon 2013: Honeycomb - MySQL Backed by Apache HBase
 
Leveraging Hadoop in Polyglot Architectures
Leveraging Hadoop in Polyglot ArchitecturesLeveraging Hadoop in Polyglot Architectures
Leveraging Hadoop in Polyglot Architectures
 
Introduction to Apache Hive | Big Data Hadoop Spark Tutorial | CloudxLab
Introduction to Apache Hive | Big Data Hadoop Spark Tutorial | CloudxLabIntroduction to Apache Hive | Big Data Hadoop Spark Tutorial | CloudxLab
Introduction to Apache Hive | Big Data Hadoop Spark Tutorial | CloudxLab
 
What and Why and How: Apache Drill ! - Tugdual Grall
What and Why and How: Apache Drill ! - Tugdual GrallWhat and Why and How: Apache Drill ! - Tugdual Grall
What and Why and How: Apache Drill ! - Tugdual Grall
 
GlobusWorld 2021 Tutorial: Globus for System Administrators
GlobusWorld 2021 Tutorial: Globus for System AdministratorsGlobusWorld 2021 Tutorial: Globus for System Administrators
GlobusWorld 2021 Tutorial: Globus for System Administrators
 
NoSQL meets Microservices - Michael Hackstein
NoSQL meets Microservices -  Michael HacksteinNoSQL meets Microservices -  Michael Hackstein
NoSQL meets Microservices - Michael Hackstein
 
From Overnight to Always On @ Jfokus 2019
From Overnight to Always On @ Jfokus 2019From Overnight to Always On @ Jfokus 2019
From Overnight to Always On @ Jfokus 2019
 
NoSQL Databases
NoSQL DatabasesNoSQL Databases
NoSQL Databases
 
Enabling Secure Data Discoverability (SC21 Tutorial)
Enabling Secure Data Discoverability (SC21 Tutorial)Enabling Secure Data Discoverability (SC21 Tutorial)
Enabling Secure Data Discoverability (SC21 Tutorial)
 
Browser
BrowserBrowser
Browser
 
REST meets Semantic Web
REST meets Semantic WebREST meets Semantic Web
REST meets Semantic Web
 
W3C Linked Data Platform Overview
W3C Linked Data Platform OverviewW3C Linked Data Platform Overview
W3C Linked Data Platform Overview
 
Data Processing and Aggregation with MongoDB
Data Processing and Aggregation with MongoDB Data Processing and Aggregation with MongoDB
Data Processing and Aggregation with MongoDB
 
20090701 Climate Data Staging
20090701 Climate Data Staging20090701 Climate Data Staging
20090701 Climate Data Staging
 
GlobusWorld 2021 Tutorial: The Globus CLI, Platform and SDK
GlobusWorld 2021 Tutorial: The Globus CLI, Platform and SDKGlobusWorld 2021 Tutorial: The Globus CLI, Platform and SDK
GlobusWorld 2021 Tutorial: The Globus CLI, Platform and SDK
 
Introduction to Hbase
Introduction to Hbase Introduction to Hbase
Introduction to Hbase
 
What's New in Globus - Internet2 TechEXtra
What's New in Globus - Internet2 TechEXtraWhat's New in Globus - Internet2 TechEXtra
What's New in Globus - Internet2 TechEXtra
 

Viewers also liked

MTBiz February 2014
MTBiz February 2014MTBiz February 2014
MTBiz February 2014
Mutual Trust Bank Ltd.
 
What's Next with Government Big Data
What's Next with Government Big Data What's Next with Government Big Data
What's Next with Government Big Data
GovLoop
 
AVG Community Powered Threat Report: Q1 2012
AVG Community Powered Threat Report: Q1 2012AVG Community Powered Threat Report: Q1 2012
AVG Community Powered Threat Report: Q1 2012
AVG Technologies AU
 
Visual facilitation for agile BAs
Visual facilitation for agile BAsVisual facilitation for agile BAs
Visual facilitation for agile BAs
IQ Business - agility@IQ
 
Adventures in Digital Forensics
Adventures in Digital ForensicsAdventures in Digital Forensics
Adventures in Digital Forensics
_xhr_
 
Static Detection of Application Backdoors
Static Detection of Application BackdoorsStatic Detection of Application Backdoors
Static Detection of Application BackdoorsTyler Shields
 
DBA Basics guide
DBA Basics guideDBA Basics guide
DBA Basics guide
azoznasser1
 
'A View-Based Approach to Quality of Service Modelling in Service-Oriented En...
'A View-Based Approach to Quality of Service Modelling in Service-Oriented En...'A View-Based Approach to Quality of Service Modelling in Service-Oriented En...
'A View-Based Approach to Quality of Service Modelling in Service-Oriented En...IIBA_Latvia_Chapter
 
基于Android设备的音视频实时监控系统的设计与实现
基于Android设备的音视频实时监控系统的设计与实现基于Android设备的音视频实时监控系统的设计与实现
基于Android设备的音视频实时监控系统的设计与实现
Shawn Zhung
 
Configuration Manager vNext - What to expect
Configuration Manager vNext - What to expectConfiguration Manager vNext - What to expect
Configuration Manager vNext - What to expect
Tim De Keukelaere
 
シラサギハンズオン 東京
シラサギハンズオン 東京シラサギハンズオン 東京
シラサギハンズオン 東京
Yu Ito
 
LISA Qooxdoo Tutorial Handouts
LISA Qooxdoo Tutorial HandoutsLISA Qooxdoo Tutorial Handouts
LISA Qooxdoo Tutorial Handouts
Tobias Oetiker
 
Sistem keamanan komputer
Sistem keamanan komputerSistem keamanan komputer
Sistem keamanan komputer
Rhusman 05
 
Upgrade OBIEE to 11.1.1.7.1
Upgrade OBIEE to 11.1.1.7.1Upgrade OBIEE to 11.1.1.7.1
Upgrade OBIEE to 11.1.1.7.1
Osama Mustafa
 
Anonos U.S. Patent Number 9,087,215
Anonos U.S. Patent Number 9,087,215Anonos U.S. Patent Number 9,087,215
Anonos U.S. Patent Number 9,087,215
Ted Myerson
 
Nosotros Elmedio
Nosotros ElmedioNosotros Elmedio
Nosotros Elmedio
Espacio Público
 
El papel del vídeo en la Web 2.0
El papel del vídeo en la Web 2.0El papel del vídeo en la Web 2.0
El papel del vídeo en la Web 2.0
Pablo Olmeda
 
Alumni Engagement: Gettting Them to Comeback
Alumni Engagement: Gettting Them to ComebackAlumni Engagement: Gettting Them to Comeback
Alumni Engagement: Gettting Them to Comeback
Agilon LLC
 
Engage -- eGovernment strategies for success
Engage -- eGovernment strategies for successEngage -- eGovernment strategies for success
Engage -- eGovernment strategies for success
NIC Inc | EGOV
 

Viewers also liked (20)

MTBiz February 2014
MTBiz February 2014MTBiz February 2014
MTBiz February 2014
 
What's Next with Government Big Data
What's Next with Government Big Data What's Next with Government Big Data
What's Next with Government Big Data
 
AVG Community Powered Threat Report: Q1 2012
AVG Community Powered Threat Report: Q1 2012AVG Community Powered Threat Report: Q1 2012
AVG Community Powered Threat Report: Q1 2012
 
Db2 howto-linux
Db2 howto-linuxDb2 howto-linux
Db2 howto-linux
 
Visual facilitation for agile BAs
Visual facilitation for agile BAsVisual facilitation for agile BAs
Visual facilitation for agile BAs
 
Adventures in Digital Forensics
Adventures in Digital ForensicsAdventures in Digital Forensics
Adventures in Digital Forensics
 
Static Detection of Application Backdoors
Static Detection of Application BackdoorsStatic Detection of Application Backdoors
Static Detection of Application Backdoors
 
DBA Basics guide
DBA Basics guideDBA Basics guide
DBA Basics guide
 
'A View-Based Approach to Quality of Service Modelling in Service-Oriented En...
'A View-Based Approach to Quality of Service Modelling in Service-Oriented En...'A View-Based Approach to Quality of Service Modelling in Service-Oriented En...
'A View-Based Approach to Quality of Service Modelling in Service-Oriented En...
 
基于Android设备的音视频实时监控系统的设计与实现
基于Android设备的音视频实时监控系统的设计与实现基于Android设备的音视频实时监控系统的设计与实现
基于Android设备的音视频实时监控系统的设计与实现
 
Configuration Manager vNext - What to expect
Configuration Manager vNext - What to expectConfiguration Manager vNext - What to expect
Configuration Manager vNext - What to expect
 
シラサギハンズオン 東京
シラサギハンズオン 東京シラサギハンズオン 東京
シラサギハンズオン 東京
 
LISA Qooxdoo Tutorial Handouts
LISA Qooxdoo Tutorial HandoutsLISA Qooxdoo Tutorial Handouts
LISA Qooxdoo Tutorial Handouts
 
Sistem keamanan komputer
Sistem keamanan komputerSistem keamanan komputer
Sistem keamanan komputer
 
Upgrade OBIEE to 11.1.1.7.1
Upgrade OBIEE to 11.1.1.7.1Upgrade OBIEE to 11.1.1.7.1
Upgrade OBIEE to 11.1.1.7.1
 
Anonos U.S. Patent Number 9,087,215
Anonos U.S. Patent Number 9,087,215Anonos U.S. Patent Number 9,087,215
Anonos U.S. Patent Number 9,087,215
 
Nosotros Elmedio
Nosotros ElmedioNosotros Elmedio
Nosotros Elmedio
 
El papel del vídeo en la Web 2.0
El papel del vídeo en la Web 2.0El papel del vídeo en la Web 2.0
El papel del vídeo en la Web 2.0
 
Alumni Engagement: Gettting Them to Comeback
Alumni Engagement: Gettting Them to ComebackAlumni Engagement: Gettting Them to Comeback
Alumni Engagement: Gettting Them to Comeback
 
Engage -- eGovernment strategies for success
Engage -- eGovernment strategies for successEngage -- eGovernment strategies for success
Engage -- eGovernment strategies for success
 

Similar to Configuring Greenstone's OAI server

Big Data Day LA 2016/ Big Data Track - Fluentd and Embulk: Collect More Data,...
Big Data Day LA 2016/ Big Data Track - Fluentd and Embulk: Collect More Data,...Big Data Day LA 2016/ Big Data Track - Fluentd and Embulk: Collect More Data,...
Big Data Day LA 2016/ Big Data Track - Fluentd and Embulk: Collect More Data,...
Data Con LA
 
Open for Business - Open Archives, OpenURL, RSS and the Dublin Core
Open for Business - Open Archives, OpenURL, RSS and the Dublin CoreOpen for Business - Open Archives, OpenURL, RSS and the Dublin Core
Open for Business - Open Archives, OpenURL, RSS and the Dublin Core
Andy Powell
 
Solid fire cloudstack storage overview - CloudStack European User Group
Solid fire cloudstack storage overview - CloudStack European User GroupSolid fire cloudstack storage overview - CloudStack European User Group
Solid fire cloudstack storage overview - CloudStack European User Group
ShapeBlue
 
CloudStack Meetup London - Primary Storage Presentation by SolidFire
CloudStack Meetup London - Primary Storage Presentation by SolidFire CloudStack Meetup London - Primary Storage Presentation by SolidFire
CloudStack Meetup London - Primary Storage Presentation by SolidFire
NetApp
 
Sword 2007 06 22
Sword 2007 06 22Sword 2007 06 22
Sword 2007 06 22
Julie Allinson
 
Large Scale Crawling with Apache Nutch and Friends
Large Scale Crawling with Apache Nutch and FriendsLarge Scale Crawling with Apache Nutch and Friends
Large Scale Crawling with Apache Nutch and Friends
lucenerevolution
 
Large Scale Crawling with Apache Nutch and Friends
Large Scale Crawling with Apache Nutch and FriendsLarge Scale Crawling with Apache Nutch and Friends
Large Scale Crawling with Apache Nutch and Friends
Julien Nioche
 
EWD 3 Training Course Part 1: How Node.js Integrates With Global Storage Data...
EWD 3 Training Course Part 1: How Node.js Integrates With Global Storage Data...EWD 3 Training Course Part 1: How Node.js Integrates With Global Storage Data...
EWD 3 Training Course Part 1: How Node.js Integrates With Global Storage Data...
Rob Tweed
 
CrossRef Technical Basics 2010 CrossRef Workshops
CrossRef Technical Basics 2010 CrossRef WorkshopsCrossRef Technical Basics 2010 CrossRef Workshops
CrossRef Technical Basics 2010 CrossRef WorkshopsCrossref
 
Hadoop_File_Formats_and_Data_Ingestion.pptx
Hadoop_File_Formats_and_Data_Ingestion.pptxHadoop_File_Formats_and_Data_Ingestion.pptx
Hadoop_File_Formats_and_Data_Ingestion.pptx
ShashankSahu34
 
Peter Tiernan - Ceph at the Digital Repository of Ireland
Peter Tiernan - Ceph at the Digital Repository of IrelandPeter Tiernan - Ceph at the Digital Repository of Ireland
Peter Tiernan - Ceph at the Digital Repository of Ireland
dri_ireland
 
Ceph at the Digital Repository of Ireland - Ceph Day Frankfurt
Ceph at the Digital Repository of Ireland - Ceph Day Frankfurt Ceph at the Digital Repository of Ireland - Ceph Day Frankfurt
Ceph at the Digital Repository of Ireland - Ceph Day Frankfurt
Ceph Community
 
The new CIARD RING , a machine-readable directory of datasets for agriculture
The new CIARD RING, a machine-readable directory of datasets for agricultureThe new CIARD RING, a machine-readable directory of datasets for agriculture
The new CIARD RING , a machine-readable directory of datasets for agriculture
Valeria Pesce
 
DataGraft Platform: RDF Database-as-a-Service
DataGraft Platform: RDF Database-as-a-ServiceDataGraft Platform: RDF Database-as-a-Service
DataGraft Platform: RDF Database-as-a-ServiceMarin Dimitrov
 
Integrating with SAP FIX and HL7
Integrating with SAP FIX and HL7Integrating with SAP FIX and HL7
Integrating with SAP FIX and HL7
WSO2
 
Datalake Architecture
Datalake ArchitectureDatalake Architecture
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...Cloudera, Inc.
 

Similar to Configuring Greenstone's OAI server (20)

Digitisation and institutional repositories 3
Digitisation and institutional repositories 3Digitisation and institutional repositories 3
Digitisation and institutional repositories 3
 
Big Data Day LA 2016/ Big Data Track - Fluentd and Embulk: Collect More Data,...
Big Data Day LA 2016/ Big Data Track - Fluentd and Embulk: Collect More Data,...Big Data Day LA 2016/ Big Data Track - Fluentd and Embulk: Collect More Data,...
Big Data Day LA 2016/ Big Data Track - Fluentd and Embulk: Collect More Data,...
 
Open for Business - Open Archives, OpenURL, RSS and the Dublin Core
Open for Business - Open Archives, OpenURL, RSS and the Dublin CoreOpen for Business - Open Archives, OpenURL, RSS and the Dublin Core
Open for Business - Open Archives, OpenURL, RSS and the Dublin Core
 
Solid fire cloudstack storage overview - CloudStack European User Group
Solid fire cloudstack storage overview - CloudStack European User GroupSolid fire cloudstack storage overview - CloudStack European User Group
Solid fire cloudstack storage overview - CloudStack European User Group
 
CloudStack Meetup London - Primary Storage Presentation by SolidFire
CloudStack Meetup London - Primary Storage Presentation by SolidFire CloudStack Meetup London - Primary Storage Presentation by SolidFire
CloudStack Meetup London - Primary Storage Presentation by SolidFire
 
Sword Crig 2007 12 06
Sword Crig 2007 12 06Sword Crig 2007 12 06
Sword Crig 2007 12 06
 
Sword 2007 06 22
Sword 2007 06 22Sword 2007 06 22
Sword 2007 06 22
 
Large Scale Crawling with Apache Nutch and Friends
Large Scale Crawling with Apache Nutch and FriendsLarge Scale Crawling with Apache Nutch and Friends
Large Scale Crawling with Apache Nutch and Friends
 
Large Scale Crawling with Apache Nutch and Friends
Large Scale Crawling with Apache Nutch and FriendsLarge Scale Crawling with Apache Nutch and Friends
Large Scale Crawling with Apache Nutch and Friends
 
EWD 3 Training Course Part 1: How Node.js Integrates With Global Storage Data...
EWD 3 Training Course Part 1: How Node.js Integrates With Global Storage Data...EWD 3 Training Course Part 1: How Node.js Integrates With Global Storage Data...
EWD 3 Training Course Part 1: How Node.js Integrates With Global Storage Data...
 
CrossRef Technical Basics 2010 CrossRef Workshops
CrossRef Technical Basics 2010 CrossRef WorkshopsCrossRef Technical Basics 2010 CrossRef Workshops
CrossRef Technical Basics 2010 CrossRef Workshops
 
Hadoop_File_Formats_and_Data_Ingestion.pptx
Hadoop_File_Formats_and_Data_Ingestion.pptxHadoop_File_Formats_and_Data_Ingestion.pptx
Hadoop_File_Formats_and_Data_Ingestion.pptx
 
Peter Tiernan - Ceph at the Digital Repository of Ireland
Peter Tiernan - Ceph at the Digital Repository of IrelandPeter Tiernan - Ceph at the Digital Repository of Ireland
Peter Tiernan - Ceph at the Digital Repository of Ireland
 
Ceph at the Digital Repository of Ireland - Ceph Day Frankfurt
Ceph at the Digital Repository of Ireland - Ceph Day Frankfurt Ceph at the Digital Repository of Ireland - Ceph Day Frankfurt
Ceph at the Digital Repository of Ireland - Ceph Day Frankfurt
 
OAISRB
OAISRBOAISRB
OAISRB
 
The new CIARD RING , a machine-readable directory of datasets for agriculture
The new CIARD RING, a machine-readable directory of datasets for agricultureThe new CIARD RING, a machine-readable directory of datasets for agriculture
The new CIARD RING , a machine-readable directory of datasets for agriculture
 
DataGraft Platform: RDF Database-as-a-Service
DataGraft Platform: RDF Database-as-a-ServiceDataGraft Platform: RDF Database-as-a-Service
DataGraft Platform: RDF Database-as-a-Service
 
Integrating with SAP FIX and HL7
Integrating with SAP FIX and HL7Integrating with SAP FIX and HL7
Integrating with SAP FIX and HL7
 
Datalake Architecture
Datalake ArchitectureDatalake Architecture
Datalake Architecture
 
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...
 

Recently uploaded

Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
sonjaschweigert1
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
Neo4j
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems S.M.S.A.
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
Neo4j
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Zilliz
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Malak Abu Hammad
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
James Anderson
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
SOFTTECHHUB
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 

Recently uploaded (20)

Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 

Configuring Greenstone's OAI server

  • 1. GREENSTONE OAI IMPLEMENTATION Wednesday, May 22, 2013 Kathmandu - Nepal Diego J. Spano dspano@prodigioconsultores.com diegospano @diegospano
  • 3. The Open Archives Approach "The Open Archives Initiative develops and promotes interoperability standards that aim to facilitate the efficient dissemination of content." Facilitates access to heterogeneous web-accessible material A low-barrier interoperability solution Based on repositories supporting Metadata sharing Publishing Archiving Arose out of the e-print community Main features Open Archives Initiative OAI Protocol for Metadata Harvesting (OAI-PMH)
  • 4. To facilitate metadata harvesting there needs to be agreement on: Transport protocol - HTTP Metadata format - Dublin Core Metadata Quality Assurance - mandatory element set, naming and subject conventions Intellectual Property and Usage Rights The Open Archives Approach
  • 5. OAI-PMH A mechanism for harvesting Data providers make metadata available for harvesting Service Providers harvest metadata Metadata can be centrally collected or “aggregated” That’s all it is: a way to bring metadata together in one place!
  • 7. Greenstone comes with a built-in OAI data provider. This runs as a CGI program called "oaiserver.cgi“. It can be accessed via the same URL as the Greenstone library (replacing "library.cgi with "oaiserver.cgi"). GaDP – GREENSTONE AS DATA PROVIDER http://biblioteca.clacso.edu.ar/gsdl/cgi-bin/library.cgi http://biblioteca.clacso.edu.ar/gsdl/cgi-bin/oaiserver.cgi
  • 8. Configuration of the server is done via the oai.cfg file in the Greenstone etc directory. This file specifies general information about the repository, and lists collections to be made accessible to OAI clients. By default, collections are not accessible. Greenstone's OAI server currently supports Dublin Core, Qualified Dublin Core, RFC1807 metadata. For collections that use other metadata sets, including extracted metadata, metadata mapping rules should be provided to map the existing metadata to Dublin Core. Next … Main parameters to configure GaDP – GREENSTONE AS DATA PROVIDER
  • 9. baseServerURL: the URL of the web server (the domain and port number if necessary) baseServerURL “http://www.mylibrary.org” oaiserverPath: the relative path of the oaiserver.cgi (will be added to baseServerURL) oaiserverPath “/greenstone/cgi-bin/oaiserver.cgi” libraryPath: the relative path of the Greenstone library.cgi libraryPath “/greenstone/cgi-bin/library.cgi” repositoryName: The OAI repository name - human readable repositoryName “University of Waikato - Institutional Repository” repositoryId: The OAI repository domain name or unique identifier - used for generating the document identifiers, which will be like: oai:reposId:coll:oid repositoryId “waikato” maintainer: The central administrative email address for the repository maintainer “george@mylib.edu" resumeafter: How many records to output before needing a resumption token resumeafter 250 GaDP – GREENSTONE AS DATA PROVIDER
  • 10. oaiinfo: Free text items describing the repository. oaiinfo “Purpose” “This Repository has all the papers …” oaiinfo “Size” “More than 5000 articles” oaicollection: A list of collections that should be made available to OAI clients. By default, collections are not available. There can be more than one line oaicollection thesis images videos oaisetname / oaisetdescription: Optional set name and description for each collection. oaisetname thesis “Thesis of Chemystry subjects” oaisetdescription thesis “Ph students´papers” oaimetadata: the list of metadata formats this server supports oaimetadata oai_dc gsdl_qdc Next … how to map your own metadata fields GaDP – GREENSTONE AS DATA PROVIDER
  • 11. Mapping rules allow you to match your own metadata schema to the right Dublin Core metadata. The format is: oaimapping <greenstone fieldname> <metadataPrefix.fieldname> oaimapping dls.Subject oai_dc.coverage oaimapping nz.Region oai_dc.coverage oaimapping nz.Region gsdl_qdc.spatial Next … Validate your site GaDP – GREENSTONE AS DATA PROVIDER
  • 12. Go to http://www.openarchives.org/data/registerasprovider.html and validate your site to be sure it can be a data provider. GaDP – GREENSTONE AS DATA PROVIDER
  • 13. GaDP – GREENSTONE AS DATA PROVIDER Once your OAI server is operational, to provide maximal international visibility for your open access collections you should register them in at least one (and ideally all) of the following: -the ROAR directory (http://roar.eprints.org/) - the OAI directory (http://www.openarchives.org/Register/BrowseSites) - the OpenDOAR directory (http://www.opendoar.org/) Next … A real case.
  • 16. oaimapping GaDP – GREENSTONE AS DATA PROVIDER
  • 18. GLI can serve as an OAI client application: it can connect to a remote OAI server and retrieve metadata and even download documents. You have three methods for doing this: - Using GLI (Greenstone Librarian Interface) - Using command line from within Greenstone environment - Using a third party application that allows you to download OAI records. We´ll focus on the first two methods. Next … Using GLI to download OAI records GaSP – GREENSTONE AS SERVICE PROVIDER
  • 19. GLI offers a “Download” tab for downloading documents and/or records. Assuming that you know the URL for the external repository, you can know the rest of the parameters by clicking “Server Information”, i.e.. the name of the sets. GaSP – GREENSTONE AS SERVICE PROVIDER
  • 20. Once you filled the required fields just press the Download button. GaSP – GREENSTONE AS SERVICE PROVIDER
  • 21. The downloaded records will appear in the “Gather” tab under “Downloaded Files” section. Drag them to the right panel to get the records ready for building the collection. GaSP – GREENSTONE AS SERVICE PROVIDER
  • 22. OAI records imported into a Greenstone collection are saved as Greenstone archive format, similar to XML format but with some extra data GaSP – GREENSTONE AS SERVICE PROVIDER Dublin Core metadata extracted as “ex” Schema “prettymd” metadata: its value is a preformated table (with style) Containing the full OAI record
  • 23. format DocumentText "[prettymd]“ is all you need to display the OAI record… GaSP – GREENSTONE AS SERVICE PROVIDER
  • 24. Using command line to download OAI records Just a few step to do this task: 1. Open a terminal or console window 2. Go to Greenstone root folder 3. Run setup.bat/setup.bash 4. Move to the import folder of the target collection for the records 5. Run perl -S downloadfrom.pl (with the following options) - download_mode OAI - url http://biblioteca.clacso.edu.ar/gsdl/cgi-bin/oaiserver.cgi - set ar/ar-001 - max_record 10 6. Build the collection as usual The command “downloadfrom” will have the same results as the previous task executed through GLI GaSP – GREENSTONE AS SERVICE PROVIDER
  • 25. Harvester: client application issuing OAI-PMH requests Repository: network accessible server, able to process OAI-PMH requests correctly Resource: object the metadata is “about”, nature of resources is not defined in the OAI-PMH Item: component of a repository from which metadata about a resource can be disseminated. Has an unique identifier Record: metadata in a specific metadata format Identifier: unique key for an item in a repository Set: optional construct for grouping items in a repository GLOSARY
  • 26. For GS2, there are 3 tutorials: http://wiki.greenstone.org/wiki/index.php/Tutorial_exercises#Setting_up_your_Greenstone_OAI_Server http://wiki.greenstone.org/wiki/index.php/Tutorial_exercises#Connecting_to_an_OAI_server_from_GLI http://wiki.greenstone.org/wiki/index.php/Tutorial_exercises#Connecting_to_the_Greenstone_OAI_server_fro m_the_outside_world http://wiki.greenstone.org/wiki/index.php/2.85_Release_Notes#OAI_downloading For GS3, the same approximate information is merged into the following 2 tutorials: http://wiki.greenstone.org/wiki/index.php/GS3_Tutorial_exercises#Setting_up_your_Greenstone_OAI_Server http://wiki.greenstone.org/wiki/gsdoc/tutorial/gs3-current/en/OAI_downloading.htm + INFO...