SlideShare una empresa de Scribd logo
1 de 50
Descargar para leer sin conexión
Hvordan sette opp en OAI-PMH
     metadata-innhøster
(PKP Open Archives Harvester)



            Magnus Enger
    Biblioteklaboratorie-workshop
   Bergen, 12. - 13. november 2007
System Requirements
●   PHP >= 4.2.x (including PHP 5.x); Microsoft IIS
    requires PHP 5.x
●   MySQL >= 3.23.23 (including MySQL 4.x/5.x)
    or PostgreSQL >= 7.1 (including PostgreSQL
    8.x)
●   Apache >= 1.3.2x or >= 2.0.4x or Microsoft IIS
    6 (untested)
●   Operating system: Any OS that supports the
    above software, including Linux, BSD, Solaris,
    Mac OS X, Windows
Support
Med SSH (PuTTY)/Telnet
●   Logg på serveren
●   Last ned fila
$ wget http://pkp.sfu.ca/harvester2/download/harvester-2.0.1.tar.gz

●   Pakk ut fila
$ tar -xvf harvester-2.0.1.tar.gz

●   Gå inn i mappa
$ cd harvester-2.0.1

●   Flytt innholdet til ønsket plass
$ mv * ~/subdomener/harvester/
Med FTP
●   Last ned fila til lokal maskin
●   Pakk ut fila
●   Logg på server med FTP-klient
●   Last opp de utpakkede filene til ønsket plass på
    serveren
Fremgangsmåte
●   Er beskrevet i fila docs/README
Endre filrettigheter
●   Gjør følgende filer/mapper skrivbare:
    –   config.inc.php (optional -- if not writable you will be
        prompted to manually overwrite this file during
        installation)
    –   public
    –   cache
    –   cache/t_cache
    –   cache/t_config
    –   cache/t_compile
    –   cache/_db
Lag opplastingsmappe
●   Lag en mappe for å lagre opplastede filer,
    gjerne utenfor server-rota
●   Gjør denne mappa skrivbar
Installasjon
●   Via vevleser
    –   http://yourdomain.com/path/to/harvester2/
●   Eller vi kommandolinja
    –   php tools/install.php
Rediger fila config.inc.php


; Use URL parameters instead of CGI PATH_INFO. This is useful for
; broken server setups that don't support the PATH_INFO environment
; variable.
disable_path_info = Off



Endre siste linja til:

disable_path_info = On
Problem
●   «login»:
    http://harvester.collib.info/index.php/login
●   Viser bare forsiden
●   Klikk på «HOME»:
    http://harvester.collib.info/index.php?page=index
●   Gå til URLen:
    http://harvester.collib.info/index.php?page=login
Hvordan finne høstbare arkiver?
●   «Registered Data Providers» fra OAI:
    http://www.openarchives.org/Register/BrowseSites
Eksempel: DUO ved UIO
Sets
●   Document types
●   Frequent occurrences of languages in the
    database
●   Documents where online fulltext-versions are
    available
●   Units at the university
Document types
●   Master thesis
●   Dissertation
●   Student thesis
●   Series titles
●   Report
●   Monography
●   Article
Frequent occurrences of languages
         in the database
●   Norwegian
●   English
●   French
●   Norwegian Bokmål
●   Norwegian Nynorsk
●   Swedish
●   German
Documents where online fulltext-
    versions are available
Units at the university
●   Humanities(2437)
●   HumanitiesMedia and Communication(358)
●   HumanitiesMedia and CommunicationMedia
    studies(331)
●   HumanitiesMedia and
    CommunicationJournalism(23)
●   HumanitiesArcheology, Conservation and
    Historical Studies(459)
«Update Metadata Index»




   Bruk tools/harvest.php !
Nyttige «kommandoer»
●   php harvest.php
    –   lister opp diverse opsjoner
●   php harvest.php list
    –   liste over arkivene
●   php harvest.php 1
    –   Høster metadata fra ett arkiv, tallet tilsvarer tallet i
        lista over
●   php harvest.php 1 verbose
    –   Som over, men med detaljerte meldinger om
        fremdriften
Flere nyttige «kommandoer»
●   php harvest.php all
    –   Høster data fra alle arkivene
●   php harvest.php all from=last
    –   Høster alle nye metadata siden sist innhøsting
    –   Kjør denne jevnlig ved hjelp av cron!
Hacke databasen
●   archive_settings       ●   records
●   archives               ●   rt_contexts
●   captchas               ●   rt_searches
●   crosswalk_fields       ●   rt_versions
●   crosswalks             ●   schema_plugins
●   email_templates        ●   search_keyword_list
●   email_templates_data   ●   search_object_keywords
●   entries                ●   search_objects
●   entry_attributes       ●   sessions
●   plugin_settings        ●   site_settings
●   raw_fields             ●   versions
Tabell: records
mysql> describe records;

+------------------+--------------+------+-----+---------+----------------+

| Field            | Type         | Null | Key | Default | Extra          |

+------------------+--------------+------+-----+---------+----------------+

| record_id        | int(11)      |       | PRI | NULL   | auto_increment |

| archive_id       | int(11)      |       |    | 0       |                |

| schema_plugin_id | int(11)      |       |    | 0       |                |

| identifier       | varchar(255) | YES   |    | NULL    |                |

| datestamp        | datetime     | YES   |    | NULL    |                |

+------------------+--------------+------+-----+---------+----------------+
Tabell: entries
mysql> describe entries;

+--------------+---------+------+-----+---------+----------------+

| Field       | Type       | Null | Key | Default | Extra          |

+--------------+---------+------+-----+---------+----------------+

| entry_id    | int(11) |          | PRI | NULL   | auto_increment |

| record_id   | int(11) |          | MUL | 0      |                |

| raw_field_id | int(11) |         | MUL | 0      |                |

| value       | text       | YES   |     | NULL   |                |

+--------------+---------+------+-----+---------+----------------+
Tabell: raw_fields
mysql> describe raw_fields;

+------------------+-------------+------+-----+---------+----------------+

| Field            | Type        | Null | Key | Default | Extra          |

+------------------+-------------+------+-----+---------+----------------+

| raw_field_id     | int(11)     |      | PRI | NULL    | auto_increment |

| name             | varchar(60) |      | MUL |         |                |

| schema_plugin_id | int(11)     |      |     | 0       |                |

+------------------+-------------+------+-----+---------+----------------+
Strukturen til en «record»
●   Record
    –   Entry a
         ●   raw_field_id = x
              –  raw_fields name = 'Title'
         ●   value = 'Kasus før og nå'

    –   Entry b
         ●   raw_field_id = y
              –  raw_fields name = 'Author'
         ●   value = 'Hansen, Hans'

    –   Entry c
         ●   raw_field_id = z
              –  raw_fields name = 'Publisher'
         ●   value = 'Universitetet i Bodø'
Hacke systemet
●   Interessante kataloger
    –   classes
    –   locale - språk
    –   pages
    –   plugins
    –   styles – CSS
    –   templates – benytter Smarty
Lære mer
●   README
    –   http://pkp.sfu.ca/harvester2/README
●   Administrator's Guide (17 s.)
    –   http://pkp.sfu.ca/harvester2/AdminGuide.pdf
●   Technical Reference (50 s.)
    –   http://pkp.sfu.ca/harvester2/TechnicalReference.pdf
Spørsmål
●   Bruk BibLab-wikien (Allmenningen)!
●   eller
●   magnus@enger.priv.no

Más contenido relacionado

La actualidad más candente

Hadoop Installation and basic configuration
Hadoop Installation and basic configurationHadoop Installation and basic configuration
Hadoop Installation and basic configuration
Gerrit van Vuuren
 
Pgbr 2013 postgres on aws
Pgbr 2013   postgres on awsPgbr 2013   postgres on aws
Pgbr 2013 postgres on aws
Emanuel Calvo
 

La actualidad más candente (20)

XtraDB 5.6 and 5.7: Key Performance Algorithms
XtraDB 5.6 and 5.7: Key Performance AlgorithmsXtraDB 5.6 and 5.7: Key Performance Algorithms
XtraDB 5.6 and 5.7: Key Performance Algorithms
 
Performance Profiling in Rust
Performance Profiling in RustPerformance Profiling in Rust
Performance Profiling in Rust
 
Hadoop 20111215
Hadoop 20111215Hadoop 20111215
Hadoop 20111215
 
From zero to hero - Easy log centralization with Logstash and Elasticsearch
From zero to hero - Easy log centralization with Logstash and ElasticsearchFrom zero to hero - Easy log centralization with Logstash and Elasticsearch
From zero to hero - Easy log centralization with Logstash and Elasticsearch
 
Hadoop Installation and basic configuration
Hadoop Installation and basic configurationHadoop Installation and basic configuration
Hadoop Installation and basic configuration
 
Web scraping with nutch solr
Web scraping with nutch solrWeb scraping with nutch solr
Web scraping with nutch solr
 
Centralized + Unified Logging
Centralized + Unified LoggingCentralized + Unified Logging
Centralized + Unified Logging
 
Http capturing
Http capturingHttp capturing
Http capturing
 
Perl Programming - 03 Programming File
Perl Programming - 03 Programming FilePerl Programming - 03 Programming File
Perl Programming - 03 Programming File
 
Pgbr 2013 postgres on aws
Pgbr 2013   postgres on awsPgbr 2013   postgres on aws
Pgbr 2013 postgres on aws
 
Perl for System Automation - 01 Advanced File Processing
Perl for System Automation - 01 Advanced File ProcessingPerl for System Automation - 01 Advanced File Processing
Perl for System Automation - 01 Advanced File Processing
 
dns.workshop.hsgr
dns.workshop.hsgrdns.workshop.hsgr
dns.workshop.hsgr
 
PGDay.Amsterdam 2018 - Stefan Fercot - Save your data with pgBackRest
PGDay.Amsterdam 2018 - Stefan Fercot - Save your data with pgBackRestPGDay.Amsterdam 2018 - Stefan Fercot - Save your data with pgBackRest
PGDay.Amsterdam 2018 - Stefan Fercot - Save your data with pgBackRest
 
Boosting MongoDB performance
Boosting MongoDB performanceBoosting MongoDB performance
Boosting MongoDB performance
 
Introduction to scoop and its functions
Introduction to scoop and its functionsIntroduction to scoop and its functions
Introduction to scoop and its functions
 
Beyond php - it's not (just) about the code
Beyond php - it's not (just) about the codeBeyond php - it's not (just) about the code
Beyond php - it's not (just) about the code
 
Advanced backup methods (Postgres@CERN)
Advanced backup methods (Postgres@CERN)Advanced backup methods (Postgres@CERN)
Advanced backup methods (Postgres@CERN)
 
phptek13 - Caching and tuning fun tutorial
phptek13 - Caching and tuning fun tutorialphptek13 - Caching and tuning fun tutorial
phptek13 - Caching and tuning fun tutorial
 
Backups
BackupsBackups
Backups
 
Containers for sysadmins
Containers for sysadminsContainers for sysadmins
Containers for sysadmins
 

Destacado (9)

F/LOSS in Norwegian libraries
F/LOSS in Norwegian librariesF/LOSS in Norwegian libraries
F/LOSS in Norwegian libraries
 
Hva er fri programvare? (2011)
Hva er fri programvare? (2011)Hva er fri programvare? (2011)
Hva er fri programvare? (2011)
 
Hva er Koha? - Versjon 2
Hva er Koha? - Versjon 2Hva er Koha? - Versjon 2
Hva er Koha? - Versjon 2
 
Hva er Koha?
Hva er Koha?Hva er Koha?
Hva er Koha?
 
Koha og åpenhet
Koha og åpenhetKoha og åpenhet
Koha og åpenhet
 
Snik-semantifisering av bibliotekkatalogen
Snik-semantifisering av bibliotekkatalogenSnik-semantifisering av bibliotekkatalogen
Snik-semantifisering av bibliotekkatalogen
 
Hva er fri programvare
Hva er fri programvareHva er fri programvare
Hva er fri programvare
 
Må MARC dø?
Må MARC dø?Må MARC dø?
Må MARC dø?
 
Pode - en kort presentasjon
Pode - en kort presentasjonPode - en kort presentasjon
Pode - en kort presentasjon
 

Similar a Hvordan sette opp en OAI-PMH metadata-innhøster

MariaDB with SphinxSE
MariaDB with SphinxSEMariaDB with SphinxSE
MariaDB with SphinxSE
Colin Charles
 
Common schema my sql uc 2012
Common schema   my sql uc 2012Common schema   my sql uc 2012
Common schema my sql uc 2012
Roland Bouman
 
Common schema my sql uc 2012
Common schema   my sql uc 2012Common schema   my sql uc 2012
Common schema my sql uc 2012
Roland Bouman
 
Applied Partitioning And Scaling Your Database System Presentation
Applied Partitioning And Scaling Your Database System PresentationApplied Partitioning And Scaling Your Database System Presentation
Applied Partitioning And Scaling Your Database System Presentation
Richard Crowley
 

Similar a Hvordan sette opp en OAI-PMH metadata-innhøster (20)

MySQL 5.7 innodb_enhance_partii_20160527
MySQL 5.7 innodb_enhance_partii_20160527MySQL 5.7 innodb_enhance_partii_20160527
MySQL 5.7 innodb_enhance_partii_20160527
 
Streaming ETL - from RDBMS to Dashboard with KSQL
Streaming ETL - from RDBMS to Dashboard with KSQLStreaming ETL - from RDBMS to Dashboard with KSQL
Streaming ETL - from RDBMS to Dashboard with KSQL
 
MySQLinsanity
MySQLinsanityMySQLinsanity
MySQLinsanity
 
MySQL 5.7 Tutorial Dutch PHP Conference 2015
MySQL 5.7 Tutorial Dutch PHP Conference 2015MySQL 5.7 Tutorial Dutch PHP Conference 2015
MySQL 5.7 Tutorial Dutch PHP Conference 2015
 
MySQL 5.7. Tutorial - Dutch PHP Conference 2015
MySQL 5.7. Tutorial - Dutch PHP Conference 2015MySQL 5.7. Tutorial - Dutch PHP Conference 2015
MySQL 5.7. Tutorial - Dutch PHP Conference 2015
 
Puppet Data Mining
Puppet Data MiningPuppet Data Mining
Puppet Data Mining
 
MariaDB with SphinxSE
MariaDB with SphinxSEMariaDB with SphinxSE
MariaDB with SphinxSE
 
Common schema my sql uc 2012
Common schema   my sql uc 2012Common schema   my sql uc 2012
Common schema my sql uc 2012
 
Common schema my sql uc 2012
Common schema   my sql uc 2012Common schema   my sql uc 2012
Common schema my sql uc 2012
 
Applied Partitioning And Scaling Your Database System Presentation
Applied Partitioning And Scaling Your Database System PresentationApplied Partitioning And Scaling Your Database System Presentation
Applied Partitioning And Scaling Your Database System Presentation
 
Percona toolkit
Percona toolkitPercona toolkit
Percona toolkit
 
DB Floripa - ProxySQL para MySQL
DB Floripa - ProxySQL para MySQLDB Floripa - ProxySQL para MySQL
DB Floripa - ProxySQL para MySQL
 
Beyond php - it's not (just) about the code
Beyond php - it's not (just) about the codeBeyond php - it's not (just) about the code
Beyond php - it's not (just) about the code
 
Mysql56 replication
Mysql56 replicationMysql56 replication
Mysql56 replication
 
Cache and Drupal
Cache and DrupalCache and Drupal
Cache and Drupal
 
Synapse india reviews on php website development
Synapse india reviews on php website developmentSynapse india reviews on php website development
Synapse india reviews on php website development
 
ProxySQL - High Performance and HA Proxy for MySQL
ProxySQL - High Performance and HA Proxy for MySQLProxySQL - High Performance and HA Proxy for MySQL
ProxySQL - High Performance and HA Proxy for MySQL
 
Basic MySQL Troubleshooting for Oracle Database Administrators
Basic MySQL Troubleshooting for Oracle Database AdministratorsBasic MySQL Troubleshooting for Oracle Database Administrators
Basic MySQL Troubleshooting for Oracle Database Administrators
 
10 things I learned building Nomad packs
10 things I learned building Nomad packs10 things I learned building Nomad packs
10 things I learned building Nomad packs
 
MySQL SQL Tutorial
MySQL SQL TutorialMySQL SQL Tutorial
MySQL SQL Tutorial
 

Más de Libriotech (9)

Contributing to Koha
Contributing to KohaContributing to Koha
Contributing to Koha
 
Glitre og Öppna bibliotek
Glitre og Öppna bibliotekGlitre og Öppna bibliotek
Glitre og Öppna bibliotek
 
Koha @ emtacl10
Koha @ emtacl10Koha @ emtacl10
Koha @ emtacl10
 
Koha
KohaKoha
Koha
 
Kort presentasjon av Koha i Norge
Kort presentasjon av Koha i NorgeKort presentasjon av Koha i Norge
Kort presentasjon av Koha i Norge
 
Bibliotek 2.0?
Bibliotek 2.0?Bibliotek 2.0?
Bibliotek 2.0?
 
Google og bibliotekene
Google og bibliotekeneGoogle og bibliotekene
Google og bibliotekene
 
Libraries and social networks
Libraries and social networksLibraries and social networks
Libraries and social networks
 
Sosiale nettverk og bibliotek
Sosiale nettverk og bibliotekSosiale nettverk og bibliotek
Sosiale nettverk og bibliotek
 

Último

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Último (20)

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 

Hvordan sette opp en OAI-PMH metadata-innhøster

  • 1. Hvordan sette opp en OAI-PMH metadata-innhøster (PKP Open Archives Harvester) Magnus Enger Biblioteklaboratorie-workshop Bergen, 12. - 13. november 2007
  • 2.
  • 3.
  • 4. System Requirements ● PHP >= 4.2.x (including PHP 5.x); Microsoft IIS requires PHP 5.x ● MySQL >= 3.23.23 (including MySQL 4.x/5.x) or PostgreSQL >= 7.1 (including PostgreSQL 8.x) ● Apache >= 1.3.2x or >= 2.0.4x or Microsoft IIS 6 (untested) ● Operating system: Any OS that supports the above software, including Linux, BSD, Solaris, Mac OS X, Windows
  • 5.
  • 7. Med SSH (PuTTY)/Telnet ● Logg på serveren ● Last ned fila $ wget http://pkp.sfu.ca/harvester2/download/harvester-2.0.1.tar.gz ● Pakk ut fila $ tar -xvf harvester-2.0.1.tar.gz ● Gå inn i mappa $ cd harvester-2.0.1 ● Flytt innholdet til ønsket plass $ mv * ~/subdomener/harvester/
  • 8. Med FTP ● Last ned fila til lokal maskin ● Pakk ut fila ● Logg på server med FTP-klient ● Last opp de utpakkede filene til ønsket plass på serveren
  • 9.
  • 10. Fremgangsmåte ● Er beskrevet i fila docs/README
  • 11. Endre filrettigheter ● Gjør følgende filer/mapper skrivbare: – config.inc.php (optional -- if not writable you will be prompted to manually overwrite this file during installation) – public – cache – cache/t_cache – cache/t_config – cache/t_compile – cache/_db
  • 12. Lag opplastingsmappe ● Lag en mappe for å lagre opplastede filer, gjerne utenfor server-rota ● Gjør denne mappa skrivbar
  • 13. Installasjon ● Via vevleser – http://yourdomain.com/path/to/harvester2/ ● Eller vi kommandolinja – php tools/install.php
  • 14.
  • 15. Rediger fila config.inc.php ; Use URL parameters instead of CGI PATH_INFO. This is useful for ; broken server setups that don't support the PATH_INFO environment ; variable. disable_path_info = Off Endre siste linja til: disable_path_info = On
  • 16.
  • 17.
  • 18.
  • 19.
  • 20.
  • 21. Problem ● «login»: http://harvester.collib.info/index.php/login ● Viser bare forsiden ● Klikk på «HOME»: http://harvester.collib.info/index.php?page=index ● Gå til URLen: http://harvester.collib.info/index.php?page=login
  • 22.
  • 23.
  • 24.
  • 25.
  • 26. Hvordan finne høstbare arkiver? ● «Registered Data Providers» fra OAI: http://www.openarchives.org/Register/BrowseSites
  • 28.
  • 29.
  • 30.
  • 31. Sets ● Document types ● Frequent occurrences of languages in the database ● Documents where online fulltext-versions are available ● Units at the university
  • 32. Document types ● Master thesis ● Dissertation ● Student thesis ● Series titles ● Report ● Monography ● Article
  • 33. Frequent occurrences of languages in the database ● Norwegian ● English ● French ● Norwegian Bokmål ● Norwegian Nynorsk ● Swedish ● German
  • 34. Documents where online fulltext- versions are available
  • 35. Units at the university ● Humanities(2437) ● HumanitiesMedia and Communication(358) ● HumanitiesMedia and CommunicationMedia studies(331) ● HumanitiesMedia and CommunicationJournalism(23) ● HumanitiesArcheology, Conservation and Historical Studies(459)
  • 36. «Update Metadata Index» Bruk tools/harvest.php !
  • 37.
  • 38.
  • 39.
  • 40. Nyttige «kommandoer» ● php harvest.php – lister opp diverse opsjoner ● php harvest.php list – liste over arkivene ● php harvest.php 1 – Høster metadata fra ett arkiv, tallet tilsvarer tallet i lista over ● php harvest.php 1 verbose – Som over, men med detaljerte meldinger om fremdriften
  • 41.
  • 42. Flere nyttige «kommandoer» ● php harvest.php all – Høster data fra alle arkivene ● php harvest.php all from=last – Høster alle nye metadata siden sist innhøsting – Kjør denne jevnlig ved hjelp av cron!
  • 43. Hacke databasen ● archive_settings ● records ● archives ● rt_contexts ● captchas ● rt_searches ● crosswalk_fields ● rt_versions ● crosswalks ● schema_plugins ● email_templates ● search_keyword_list ● email_templates_data ● search_object_keywords ● entries ● search_objects ● entry_attributes ● sessions ● plugin_settings ● site_settings ● raw_fields ● versions
  • 44. Tabell: records mysql> describe records; +------------------+--------------+------+-----+---------+----------------+ | Field | Type | Null | Key | Default | Extra | +------------------+--------------+------+-----+---------+----------------+ | record_id | int(11) | | PRI | NULL | auto_increment | | archive_id | int(11) | | | 0 | | | schema_plugin_id | int(11) | | | 0 | | | identifier | varchar(255) | YES | | NULL | | | datestamp | datetime | YES | | NULL | | +------------------+--------------+------+-----+---------+----------------+
  • 45. Tabell: entries mysql> describe entries; +--------------+---------+------+-----+---------+----------------+ | Field | Type | Null | Key | Default | Extra | +--------------+---------+------+-----+---------+----------------+ | entry_id | int(11) | | PRI | NULL | auto_increment | | record_id | int(11) | | MUL | 0 | | | raw_field_id | int(11) | | MUL | 0 | | | value | text | YES | | NULL | | +--------------+---------+------+-----+---------+----------------+
  • 46. Tabell: raw_fields mysql> describe raw_fields; +------------------+-------------+------+-----+---------+----------------+ | Field | Type | Null | Key | Default | Extra | +------------------+-------------+------+-----+---------+----------------+ | raw_field_id | int(11) | | PRI | NULL | auto_increment | | name | varchar(60) | | MUL | | | | schema_plugin_id | int(11) | | | 0 | | +------------------+-------------+------+-----+---------+----------------+
  • 47. Strukturen til en «record» ● Record – Entry a ● raw_field_id = x – raw_fields name = 'Title' ● value = 'Kasus før og nå' – Entry b ● raw_field_id = y – raw_fields name = 'Author' ● value = 'Hansen, Hans' – Entry c ● raw_field_id = z – raw_fields name = 'Publisher' ● value = 'Universitetet i Bodø'
  • 48. Hacke systemet ● Interessante kataloger – classes – locale - språk – pages – plugins – styles – CSS – templates – benytter Smarty
  • 49. Lære mer ● README – http://pkp.sfu.ca/harvester2/README ● Administrator's Guide (17 s.) – http://pkp.sfu.ca/harvester2/AdminGuide.pdf ● Technical Reference (50 s.) – http://pkp.sfu.ca/harvester2/TechnicalReference.pdf
  • 50. Spørsmål ● Bruk BibLab-wikien (Allmenningen)! ● eller ● magnus@enger.priv.no