SlideShare a Scribd company logo
1 of 24
Web-Harvesting
Web-Harvesting ,[object Object],[object Object],[object Object]
Web-Harvesting ,[object Object],[object Object],[object Object]
Concept Web resource Web resource Web resource Web resource Web resource Web resource Web resource Web resource Web resource
Reference to Web Resource ,[object Object]
The Web
The Web
The Web Harvester
3 Major Activities ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Storage
Migration
Retrieval
Developments in Web Archiving ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Web-Harvesting Concept ,[object Object],[object Object],[object Object]
Web-Harvesting ,[object Object],[object Object],[object Object]
Web-Harvesting ,[object Object],[object Object],[object Object]
Issues – Storage ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Issues – Storage ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Issues – Migration ,[object Object],[object Object]
Issues – Retrieval ,[object Object],[object Object],[object Object],[object Object]
Web-Harvesting ,[object Object],[object Object],[object Object]
Web-Harvesting ,[object Object],[object Object],[object Object]
Prospects ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Have a nice day!

More Related Content

What's hot

Web24dev Icrisat 2
Web24dev Icrisat 2Web24dev Icrisat 2
Web24dev Icrisat 2
pritpalkaur
 

What's hot (20)

Load webinar deposit.final
Load webinar deposit.finalLoad webinar deposit.final
Load webinar deposit.final
 
‘PERSIST – UNESCO’s Memory of the World Programme as a catalyst for the deba...
 ‘PERSIST – UNESCO’s Memory of the World Programme as a catalyst for the deba... ‘PERSIST – UNESCO’s Memory of the World Programme as a catalyst for the deba...
‘PERSIST – UNESCO’s Memory of the World Programme as a catalyst for the deba...
 
Web24dev Icrisat 2
Web24dev Icrisat 2Web24dev Icrisat 2
Web24dev Icrisat 2
 
AKstem Service: Supporting the AGRIS Network
AKstem Service: Supporting the AGRIS NetworkAKstem Service: Supporting the AGRIS Network
AKstem Service: Supporting the AGRIS Network
 
Ariadne: Archiving and Repositories
Ariadne: Archiving and RepositoriesAriadne: Archiving and Repositories
Ariadne: Archiving and Repositories
 
iMarine Services
iMarine ServicesiMarine Services
iMarine Services
 
Open Access of Research Data - The Present and Future Situation in Germany
Open Access of Research Data - The Present and Future Situation in GermanyOpen Access of Research Data - The Present and Future Situation in Germany
Open Access of Research Data - The Present and Future Situation in Germany
 
Andrew White's Technical Breakfast Club
Andrew White's Technical Breakfast ClubAndrew White's Technical Breakfast Club
Andrew White's Technical Breakfast Club
 
2014-02-27 Wikidata talk Cambridge
2014-02-27 Wikidata talk Cambridge2014-02-27 Wikidata talk Cambridge
2014-02-27 Wikidata talk Cambridge
 
IPTC Semantic Web 2012 Spring Working Group
IPTC Semantic Web 2012 Spring Working GroupIPTC Semantic Web 2012 Spring Working Group
IPTC Semantic Web 2012 Spring Working Group
 
ARTiFACTS, Emma Boswood
ARTiFACTS, Emma BoswoodARTiFACTS, Emma Boswood
ARTiFACTS, Emma Boswood
 
Integrating Data for Archaeology
Integrating Data for ArchaeologyIntegrating Data for Archaeology
Integrating Data for Archaeology
 
The OAIS reference model and archaeological data
The OAIS reference model and archaeological dataThe OAIS reference model and archaeological data
The OAIS reference model and archaeological data
 
ICOS: Integrated Carbon Observation System Open data to open our eyes to clim...
ICOS: Integrated Carbon Observation System Open data to open our eyes to clim...ICOS: Integrated Carbon Observation System Open data to open our eyes to clim...
ICOS: Integrated Carbon Observation System Open data to open our eyes to clim...
 
Let's talk about data: Citation and publication
Let's talk about data: Citation and publicationLet's talk about data: Citation and publication
Let's talk about data: Citation and publication
 
Ag Data Commons for AgBioData
Ag Data Commons for AgBioDataAg Data Commons for AgBioData
Ag Data Commons for AgBioData
 
Jisc updates - Jisc research data shared service
Jisc updates - Jisc research data shared service Jisc updates - Jisc research data shared service
Jisc updates - Jisc research data shared service
 
BIBFRAME on its way
BIBFRAME on its wayBIBFRAME on its way
BIBFRAME on its way
 
Using controlled vocabularies to help organize ILRI’s information products
Using controlled vocabularies to help organize ILRI’s information productsUsing controlled vocabularies to help organize ILRI’s information products
Using controlled vocabularies to help organize ILRI’s information products
 
IPTC Semantic Web Working Group 2011 Autumn Working Group
IPTC Semantic Web Working Group 2011 Autumn Working GroupIPTC Semantic Web Working Group 2011 Autumn Working Group
IPTC Semantic Web Working Group 2011 Autumn Working Group
 

Viewers also liked

Hummingbird Banding in Paridise, AZ
Hummingbird  Banding  in Paridise, AZHummingbird  Banding  in Paridise, AZ
Hummingbird Banding in Paridise, AZ
tdainsure
 
M.Tech_Thesis_Presentation
M.Tech_Thesis_PresentationM.Tech_Thesis_Presentation
M.Tech_Thesis_Presentation
Manish Pillai
 

Viewers also liked (20)

Smart Crawler -A Two Stage Crawler For Efficiently Harvesting Deep Web
Smart Crawler -A Two Stage Crawler For Efficiently Harvesting Deep WebSmart Crawler -A Two Stage Crawler For Efficiently Harvesting Deep Web
Smart Crawler -A Two Stage Crawler For Efficiently Harvesting Deep Web
 
Rethink Web Harvesting and Scraping
Rethink Web Harvesting and ScrapingRethink Web Harvesting and Scraping
Rethink Web Harvesting and Scraping
 
Usage of Technology and Digital Resources in the De La Salle University Library
Usage of Technology and Digital Resources in the De La Salle University LibraryUsage of Technology and Digital Resources in the De La Salle University Library
Usage of Technology and Digital Resources in the De La Salle University Library
 
Preaching
PreachingPreaching
Preaching
 
Preaching
PreachingPreaching
Preaching
 
Hummingbird Banding in Paridise, AZ
Hummingbird  Banding  in Paridise, AZHummingbird  Banding  in Paridise, AZ
Hummingbird Banding in Paridise, AZ
 
Preaching
PreachingPreaching
Preaching
 
ProQuest Tutorial Revised
ProQuest Tutorial RevisedProQuest Tutorial Revised
ProQuest Tutorial Revised
 
Electronic Resource Management Systems
Electronic Resource Management SystemsElectronic Resource Management Systems
Electronic Resource Management Systems
 
Colloquim Report on Crawler - 1 Dec 2014
Colloquim Report on Crawler - 1 Dec 2014Colloquim Report on Crawler - 1 Dec 2014
Colloquim Report on Crawler - 1 Dec 2014
 
Library Orientation Revised
Library Orientation RevisedLibrary Orientation Revised
Library Orientation Revised
 
Web crawler
Web crawlerWeb crawler
Web crawler
 
Smart crawler a two stage crawler
Smart crawler a two stage crawlerSmart crawler a two stage crawler
Smart crawler a two stage crawler
 
Smart crawlet A two stage crawler for efficiently harvesting deep web interf...
Smart crawlet A two stage crawler  for efficiently harvesting deep web interf...Smart crawlet A two stage crawler  for efficiently harvesting deep web interf...
Smart crawlet A two stage crawler for efficiently harvesting deep web interf...
 
WebCrawler
WebCrawlerWebCrawler
WebCrawler
 
M.Tech_Thesis_Presentation
M.Tech_Thesis_PresentationM.Tech_Thesis_Presentation
M.Tech_Thesis_Presentation
 
Web crawler
Web crawlerWeb crawler
Web crawler
 
Deep web
Deep webDeep web
Deep web
 
Deep web
Deep webDeep web
Deep web
 
Biomass supported solar thermal power plant
Biomass supported solar thermal power plantBiomass supported solar thermal power plant
Biomass supported solar thermal power plant
 

Similar to Web-Harvesting: concepts, issues, and prospects

The development of web archiving 3
The development of web archiving 3The development of web archiving 3
The development of web archiving 3
Essam Obaid
 
"Woe, Destruction, Ruin, and Decay:" An Introduction to Web Archiving
"Woe, Destruction, Ruin, and Decay:" An Introduction to Web Archiving"Woe, Destruction, Ruin, and Decay:" An Introduction to Web Archiving
"Woe, Destruction, Ruin, and Decay:" An Introduction to Web Archiving
jaime916
 
Analytics and Access to the UK web archive
Analytics and Access to the UK web archiveAnalytics and Access to the UK web archive
Analytics and Access to the UK web archive
Lewis Crawford
 
eHive Open Day - London November 2010
eHive Open Day - London November 2010eHive Open Day - London November 2010
eHive Open Day - London November 2010
Paul Rowe
 
Lsr vpresntation
Lsr vpresntationLsr vpresntation
Lsr vpresntation
jarcherumd
 
Slides anu talkwebarchivingaug2012
Slides anu talkwebarchivingaug2012Slides anu talkwebarchivingaug2012
Slides anu talkwebarchivingaug2012
Roxanne Missingham
 

Similar to Web-Harvesting: concepts, issues, and prospects (20)

Creating and Maintaining Web Archives
Creating and Maintaining Web ArchivesCreating and Maintaining Web Archives
Creating and Maintaining Web Archives
 
Web Archiving Intro (circa 2015)
Web Archiving Intro (circa 2015)Web Archiving Intro (circa 2015)
Web Archiving Intro (circa 2015)
 
The development of web archiving 3
The development of web archiving 3The development of web archiving 3
The development of web archiving 3
 
Web archiving challenges and opportunities
Web archiving challenges and opportunitiesWeb archiving challenges and opportunities
Web archiving challenges and opportunities
 
November 19, 2014 NISO Virtual Conference: Can't We All Work Together?: Inter...
November 19, 2014 NISO Virtual Conference: Can't We All Work Together?: Inter...November 19, 2014 NISO Virtual Conference: Can't We All Work Together?: Inter...
November 19, 2014 NISO Virtual Conference: Can't We All Work Together?: Inter...
 
Tools for Managing the Past Web
Tools for Managing the Past WebTools for Managing the Past Web
Tools for Managing the Past Web
 
5463 26 web mining
5463 26 web mining5463 26 web mining
5463 26 web mining
 
"Woe, Destruction, Ruin, and Decay:" An Introduction to Web Archiving
"Woe, Destruction, Ruin, and Decay:" An Introduction to Web Archiving"Woe, Destruction, Ruin, and Decay:" An Introduction to Web Archiving
"Woe, Destruction, Ruin, and Decay:" An Introduction to Web Archiving
 
Progress Made and Lessons Learned through Collaborative Web Archiving Proj...
Progress Made and Lessons Learned through Collaborative Web Archiving Proj...Progress Made and Lessons Learned through Collaborative Web Archiving Proj...
Progress Made and Lessons Learned through Collaborative Web Archiving Proj...
 
CORE Repositories Dashboard
CORE Repositories DashboardCORE Repositories Dashboard
CORE Repositories Dashboard
 
Analytics and Access to the UK web archive
Analytics and Access to the UK web archiveAnalytics and Access to the UK web archive
Analytics and Access to the UK web archive
 
Internet content as research data
Internet content as research dataInternet content as research data
Internet content as research data
 
Detecting Off-Topic Pages in Web Archives
Detecting Off-Topic Pages in Web ArchivesDetecting Off-Topic Pages in Web Archives
Detecting Off-Topic Pages in Web Archives
 
Tuesday 5 May 2020: Contextualizing and engaging with Web domains, Valérie Sc...
Tuesday 5 May 2020: Contextualizing and engaging with Web domains, Valérie Sc...Tuesday 5 May 2020: Contextualizing and engaging with Web domains, Valérie Sc...
Tuesday 5 May 2020: Contextualizing and engaging with Web domains, Valérie Sc...
 
Answers to usual issues in getting started with consuming Linked Data
Answers to usual issues in getting started with consuming Linked DataAnswers to usual issues in getting started with consuming Linked Data
Answers to usual issues in getting started with consuming Linked Data
 
eHive Open Day - London November 2010
eHive Open Day - London November 2010eHive Open Day - London November 2010
eHive Open Day - London November 2010
 
Lsr vpresntation
Lsr vpresntationLsr vpresntation
Lsr vpresntation
 
Digital Infrastructure: Storage and Content Management
Digital Infrastructure: Storage and Content ManagementDigital Infrastructure: Storage and Content Management
Digital Infrastructure: Storage and Content Management
 
Web content mining
Web content miningWeb content mining
Web content mining
 
Slides anu talkwebarchivingaug2012
Slides anu talkwebarchivingaug2012Slides anu talkwebarchivingaug2012
Slides anu talkwebarchivingaug2012
 

More from De La Salle University Library

More from De La Salle University Library (6)

Technical Competencies of Health Librarians in a Library 2.0 Environment
Technical Competencies of Health Librarians in a Library 2.0 EnvironmentTechnical Competencies of Health Librarians in a Library 2.0 Environment
Technical Competencies of Health Librarians in a Library 2.0 Environment
 
De La Salle University Library System Migration: a Strategic Decision
De La Salle University Library System Migration: a Strategic DecisionDe La Salle University Library System Migration: a Strategic Decision
De La Salle University Library System Migration: a Strategic Decision
 
Collaborative Cataloging
Collaborative CatalogingCollaborative Cataloging
Collaborative Cataloging
 
Proquest Tutorial
Proquest TutorialProquest Tutorial
Proquest Tutorial
 
Cataloging At The De La Salle University Library
Cataloging At The De La Salle University LibraryCataloging At The De La Salle University Library
Cataloging At The De La Salle University Library
 
Knowledge Management: the De La Salle University-Manila Library’s Experience
Knowledge Management: the De La Salle University-Manila Library’s ExperienceKnowledge Management: the De La Salle University-Manila Library’s Experience
Knowledge Management: the De La Salle University-Manila Library’s Experience
 

Recently uploaded

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Recently uploaded (20)

Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 

Web-Harvesting: concepts, issues, and prospects