SlideShare una empresa de Scribd logo
1 de 75
Nicholas Taylor
Repository Development Group
Using Wayback Machine for Research
WAYBACK MACHINE?
What Is the
WABAC Machine?
Internet Archive’s Wayback Machine
not one, but many Wayback Machines
 open source software to “replay” web archives
 rewrites links to point to archived resources
 allows for temporal navigation within archive
 used by many web archiving institutions
 33 out of 62 initiatives listed on Wikipedia
Government of Canada Web Archive
Government of Canada Web Archive
Portuguese Web Archive
Web Archive Singapore
Web Archive Singapore
Catalonian Web Archive
Catalonian Web Archive
California Digital Library Web Archiving Service
Harvard University Web Archive Collection
Service
LIMITATIONS AND
WORKAROUNDS
Common
limitation: banner displaces page elements
workaround: hide the banner
limitation: AJAX-enabled sites
limitation: AJAX-enabled sites
workaround: disable JavaScript
limitation: nav menu link errors
workaround: insert live site URL in archive
workaround: insert live site URL in archive
workaround: insert live site URL in archive
limitation: no full-text search
workaround: none yet, but R&D ongoing
MECHANICS
Basic
structure of a Wayback Machine URL
http://webarchiveqr.loc.gov/loc_sites/20120131201510/http://www.loc.gov/index.html
Wayback Machine URL collection date/timestamp
(YYYYMMDDHHMMSS)
URL of archived
resource
URL-based access
URL-based access
date wildcarding
date wildcarding
document wildcarding
document wildcarding
document wildcarding
FINDING MISSING
RESOURCES
Strategies for
removed or moved?
 don’t start with the archive
 missing resources have often just moved (
Klein & Nelson, 2010)
 Synchronicity for Firefox helps find new location
 scrapes archived version for “fingerprint”
keywords; uses them to query search engines
MementoFox
MementoFox
find archives for a site whose URL has changed
 website URL changed recently
 historical URL is unknown
 solution: use search engine to find historical
URL then apply it in the archive
Federal IT Dashboard
check Internet Archive’s Wayback Machine
IA Wayback coverage goes back to July 2010
LCWA only goes back to June 2011
use search engine to find historical URL
use search engine to find historical URL
White House IT Dashboard announcement
note the redirect from http://it.usaspending.gov/
append URL to IA Wayback URL
append URL to LC Wayback URL
find archives for a site whose URL has changed
 congressional committee hearings archive
 live site URL doesn’t work in archive
 solution: find a site in the archive that would
link to the desired site, then navigate to
contemporaneous snapshot
hearings archive only spans 2001-2006
hearings archive URL changed in 2011
truncate archival access URL
snapshot from prior to site change
navigate to appropriate section
navigate to appropriate section
find archives for a previously accessible webpage
 records currently stored in password-protected
part of site may have previously been publicly-
accessible
 conceptual site organization lasts longer than
exact link construction
 solution: figure out where desired resource
would be on the live site, then navigate to
analogous section on archived site
location of resources on live site
location of resources on live site
authentication required
check the site in the archive
navigate to an individual capture
navigate to appropriate section
navigate to appropriate section
GET INVOLVED
How You Can
 what websites from today
would you want to be able to
consult in five, ten, twenty
years’ time?
 have you told us what is
important to capture?
help us to help you
End of Term 2012 Web Archive
USEFUL RESOURCES
Other
End of Term 2008 Web Archive
CyberCemetery
LCWA
Project One Web Archives
links
 Library of Congress Web Archiving Program:
http://www.loc.gov/webarchiving/
 Library of Congress Web Archives: http://
loc.gov/lcwa/
 International Internet Preservation Consortium:
http://netpreserve.org/
 National Digital Information Infrastructure and
Preservation Program: http://
www.digitalpreservation.gov/
questions?
webcapture@loc.gov

Más contenido relacionado

La actualidad más candente

Gaca-Tworek: JavaScript analysis is extremely important and anyone can do it!...
Gaca-Tworek: JavaScript analysis is extremely important and anyone can do it!...Gaca-Tworek: JavaScript analysis is extremely important and anyone can do it!...
Gaca-Tworek: JavaScript analysis is extremely important and anyone can do it!...BarbaraGacaTworek
 
Search engine
Search engineSearch engine
Search engineswaraj27
 
Library powerpoint
Library powerpointLibrary powerpoint
Library powerpointthemachine99
 
Social media analytics
Social media analyticsSocial media analytics
Social media analyticsShekhar Shukla
 
Searchlove 2022 - The State of E-A-T, from Lily Ray
Searchlove 2022 - The State of E-A-T, from Lily RaySearchlove 2022 - The State of E-A-T, from Lily Ray
Searchlove 2022 - The State of E-A-T, from Lily RayAmsive
 
New and innovative services in university library
New and innovative services in university libraryNew and innovative services in university library
New and innovative services in university libraryShiv Prasad
 
Mobile Deep Linking for Apps – What? Why? How?
Mobile Deep Linking for Apps – What? Why? How?Mobile Deep Linking for Apps – What? Why? How?
Mobile Deep Linking for Apps – What? Why? How?Branch
 
SEO Proposal Template - Format.pdf
SEO Proposal Template - Format.pdfSEO Proposal Template - Format.pdf
SEO Proposal Template - Format.pdfKen Khan
 
How To Drive Product Page Rankings With A Funnel Of Content And Links
How To Drive Product Page Rankings With A Funnel Of Content And LinksHow To Drive Product Page Rankings With A Funnel Of Content And Links
How To Drive Product Page Rankings With A Funnel Of Content And LinksSearch Engine Journal
 
3 Simple Google Ads Tweaks That Immediately Boost Sales.pdf
3 Simple Google Ads Tweaks That Immediately Boost Sales.pdf3 Simple Google Ads Tweaks That Immediately Boost Sales.pdf
3 Simple Google Ads Tweaks That Immediately Boost Sales.pdfSearch Engine Journal
 
API 101 - Understanding APIs
API 101 - Understanding APIsAPI 101 - Understanding APIs
API 101 - Understanding APIs3scale
 
Library management system basic points
Library management system basic pointsLibrary management system basic points
Library management system basic pointsUmer Awan
 
Digital library initiative at inflibnet
Digital library initiative at inflibnetDigital library initiative at inflibnet
Digital library initiative at inflibnetAbid Fakhre Alam
 
Web 2.0 in Libraries
Web 2.0 in LibrariesWeb 2.0 in Libraries
Web 2.0 in LibrariesAnupama Saini
 
Using the CC BY license, Workshop for 2013 OPEN Kick-off
Using the CC BY license, Workshop for 2013 OPEN Kick-offUsing the CC BY license, Workshop for 2013 OPEN Kick-off
Using the CC BY license, Workshop for 2013 OPEN Kick-offJane Park
 

La actualidad más candente (20)

Gaca-Tworek: JavaScript analysis is extremely important and anyone can do it!...
Gaca-Tworek: JavaScript analysis is extremely important and anyone can do it!...Gaca-Tworek: JavaScript analysis is extremely important and anyone can do it!...
Gaca-Tworek: JavaScript analysis is extremely important and anyone can do it!...
 
Webcrawler
Webcrawler Webcrawler
Webcrawler
 
Search engine
Search engineSearch engine
Search engine
 
Library powerpoint
Library powerpointLibrary powerpoint
Library powerpoint
 
Social media analytics
Social media analyticsSocial media analytics
Social media analytics
 
Searchlove 2022 - The State of E-A-T, from Lily Ray
Searchlove 2022 - The State of E-A-T, from Lily RaySearchlove 2022 - The State of E-A-T, from Lily Ray
Searchlove 2022 - The State of E-A-T, from Lily Ray
 
New and innovative services in university library
New and innovative services in university libraryNew and innovative services in university library
New and innovative services in university library
 
Mobile Deep Linking for Apps – What? Why? How?
Mobile Deep Linking for Apps – What? Why? How?Mobile Deep Linking for Apps – What? Why? How?
Mobile Deep Linking for Apps – What? Why? How?
 
SEO Proposal Template - Format.pdf
SEO Proposal Template - Format.pdfSEO Proposal Template - Format.pdf
SEO Proposal Template - Format.pdf
 
How To Drive Product Page Rankings With A Funnel Of Content And Links
How To Drive Product Page Rankings With A Funnel Of Content And LinksHow To Drive Product Page Rankings With A Funnel Of Content And Links
How To Drive Product Page Rankings With A Funnel Of Content And Links
 
Seo for-content
Seo for-contentSeo for-content
Seo for-content
 
3 Simple Google Ads Tweaks That Immediately Boost Sales.pdf
3 Simple Google Ads Tweaks That Immediately Boost Sales.pdf3 Simple Google Ads Tweaks That Immediately Boost Sales.pdf
3 Simple Google Ads Tweaks That Immediately Boost Sales.pdf
 
API 101 - Understanding APIs
API 101 - Understanding APIsAPI 101 - Understanding APIs
API 101 - Understanding APIs
 
Algorithms in Social Media
Algorithms in Social MediaAlgorithms in Social Media
Algorithms in Social Media
 
Library management system basic points
Library management system basic pointsLibrary management system basic points
Library management system basic points
 
API Governance in the Enterprise
API Governance in the EnterpriseAPI Governance in the Enterprise
API Governance in the Enterprise
 
Digital library initiative at inflibnet
Digital library initiative at inflibnetDigital library initiative at inflibnet
Digital library initiative at inflibnet
 
LibQUAL+®
LibQUAL+®LibQUAL+®
LibQUAL+®
 
Web 2.0 in Libraries
Web 2.0 in LibrariesWeb 2.0 in Libraries
Web 2.0 in Libraries
 
Using the CC BY license, Workshop for 2013 OPEN Kick-off
Using the CC BY license, Workshop for 2013 OPEN Kick-offUsing the CC BY license, Workshop for 2013 OPEN Kick-off
Using the CC BY license, Workshop for 2013 OPEN Kick-off
 

Similar a Using Wayback Machine for Research

Web Archiving Intro (circa 2015)
Web Archiving Intro (circa 2015)Web Archiving Intro (circa 2015)
Web Archiving Intro (circa 2015)Anna Perricci
 
SharePoint Saturday Utah 2015 - SP2013 Search Driven Sites
SharePoint Saturday Utah 2015 - SP2013 Search Driven SitesSharePoint Saturday Utah 2015 - SP2013 Search Driven Sites
SharePoint Saturday Utah 2015 - SP2013 Search Driven SitesBrian Culver
 
ELAG - Mashing Up and Remixing the Library Website
ELAG - Mashing Up and Remixing the Library WebsiteELAG - Mashing Up and Remixing the Library Website
ELAG - Mashing Up and Remixing the Library Websitelibrarywebchic
 
On building a search interface discovery system
On building a search interface discovery systemOn building a search interface discovery system
On building a search interface discovery systemDenis Shestakov
 
SharePoint Saturday DFW 2015 - Build a SharePoint 2013 Search Driven Application
SharePoint Saturday DFW 2015 - Build a SharePoint 2013 Search Driven ApplicationSharePoint Saturday DFW 2015 - Build a SharePoint 2013 Search Driven Application
SharePoint Saturday DFW 2015 - Build a SharePoint 2013 Search Driven ApplicationBrian Culver
 
Online Collections Crawlability for Libraries, Archives, and Museums
Online Collections Crawlability for Libraries, Archives, and MuseumsOnline Collections Crawlability for Libraries, Archives, and Museums
Online Collections Crawlability for Libraries, Archives, and Museumsmherbison
 
SRC 204 - Build a SharePoint 2013 Search Driven Application!
SRC 204 - Build a SharePoint 2013 Search Driven Application!SRC 204 - Build a SharePoint 2013 Search Driven Application!
SRC 204 - Build a SharePoint 2013 Search Driven Application!Brian Culver
 
LD4L OCLC Data Strategy
LD4L OCLC Data StrategyLD4L OCLC Data Strategy
LD4L OCLC Data StrategyRichard Wallis
 
At Begin, URL Handling and REST
At Begin, URL Handling and RESTAt Begin, URL Handling and REST
At Begin, URL Handling and RESTBrian Loomis
 
The development of web archiving 3
The development of web archiving 3The development of web archiving 3
The development of web archiving 3Essam Obaid
 
Wordpress
WordpressWordpress
WordpressCKLS
 
SharePoint 2013 Search Driven Sites - SPSHOU
SharePoint 2013 Search Driven Sites - SPSHOUSharePoint 2013 Search Driven Sites - SPSHOU
SharePoint 2013 Search Driven Sites - SPSHOUBrian Culver
 
Build a Search Driven Site-Understanding Cross-Site Publishing
Build a Search Driven Site-Understanding Cross-Site PublishingBuild a Search Driven Site-Understanding Cross-Site Publishing
Build a Search Driven Site-Understanding Cross-Site PublishingSPC Adriatics
 

Similar a Using Wayback Machine for Research (20)

Web Archiving Intro (circa 2015)
Web Archiving Intro (circa 2015)Web Archiving Intro (circa 2015)
Web Archiving Intro (circa 2015)
 
SharePoint Saturday Utah 2015 - SP2013 Search Driven Sites
SharePoint Saturday Utah 2015 - SP2013 Search Driven SitesSharePoint Saturday Utah 2015 - SP2013 Search Driven Sites
SharePoint Saturday Utah 2015 - SP2013 Search Driven Sites
 
Mashups for Libraries
Mashups for LibrariesMashups for Libraries
Mashups for Libraries
 
ELAG - Mashing Up and Remixing the Library Website
ELAG - Mashing Up and Remixing the Library WebsiteELAG - Mashing Up and Remixing the Library Website
ELAG - Mashing Up and Remixing the Library Website
 
On building a search interface discovery system
On building a search interface discovery systemOn building a search interface discovery system
On building a search interface discovery system
 
Html5 histroy api
Html5 histroy apiHtml5 histroy api
Html5 histroy api
 
SharePoint Saturday DFW 2015 - Build a SharePoint 2013 Search Driven Application
SharePoint Saturday DFW 2015 - Build a SharePoint 2013 Search Driven ApplicationSharePoint Saturday DFW 2015 - Build a SharePoint 2013 Search Driven Application
SharePoint Saturday DFW 2015 - Build a SharePoint 2013 Search Driven Application
 
E Write Intro To Web 2
E Write   Intro To Web 2E Write   Intro To Web 2
E Write Intro To Web 2
 
Online Collections Crawlability for Libraries, Archives, and Museums
Online Collections Crawlability for Libraries, Archives, and MuseumsOnline Collections Crawlability for Libraries, Archives, and Museums
Online Collections Crawlability for Libraries, Archives, and Museums
 
SRC 204 - Build a SharePoint 2013 Search Driven Application!
SRC 204 - Build a SharePoint 2013 Search Driven Application!SRC 204 - Build a SharePoint 2013 Search Driven Application!
SRC 204 - Build a SharePoint 2013 Search Driven Application!
 
Website Mashup
Website MashupWebsite Mashup
Website Mashup
 
Library hacks
Library hacksLibrary hacks
Library hacks
 
LD4L OCLC Data Strategy
LD4L OCLC Data StrategyLD4L OCLC Data Strategy
LD4L OCLC Data Strategy
 
At Begin, URL Handling and REST
At Begin, URL Handling and RESTAt Begin, URL Handling and REST
At Begin, URL Handling and REST
 
The development of web archiving 3
The development of web archiving 3The development of web archiving 3
The development of web archiving 3
 
Wordpress
WordpressWordpress
Wordpress
 
Web Crawler
Web CrawlerWeb Crawler
Web Crawler
 
SharePoint 2013 Search Driven Sites - SPSHOU
SharePoint 2013 Search Driven Sites - SPSHOUSharePoint 2013 Search Driven Sites - SPSHOU
SharePoint 2013 Search Driven Sites - SPSHOU
 
Build a Search Driven Site-Understanding Cross-Site Publishing
Build a Search Driven Site-Understanding Cross-Site PublishingBuild a Search Driven Site-Understanding Cross-Site Publishing
Build a Search Driven Site-Understanding Cross-Site Publishing
 
Boost and SEO
Boost and SEOBoost and SEO
Boost and SEO
 

Más de nullhandle

Understanding Legal Use Cases for Web Archives
Understanding Legal Use Cases for Web ArchivesUnderstanding Legal Use Cases for Web Archives
Understanding Legal Use Cases for Web Archivesnullhandle
 
Lots More LOCKSS for Web Archiving: Boons from the LOCKSS Software Re-Archite...
Lots More LOCKSS for Web Archiving: Boons from the LOCKSS Software Re-Archite...Lots More LOCKSS for Web Archiving: Boons from the LOCKSS Software Re-Archite...
Lots More LOCKSS for Web Archiving: Boons from the LOCKSS Software Re-Archite...nullhandle
 
Unlocking LOCKSS with APIs
Unlocking LOCKSS with APIsUnlocking LOCKSS with APIs
Unlocking LOCKSS with APIsnullhandle
 
Lots of LOCKSS Keeping Stuff Safe: The Future of the LOCKSS Program
Lots of LOCKSS Keeping Stuff Safe: The Future of the LOCKSS ProgramLots of LOCKSS Keeping Stuff Safe: The Future of the LOCKSS Program
Lots of LOCKSS Keeping Stuff Safe: The Future of the LOCKSS Programnullhandle
 
Interoperability and Technical Collaboration for Web and Social Media Archiving
Interoperability and Technical Collaboration for Web and Social Media ArchivingInteroperability and Technical Collaboration for Web and Social Media Archiving
Interoperability and Technical Collaboration for Web and Social Media Archivingnullhandle
 
Rethinking Web Archiving Quality Assurance for Impact, Scalability, and Susta...
Rethinking Web Archiving Quality Assurance for Impact, Scalability, and Susta...Rethinking Web Archiving Quality Assurance for Impact, Scalability, and Susta...
Rethinking Web Archiving Quality Assurance for Impact, Scalability, and Susta...nullhandle
 
2015 NDSA Web Archiving Survey Report Highlights
2015 NDSA Web Archiving Survey Report Highlights2015 NDSA Web Archiving Survey Report Highlights
2015 NDSA Web Archiving Survey Report Highlightsnullhandle
 
Collection Development for Selective Web Archiving
Collection Development for Selective Web ArchivingCollection Development for Selective Web Archiving
Collection Development for Selective Web Archivingnullhandle
 
Why Not Lots of Copies Keep(ing) Software Safe?
Why Not Lots of Copies Keep(ing) Software Safe?Why Not Lots of Copies Keep(ing) Software Safe?
Why Not Lots of Copies Keep(ing) Software Safe?nullhandle
 
WASAPI Web Archive Data Transfer APIs
WASAPI Web Archive Data Transfer APIsWASAPI Web Archive Data Transfer APIs
WASAPI Web Archive Data Transfer APIsnullhandle
 
Building Web Archiving Technology, Together
Building Web Archiving Technology, TogetherBuilding Web Archiving Technology, Together
Building Web Archiving Technology, Togethernullhandle
 
Outreach to Campus Webmasters for a Better Web, and Better Web Archiving
Outreach to Campus Webmasters for a Better Web, and Better Web ArchivingOutreach to Campus Webmasters for a Better Web, and Better Web Archiving
Outreach to Campus Webmasters for a Better Web, and Better Web Archivingnullhandle
 
Measure All the (Web Archiving) Things!
Measure All the (Web Archiving) Things!Measure All the (Web Archiving) Things!
Measure All the (Web Archiving) Things!nullhandle
 
A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey R...
A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey R...A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey R...
A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey R...nullhandle
 
Campaign Web Archives to Support Multi-Institutional Research
Campaign Web Archives to Support Multi-Institutional ResearchCampaign Web Archives to Support Multi-Institutional Research
Campaign Web Archives to Support Multi-Institutional Researchnullhandle
 
2013 NDSA Web Archiving Survey Report Highlights
2013 NDSA Web Archiving Survey Report Highlights2013 NDSA Web Archiving Survey Report Highlights
2013 NDSA Web Archiving Survey Report Highlightsnullhandle
 
Considerations for Strategic Web Archive Collection Development
Considerations for Strategic Web Archive Collection DevelopmentConsiderations for Strategic Web Archive Collection Development
Considerations for Strategic Web Archive Collection Developmentnullhandle
 
Boiling the Ocean, Together: Web Archive Collection Development in a Global C...
Boiling the Ocean, Together: Web Archive Collection Development in a Global C...Boiling the Ocean, Together: Web Archive Collection Development in a Global C...
Boiling the Ocean, Together: Web Archive Collection Development in a Global C...nullhandle
 
Advocating for Web Archivability
Advocating for Web ArchivabilityAdvocating for Web Archivability
Advocating for Web Archivabilitynullhandle
 
Building Archivable Websites
Building Archivable WebsitesBuilding Archivable Websites
Building Archivable Websitesnullhandle
 

Más de nullhandle (20)

Understanding Legal Use Cases for Web Archives
Understanding Legal Use Cases for Web ArchivesUnderstanding Legal Use Cases for Web Archives
Understanding Legal Use Cases for Web Archives
 
Lots More LOCKSS for Web Archiving: Boons from the LOCKSS Software Re-Archite...
Lots More LOCKSS for Web Archiving: Boons from the LOCKSS Software Re-Archite...Lots More LOCKSS for Web Archiving: Boons from the LOCKSS Software Re-Archite...
Lots More LOCKSS for Web Archiving: Boons from the LOCKSS Software Re-Archite...
 
Unlocking LOCKSS with APIs
Unlocking LOCKSS with APIsUnlocking LOCKSS with APIs
Unlocking LOCKSS with APIs
 
Lots of LOCKSS Keeping Stuff Safe: The Future of the LOCKSS Program
Lots of LOCKSS Keeping Stuff Safe: The Future of the LOCKSS ProgramLots of LOCKSS Keeping Stuff Safe: The Future of the LOCKSS Program
Lots of LOCKSS Keeping Stuff Safe: The Future of the LOCKSS Program
 
Interoperability and Technical Collaboration for Web and Social Media Archiving
Interoperability and Technical Collaboration for Web and Social Media ArchivingInteroperability and Technical Collaboration for Web and Social Media Archiving
Interoperability and Technical Collaboration for Web and Social Media Archiving
 
Rethinking Web Archiving Quality Assurance for Impact, Scalability, and Susta...
Rethinking Web Archiving Quality Assurance for Impact, Scalability, and Susta...Rethinking Web Archiving Quality Assurance for Impact, Scalability, and Susta...
Rethinking Web Archiving Quality Assurance for Impact, Scalability, and Susta...
 
2015 NDSA Web Archiving Survey Report Highlights
2015 NDSA Web Archiving Survey Report Highlights2015 NDSA Web Archiving Survey Report Highlights
2015 NDSA Web Archiving Survey Report Highlights
 
Collection Development for Selective Web Archiving
Collection Development for Selective Web ArchivingCollection Development for Selective Web Archiving
Collection Development for Selective Web Archiving
 
Why Not Lots of Copies Keep(ing) Software Safe?
Why Not Lots of Copies Keep(ing) Software Safe?Why Not Lots of Copies Keep(ing) Software Safe?
Why Not Lots of Copies Keep(ing) Software Safe?
 
WASAPI Web Archive Data Transfer APIs
WASAPI Web Archive Data Transfer APIsWASAPI Web Archive Data Transfer APIs
WASAPI Web Archive Data Transfer APIs
 
Building Web Archiving Technology, Together
Building Web Archiving Technology, TogetherBuilding Web Archiving Technology, Together
Building Web Archiving Technology, Together
 
Outreach to Campus Webmasters for a Better Web, and Better Web Archiving
Outreach to Campus Webmasters for a Better Web, and Better Web ArchivingOutreach to Campus Webmasters for a Better Web, and Better Web Archiving
Outreach to Campus Webmasters for a Better Web, and Better Web Archiving
 
Measure All the (Web Archiving) Things!
Measure All the (Web Archiving) Things!Measure All the (Web Archiving) Things!
Measure All the (Web Archiving) Things!
 
A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey R...
A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey R...A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey R...
A Snapshot of the U.S. Web Archiving Landscape through the 2013 NDSA Survey R...
 
Campaign Web Archives to Support Multi-Institutional Research
Campaign Web Archives to Support Multi-Institutional ResearchCampaign Web Archives to Support Multi-Institutional Research
Campaign Web Archives to Support Multi-Institutional Research
 
2013 NDSA Web Archiving Survey Report Highlights
2013 NDSA Web Archiving Survey Report Highlights2013 NDSA Web Archiving Survey Report Highlights
2013 NDSA Web Archiving Survey Report Highlights
 
Considerations for Strategic Web Archive Collection Development
Considerations for Strategic Web Archive Collection DevelopmentConsiderations for Strategic Web Archive Collection Development
Considerations for Strategic Web Archive Collection Development
 
Boiling the Ocean, Together: Web Archive Collection Development in a Global C...
Boiling the Ocean, Together: Web Archive Collection Development in a Global C...Boiling the Ocean, Together: Web Archive Collection Development in a Global C...
Boiling the Ocean, Together: Web Archive Collection Development in a Global C...
 
Advocating for Web Archivability
Advocating for Web ArchivabilityAdvocating for Web Archivability
Advocating for Web Archivability
 
Building Archivable Websites
Building Archivable WebsitesBuilding Archivable Websites
Building Archivable Websites
 

Último

Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 

Último (20)

Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 

Using Wayback Machine for Research

Notas del editor

  1. Mr. Peabody and Sherman’s time machine plot device from the television show “Rocky & Bullwinkle.”
  2. The Wayback Machine most people are familiar with.
  3. http://www.collectionscanada.gc.ca/webarchives/20071114183551/http://www.accord-treaty.gc.ca/main.asp?language=0
  4. http://www.collectionscanada.gc.ca/webarchives/*/http://www.accord-treaty.gc.ca/main.asp?language=0
  5. http://www.arquivo.pt/wayback/wayback/id4390263index3?l=en
  6. http://was.nl.sg/wayback/20080404151626/http://www.biosingapore.org.sg/
  7. http://was.nl.sg/wayback/*/http://www.biosingapore.org.sg/
  8. http://www.padi.cat:8080/wayback/20120327044230/http://www.udg.edu/
  9. http://www.padi.cat:8080/wayback/*/http://www.udg.edu/
  10. http://webarchives.cdlib.org/sw16689n33/http://bawsca.org/
  11. http://wax.lib.harvard.edu/collections/wayback.do?stamp=20080714184732&lang=eng&primColl=61&seed=175&liveWebUrl=tiffanni.blogspot.com%2F
  12. When the Twitter link in the footer is clicked…
  13. …the AJAX code truncates the URL, resulting in a blank page.
  14. If you disable JavaScript in the browser and then click on the Twitter link, the page loads fine.
  15. The navigation menu layout is awry and the links aren’t clickable.
  16. Just because Wayback can’t properly rewrite the link doesn’t mean the crawler didn’t capture it. Navigate to the live site.
  17. Find the desired URL.
  18. Append the desired URL to the Wayback URL.
  19. In the Library of Congress Web Archives, it’s only possible to search the bibliographic records.
  20. The British Library and Internet Archive are exploring Lucene/Solr for full-text searching of web archives.
  21. Note the live site URL.
  22. Appending the live site URL to the Wayback URL takes you to a “snapshot” of that page in the archive.
  23. Full date range is wildcarded (any date), so all snapshots for that URL are presented.
  24. Date range is wildcarded to include only those captures from the specified year.
  25. An individual page in the archive.
  26. The time and specific resource are wildcarded, so it shows all resources captured for the specified domain on the specified day.
  27. An example of one of the captured resources in the list.
  28. Example of a live site.
  29. Adjust the slider to request a Memento (i.e. archived resource) for the current URL.
  30. We know that the website existed before then; how do we find it?
  31. Copy the link to the IT Dashboard.
  32. Additional captures from 2009 and 2010 are presented in the archive.
  33. Additional captures from 2009 are presented in the archive.
  34. The teleconference archives are in the events section.
  35. If you click on any of the individual calls…
  36. …you’re taken to an authentication page.
  37. Even though the site URLs changed, there’s a decent chance that the teleconference archives were previously located in the events section.
  38. Sure enough, they’re there, and not password-protected.
  39. http://eotarchive.cdlib.org/2012.html
  40. http://eotarchive.cdlib.org/search?browse-all=yes
  41. http://govinfo.library.unt.edu/
  42. http://lcweb2.loc.gov/diglib/lcwa/html/lcwa-home.html