SlideShare a Scribd company logo
1 of 22
HARD CONTENT, FAB FRONT-END
Archiving websites of the Dutch Public Broadcasters
23-5-2014
Lotte Belice Baltussen | R&D, Netherlands Institute for Sound and Vision
IIPC | 21 May 2014 | BnF, Paris
Nederlands Instituut voor
Beeld en Geluid
Sound and Vision
• 70% of Dutch AV heritage
• > 850,000 hours
• 2M photos
•20,000 objects
• Large paper archives
“The Archive as a Laboratory”
Web archiving since 2008 (LiWA, several pilots) with various objectives
NTR PILOT
(2013-2014)
23-5-2014
WHY:
• Saving websites selected to be taken offline
• Getting insights in user requirements
• Create great front and back-end
• Provide public access
• Shape future plans
WEBSITES
23-5-2014
CRAWLING ISSUES
ACCESS ISSUES
USER REQUIREMENTS, PT. 1
Phase 1: Focus group
USER REQUIREMENTS SUMMARY
• Communication and information
e.g. “As a user, I can suggest a website that should be archived”
• Metadata
e.g. “As a user, I can see the crawl date for each archived URL”
• Searching
e.g. “As a user, I can search full-text through a single archived website”
• Visualisation
e.g. “As a user, I can see side-by-side comparisons of the same URL that was
archived at different moments in time”
FRONT-END AND BACK-END
DEVELOPMENT
FRONT-END AND BACK-END
DEVELOPMENT
FRONT-END AND BACK-END
DEVELOPMENT
FRONT-END AND BACK-END
DEVELOPMENT
FRONT-END AND BACK-END
DEVELOPMENT
USER REQUIREMENTS, PT. 2
Phase 2: Usability tests
think-aloud, 60-90 minutes
x 2:
• 37, PostDoc web archive research project
• 58, Multimedia editor at a Dutch public broadcaster
x 3:
• 44, Crawl engineer
• 50, Manager digital projects at a Dutch public broadcaster
• 58, Freelance (archive) researcher & journalist
LESSONS-LEARNED
UI/UX
+ Clean, visual look
- More functionality explanations
COMMUNICATION
+ FAQ contains good info about
web archiving
- Info about status + plans
/ More info about scope and size
of web archive
METADATA
+ Overview of outgoing links
- TMI
/ Creation + last change of
website
SEARCHING
+ Fast!
+ Thumbnail previews
- Search by URL
- More filtering options
- Relevance ranking
VISUALISATION
/ More stats, e.g., % text
- Highlight differences crawls
USERS & USAGE
+ Current groups representative
- No av-streaming big loss for all
/ Add more fine-grained
subgroups
FUTURE WORK WEB ARCHIVES:
CONTEXT COLLECTIONS
“Public broadcaster web archives will help you learn where you come from”
-- Usability test participant
• We need to be more dynamic than the websites we archive
• We can and must achieve public access
• We are moving from pilot to standard practice
• Connect crawls to catalogue
• Increase public broadcaster cooperation
Thanks!
@lottebelice | lbbaltussen@beeldengeluid.nl
@benglabs

More Related Content

Similar to Hard Content, Fab Front-end @ IIPC 2014

Similar to Hard Content, Fab Front-end @ IIPC 2014 (20)

Anupriy Kanti - Content Strategy Portfolio_August 2016
Anupriy Kanti - Content Strategy Portfolio_August 2016Anupriy Kanti - Content Strategy Portfolio_August 2016
Anupriy Kanti - Content Strategy Portfolio_August 2016
 
Personal learning environment
Personal learning environmentPersonal learning environment
Personal learning environment
 
AtoM, Authenticity, and the Chain of Custody
AtoM, Authenticity, and the Chain of CustodyAtoM, Authenticity, and the Chain of Custody
AtoM, Authenticity, and the Chain of Custody
 
Conducting User Research
Conducting User ResearchConducting User Research
Conducting User Research
 
Archiver pilot phase kick off Award Ceremony
Archiver pilot phase kick off Award CeremonyArchiver pilot phase kick off Award Ceremony
Archiver pilot phase kick off Award Ceremony
 
Archiver pilot phase kick off Award Ceremony
Archiver pilot phase kick off Award CeremonyArchiver pilot phase kick off Award Ceremony
Archiver pilot phase kick off Award Ceremony
 
AGM 2013 Task Force meetings
AGM 2013 Task Force meetingsAGM 2013 Task Force meetings
AGM 2013 Task Force meetings
 
LoCloud: overview of LoCloud Services
LoCloud: overview of LoCloud ServicesLoCloud: overview of LoCloud Services
LoCloud: overview of LoCloud Services
 
2009: British Accessibility Standards - PAS-78 to BS8878
2009: British Accessibility Standards - PAS-78 to BS88782009: British Accessibility Standards - PAS-78 to BS8878
2009: British Accessibility Standards - PAS-78 to BS8878
 
A user journey in OpenAIRE services through the lens of repository managers -...
A user journey in OpenAIRE services through the lens of repository managers -...A user journey in OpenAIRE services through the lens of repository managers -...
A user journey in OpenAIRE services through the lens of repository managers -...
 
Whowas: Historical Whois Service
Whowas: Historical Whois ServiceWhowas: Historical Whois Service
Whowas: Historical Whois Service
 
255 shaw
255 shaw255 shaw
255 shaw
 
Open for Business - Open Archives, OpenURL, RSS and the Dublin Core
Open for Business - Open Archives, OpenURL, RSS and the Dublin CoreOpen for Business - Open Archives, OpenURL, RSS and the Dublin Core
Open for Business - Open Archives, OpenURL, RSS and the Dublin Core
 
04_Knutas_DOIT platform as an open educational resource
04_Knutas_DOIT platform as an open educational resource04_Knutas_DOIT platform as an open educational resource
04_Knutas_DOIT platform as an open educational resource
 
SSHOC at EOSC-hub Week - Managing Training Materials Beyond Individual Projec...
SSHOC at EOSC-hub Week - Managing Training Materials Beyond Individual Projec...SSHOC at EOSC-hub Week - Managing Training Materials Beyond Individual Projec...
SSHOC at EOSC-hub Week - Managing Training Materials Beyond Individual Projec...
 
All WP Meeting Athens - Europeana Inside - Gordon McKenna
All WP Meeting Athens - Europeana Inside - Gordon McKennaAll WP Meeting Athens - Europeana Inside - Gordon McKenna
All WP Meeting Athens - Europeana Inside - Gordon McKenna
 
Summary of Day 1
Summary of Day 1Summary of Day 1
Summary of Day 1
 
Create great cncf user base from lessons learned from other open source com...
Create great cncf user base from   lessons learned from other open source com...Create great cncf user base from   lessons learned from other open source com...
Create great cncf user base from lessons learned from other open source com...
 
Web 2.0 in Libraries
Web 2.0 in LibrariesWeb 2.0 in Libraries
Web 2.0 in Libraries
 
OA Network: Heading for Joint Standards and Enhancing Cooperation: Value‐Adde...
OA Network: Heading for Joint Standards and Enhancing Cooperation: Value‐Adde...OA Network: Heading for Joint Standards and Enhancing Cooperation: Value‐Adde...
OA Network: Heading for Joint Standards and Enhancing Cooperation: Value‐Adde...
 

More from Lotte Belice Baltussen

Open Cultuur Data / Open Beelden - HackersNL #6
Open Cultuur Data / Open Beelden - HackersNL #6Open Cultuur Data / Open Beelden - HackersNL #6
Open Cultuur Data / Open Beelden - HackersNL #6
Lotte Belice Baltussen
 

More from Lotte Belice Baltussen (20)

Digitaal duurzame links - NDE Werelddag van de Digitale Duurzaamheid
Digitaal duurzame links - NDE Werelddag van de Digitale DuurzaamheidDigitaal duurzame links - NDE Werelddag van de Digitale Duurzaamheid
Digitaal duurzame links - NDE Werelddag van de Digitale Duurzaamheid
 
Cultuurmarketing - Digitale Innovatie (19 september 2019)
Cultuurmarketing - Digitale Innovatie (19 september 2019)Cultuurmarketing - Digitale Innovatie (19 september 2019)
Cultuurmarketing - Digitale Innovatie (19 september 2019)
 
Rechtenregistratie bij de Anne Frank Stichting, Adlib gebruikersdag - 8 novem...
Rechtenregistratie bij de Anne Frank Stichting, Adlib gebruikersdag - 8 novem...Rechtenregistratie bij de Anne Frank Stichting, Adlib gebruikersdag - 8 novem...
Rechtenregistratie bij de Anne Frank Stichting, Adlib gebruikersdag - 8 novem...
 
20160922 Reinwardt Academie - NDE Bruikbaar case study GTAA bij Groninger Arc...
20160922 Reinwardt Academie - NDE Bruikbaar case study GTAA bij Groninger Arc...20160922 Reinwardt Academie - NDE Bruikbaar case study GTAA bij Groninger Arc...
20160922 Reinwardt Academie - NDE Bruikbaar case study GTAA bij Groninger Arc...
 
Netwerk Digitaal Erfgoed - Annotatie webarchief Groninger Archieven - AVA_Net...
Netwerk Digitaal Erfgoed - Annotatie webarchief Groninger Archieven - AVA_Net...Netwerk Digitaal Erfgoed - Annotatie webarchief Groninger Archieven - AVA_Net...
Netwerk Digitaal Erfgoed - Annotatie webarchief Groninger Archieven - AVA_Net...
 
Open Cultuur Data België eind-event
Open Cultuur Data België eind-eventOpen Cultuur Data België eind-event
Open Cultuur Data België eind-event
 
DISH 2013 Chef's Table - Open Cultuur Data
DISH 2013 Chef's Table - Open Cultuur DataDISH 2013 Chef's Table - Open Cultuur Data
DISH 2013 Chef's Table - Open Cultuur Data
 
18 april - CLICKNL bijeenkomst - Open Cultuur Data: game on
18 april - CLICKNL bijeenkomst - Open Cultuur Data: game on18 april - CLICKNL bijeenkomst - Open Cultuur Data: game on
18 april - CLICKNL bijeenkomst - Open Cultuur Data: game on
 
AVA_net workshop 7 maart 2013
AVA_net workshop 7 maart 2013AVA_net workshop 7 maart 2013
AVA_net workshop 7 maart 2013
 
Open cultuur data - Wikimedia Conferentie Nederland 2012
Open cultuur data - Wikimedia Conferentie Nederland 2012Open cultuur data - Wikimedia Conferentie Nederland 2012
Open cultuur data - Wikimedia Conferentie Nederland 2012
 
Open cultuur data - cop gouda gha
Open cultuur data - cop gouda ghaOpen cultuur data - cop gouda gha
Open cultuur data - cop gouda gha
 
Open Cultuur Data - Eth0:2012 Summer
Open Cultuur Data - Eth0:2012 Summer Open Cultuur Data - Eth0:2012 Summer
Open Cultuur Data - Eth0:2012 Summer
 
Workshop DEN Baas over eigen metadata
Workshop DEN Baas over eigen metadataWorkshop DEN Baas over eigen metadata
Workshop DEN Baas over eigen metadata
 
Open Culture Data - PMOD
Open Culture Data - PMODOpen Culture Data - PMOD
Open Culture Data - PMOD
 
Open Cultuur Data competitie 2012
Open Cultuur Data competitie 2012Open Cultuur Data competitie 2012
Open Cultuur Data competitie 2012
 
Open Cultuur Data - hackathon pitches
Open Cultuur Data - hackathon pitchesOpen Cultuur Data - hackathon pitches
Open Cultuur Data - hackathon pitches
 
Open Cultuur Data - KVAN 2012
Open Cultuur Data - KVAN 2012Open Cultuur Data - KVAN 2012
Open Cultuur Data - KVAN 2012
 
Open Cultuur Data / Open Beelden - HackersNL #6
Open Cultuur Data / Open Beelden - HackersNL #6Open Cultuur Data / Open Beelden - HackersNL #6
Open Cultuur Data / Open Beelden - HackersNL #6
 
Crowdsourcing metadata for audiovisual collections
Crowdsourcing metadata for audiovisual collectionsCrowdsourcing metadata for audiovisual collections
Crowdsourcing metadata for audiovisual collections
 
Baltussen - AVA_Net Najaarsconferentie 2011
Baltussen - AVA_Net Najaarsconferentie 2011Baltussen - AVA_Net Najaarsconferentie 2011
Baltussen - AVA_Net Najaarsconferentie 2011
 

Recently uploaded

Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptxHarnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
FIDO Alliance
 

Recently uploaded (20)

State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!
 
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on ThanabotsContinuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
 
Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
 
Extensible Python: Robustness through Addition - PyCon 2024
Extensible Python: Robustness through Addition - PyCon 2024Extensible Python: Robustness through Addition - PyCon 2024
Extensible Python: Robustness through Addition - PyCon 2024
 
Using IESVE for Room Loads Analysis - UK & Ireland
Using IESVE for Room Loads Analysis - UK & IrelandUsing IESVE for Room Loads Analysis - UK & Ireland
Using IESVE for Room Loads Analysis - UK & Ireland
 
TopCryptoSupers 12thReport OrionX May2024
TopCryptoSupers 12thReport OrionX May2024TopCryptoSupers 12thReport OrionX May2024
TopCryptoSupers 12thReport OrionX May2024
 
The Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightThe Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and Insight
 
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
 
Design and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data ScienceDesign and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data Science
 
Event-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream ProcessingEvent-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream Processing
 
Vector Search @ sw2con for slideshare.pptx
Vector Search @ sw2con for slideshare.pptxVector Search @ sw2con for slideshare.pptx
Vector Search @ sw2con for slideshare.pptx
 
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptxHarnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
 
Generative AI Use Cases and Applications.pdf
Generative AI Use Cases and Applications.pdfGenerative AI Use Cases and Applications.pdf
Generative AI Use Cases and Applications.pdf
 
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdfSimplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
 
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdfHow Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
 
The Metaverse: Are We There Yet?
The  Metaverse:    Are   We  There  Yet?The  Metaverse:    Are   We  There  Yet?
The Metaverse: Are We There Yet?
 
Design Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptxDesign Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptx
 
Introduction to FIDO Authentication and Passkeys.pptx
Introduction to FIDO Authentication and Passkeys.pptxIntroduction to FIDO Authentication and Passkeys.pptx
Introduction to FIDO Authentication and Passkeys.pptx
 
2024 May Patch Tuesday
2024 May Patch Tuesday2024 May Patch Tuesday
2024 May Patch Tuesday
 
WebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM PerformanceWebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM Performance
 

Hard Content, Fab Front-end @ IIPC 2014

  • 1.
  • 2. HARD CONTENT, FAB FRONT-END Archiving websites of the Dutch Public Broadcasters 23-5-2014 Lotte Belice Baltussen | R&D, Netherlands Institute for Sound and Vision IIPC | 21 May 2014 | BnF, Paris
  • 3. Nederlands Instituut voor Beeld en Geluid Sound and Vision • 70% of Dutch AV heritage • > 850,000 hours • 2M photos •20,000 objects • Large paper archives
  • 4.
  • 5. “The Archive as a Laboratory” Web archiving since 2008 (LiWA, several pilots) with various objectives
  • 6. NTR PILOT (2013-2014) 23-5-2014 WHY: • Saving websites selected to be taken offline • Getting insights in user requirements • Create great front and back-end • Provide public access • Shape future plans
  • 10. USER REQUIREMENTS, PT. 1 Phase 1: Focus group
  • 11.
  • 12.
  • 13. USER REQUIREMENTS SUMMARY • Communication and information e.g. “As a user, I can suggest a website that should be archived” • Metadata e.g. “As a user, I can see the crawl date for each archived URL” • Searching e.g. “As a user, I can search full-text through a single archived website” • Visualisation e.g. “As a user, I can see side-by-side comparisons of the same URL that was archived at different moments in time”
  • 19. USER REQUIREMENTS, PT. 2 Phase 2: Usability tests think-aloud, 60-90 minutes x 2: • 37, PostDoc web archive research project • 58, Multimedia editor at a Dutch public broadcaster x 3: • 44, Crawl engineer • 50, Manager digital projects at a Dutch public broadcaster • 58, Freelance (archive) researcher & journalist
  • 20. LESSONS-LEARNED UI/UX + Clean, visual look - More functionality explanations COMMUNICATION + FAQ contains good info about web archiving - Info about status + plans / More info about scope and size of web archive METADATA + Overview of outgoing links - TMI / Creation + last change of website SEARCHING + Fast! + Thumbnail previews - Search by URL - More filtering options - Relevance ranking VISUALISATION / More stats, e.g., % text - Highlight differences crawls USERS & USAGE + Current groups representative - No av-streaming big loss for all / Add more fine-grained subgroups
  • 21. FUTURE WORK WEB ARCHIVES: CONTEXT COLLECTIONS “Public broadcaster web archives will help you learn where you come from” -- Usability test participant • We need to be more dynamic than the websites we archive • We can and must achieve public access • We are moving from pilot to standard practice • Connect crawls to catalogue • Increase public broadcaster cooperation