SlideShare una empresa de Scribd logo
1 de 22
HARD CONTENT, FAB FRONT-END
Archiving websites of the Dutch Public Broadcasters
23-5-2014
Lotte Belice Baltussen | R&D, Netherlands Institute for Sound and Vision
IIPC | 21 May 2014 | BnF, Paris
Nederlands Instituut voor
Beeld en Geluid
Sound and Vision
• 70% of Dutch AV heritage
• > 850,000 hours
• 2M photos
•20,000 objects
• Large paper archives
“The Archive as a Laboratory”
Web archiving since 2008 (LiWA, several pilots) with various objectives
NTR PILOT
(2013-2014)
23-5-2014
WHY:
• Saving websites selected to be taken offline
• Getting insights in user requirements
• Create great front and back-end
• Provide public access
• Shape future plans
WEBSITES
23-5-2014
CRAWLING ISSUES
ACCESS ISSUES
USER REQUIREMENTS, PT. 1
Phase 1: Focus group
USER REQUIREMENTS SUMMARY
• Communication and information
e.g. “As a user, I can suggest a website that should be archived”
• Metadata
e.g. “As a user, I can see the crawl date for each archived URL”
• Searching
e.g. “As a user, I can search full-text through a single archived website”
• Visualisation
e.g. “As a user, I can see side-by-side comparisons of the same URL that was
archived at different moments in time”
FRONT-END AND BACK-END
DEVELOPMENT
FRONT-END AND BACK-END
DEVELOPMENT
FRONT-END AND BACK-END
DEVELOPMENT
FRONT-END AND BACK-END
DEVELOPMENT
FRONT-END AND BACK-END
DEVELOPMENT
USER REQUIREMENTS, PT. 2
Phase 2: Usability tests
think-aloud, 60-90 minutes
x 2:
• 37, PostDoc web archive research project
• 58, Multimedia editor at a Dutch public broadcaster
x 3:
• 44, Crawl engineer
• 50, Manager digital projects at a Dutch public broadcaster
• 58, Freelance (archive) researcher & journalist
LESSONS-LEARNED
UI/UX
+ Clean, visual look
- More functionality explanations
COMMUNICATION
+ FAQ contains good info about
web archiving
- Info about status + plans
/ More info about scope and size
of web archive
METADATA
+ Overview of outgoing links
- TMI
/ Creation + last change of
website
SEARCHING
+ Fast!
+ Thumbnail previews
- Search by URL
- More filtering options
- Relevance ranking
VISUALISATION
/ More stats, e.g., % text
- Highlight differences crawls
USERS & USAGE
+ Current groups representative
- No av-streaming big loss for all
/ Add more fine-grained
subgroups
FUTURE WORK WEB ARCHIVES:
CONTEXT COLLECTIONS
“Public broadcaster web archives will help you learn where you come from”
-- Usability test participant
• We need to be more dynamic than the websites we archive
• We can and must achieve public access
• We are moving from pilot to standard practice
• Connect crawls to catalogue
• Increase public broadcaster cooperation
Thanks!
@lottebelice | lbbaltussen@beeldengeluid.nl
@benglabs

Más contenido relacionado

Similar a Hard Content, Fab Front-end @ IIPC 2014

Anupriy Kanti - Content Strategy Portfolio_August 2016
Anupriy Kanti - Content Strategy Portfolio_August 2016Anupriy Kanti - Content Strategy Portfolio_August 2016
Anupriy Kanti - Content Strategy Portfolio_August 2016Anupriy Kanti
 
Conducting User Research
Conducting User ResearchConducting User Research
Conducting User ResearchJeremy Horn
 
Archiver pilot phase kick off Award Ceremony
Archiver pilot phase kick off Award CeremonyArchiver pilot phase kick off Award Ceremony
Archiver pilot phase kick off Award CeremonyArchiver
 
Archiver pilot phase kick off Award Ceremony
Archiver pilot phase kick off Award CeremonyArchiver pilot phase kick off Award Ceremony
Archiver pilot phase kick off Award CeremonyArchiver
 
AGM 2013 Task Force meetings
AGM 2013 Task Force meetingsAGM 2013 Task Force meetings
AGM 2013 Task Force meetingsEuropeana
 
LoCloud: overview of LoCloud Services
LoCloud: overview of LoCloud ServicesLoCloud: overview of LoCloud Services
LoCloud: overview of LoCloud Serviceslocloud
 
2009: British Accessibility Standards - PAS-78 to BS8878
2009: British Accessibility Standards - PAS-78 to BS88782009: British Accessibility Standards - PAS-78 to BS8878
2009: British Accessibility Standards - PAS-78 to BS8878Jonathan Hassell
 
A user journey in OpenAIRE services through the lens of repository managers -...
A user journey in OpenAIRE services through the lens of repository managers -...A user journey in OpenAIRE services through the lens of repository managers -...
A user journey in OpenAIRE services through the lens of repository managers -...OpenAIRE
 
Whowas: Historical Whois Service
Whowas: Historical Whois ServiceWhowas: Historical Whois Service
Whowas: Historical Whois ServiceAPNIC
 
Open for Business - Open Archives, OpenURL, RSS and the Dublin Core
Open for Business - Open Archives, OpenURL, RSS and the Dublin CoreOpen for Business - Open Archives, OpenURL, RSS and the Dublin Core
Open for Business - Open Archives, OpenURL, RSS and the Dublin CoreAndy Powell
 
SSHOC at EOSC-hub Week - Managing Training Materials Beyond Individual Projec...
SSHOC at EOSC-hub Week - Managing Training Materials Beyond Individual Projec...SSHOC at EOSC-hub Week - Managing Training Materials Beyond Individual Projec...
SSHOC at EOSC-hub Week - Managing Training Materials Beyond Individual Projec...SSHOC
 
Summary of Day 1
Summary of Day 1Summary of Day 1
Summary of Day 1Europeana
 
Create great cncf user base from lessons learned from other open source com...
Create great cncf user base from   lessons learned from other open source com...Create great cncf user base from   lessons learned from other open source com...
Create great cncf user base from lessons learned from other open source com...Krishna-Kumar
 
Web 2.0 in Libraries
Web 2.0 in LibrariesWeb 2.0 in Libraries
Web 2.0 in LibrariesAnupama Saini
 
OA Network: Heading for Joint Standards and Enhancing Cooperation: Value‐Adde...
OA Network: Heading for Joint Standards and Enhancing Cooperation: Value‐Adde...OA Network: Heading for Joint Standards and Enhancing Cooperation: Value‐Adde...
OA Network: Heading for Joint Standards and Enhancing Cooperation: Value‐Adde...Stefan Buddenbohm
 

Similar a Hard Content, Fab Front-end @ IIPC 2014 (20)

Anupriy Kanti - Content Strategy Portfolio_August 2016
Anupriy Kanti - Content Strategy Portfolio_August 2016Anupriy Kanti - Content Strategy Portfolio_August 2016
Anupriy Kanti - Content Strategy Portfolio_August 2016
 
Personal learning environment
Personal learning environmentPersonal learning environment
Personal learning environment
 
AtoM, Authenticity, and the Chain of Custody
AtoM, Authenticity, and the Chain of CustodyAtoM, Authenticity, and the Chain of Custody
AtoM, Authenticity, and the Chain of Custody
 
Conducting User Research
Conducting User ResearchConducting User Research
Conducting User Research
 
Archiver pilot phase kick off Award Ceremony
Archiver pilot phase kick off Award CeremonyArchiver pilot phase kick off Award Ceremony
Archiver pilot phase kick off Award Ceremony
 
Archiver pilot phase kick off Award Ceremony
Archiver pilot phase kick off Award CeremonyArchiver pilot phase kick off Award Ceremony
Archiver pilot phase kick off Award Ceremony
 
AGM 2013 Task Force meetings
AGM 2013 Task Force meetingsAGM 2013 Task Force meetings
AGM 2013 Task Force meetings
 
LoCloud: overview of LoCloud Services
LoCloud: overview of LoCloud ServicesLoCloud: overview of LoCloud Services
LoCloud: overview of LoCloud Services
 
2009: British Accessibility Standards - PAS-78 to BS8878
2009: British Accessibility Standards - PAS-78 to BS88782009: British Accessibility Standards - PAS-78 to BS8878
2009: British Accessibility Standards - PAS-78 to BS8878
 
A user journey in OpenAIRE services through the lens of repository managers -...
A user journey in OpenAIRE services through the lens of repository managers -...A user journey in OpenAIRE services through the lens of repository managers -...
A user journey in OpenAIRE services through the lens of repository managers -...
 
Whowas: Historical Whois Service
Whowas: Historical Whois ServiceWhowas: Historical Whois Service
Whowas: Historical Whois Service
 
255 shaw
255 shaw255 shaw
255 shaw
 
Open for Business - Open Archives, OpenURL, RSS and the Dublin Core
Open for Business - Open Archives, OpenURL, RSS and the Dublin CoreOpen for Business - Open Archives, OpenURL, RSS and the Dublin Core
Open for Business - Open Archives, OpenURL, RSS and the Dublin Core
 
04_Knutas_DOIT platform as an open educational resource
04_Knutas_DOIT platform as an open educational resource04_Knutas_DOIT platform as an open educational resource
04_Knutas_DOIT platform as an open educational resource
 
SSHOC at EOSC-hub Week - Managing Training Materials Beyond Individual Projec...
SSHOC at EOSC-hub Week - Managing Training Materials Beyond Individual Projec...SSHOC at EOSC-hub Week - Managing Training Materials Beyond Individual Projec...
SSHOC at EOSC-hub Week - Managing Training Materials Beyond Individual Projec...
 
All WP Meeting Athens - Europeana Inside - Gordon McKenna
All WP Meeting Athens - Europeana Inside - Gordon McKennaAll WP Meeting Athens - Europeana Inside - Gordon McKenna
All WP Meeting Athens - Europeana Inside - Gordon McKenna
 
Summary of Day 1
Summary of Day 1Summary of Day 1
Summary of Day 1
 
Create great cncf user base from lessons learned from other open source com...
Create great cncf user base from   lessons learned from other open source com...Create great cncf user base from   lessons learned from other open source com...
Create great cncf user base from lessons learned from other open source com...
 
Web 2.0 in Libraries
Web 2.0 in LibrariesWeb 2.0 in Libraries
Web 2.0 in Libraries
 
OA Network: Heading for Joint Standards and Enhancing Cooperation: Value‐Adde...
OA Network: Heading for Joint Standards and Enhancing Cooperation: Value‐Adde...OA Network: Heading for Joint Standards and Enhancing Cooperation: Value‐Adde...
OA Network: Heading for Joint Standards and Enhancing Cooperation: Value‐Adde...
 

Más de Lotte Belice Baltussen

Digitaal duurzame links - NDE Werelddag van de Digitale Duurzaamheid
Digitaal duurzame links - NDE Werelddag van de Digitale DuurzaamheidDigitaal duurzame links - NDE Werelddag van de Digitale Duurzaamheid
Digitaal duurzame links - NDE Werelddag van de Digitale DuurzaamheidLotte Belice Baltussen
 
Cultuurmarketing - Digitale Innovatie (19 september 2019)
Cultuurmarketing - Digitale Innovatie (19 september 2019)Cultuurmarketing - Digitale Innovatie (19 september 2019)
Cultuurmarketing - Digitale Innovatie (19 september 2019)Lotte Belice Baltussen
 
Rechtenregistratie bij de Anne Frank Stichting, Adlib gebruikersdag - 8 novem...
Rechtenregistratie bij de Anne Frank Stichting, Adlib gebruikersdag - 8 novem...Rechtenregistratie bij de Anne Frank Stichting, Adlib gebruikersdag - 8 novem...
Rechtenregistratie bij de Anne Frank Stichting, Adlib gebruikersdag - 8 novem...Lotte Belice Baltussen
 
20160922 Reinwardt Academie - NDE Bruikbaar case study GTAA bij Groninger Arc...
20160922 Reinwardt Academie - NDE Bruikbaar case study GTAA bij Groninger Arc...20160922 Reinwardt Academie - NDE Bruikbaar case study GTAA bij Groninger Arc...
20160922 Reinwardt Academie - NDE Bruikbaar case study GTAA bij Groninger Arc...Lotte Belice Baltussen
 
Netwerk Digitaal Erfgoed - Annotatie webarchief Groninger Archieven - AVA_Net...
Netwerk Digitaal Erfgoed - Annotatie webarchief Groninger Archieven - AVA_Net...Netwerk Digitaal Erfgoed - Annotatie webarchief Groninger Archieven - AVA_Net...
Netwerk Digitaal Erfgoed - Annotatie webarchief Groninger Archieven - AVA_Net...Lotte Belice Baltussen
 
DISH 2013 Chef's Table - Open Cultuur Data
DISH 2013 Chef's Table - Open Cultuur DataDISH 2013 Chef's Table - Open Cultuur Data
DISH 2013 Chef's Table - Open Cultuur DataLotte Belice Baltussen
 
18 april - CLICKNL bijeenkomst - Open Cultuur Data: game on
18 april - CLICKNL bijeenkomst - Open Cultuur Data: game on18 april - CLICKNL bijeenkomst - Open Cultuur Data: game on
18 april - CLICKNL bijeenkomst - Open Cultuur Data: game onLotte Belice Baltussen
 
Open cultuur data - Wikimedia Conferentie Nederland 2012
Open cultuur data - Wikimedia Conferentie Nederland 2012Open cultuur data - Wikimedia Conferentie Nederland 2012
Open cultuur data - Wikimedia Conferentie Nederland 2012Lotte Belice Baltussen
 
Open Cultuur Data / Open Beelden - HackersNL #6
Open Cultuur Data / Open Beelden - HackersNL #6Open Cultuur Data / Open Beelden - HackersNL #6
Open Cultuur Data / Open Beelden - HackersNL #6Lotte Belice Baltussen
 
Crowdsourcing metadata for audiovisual collections
Crowdsourcing metadata for audiovisual collectionsCrowdsourcing metadata for audiovisual collections
Crowdsourcing metadata for audiovisual collectionsLotte Belice Baltussen
 
Baltussen - AVA_Net Najaarsconferentie 2011
Baltussen - AVA_Net Najaarsconferentie 2011Baltussen - AVA_Net Najaarsconferentie 2011
Baltussen - AVA_Net Najaarsconferentie 2011Lotte Belice Baltussen
 

Más de Lotte Belice Baltussen (20)

Digitaal duurzame links - NDE Werelddag van de Digitale Duurzaamheid
Digitaal duurzame links - NDE Werelddag van de Digitale DuurzaamheidDigitaal duurzame links - NDE Werelddag van de Digitale Duurzaamheid
Digitaal duurzame links - NDE Werelddag van de Digitale Duurzaamheid
 
Cultuurmarketing - Digitale Innovatie (19 september 2019)
Cultuurmarketing - Digitale Innovatie (19 september 2019)Cultuurmarketing - Digitale Innovatie (19 september 2019)
Cultuurmarketing - Digitale Innovatie (19 september 2019)
 
Rechtenregistratie bij de Anne Frank Stichting, Adlib gebruikersdag - 8 novem...
Rechtenregistratie bij de Anne Frank Stichting, Adlib gebruikersdag - 8 novem...Rechtenregistratie bij de Anne Frank Stichting, Adlib gebruikersdag - 8 novem...
Rechtenregistratie bij de Anne Frank Stichting, Adlib gebruikersdag - 8 novem...
 
20160922 Reinwardt Academie - NDE Bruikbaar case study GTAA bij Groninger Arc...
20160922 Reinwardt Academie - NDE Bruikbaar case study GTAA bij Groninger Arc...20160922 Reinwardt Academie - NDE Bruikbaar case study GTAA bij Groninger Arc...
20160922 Reinwardt Academie - NDE Bruikbaar case study GTAA bij Groninger Arc...
 
Netwerk Digitaal Erfgoed - Annotatie webarchief Groninger Archieven - AVA_Net...
Netwerk Digitaal Erfgoed - Annotatie webarchief Groninger Archieven - AVA_Net...Netwerk Digitaal Erfgoed - Annotatie webarchief Groninger Archieven - AVA_Net...
Netwerk Digitaal Erfgoed - Annotatie webarchief Groninger Archieven - AVA_Net...
 
Open Cultuur Data België eind-event
Open Cultuur Data België eind-eventOpen Cultuur Data België eind-event
Open Cultuur Data België eind-event
 
DISH 2013 Chef's Table - Open Cultuur Data
DISH 2013 Chef's Table - Open Cultuur DataDISH 2013 Chef's Table - Open Cultuur Data
DISH 2013 Chef's Table - Open Cultuur Data
 
18 april - CLICKNL bijeenkomst - Open Cultuur Data: game on
18 april - CLICKNL bijeenkomst - Open Cultuur Data: game on18 april - CLICKNL bijeenkomst - Open Cultuur Data: game on
18 april - CLICKNL bijeenkomst - Open Cultuur Data: game on
 
AVA_net workshop 7 maart 2013
AVA_net workshop 7 maart 2013AVA_net workshop 7 maart 2013
AVA_net workshop 7 maart 2013
 
Open cultuur data - Wikimedia Conferentie Nederland 2012
Open cultuur data - Wikimedia Conferentie Nederland 2012Open cultuur data - Wikimedia Conferentie Nederland 2012
Open cultuur data - Wikimedia Conferentie Nederland 2012
 
Open cultuur data - cop gouda gha
Open cultuur data - cop gouda ghaOpen cultuur data - cop gouda gha
Open cultuur data - cop gouda gha
 
Open Cultuur Data - Eth0:2012 Summer
Open Cultuur Data - Eth0:2012 Summer Open Cultuur Data - Eth0:2012 Summer
Open Cultuur Data - Eth0:2012 Summer
 
Workshop DEN Baas over eigen metadata
Workshop DEN Baas over eigen metadataWorkshop DEN Baas over eigen metadata
Workshop DEN Baas over eigen metadata
 
Open Culture Data - PMOD
Open Culture Data - PMODOpen Culture Data - PMOD
Open Culture Data - PMOD
 
Open Cultuur Data competitie 2012
Open Cultuur Data competitie 2012Open Cultuur Data competitie 2012
Open Cultuur Data competitie 2012
 
Open Cultuur Data - hackathon pitches
Open Cultuur Data - hackathon pitchesOpen Cultuur Data - hackathon pitches
Open Cultuur Data - hackathon pitches
 
Open Cultuur Data - KVAN 2012
Open Cultuur Data - KVAN 2012Open Cultuur Data - KVAN 2012
Open Cultuur Data - KVAN 2012
 
Open Cultuur Data / Open Beelden - HackersNL #6
Open Cultuur Data / Open Beelden - HackersNL #6Open Cultuur Data / Open Beelden - HackersNL #6
Open Cultuur Data / Open Beelden - HackersNL #6
 
Crowdsourcing metadata for audiovisual collections
Crowdsourcing metadata for audiovisual collectionsCrowdsourcing metadata for audiovisual collections
Crowdsourcing metadata for audiovisual collections
 
Baltussen - AVA_Net Najaarsconferentie 2011
Baltussen - AVA_Net Najaarsconferentie 2011Baltussen - AVA_Net Najaarsconferentie 2011
Baltussen - AVA_Net Najaarsconferentie 2011
 

Último

Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 

Último (20)

Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 

Hard Content, Fab Front-end @ IIPC 2014

  • 1.
  • 2. HARD CONTENT, FAB FRONT-END Archiving websites of the Dutch Public Broadcasters 23-5-2014 Lotte Belice Baltussen | R&D, Netherlands Institute for Sound and Vision IIPC | 21 May 2014 | BnF, Paris
  • 3. Nederlands Instituut voor Beeld en Geluid Sound and Vision • 70% of Dutch AV heritage • > 850,000 hours • 2M photos •20,000 objects • Large paper archives
  • 4.
  • 5. “The Archive as a Laboratory” Web archiving since 2008 (LiWA, several pilots) with various objectives
  • 6. NTR PILOT (2013-2014) 23-5-2014 WHY: • Saving websites selected to be taken offline • Getting insights in user requirements • Create great front and back-end • Provide public access • Shape future plans
  • 10. USER REQUIREMENTS, PT. 1 Phase 1: Focus group
  • 11.
  • 12.
  • 13. USER REQUIREMENTS SUMMARY • Communication and information e.g. “As a user, I can suggest a website that should be archived” • Metadata e.g. “As a user, I can see the crawl date for each archived URL” • Searching e.g. “As a user, I can search full-text through a single archived website” • Visualisation e.g. “As a user, I can see side-by-side comparisons of the same URL that was archived at different moments in time”
  • 19. USER REQUIREMENTS, PT. 2 Phase 2: Usability tests think-aloud, 60-90 minutes x 2: • 37, PostDoc web archive research project • 58, Multimedia editor at a Dutch public broadcaster x 3: • 44, Crawl engineer • 50, Manager digital projects at a Dutch public broadcaster • 58, Freelance (archive) researcher & journalist
  • 20. LESSONS-LEARNED UI/UX + Clean, visual look - More functionality explanations COMMUNICATION + FAQ contains good info about web archiving - Info about status + plans / More info about scope and size of web archive METADATA + Overview of outgoing links - TMI / Creation + last change of website SEARCHING + Fast! + Thumbnail previews - Search by URL - More filtering options - Relevance ranking VISUALISATION / More stats, e.g., % text - Highlight differences crawls USERS & USAGE + Current groups representative - No av-streaming big loss for all / Add more fine-grained subgroups
  • 21. FUTURE WORK WEB ARCHIVES: CONTEXT COLLECTIONS “Public broadcaster web archives will help you learn where you come from” -- Usability test participant • We need to be more dynamic than the websites we archive • We can and must achieve public access • We are moving from pilot to standard practice • Connect crawls to catalogue • Increase public broadcaster cooperation