Presentation at the International Internet Preservation Consortium conference 2014 in Paris on the web archiving project the Netherlands Institute for Sound and Vision did together with Dutch public broadcaster NTR.
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Hard Content, Fab Front-end @ IIPC 2014
1.
2. HARD CONTENT, FAB FRONT-END
Archiving websites of the Dutch Public Broadcasters
23-5-2014
Lotte Belice Baltussen | R&D, Netherlands Institute for Sound and Vision
IIPC | 21 May 2014 | BnF, Paris
3. Nederlands Instituut voor
Beeld en Geluid
Sound and Vision
• 70% of Dutch AV heritage
• > 850,000 hours
• 2M photos
•20,000 objects
• Large paper archives
4.
5. “The Archive as a Laboratory”
Web archiving since 2008 (LiWA, several pilots) with various objectives
6. NTR PILOT
(2013-2014)
23-5-2014
WHY:
• Saving websites selected to be taken offline
• Getting insights in user requirements
• Create great front and back-end
• Provide public access
• Shape future plans
13. USER REQUIREMENTS SUMMARY
• Communication and information
e.g. “As a user, I can suggest a website that should be archived”
• Metadata
e.g. “As a user, I can see the crawl date for each archived URL”
• Searching
e.g. “As a user, I can search full-text through a single archived website”
• Visualisation
e.g. “As a user, I can see side-by-side comparisons of the same URL that was
archived at different moments in time”
19. USER REQUIREMENTS, PT. 2
Phase 2: Usability tests
think-aloud, 60-90 minutes
x 2:
• 37, PostDoc web archive research project
• 58, Multimedia editor at a Dutch public broadcaster
x 3:
• 44, Crawl engineer
• 50, Manager digital projects at a Dutch public broadcaster
• 58, Freelance (archive) researcher & journalist
20. LESSONS-LEARNED
UI/UX
+ Clean, visual look
- More functionality explanations
COMMUNICATION
+ FAQ contains good info about
web archiving
- Info about status + plans
/ More info about scope and size
of web archive
METADATA
+ Overview of outgoing links
- TMI
/ Creation + last change of
website
SEARCHING
+ Fast!
+ Thumbnail previews
- Search by URL
- More filtering options
- Relevance ranking
VISUALISATION
/ More stats, e.g., % text
- Highlight differences crawls
USERS & USAGE
+ Current groups representative
- No av-streaming big loss for all
/ Add more fine-grained
subgroups
21. FUTURE WORK WEB ARCHIVES:
CONTEXT COLLECTIONS
“Public broadcaster web archives will help you learn where you come from”
-- Usability test participant
• We need to be more dynamic than the websites we archive
• We can and must achieve public access
• We are moving from pilot to standard practice
• Connect crawls to catalogue
• Increase public broadcaster cooperation