The magic of MarcEdit, or, how I learned to stop worrying and love metadata / Will Peaden

The magic of
MarcEdit, or, how I
learned to stop
worrying and love
metadata
By Will Peaden
w.peaden@aston.ac.uk

Introduction: Aston metadata
Card catalog
image by Megan
Amaral (CC BY 2.0)

History
► Library restructure in 1995
► Individual specialists roles dissolved and each
professional member of staff given many hats
► No metadata/cataloguing specialist from that
time until November 2016
► Small group of staff do their best to catalogue
in the interim

The state of the catalogue
► Authority control
was lacking
► Many fields missing
or incorrect
► Local subject index
► Local subject
headings
► No LCSH in some
records
► Hybrid e-book and
print book records
► Split multi-volume
works

MARCEdit
► First created in 1999 to
enable a data clean-up
project at Oregon State
University.
► Developed by Terry Reese
and updated by him
regularly
► Offered as a free download
► Has an enormous array of
functionality built into it

Metadata projects
Authority control
Module codes
Reclassification
Metadata enhancement

Authority control
► Authority control: established, unique,
consistent forms of terms for
disambiguation and collocation
► Project scope: to authorise the name and
subject headings in Sierra
► All records were in scope except PDA
records as these were not purchased

Data extraction and manipulation
1. Extract records in scope from Sierra and
save them locally
2. Use MarcBreaker
3. Validate name headings and embed URIs

save them to locally
2. Use MarcBreaker

save them locally
2. Use MarcBreaker
4. Extract 1XX, 7XX headings and URIs and
copy to Notepad++

2. Use MarcBreaker
copy to Notepad++

save them locally
2. Use MarcBreaker
copy to Notepad++
5. Use regular expressions to extract just the
LCCN
6. Make the LCCNs searchable via z39.50

2. Use MarcBreaker
copy to Notepad++
5. Use regular expressions to extract just the
LCCN
6. Make the LCCNs searchable via z39.50

To sum up
► MarcBreaker
► Validate headings
► Normalise data for searching
► z39.50

Module codes
► Project scope:
 Update all reading list items in Sierra with
the current course code
► Concerns these were out of date
► Codes in Sierra still important for current
workflows

Finishing
► Load dummy MARC records using custom
load table
► Matches on bibliographic number
► Only importing 980 field
► Use Sierra Global Update function to
update 900 module code field

Reclassification
► Some areas of the collection classified to an old
standard
► Split collections with shelf ready records
► Too many to individually reclassify
► MarcEdit function “Generate classification” based
on OCLCs Classify
► Project Scope: 301-307, just over 7000 titles
► Import the classification the same way as module
codes via local field 982

Pros
► Tool is fast and
easy to use
► Lots of extra
functionality such
as fast headings
► Accurate up to date
classification
(mostly)
► It relies on ISBN
and author/title
matching
► Some errors
► Some things simply
not found
…and cons

Metadata enhancement
► Project scope:
 Improve the metadata of Aston legacy records
starting with records lacking LCSH by fishing for
records
► Data preparation
► z39.50 search
► Data enhancement
► RDA
► Linked data?

Normalize data, z39.50 searching
► Export target records from Sierra
► Search for and extract data points such as
ISBN, title, main author, date of publication
► Normalize data e.g. remove fluff from 020s,
use only title proper, fixed dates, use
surnames only
► Make these data searchable via z39.50

Normalize data, z39.50 searching

Search for records, analyse results

Transformation and match points

Lessons learned and concluding remarks
► Data normalization takes time and is important
► Take care to document everything and make your
file metadata clear
► Using Box really helped with reversing mistakes
► Search files need to be manageable (probably no
bigger than 1000 records)
► Trial and error and Google are your friends
► Questions?

The magic of MarcEdit, or, how I learned to stop worrying and love metadata / Will Peaden

Recomendados

Recomendados

Más contenido relacionado

Similar a The magic of MarcEdit, or, how I learned to stop worrying and love metadata / Will Peaden

Similar a The magic of MarcEdit, or, how I learned to stop worrying and love metadata / Will Peaden (20)

Más de CILIP MDG

Más de CILIP MDG (20)

Último

Último (20)

The magic of MarcEdit, or, how I learned to stop worrying and love metadata / Will Peaden

Notas del editor