Who Updates Publisher Metadata & Why? Downstream Vendors On What Happens To Publisher Metadata In The Supply Chain
5/30/2013
Good metadata is the first step to populating industry databases and online sites. Good metadata leads to higher sales! Participate in this session and get critical answers on who updates publisher metadata and why. Barnes & Noble, will share insight into examples of metadata feeds received, and how incomplete or inaccurate information can lead to reduced sales! By using real world examples, speakers from Ingram Content Group, Bowker and Baker & Taylor will explore: Which metadata elements are often changed why are they changed? Which metadata elements are often added and why? How are needed changes or additions identified? What processes are involved in changing or adding metadata? Where does the vendor metadata go after changes and enhancements are made? What are the benefits of changing or enhancing publisher metadata?
2. Moderator and Panelists
Moderator
• Renée Register, DataCurate LLC
Panelists
• Sam Dempsey, VP of Data Management, Baker & Taylor
• Patricia Payton, Senior Manager, Publisher Relations and
Content Development, Bowker
• Richard Stark, Director, Product Data, Barnes & Noble
• George Tattersfield, VP of Merchandising, Ingram Book Group
3.
4. Downstream Metadata Survey
1. Which metadata elements are often changed and why
are they changed?
2. Which metadata elements are often added and why?
3. How are needed changes or additions identified?
4. What processes are involved in adding or changing
metadata?
5. Where does vendor metadata go after changes and
enhancements are made?
6. What are the benefits of changing or enhancing
publisher metadata?
5. Most Common Changes
• Title/Subtitle
• Contributor
• Series
• Subject
• Publication Data
• Status/Availability
6. Why Change Title/Subtitle?
• Non-title information included in the title
field is moved to correct fields to improve
display and search
– Common examples:
• Edition
• Binding
• Series
• Translated title
• Capitalization is standardized
7. Why Change Contributor?
• Make names consistent across titles by
using a name authority file
– Standardize variations in names
– Ensure that titles are associated with the correct
contributor name
– Some contributor work is automated but some
must be reviewed manually
8. Why Change Contributor?
• Move Information included in the
contributor field to correct fields to
correct fields to optimize search, sorting,
and display
– Examples:
• Degrees (PhD, etc.)
• Titles (Dr., Pope, etc.)
• Birth/death dates.
9. Why Change Series?
• Move series names from incorrect fields to the
series field
– Allows search by series
• Regularly review books in series
– Ensure they are connected to the correct series
– Ensure consistent information across titles
10. Why Change Subject?
• Replace general subjects with more specific
subjects
• Replace sent subjects with clearly more
accurate subjects
– Examples:
• Title is coded with both JNF (Juvenile non-fiction) and
JUV (Juvenile fiction) BISAC codes
• Title is coded with both HIS (History) and FIC (Fiction)
BISAC codes
11. Why Change Publication Date?
• Vendors work closely with publishers on
publication date changes
• May update a publication date to reflect the
actual date that the book is available to
consumers
12. Why Change Status/Availability?
• Vendors regularly compare prices received in
metadata files to actual invoiced prices
Why Change Price?
• Updated based on purchase order
acknowledgements and other supply chain
messages
• For example, physical receipt of a product
may cause a change in status/availability
13. Common reasons for changes
• Standard quality checks identify changes needed
– Vendors routinely confirm metadata received against
the published book
– Publisher metadata moved to correct ONIX field
– Periodic data reviews are also common and attempt
to ensure:
• Consistency across different formats of the same title
• Consistency within a series
• Consistency and standardization of contributor names
14. Metadata most often added
• Edition
• Series
• Contributor Biography
• Awards
• Media mentions
• Bestseller mentions
• Title linking across
formats
• Age and grade level
• Metadata licensed or
• Reviews
• Tables of contents
• Descriptions
• Covers
• Proprietary subjects
and descriptors
• Dewey Decimal
Classification or other
library metadata
15. Common reasons for adding metadata
• Known information about the book is not supplied
by publisher
• Elements are added to enhance user experience
and increase discovery and sales
• Proprietary data for the vendor is added
• Additional metadata is needed for the library
market
• Formatting is added to allow titles to link
automatically
16. Ingram Book in Hand changes made (1600 titles tested)
Changes made # Changes %
Features Added 1389 86.8%
Illustrations Added 960 60.0%
Page Count 917 57.3%
Image Scanned 623 38.9%
Country of Manufacture Added 474 29.6%
Contributor Changed 374 23.4%
Contributor Added 329 20.6%
Series Added 261 16.3%
Title Changed 237 14.8%
Contributor Deleted 237 14.8%
Subtitle 205 12.8%
Series Changed 166 10.4%
Edition 161 10.1%
18. <title>
<b202>01</b202>
<b203>The First Phone Call From Heaven CD</b203>
<b029>TheFirst Phone Call From Heaven CD</b029>
</title>
ONIX
- Format in Title field; use <b012>
- Title repeated, with Format, in Subtitle field
Title and Subtitle
19. <title>
<b202>01</b202>
<b203><![CDATA[This Heart Within Me Burns – From
Bedlam to Benidorm (Revised & Updated)]]></b203>
<b030><![CDATA[This]]></b030>
<b031><![CDATA[Heart Within Me Burns - From Bedlam
to Benidorm (Revised & Updated)]]></b031>
</title>
ONIX
- Subtitle included in Title field, no Subtitle field included;
use <b029>
- Edition values included in Title field; use <b056><b058>
21. <Title>How Ottawa Spends Series,Volume 33</Title>
<Sub_Title>HowOttawa Spends, 2012-2013 : The Harper
Majority, Budget Cuts, and the New Opposition</Sub_Title>
Proprietary XML format, not ONIX
- Title includes Series Name, Volume Number; use
<b018><b019>
- Subtitle repeats Series Name, includes distinctive title
elements
22. Excel
- Series and Series Number in Title field
ISBN Title Series Title
9781622861132
Losing Hope: Book
One of the Sienna St.
James Series Sienna St. James
23. Excel
- Series Number in Title field, not in Series Number
ISBN Title Series Title
9780761368175
#06 Sherlock Holmes
and the Adventure of
the Sussex Vampire
On the Case with
Holmes and
Watson
28. Best Practices for Title
- Properly field elements:
- Subtitle, Series, Series #, Edition, Edition #, Volume,
Formats, lead articles, etc.
- Match product’s Title page
- Make consistent across formats, editions, series
- Watch for misspellings, capitalization, abbreviations,
truncations, special characters, etc.
- Send “consumer/patron” ready
30. Good practice: Contributor names
• As distinct fields, the least you can do is:
– First name – all names preceding the indexing
name of a person (include a middle name here)
– Last name – the key or indexing name of a person
OR
• Corporate Contributor – if it’s not a person,
don’t use the fields for people, use this
instead.
31. Good practice: Contributor names
Titles before names or prefixes to entire names
Pope John Paul II
Names before key name (includes given names as appropriate)
Robert Louis Stevenson
Prefixes to key name(s)
Simone de Beauvoir
Key name(s) (usually the family name)
Gabriel García Márquez
Names after key name(s) (including given names where appropriate)
Mao Zedong
Suffix after key name(s)
Reverend Adam Clayton Powell, Jr.
Qualifications and honors after name(s)
Stephen LaBerge, Ph.D.
Titles after name(s)
Sarah Ferguson, Duchess of York
32. Good practice: Multiple contributors
• Support for three to five contributors is a
minimum in trade retail records, but
accurately listing everyone associated with the
book is best.
• Support sequence numbering:
– Author order is important and metadata records
should give a clear order for display.
33. Good practice: Contributor role(s)
• You need to specify who did what
– “author” is code “A01”
– “editor” is “B01”
– “illustrator” is “A12”
• If one contributor fulfilled two roles, repeat
the contributor and assign each entry its own
role code.
34. Contributor Name
Mapping all variants to an authority controlled form of the name ensures that
all titles by Leo Tolstoy will be grouped together rather than scattered across
multiple versions of his name.
35. Contributor Authority Control
Tolstoi , Count Lev
Tolstoi , Graf Leo N.
Tolstoi , León
Tolstoi , Lev Nicolaevici
Tolstoi , Lew Nikolajewitsch
Tolstoi , Liev Nikolaievich
Tolstoi , Count Lvof N.
Tolstoj , Lev Nikolaevitch
Tolstoy, Count Lev N.
Tolstoy, Graf Leo
Tolstoy, Leo Nikolayvich
Tolstoy, Leo Nikolaevich
Tolstoy, Graf Lev
Tolstoy, Lyof N.
Leo Tolstoy
ISNI: 0000 0004 0006 2877
36. • Each Public identity of a given Party gets its own ISNI:
• Lewis Carroll & Charles Lutwidge Dodgson get separate
ISNIs
• Separate identities that are the same person can be
linked
• Authors of the same name get distinct ISNIs:
• Thomas Wolfe
• Tom Wolfe, author of The Bonfire of the Vanities
• Tom Wolfe, author of numerous books on
woodcarving
ISNI: International Standard
Name Identifier
37. • Günter Graß, Guenter Grass and GuenterGraßare
character set variances of the same Public Identity
• Ciaikovsky, PjotrIljcand ПётрИльичЧайковскийare
transliteration variances of the same public identity
and also receive the same ISNI.
• Pyotr Tchaikovsky and Peter Tchaikovsky are
linguistic variances of the same public identity and
again receive the same ISNI
Character-Set & Spelling
Differences Don’t Change the ISNI
38.
39. Good practice: Review quotes
<productidentifier>
<b221>03</b221>
<b244>9780520262508</b244>
</productidentifier>
<othertext>
<d102>08</d102>
<d104>A gorgeously produced collection of maps and essays.</d104>
<d107>NikilSaval</d107>
<d108>The Los Angeles Review of Books</d108>
<d109>20101103</d109>
</othertext>
<othertext>
<d102>08</d102>
<d104>Inventive and affectionate.</d104>
<d107>LiseFunderburg</d107>
<d108>The New York Times Book Review</d108>
<d109>20101205</d109>
</othertext>
<othertext>
<d102>08</d102>
<d104>A many-layered treat for any San Franciscan who thinks she knows her
city.</d104>
<d107>Danielle Sommer</d107>
<d108>KQED</d108>
<d109>20101018</d109>
</othertext>
Infinite City:
A San Francisco Atlas
by Rebecca Solnit
ISBN: 9780520262508
<d102>tag identifies the type
of text provided. Value 08
indicates that the text is a
review quote.
<d107>tag identifies the author of
the provided text. This tag is only
used when there is a named
author/reviewer.
<d108>tag identifies the corporate source of
the provided text. The publication where a
review appeared would be placed here.
<d109>tag identifies the publication date of
the provided text (review).
<d104>tag identifies the actual text of the
review.
40. Bad practice: Review quotes
<othertext>
<d102>18</d102>
<d104><![CDATA[<DIV><P>What makes a place? <I>Infinite City</I>,
Rebecca Solnit’s brilliant reinvention of the traditional atlas, searches out
the answer by examining the many layers of meaning in one place, the San
Francisco Bay Area. Aided by artists, writers, cartographers, and twenty-
two gorgeous color maps, each of which illuminates the city and its
surroundings as experienced by different inhabitants, Solnit takes us on a
tour that will forever change the way we think about place. She explores
the area thematically—connecting, for example, Eadweard Muybridge’s
foundation of motion-picture technology with Alfred Hitchcock’s filming of
Vertigo. Across an urban grid of just seven by seven miles, she finds
seemingly unlimited landmarks and treasures—butterfly habitats, queer
sites, murders, World War II shipyards, blues clubs, Zen Buddhist centers.
She roams the political terrain, both progressive and conservative, and
details the cultural geographies of the Mission District, the culture wars of
the Fillmore, the South of Market world being devoured by
redevelopment, and much, much more. Breathtakingly original, this atlas
of the imagination invites us to search out the layers of San Francisco that
carry meaning for us—or to discover our own infinite city, be it Cleveland,
Toulouse, or Shanghai. </P><P> “A gorgeously produced collection of maps
and essays.” —Nikil Saval, <I>Los Angeles Review of Books</I></P><P>
“Inventive and affectionate.” —Lise Funderburg, <I>New York Times Book
Review</I></P><P> “A many-layered treat for any San Franciscan who
thinks she knows her city.” —Danielle Sommer,
<I>KQED</I></P></DIV>]]></d104>
</othertext>
Infinite City:
A San Francisco Atlas
by Rebecca Solnit
ISBN: 9780520262508
Using CDATA tags is bad
practice. Data providers
should use XHTML to
provide information on
formatting the text.
Using <DIV> tags is bad is
bad practice. Separating
a section of text can be
handled using a <P> tag.
Concatenating review
quotes at the end of the
jacket copy or main
description is BAD
practice.
53. First Book in Series
<othertext>
<d102>08</d102>
<d103>02</d103>
<d104><![CDATA[
"A fantastic new dystopian series...Littlefield's compelling writing will keep
readers turning pages late into the night to find out what happens next.
Outstanding!" Top Pick, 4 1/2 stars<P> -RT Book Reviews]]>
</d104>
</othertext>
<othertext>
<d102>08</d102>
<d103>02</d103>
<d104><![CDATA[
<P>"A series starter that will appeal to fans of Jeff Kinney and DavPilkey . . . A
treat for all . . . And guess what? It makes for good tree house reading." --
<I>Booklist</I><P>"Fans of Jeff Kinney’s “Diary of a Wimpy
Kid”(Abrams) and Lincoln Peirce’s “Big Nate”
series (HarperCollins) will be drawn to this book, while parents will enjoy the
absence of snarky humor." -- <I>School Library Journal</I>]]>
</d104>
</othertext>
54. International Titles in USA Feeds
<title>
<b202>01</b202>
<b203>The Science of Discworld IV</b203>
<b029>Judgement Day: It's Wizards Vs Priets in a Battle
for the Future of Roundworld</b029>
</title>
56. • As a general rule a title cannot have both the parent and child
of a specific subject tree
• We aim for consistency across formats
• We try to assign categories based on content, not on the
merchandising plans of the publisher
• Juvenile and YA books both get JUV or JNF subject headings,
but not adult headings
• A book cannot be both fiction and nonfiction
• The three “ media tie-in” subjects (FIC, JNF, and JUV)
should never be in first position
• We try to avoid the General subjects
Main reasons for changing subjects
57. A deep, thought-provoking novel of love, loss, civil
unrest, and basketball. It is 1989, and Jim
Keating has hit absolute rock bottom. He’s
lost his wife to cancer, his house to
bankruptcy, and his job as a college
basketball coach to what many outsiders
believed to be a racially insensitive career-
ending decision. He has also just about lost
his mind, having slipped into a bout of
serious depression. Attempting to pick up
the pieces and start life over, Jim returns
home to Worcester and rents a small
apartment.
… Soon, Jim finds himself in Burundi, Africa,
where he is to create a basketball league that
will bring two warring tribes—the Hutus and
Example 1
An African Rebound: A Novel by Dan Doyle
Fiction coded as non-fiction
58. Hardcover Ingram
BISAC change
Ingram
Regional Theme
added
HIS001010
History-Africa-Central
FIC038000
Fiction – Sports
Africa
SPO004000
Sports - Basketball
Example 1
Fiction coded as non-fiction
59. A local guide leads adventurers on safaris into
three surprisingly varied worlds: the
bushveld, a river and a desert. Each
environment brings its unique flavor to the
adventure; multiple images (both photos and
illustrations) and interesting text introduce
the region's fascinating wildlife with its
intriguing habits and lifestyles. Readers are
encouraged to play along by means of
engaging activities, games and quizzes, with
answers at the end of each section. Nature's
edible treats and useful props are ...
Example 2
Getting Bushwise on Safari: A Young Explorer’s Guide
by Nadine Clark
Juvenile coded as adult
61. DAVID BUSCH'S CANON POWERSHOT G15 GUIDE
TO DIGITAL PHOTOGRAPHY shows you how,
when, and why to use all the features,
controls, and functions of the G15 to take
great photographs and movies. Introductory
chapters will help you get comfortable with
the basics of your camera before you dive
right into exploring creative ways to apply file
formats, resolution, and exposure. Beautiful,
full-color images illustrate where the
essential buttons and dials are, so you'll
quickly learn how to use the Canon
PowerShot G15, and use it like a pro!
Example 3
David Busch’s Canon Powershot G15
Incorrect BISAC subject
63. Jerry Harris was a self-made California
millionaire who, at age forty-four, had it
all: booming businesses, yachts, a
mansion, a beautiful wife, and a voice to
rival Elvis. No one who knew this well-
liked, generous man could make sense of
his sudden disappearance one autumn
night. …
Example 4
In the Name of Love by Ann Rule
Incorrect BISAC subject
64. Publisher BISAC
Mass market
paperback
Publisher BISAC
Trade paperback
Publisher BISAC
Audio
Ingram BISAC
Change
TRU002000
True Crime – Murder
– General
TRU002000
True Crime – Murder
– General
TRU003000
True Crime –
Organized Crime
TRU002000
True Crime –
Murder –
General
LIT000000
Literary Criticism –
General
LIT000000
Literary Criticism –
General
Example 4
Incorrect BISAC (Literary Criticism)
Audio version different
65. Detailed travel map, scale 1:250,000, with an
inset of central Mbabane, a distance chart
and text information for visitors.
Example 5
International Travel Maps Swaziland
Adult with juvenile BISAC subject
66. Publisher BISAC Ingram BISAC Change
JUV000000 Juvenile
fiction – General
TRV002000 Travel –
Africa – General
TRV027000 Travel –
Maps & Road Atlases
Example 5
Adult with juvenile fiction BISAC subject
67. Identifying Needed Changes
• Strongly tied to processes developed for evaluation
and manipulation of inbound metadata
• Routinely administered audits of the entire
database
• All the vendors maintain specialists for file review
and manual metadata activities
68. Automated Processes
• Audits of inbound or recently added metadata to
detect missing metadata, formatting problems, and
other errors
• Metadata correction, normalization, addition of
proprietary data, and data mapping
• Linking to name authority files
• Establishing links and checking for consistency
across titles, e.g. the same title in different formats,
titles belonging to the same series
• Addition of licensed metadata, e.g. reviews, tables
of contents, etc
69. Staff Processes
• Book-in-hand review of metadata (for the
bookselling vendors)
• Close review of high-demand and high-profile titles
• Manual changes and additions based on audit
reports
• Changes and additions in response to internal staff
(buyers, store personnel, etc.), consumer, and
publisher reports
• Communication with publishers to resolve issues
discovered through automated or manual processes
70. Where Changes Go
Proprietary databases used for:
1. To support internal processes such as ordering,
invoicing, etc
2. To populate consumer-facing websites
3. To create licensed data products and services
such as Books in Print and ongoing data feeds to
subscribing customers
71. Benefits of Changing Metadata
• Vendor panelists invest significant financial,
technical, and staff resources on metadata to:
– Increase sales
– Improve search engine discoverability and customer
experience
– Create efficiencies in
• Internal processes
• Supply chain communication
• Transactions
Good metadata = better sales and reduced
costs
72. Questions?
This presentation will be available from the BEA
website after the conference.
Or contact Renée Register for a PDF of the
presentation.
reneeregister@datacurate.com