SlideShare una empresa de Scribd logo
1 de 14
Descargar para leer sin conexión
1
How to import diacritics into CONTENTdm 
from a library catalog using Excel and 
MarcEdit
Jill Strass
This talk was inspired by our struggles to digitize some
Nordic Solo Songs as collected by Dan Dressen and
bravely cataloged and uploaded by Kathy Blough.
Jill Strass
St. Olaf College
Upper Midwest Online CONTENTdm Conference
November 8‐9, 2010
The Challenge
• Shortcut to metadata: obtain MARC records• Shortcut to metadata: obtain MARC records 
containing diacritics from a library catalog 
as a tab‐delimited file for easy import into 
CONTENTdm
2
The Method
• Export our records from the library catalog• Export our records from the library catalog 
as a delimited file
The Method
• Export our records from the library catalog• Export our records from the library catalog 
as a delimited file
• Use the tab‐delimited file to generate 
metadata for CONTENTdm
3
The Method
• Export our records from the library catalog• Export our records from the library catalog 
as a delimited file
• Use the tab‐delimited file to generate 
metadata for CONTENTdm
• Upload as a compound object into Up oad as a co pou d object to
CONTENTdm  
The Challenge
• Uh oh we have an export bug that won’t• Uh oh, we have an export bug that won t 
allow us to cleanly export fields with 
repeating values from the catalog to a 
delimited file. 
4
The Workaround – Catalog to MarcEdit
• Uh oh we have an export bug that won’t• Uh oh, we have an export bug that won t 
allow us to cleanly export fields with 
repeating values from the catalog to a 
delimited file.
• No worries, we’ll use MarcEdit
The Workaround – Catalog to MarcEdit
• Uh oh we have an export bug that won’t• Uh oh, we have an export bug that won t 
allow us to cleanly export fields with 
repeating values from the catalog to a 
delimited file.
• No worries, we’ll use MarcEdit
• Convert the tab delimited file (.out) from 
the catalog into an (.mrc) format file using 
MarcEdit
5
The Workaround – Catalog to MarcEdit
• Uh oh we have an export bug that won’t• Uh oh, we have an export bug that won t 
allow us to cleanly export from the catalog 
to a delimited file.
• No worries, we’ll use MarcEdit
• Convert the tab delimited file (.out) from Co e t t e tab de ted e ( out) o
the catalog into an (.mrc) format file using 
MarcEdit
• Take the (.mrc) file and export using 
MarcEdit’s tool for tab‐delimited files. 
The Workaround – Catalog to MarcEdit
• Uh oh we have an export bug that won’t allow us to• Uh oh, we have an export bug that won t allow us to 
cleanly export from the catalog to a delimited file.
• No worries, we’ll use MarcEdit
• Convert the tab delimited file (.out) from the catalog 
into an (.mrc) format file using MarcEdit
• Take the (.mrc) file and export using MarcEdit’s tool ( ) p g
for tab‐delimited files. 
• In MarcEdit, we choose which MARC fields we want 
for our metadata in digital collections.
6
The Trick to know in MarcEdit for 
diacritics 
• Use the MarcEdit Characterset Translation• Use the MarcEdit Characterset Translation 
tool, and while breaking the record, select 
UTF‐8 as the format, so Excel can recognize 
diacritic characters.
The Trick to know in MarcEdit for 
diacritics
Note that the box forNote that the box for
Translate to UTF-8 is
checked.
7
The Trick to know in MarcEdit for 
diacritics
Yippee! If youYippee! If you
look real close,
you can see
diacritics are
showing up in
the text editor in
MarcEdit.
Trick for Diacritics in Excel
• Now we have our diacritics within a tab• Now we have our diacritics within a tab 
delimited file, courtesy of MarcEdit. 
• There is a trick you’ll need to use when you 
first open Excel. 
8
Trick for Diacritics in Excel
When you first open your
tab-delimited file from
MarcEdit, when Excel takes
you through its wizard for
importing the tab delimited
file, select 65001 Unicode
(UTF-8) from the File Origin
pull-down menu.
This will allow Excel to
“see” the diacritics.
Generating Metadata from tab‐
delimited files
• We use a tricked out spreadsheet that• We use a tricked‐out spreadsheet that 
allows us to take a row from a tab delimited 
file, copy and paste it into Excel, and then 
Excel generates a compound object 
template for easy upload into CONTENTdm.
9
Generating Metadata from tab‐
delimited files
• We use a tricked out spreadsheet that• We use a tricked‐out spreadsheet that 
allows us to take a row from a tab‐
delimited file, copy and paste it into Excel, 
and then Excel generates a compound 
object template for easy upload into 
CONTENTdm.
• We do this to avoid manual data entry as 
much as possible.
Generating Metadata from tab‐
delimited files
• We use a tricked out spreadsheet that• We use a tricked‐out spreadsheet that 
allows us to take a row from a tab‐
delimited file, copy and paste it into Excel, 
and then Excel generates a compound 
object template for easy upload into 
CONTENTdm.
• We do this to avoid manual data entry as 
much as possible.
• If you’d like a spreadsheet file and 
documentation on how to use it contact
10
Generating Metadata from tab‐
delimited files
• To convert the xls file• To convert the .xls file 
to .txt, we select, 
copy and paste from 
Excel into Notepad++.
• We do this so we can 
see exactly what 
characters are 
showing up in our 
text files. 
Generating Metadata from tab‐
delimited files
• Note that Notepad++• Note that Notepad++ 
is so cool, we don’t 
need any tricks to 
use it!
11
Uploading into CONTENTdm with 
Diacritics (CDM 5.3)
From Project Client,j ,
select Add Multiple
Compound Objects,
then select the Map
Fields Tab.
Uploading into CONTENTdm with 
Diacritics (CDM 5.3)
Click the
Encoding
button.
12
Uploading into CONTENTdm with 
Diacritics (CDM 5.3)
If only it were thisIf only it were this
simple…. For us,
we had to select
ANSI for this to
work, but according
to the
documentation,
UTF-8 as encodingUTF 8 as encoding
is supposed to
work.
Uploading into CONTENTdm with 
Diacritics (CDM 5.3)
We may nevery
know why this is so
for us. Please
share your
experiences.
13
A Sample of Diacritics on CONTENTdm
And here we are, at journey’s, j y
end….
Summary of Diacritics on 
CONTENTdm
• Export MARC records from your catalog or source forExport MARC records from your catalog or source for 
text with diacritics.
• If you need to use MarcEdit in this process, select the 
UTF‐8 box in the Characterset Translation Tool.
• When first opening a tab‐delimited file in Excel, select 
65001 Unicode (UTF‐8) from the File Origin pull‐down 
menu.
• When uploading to CONTENTdm, experiment with the 
UTF‐8 vs ANSI setting in the Add Compound Object, 
File Mapping, Encoding box.
14
How to import Diacritics from a Library 
Catalog into CONTENTdm Using Excel 
and MarcEdit
Jill Strass
Digital Initiatives and Metadata Librarian
St. Olaf College
strass@stolaf.edu

Más contenido relacionado

Similar a Diacritics Online

Improving access to special collections by automating descriptive metadata cr...
Improving access to special collections by automating descriptive metadata cr...Improving access to special collections by automating descriptive metadata cr...
Improving access to special collections by automating descriptive metadata cr...aneatrour
 
The Fantastic Challenges of Librarianship: Digital Solutions at The Ringling ...
The Fantastic Challenges of Librarianship: Digital Solutions at The Ringling ...The Fantastic Challenges of Librarianship: Digital Solutions at The Ringling ...
The Fantastic Challenges of Librarianship: Digital Solutions at The Ringling ...Mēgan Oliver, MLIS
 
Loading 350M documents into a large Solr cluster: Presented by Dion Olsthoorn...
Loading 350M documents into a large Solr cluster: Presented by Dion Olsthoorn...Loading 350M documents into a large Solr cluster: Presented by Dion Olsthoorn...
Loading 350M documents into a large Solr cluster: Presented by Dion Olsthoorn...Lucidworks
 
Reimagining Serials handout: BIBFRAME Exercise
Reimagining Serials handout: BIBFRAME ExerciseReimagining Serials handout: BIBFRAME Exercise
Reimagining Serials handout: BIBFRAME ExerciseNASIG
 
Stupid Index Block Tricks
Stupid Index Block TricksStupid Index Block Tricks
Stupid Index Block Trickshannonhill
 
Optimising Workflows for Digital Archives: UCD Digital Library
Optimising Workflows for Digital Archives: UCD Digital LibraryOptimising Workflows for Digital Archives: UCD Digital Library
Optimising Workflows for Digital Archives: UCD Digital LibraryUCD Library
 
Docker: Containers for Data Science
Docker: Containers for Data ScienceDocker: Containers for Data Science
Docker: Containers for Data ScienceAlessandro Adamo
 
Veodin slide proof manual
Veodin slide proof manualVeodin slide proof manual
Veodin slide proof manualVeodin
 
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...Ricard de la Vega
 
AUTOMATED DATA EXPLORATION - Building efficient analysis pipelines with Dask
AUTOMATED DATA EXPLORATION - Building efficient analysis pipelines with DaskAUTOMATED DATA EXPLORATION - Building efficient analysis pipelines with Dask
AUTOMATED DATA EXPLORATION - Building efficient analysis pipelines with DaskVíctor Zabalza
 
Stuart Kenny; Kathryn Cassidy - Experience with Ingestion of Large Collection...
Stuart Kenny; Kathryn Cassidy - Experience with Ingestion of Large Collection...Stuart Kenny; Kathryn Cassidy - Experience with Ingestion of Large Collection...
Stuart Kenny; Kathryn Cassidy - Experience with Ingestion of Large Collection...dri_ireland
 
UKSG Conference 2017 Breakout - A tale of two systems: discovery at the Unive...
UKSG Conference 2017 Breakout - A tale of two systems: discovery at the Unive...UKSG Conference 2017 Breakout - A tale of two systems: discovery at the Unive...
UKSG Conference 2017 Breakout - A tale of two systems: discovery at the Unive...UKSG: connecting the knowledge community
 
Hands Free Articles: Implementing and Maximizing OCLC Knowledge Base in ILLiad
Hands Free Articles: Implementing and Maximizing OCLC Knowledge Base in ILLiadHands Free Articles: Implementing and Maximizing OCLC Knowledge Base in ILLiad
Hands Free Articles: Implementing and Maximizing OCLC Knowledge Base in ILLiadalisonjohnson53
 
Metadata sharing module
Metadata sharing moduleMetadata sharing module
Metadata sharing moduleelusiveO2
 
The magic of MarcEdit, or, how I learned to stop worrying and love metadata /...
The magic of MarcEdit, or, how I learned to stop worrying and love metadata /...The magic of MarcEdit, or, how I learned to stop worrying and love metadata /...
The magic of MarcEdit, or, how I learned to stop worrying and love metadata /...CILIP MDG
 
Continuous Applications at Scale of 100 Teams with Databricks Delta and Struc...
Continuous Applications at Scale of 100 Teams with Databricks Delta and Struc...Continuous Applications at Scale of 100 Teams with Databricks Delta and Struc...
Continuous Applications at Scale of 100 Teams with Databricks Delta and Struc...Databricks
 

Similar a Diacritics Online (20)

Improving access to special collections by automating descriptive metadata cr...
Improving access to special collections by automating descriptive metadata cr...Improving access to special collections by automating descriptive metadata cr...
Improving access to special collections by automating descriptive metadata cr...
 
The Fantastic Challenges of Librarianship: Digital Solutions at The Ringling ...
The Fantastic Challenges of Librarianship: Digital Solutions at The Ringling ...The Fantastic Challenges of Librarianship: Digital Solutions at The Ringling ...
The Fantastic Challenges of Librarianship: Digital Solutions at The Ringling ...
 
Loading 350M documents into a large Solr cluster: Presented by Dion Olsthoorn...
Loading 350M documents into a large Solr cluster: Presented by Dion Olsthoorn...Loading 350M documents into a large Solr cluster: Presented by Dion Olsthoorn...
Loading 350M documents into a large Solr cluster: Presented by Dion Olsthoorn...
 
Reimagining Serials handout: BIBFRAME Exercise
Reimagining Serials handout: BIBFRAME ExerciseReimagining Serials handout: BIBFRAME Exercise
Reimagining Serials handout: BIBFRAME Exercise
 
Inventor Content Center: Adding Information
Inventor Content Center:   Adding InformationInventor Content Center:   Adding Information
Inventor Content Center: Adding Information
 
Stupid Index Block Tricks
Stupid Index Block TricksStupid Index Block Tricks
Stupid Index Block Tricks
 
Optimising Workflows for Digital Archives: UCD Digital Library
Optimising Workflows for Digital Archives: UCD Digital LibraryOptimising Workflows for Digital Archives: UCD Digital Library
Optimising Workflows for Digital Archives: UCD Digital Library
 
Docker: Containers for Data Science
Docker: Containers for Data ScienceDocker: Containers for Data Science
Docker: Containers for Data Science
 
Veodin slide proof manual
Veodin slide proof manualVeodin slide proof manual
Veodin slide proof manual
 
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
 
AUTOMATED DATA EXPLORATION - Building efficient analysis pipelines with Dask
AUTOMATED DATA EXPLORATION - Building efficient analysis pipelines with DaskAUTOMATED DATA EXPLORATION - Building efficient analysis pipelines with Dask
AUTOMATED DATA EXPLORATION - Building efficient analysis pipelines with Dask
 
Stuart Kenny; Kathryn Cassidy - Experience with Ingestion of Large Collection...
Stuart Kenny; Kathryn Cassidy - Experience with Ingestion of Large Collection...Stuart Kenny; Kathryn Cassidy - Experience with Ingestion of Large Collection...
Stuart Kenny; Kathryn Cassidy - Experience with Ingestion of Large Collection...
 
UKSG Conference 2017 Breakout - A tale of two systems: discovery at the Unive...
UKSG Conference 2017 Breakout - A tale of two systems: discovery at the Unive...UKSG Conference 2017 Breakout - A tale of two systems: discovery at the Unive...
UKSG Conference 2017 Breakout - A tale of two systems: discovery at the Unive...
 
Hands Free Articles: Implementing and Maximizing OCLC Knowledge Base in ILLiad
Hands Free Articles: Implementing and Maximizing OCLC Knowledge Base in ILLiadHands Free Articles: Implementing and Maximizing OCLC Knowledge Base in ILLiad
Hands Free Articles: Implementing and Maximizing OCLC Knowledge Base in ILLiad
 
Metadata sharing module
Metadata sharing moduleMetadata sharing module
Metadata sharing module
 
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
 
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
 
EAD - QC GSLIS 730
EAD - QC GSLIS 730EAD - QC GSLIS 730
EAD - QC GSLIS 730
 
The magic of MarcEdit, or, how I learned to stop worrying and love metadata /...
The magic of MarcEdit, or, how I learned to stop worrying and love metadata /...The magic of MarcEdit, or, how I learned to stop worrying and love metadata /...
The magic of MarcEdit, or, how I learned to stop worrying and love metadata /...
 
Continuous Applications at Scale of 100 Teams with Databricks Delta and Struc...
Continuous Applications at Scale of 100 Teams with Databricks Delta and Struc...Continuous Applications at Scale of 100 Teams with Databricks Delta and Struc...
Continuous Applications at Scale of 100 Teams with Databricks Delta and Struc...
 

Último

Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...RKavithamani
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991RKavithamani
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 

Último (20)

Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 

Diacritics Online

  • 1. 1 How to import diacritics into CONTENTdm  from a library catalog using Excel and  MarcEdit Jill Strass This talk was inspired by our struggles to digitize some Nordic Solo Songs as collected by Dan Dressen and bravely cataloged and uploaded by Kathy Blough. Jill Strass St. Olaf College Upper Midwest Online CONTENTdm Conference November 8‐9, 2010 The Challenge • Shortcut to metadata: obtain MARC records• Shortcut to metadata: obtain MARC records  containing diacritics from a library catalog  as a tab‐delimited file for easy import into  CONTENTdm
  • 2. 2 The Method • Export our records from the library catalog• Export our records from the library catalog  as a delimited file The Method • Export our records from the library catalog• Export our records from the library catalog  as a delimited file • Use the tab‐delimited file to generate  metadata for CONTENTdm
  • 3. 3 The Method • Export our records from the library catalog• Export our records from the library catalog  as a delimited file • Use the tab‐delimited file to generate  metadata for CONTENTdm • Upload as a compound object into Up oad as a co pou d object to CONTENTdm   The Challenge • Uh oh we have an export bug that won’t• Uh oh, we have an export bug that won t  allow us to cleanly export fields with  repeating values from the catalog to a  delimited file. 
  • 4. 4 The Workaround – Catalog to MarcEdit • Uh oh we have an export bug that won’t• Uh oh, we have an export bug that won t  allow us to cleanly export fields with  repeating values from the catalog to a  delimited file. • No worries, we’ll use MarcEdit The Workaround – Catalog to MarcEdit • Uh oh we have an export bug that won’t• Uh oh, we have an export bug that won t  allow us to cleanly export fields with  repeating values from the catalog to a  delimited file. • No worries, we’ll use MarcEdit • Convert the tab delimited file (.out) from  the catalog into an (.mrc) format file using  MarcEdit
  • 5. 5 The Workaround – Catalog to MarcEdit • Uh oh we have an export bug that won’t• Uh oh, we have an export bug that won t  allow us to cleanly export from the catalog  to a delimited file. • No worries, we’ll use MarcEdit • Convert the tab delimited file (.out) from Co e t t e tab de ted e ( out) o the catalog into an (.mrc) format file using  MarcEdit • Take the (.mrc) file and export using  MarcEdit’s tool for tab‐delimited files.  The Workaround – Catalog to MarcEdit • Uh oh we have an export bug that won’t allow us to• Uh oh, we have an export bug that won t allow us to  cleanly export from the catalog to a delimited file. • No worries, we’ll use MarcEdit • Convert the tab delimited file (.out) from the catalog  into an (.mrc) format file using MarcEdit • Take the (.mrc) file and export using MarcEdit’s tool ( ) p g for tab‐delimited files.  • In MarcEdit, we choose which MARC fields we want  for our metadata in digital collections.
  • 6. 6 The Trick to know in MarcEdit for  diacritics  • Use the MarcEdit Characterset Translation• Use the MarcEdit Characterset Translation  tool, and while breaking the record, select  UTF‐8 as the format, so Excel can recognize  diacritic characters. The Trick to know in MarcEdit for  diacritics Note that the box forNote that the box for Translate to UTF-8 is checked.
  • 7. 7 The Trick to know in MarcEdit for  diacritics Yippee! If youYippee! If you look real close, you can see diacritics are showing up in the text editor in MarcEdit. Trick for Diacritics in Excel • Now we have our diacritics within a tab• Now we have our diacritics within a tab  delimited file, courtesy of MarcEdit.  • There is a trick you’ll need to use when you  first open Excel. 
  • 8. 8 Trick for Diacritics in Excel When you first open your tab-delimited file from MarcEdit, when Excel takes you through its wizard for importing the tab delimited file, select 65001 Unicode (UTF-8) from the File Origin pull-down menu. This will allow Excel to “see” the diacritics. Generating Metadata from tab‐ delimited files • We use a tricked out spreadsheet that• We use a tricked‐out spreadsheet that  allows us to take a row from a tab delimited  file, copy and paste it into Excel, and then  Excel generates a compound object  template for easy upload into CONTENTdm.
  • 9. 9 Generating Metadata from tab‐ delimited files • We use a tricked out spreadsheet that• We use a tricked‐out spreadsheet that  allows us to take a row from a tab‐ delimited file, copy and paste it into Excel,  and then Excel generates a compound  object template for easy upload into  CONTENTdm. • We do this to avoid manual data entry as  much as possible. Generating Metadata from tab‐ delimited files • We use a tricked out spreadsheet that• We use a tricked‐out spreadsheet that  allows us to take a row from a tab‐ delimited file, copy and paste it into Excel,  and then Excel generates a compound  object template for easy upload into  CONTENTdm. • We do this to avoid manual data entry as  much as possible. • If you’d like a spreadsheet file and  documentation on how to use it contact
  • 10. 10 Generating Metadata from tab‐ delimited files • To convert the xls file• To convert the .xls file  to .txt, we select,  copy and paste from  Excel into Notepad++. • We do this so we can  see exactly what  characters are  showing up in our  text files.  Generating Metadata from tab‐ delimited files • Note that Notepad++• Note that Notepad++  is so cool, we don’t  need any tricks to  use it!
  • 11. 11 Uploading into CONTENTdm with  Diacritics (CDM 5.3) From Project Client,j , select Add Multiple Compound Objects, then select the Map Fields Tab. Uploading into CONTENTdm with  Diacritics (CDM 5.3) Click the Encoding button.
  • 12. 12 Uploading into CONTENTdm with  Diacritics (CDM 5.3) If only it were thisIf only it were this simple…. For us, we had to select ANSI for this to work, but according to the documentation, UTF-8 as encodingUTF 8 as encoding is supposed to work. Uploading into CONTENTdm with  Diacritics (CDM 5.3) We may nevery know why this is so for us. Please share your experiences.
  • 13. 13 A Sample of Diacritics on CONTENTdm And here we are, at journey’s, j y end…. Summary of Diacritics on  CONTENTdm • Export MARC records from your catalog or source forExport MARC records from your catalog or source for  text with diacritics. • If you need to use MarcEdit in this process, select the  UTF‐8 box in the Characterset Translation Tool. • When first opening a tab‐delimited file in Excel, select  65001 Unicode (UTF‐8) from the File Origin pull‐down  menu. • When uploading to CONTENTdm, experiment with the  UTF‐8 vs ANSI setting in the Add Compound Object,  File Mapping, Encoding box.