SlideShare una empresa de Scribd logo
1 de 29
+
PDF/A
A Preservation Format




Mid-Atlantic Regional Archives Conference
21 October 2011

                                             Geof Huth
                                    geofhuth@gmail.com
+
    File Format Confusion

       From 5,000 to 15,000 extant file formats


       Most are proprietary


       The numbers add complexity to preservation


       Real preservation formats are few in number


       And we can really count on none of them
+
    Two General Classes of Formats

       Proprietary
           Controlled by one company
           Underlying code is a trade secret
           If the company goes under, the file format becomes obsolete

       Open
           Controlled by a standards body, a consortium, wiki-like bodies
           Code is free and open to all
           In absence of an “owner,” can still use the code to make a reader

       Neither Guarantees Preservation
           But open formats give you an opening to preservation
+
    Proprietary Formats


       Tend to be rich in features


       Limited readers for each format


       Limited ability to exchange data


       Difficult for long-term accessibility


       Greater associated costs
+
    Advantages of Open Formats


       More choice in what application to use


       Better exchange of data


       Better support of long-term preservation


       Possible lower costs


       Ability to create own readers
+
    Format/Software Confusion

       Software
           Creates a file in the format
           Reads the file for you
           Allows you to interact with the file

       Format
           Is the specific technical form in which a certain file exists
           Can be created by one software product or many

       Examples
           Adobe Acrobat (and many others) vs PDF
           Microsoft Word vs .doc (and .docx, etc.)
+
    Criteria for Preservation Formats
    (and Files)
       Ubiquitous

       Long-lived

       Documented

       Metadata-supporting

       Accurate

       Open

       Uncompressed

       Unencrypted
+
    When to Use a Preservation Format

       Creation
           Begin with a format you know will last
           If so, choose a format that allows modification to a file

       Recordation
           When information becomes a record, save it in a chosen format
           This freezes the file and demonstrates it is a record

       Archiving
           Convert to persistent formats those records needed long-term
           The conversion preserves the records and marks is as permanent

       Early Action Can Save Money and Time
+
    Normalization
    (action at the point of archiving)
       Conversion to a format

             Not expected to change

             Not expected to disappear

             Not expected to become unreadable

       Usually conversion to a different format from original

       Generally how preservation formats are used

       Still, may cause data loss or corruption
+
    Options for Preservation of Text

       American Standard Coding for Information Interchange (ASCII)

       Unicode

       Portable Document Format / Archive (PDF/A)

       Extensible Markup Language (XML)

           Open Document Format (ODF) (ISO/IEC 26300:2006)

           Office Open XML (OOXML) (ISO/IEC 29500:2008)
+
    What is Portable Document
    Format?
        Originally developed by Adobe in 1991


        Specifications made available for free in 2001


        Format made an open international standard in 2008


        Includes text and image features
+
    Advantages of PDF

        Has accessibility across platforms

        Saves look and searchability of original

        Embeds fonts (if desired)

        Allows copying of text from files

        Remains fairly stable and universal

        Is difficult to modify

        Has enhanced document security

        Supports authenticity
+
    Disadvantages of PDF

       Won’t always perfectly represent original

       Some files are more difficult to convert

       Some formatting may be lost if saved back to original file format

       Limited ability to modify

       A complex format saving image and text

       Tends to be larger than a word processing document
+
    PDF’s Advantage over Others


       Image and text in one bundle


       Intelligent text


       Accepts importance of format to meaning


       Ubiquity of format and readers
+
    Conversion Practices

        Have necessary fonts installed

        Ensure lossless compression

            Important for embedded images


        When converting PDF to PDF/A

            Eliminate prohibited features

            Check beforehand or fix during
+
    Flavors of the PDF Standard

       PDF (vanilla)

       PDF/A (for archival preservation)

       PDF/X (for publishing)

       PDF/E (for engineering drawings)

       PDF/VT (for variable data and transactional printing)

       PDF/UA (for accessibility—in development)

       PDF/H (for healthcare records—a guide, not a standard)

       GeoPDF (for geospatial records—only based on standards)
+
    Portable Document Format /
    Archive Standards
       PDF/ A-1
           ISO Standard 19005-1:2005

           Based on PDF Reference 1.4 (Acrobat 5)

       PDF/A-2
           ISO Standard 19005-2:2011

           Based on PDF Reference 1.7

           Published 20 June 2011

       New versions of PDF/A expected
+
    Uses of PDF/A

        Standard textual documents
            Paper documents
            Word-processing and PDF documents

        Sequences of related digital images


        Documents where appearance matters


        Static documents
+
    Less Appropriate for PDF/A

        Webpages


        Databases


        Spreadsheets


        Dynamic documents
+
    Creating PDF/As

        Need a product that can produce one
            Like Adobe Acrobat 8 Professional

        Can convert documents individually
            Opening and converting one at a time

        Can use batch processing
            Converting multiple documents at once
            Supported by Acrobat 8
+
    General Goals of PDF/A

       Specifies limited stable set of features


           To ensure long-term validity


           Eliminate features that are not “archival”



       An open preservation standard



       Format designed to be a preservation standard
+
    Required in PDF/A

       All fonts embedded


       Unlimited legal use of embedded fonts


       Device-independent color


       Metadata describing the file


           File must self-identify the PDF/A version
+
    Excluded from PDF/A-1
       Audio and video content

       JavaScript and executable files

       Encryption

       LZW and JPEG 2000 image compression

       Reference to outside content

       Transparency

       Embedded files
+
    Differences in PDF/A-2
       Allows embedding of OpenType fonts
       Allows JPEG2000 image compression
       Supports transparent objects
       Supports layers, which can be hidden for viewing
       Defines use of digital signatures
           Defines rules via PDF Advanced Electronic Signatures (PAdES)
       Specifies requirements for custom XMP metadata
       Allows embedded files, but in only one context
           In a PDF/A-2 you can embed PDF/A files
           Allows creation of sets of documents in a single file (e.g. emails)
       All PDF/A-1s are compliant with PDF/A-2 standard
           PDF/A-2 is an extension of PDF/A-1
+
    PDF/A-1 Conformance Levels
        PDF/A-1, Level A (full compliance)

            Preserves document’s logical structure

            Preserves text stream in reading order

            Requires language specification

            Requires UNICODE mapping


        PDF/A-1, Level B (minimal compliance)

            Preserves visual appearance

            Doesn’t require as much descriptive info

            Less “accessible” format
+
    Flavors of PDF/A

       PDF/A-1a (a = accessible)
           RGB Color
           CMYK Color

       PDF/A-1b (b = basic)
           Same color choices

       PDF/A-2a (extension of A-1a)
       PDF/A-2b (extension of A-1b)
       PDF/A-2u (u = Unicode)
           Must use Unicode
           Does not require representation of logical structure
+
    PDF/A Product Lines

        Adobe Acrobat (www.adobe.com)

        Apago (www.apagoinc.com)

        Callas (www.callassoftware.com)

        Compart (www.compart.net)

        PDFlib (www.pdflib.com)

        PDF Tools AG (www.pdf-tools.com)
+
    PDF/A Validation Tools


        Adobe Acrobat Preflight Function (www.adobe.com)



        Callas Software pdfaPilot (www.callassoftware.com)



        PDF Tools AG's 3-Heights PDF Validator (www.pdf-tools.com)
+
    Formats are Not Everything

       Preservation Programs Require Work
           Conversion procedures
           Quality control
           Version control
           Environmental controls
           Metadata creation and maintenance
               Metadata about the records and their information
               Metadata about your preservation actions
           Data management controls (backups, etc.)
           Ensuring that chosen normalized formats are still valid
           Vigilance

Más contenido relacionado

Similar a PDF/A: A Preservation Format

PDF/Archive - Preserving Electronic Documents
PDF/Archive - Preserving Electronic DocumentsPDF/Archive - Preserving Electronic Documents
PDF/Archive - Preserving Electronic DocumentsBetsy Fanning
 
Presentation1
Presentation1Presentation1
Presentation1f6aim
 
An introduction to Portable Document Format
An introduction to Portable Document FormatAn introduction to Portable Document Format
An introduction to Portable Document FormatFiter Kill
 
What is PDF/X?
What is PDF/X? What is PDF/X?
What is PDF/X? DeftPDF
 
January 2006 Archival Storage Strategies and Technologies Presentation
January 2006 Archival Storage Strategies and Technologies PresentationJanuary 2006 Archival Storage Strategies and Technologies Presentation
January 2006 Archival Storage Strategies and Technologies PresentationJohn Wang
 
Apago Pdfx Nyc Seminar Fs Presentation
Apago Pdfx Nyc Seminar Fs PresentationApago Pdfx Nyc Seminar Fs Presentation
Apago Pdfx Nyc Seminar Fs PresentationDwight Kelly
 
Premedia Presentation
Premedia PresentationPremedia Presentation
Premedia PresentationDwight Kelly
 
From Planning to Publishing: How Business Objects Migrated Documentation to D...
From Planning to Publishing: How Business Objects Migrated Documentation to D...From Planning to Publishing: How Business Objects Migrated Documentation to D...
From Planning to Publishing: How Business Objects Migrated Documentation to D...Scott Abel
 
Document Sucuess With Office 2007
Document Sucuess With Office 2007 Document Sucuess With Office 2007
Document Sucuess With Office 2007 Severus Prime
 
Lotus Symphony has matured quite a bit the past year, but are you taking full...
Lotus Symphony has matured quite a bit the past year, but are you taking full...Lotus Symphony has matured quite a bit the past year, but are you taking full...
Lotus Symphony has matured quite a bit the past year, but are you taking full...John Head
 
Cochrane von Suchodoletz File Creation, Rendering and Formats
Cochrane von Suchodoletz File Creation, Rendering and FormatsCochrane von Suchodoletz File Creation, Rendering and Formats
Cochrane von Suchodoletz File Creation, Rendering and FormatsFuture Perfect 2012
 
PDF/a for Dutch Law firms
PDF/a for Dutch Law firmsPDF/a for Dutch Law firms
PDF/a for Dutch Law firmsDean Sappey
 
A Technical Comparison: ISO/IEC 26300 vs Microsoft Office Open XML
A Technical Comparison: ISO/IEC 26300 vs Microsoft Office Open XML A Technical Comparison: ISO/IEC 26300 vs Microsoft Office Open XML
A Technical Comparison: ISO/IEC 26300 vs Microsoft Office Open XML Alexandro Colorado
 
online_formats_for_images_and_texts.pptx
online_formats_for_images_and_texts.pptxonline_formats_for_images_and_texts.pptx
online_formats_for_images_and_texts.pptxRegineArellano2
 
PRESENTATION: Challenges of Digitization (November 2012)
PRESENTATION: Challenges of Digitization (November 2012)PRESENTATION: Challenges of Digitization (November 2012)
PRESENTATION: Challenges of Digitization (November 2012)Adlib - The PDF Experts
 

Similar a PDF/A: A Preservation Format (20)

PDF/Archive - Preserving Electronic Documents
PDF/Archive - Preserving Electronic DocumentsPDF/Archive - Preserving Electronic Documents
PDF/Archive - Preserving Electronic Documents
 
Pdfa 2 rome-fanning
Pdfa 2 rome-fanningPdfa 2 rome-fanning
Pdfa 2 rome-fanning
 
What is PDF/A?
What is PDF/A?What is PDF/A?
What is PDF/A?
 
Presentation1
Presentation1Presentation1
Presentation1
 
An introduction to Portable Document Format
An introduction to Portable Document FormatAn introduction to Portable Document Format
An introduction to Portable Document Format
 
What is PDF/X?
What is PDF/X? What is PDF/X?
What is PDF/X?
 
January 2006 Archival Storage Strategies and Technologies Presentation
January 2006 Archival Storage Strategies and Technologies PresentationJanuary 2006 Archival Storage Strategies and Technologies Presentation
January 2006 Archival Storage Strategies and Technologies Presentation
 
Apago Pdfx Nyc Seminar Fs Presentation
Apago Pdfx Nyc Seminar Fs PresentationApago Pdfx Nyc Seminar Fs Presentation
Apago Pdfx Nyc Seminar Fs Presentation
 
Pdfa Keynote
Pdfa KeynotePdfa Keynote
Pdfa Keynote
 
Premedia Presentation
Premedia PresentationPremedia Presentation
Premedia Presentation
 
From Planning to Publishing: How Business Objects Migrated Documentation to D...
From Planning to Publishing: How Business Objects Migrated Documentation to D...From Planning to Publishing: How Business Objects Migrated Documentation to D...
From Planning to Publishing: How Business Objects Migrated Documentation to D...
 
Document Sucuess With Office 2007
Document Sucuess With Office 2007 Document Sucuess With Office 2007
Document Sucuess With Office 2007
 
PDF
PDFPDF
PDF
 
Lotus Symphony has matured quite a bit the past year, but are you taking full...
Lotus Symphony has matured quite a bit the past year, but are you taking full...Lotus Symphony has matured quite a bit the past year, but are you taking full...
Lotus Symphony has matured quite a bit the past year, but are you taking full...
 
Cochrane von Suchodoletz File Creation, Rendering and Formats
Cochrane von Suchodoletz File Creation, Rendering and FormatsCochrane von Suchodoletz File Creation, Rendering and Formats
Cochrane von Suchodoletz File Creation, Rendering and Formats
 
PDF/a for Dutch Law firms
PDF/a for Dutch Law firmsPDF/a for Dutch Law firms
PDF/a for Dutch Law firms
 
A Technical Comparison: ISO/IEC 26300 vs Microsoft Office Open XML
A Technical Comparison: ISO/IEC 26300 vs Microsoft Office Open XML A Technical Comparison: ISO/IEC 26300 vs Microsoft Office Open XML
A Technical Comparison: ISO/IEC 26300 vs Microsoft Office Open XML
 
online_formats_for_images_and_texts.pptx
online_formats_for_images_and_texts.pptxonline_formats_for_images_and_texts.pptx
online_formats_for_images_and_texts.pptx
 
PRESENTATION: Challenges of Digitization (November 2012)
PRESENTATION: Challenges of Digitization (November 2012)PRESENTATION: Challenges of Digitization (November 2012)
PRESENTATION: Challenges of Digitization (November 2012)
 
PDF vs. TIFF, An Evaluation of Document Scanning File Formats
PDF vs. TIFF, An Evaluation of Document Scanning File FormatsPDF vs. TIFF, An Evaluation of Document Scanning File Formats
PDF vs. TIFF, An Evaluation of Document Scanning File Formats
 

Último

New 2024 Cannabis Edibles Investor Pitch Deck Template
New 2024 Cannabis Edibles Investor Pitch Deck TemplateNew 2024 Cannabis Edibles Investor Pitch Deck Template
New 2024 Cannabis Edibles Investor Pitch Deck TemplateCannaBusinessPlans
 
Pre Engineered Building Manufacturers Hyderabad.pptx
Pre Engineered  Building Manufacturers Hyderabad.pptxPre Engineered  Building Manufacturers Hyderabad.pptx
Pre Engineered Building Manufacturers Hyderabad.pptxRoofing Contractor
 
Berhampur 70918*19311 CALL GIRLS IN ESCORT SERVICE WE ARE PROVIDING
Berhampur 70918*19311 CALL GIRLS IN ESCORT SERVICE WE ARE PROVIDINGBerhampur 70918*19311 CALL GIRLS IN ESCORT SERVICE WE ARE PROVIDING
Berhampur 70918*19311 CALL GIRLS IN ESCORT SERVICE WE ARE PROVIDINGpr788182
 
Mckinsey foundation level Handbook for Viewing
Mckinsey foundation level Handbook for ViewingMckinsey foundation level Handbook for Viewing
Mckinsey foundation level Handbook for ViewingNauman Safdar
 
QSM Chap 10 Service Culture in Tourism and Hospitality Industry.pptx
QSM Chap 10 Service Culture in Tourism and Hospitality Industry.pptxQSM Chap 10 Service Culture in Tourism and Hospitality Industry.pptx
QSM Chap 10 Service Culture in Tourism and Hospitality Industry.pptxDitasDelaCruz
 
JAJPUR CALL GIRL ❤ 82729*64427❤ CALL GIRLS IN JAJPUR ESCORTS
JAJPUR CALL GIRL ❤ 82729*64427❤ CALL GIRLS IN JAJPUR  ESCORTSJAJPUR CALL GIRL ❤ 82729*64427❤ CALL GIRLS IN JAJPUR  ESCORTS
JAJPUR CALL GIRL ❤ 82729*64427❤ CALL GIRLS IN JAJPUR ESCORTSkajalroy875762
 
Organizational Transformation Lead with Culture
Organizational Transformation Lead with CultureOrganizational Transformation Lead with Culture
Organizational Transformation Lead with CultureSeta Wicaksana
 
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...daisycvs
 
CROSS CULTURAL NEGOTIATION BY PANMISEM NS
CROSS CULTURAL NEGOTIATION BY PANMISEM NSCROSS CULTURAL NEGOTIATION BY PANMISEM NS
CROSS CULTURAL NEGOTIATION BY PANMISEM NSpanmisemningshen123
 
SEO Case Study: How I Increased SEO Traffic & Ranking by 50-60% in 6 Months
SEO Case Study: How I Increased SEO Traffic & Ranking by 50-60%  in 6 MonthsSEO Case Study: How I Increased SEO Traffic & Ranking by 50-60%  in 6 Months
SEO Case Study: How I Increased SEO Traffic & Ranking by 50-60% in 6 MonthsIndeedSEO
 
Uneak White's Personal Brand Exploration Presentation
Uneak White's Personal Brand Exploration PresentationUneak White's Personal Brand Exploration Presentation
Uneak White's Personal Brand Exploration Presentationuneakwhite
 
Lucknow Housewife Escorts by Sexy Bhabhi Service 8250092165
Lucknow Housewife Escorts  by Sexy Bhabhi Service 8250092165Lucknow Housewife Escorts  by Sexy Bhabhi Service 8250092165
Lucknow Housewife Escorts by Sexy Bhabhi Service 8250092165meghakumariji156
 
Falcon Invoice Discounting: Unlock Your Business Potential
Falcon Invoice Discounting: Unlock Your Business PotentialFalcon Invoice Discounting: Unlock Your Business Potential
Falcon Invoice Discounting: Unlock Your Business PotentialFalcon investment
 
Horngren’s Cost Accounting A Managerial Emphasis, Canadian 9th edition soluti...
Horngren’s Cost Accounting A Managerial Emphasis, Canadian 9th edition soluti...Horngren’s Cost Accounting A Managerial Emphasis, Canadian 9th edition soluti...
Horngren’s Cost Accounting A Managerial Emphasis, Canadian 9th edition soluti...ssuserf63bd7
 
UAE Bur Dubai Call Girls ☏ 0564401582 Call Girl in Bur Dubai
UAE Bur Dubai Call Girls ☏ 0564401582 Call Girl in Bur DubaiUAE Bur Dubai Call Girls ☏ 0564401582 Call Girl in Bur Dubai
UAE Bur Dubai Call Girls ☏ 0564401582 Call Girl in Bur Dubaijaehdlyzca
 
joint cost.pptx COST ACCOUNTING Sixteenth Edition ...
joint cost.pptx  COST ACCOUNTING  Sixteenth Edition                          ...joint cost.pptx  COST ACCOUNTING  Sixteenth Edition                          ...
joint cost.pptx COST ACCOUNTING Sixteenth Edition ...NadhimTaha
 
Falcon Invoice Discounting: The best investment platform in india for investors
Falcon Invoice Discounting: The best investment platform in india for investorsFalcon Invoice Discounting: The best investment platform in india for investors
Falcon Invoice Discounting: The best investment platform in india for investorsFalcon Invoice Discounting
 
Lundin Gold - Q1 2024 Conference Call Presentation (Revised)
Lundin Gold - Q1 2024 Conference Call Presentation (Revised)Lundin Gold - Q1 2024 Conference Call Presentation (Revised)
Lundin Gold - Q1 2024 Conference Call Presentation (Revised)Adnet Communications
 
GUWAHATI 💋 Call Girl 9827461493 Call Girls in Escort service book now
GUWAHATI 💋 Call Girl 9827461493 Call Girls in  Escort service book nowGUWAHATI 💋 Call Girl 9827461493 Call Girls in  Escort service book now
GUWAHATI 💋 Call Girl 9827461493 Call Girls in Escort service book nowkapoorjyoti4444
 

Último (20)

New 2024 Cannabis Edibles Investor Pitch Deck Template
New 2024 Cannabis Edibles Investor Pitch Deck TemplateNew 2024 Cannabis Edibles Investor Pitch Deck Template
New 2024 Cannabis Edibles Investor Pitch Deck Template
 
Pre Engineered Building Manufacturers Hyderabad.pptx
Pre Engineered  Building Manufacturers Hyderabad.pptxPre Engineered  Building Manufacturers Hyderabad.pptx
Pre Engineered Building Manufacturers Hyderabad.pptx
 
Berhampur 70918*19311 CALL GIRLS IN ESCORT SERVICE WE ARE PROVIDING
Berhampur 70918*19311 CALL GIRLS IN ESCORT SERVICE WE ARE PROVIDINGBerhampur 70918*19311 CALL GIRLS IN ESCORT SERVICE WE ARE PROVIDING
Berhampur 70918*19311 CALL GIRLS IN ESCORT SERVICE WE ARE PROVIDING
 
Mckinsey foundation level Handbook for Viewing
Mckinsey foundation level Handbook for ViewingMckinsey foundation level Handbook for Viewing
Mckinsey foundation level Handbook for Viewing
 
QSM Chap 10 Service Culture in Tourism and Hospitality Industry.pptx
QSM Chap 10 Service Culture in Tourism and Hospitality Industry.pptxQSM Chap 10 Service Culture in Tourism and Hospitality Industry.pptx
QSM Chap 10 Service Culture in Tourism and Hospitality Industry.pptx
 
JAJPUR CALL GIRL ❤ 82729*64427❤ CALL GIRLS IN JAJPUR ESCORTS
JAJPUR CALL GIRL ❤ 82729*64427❤ CALL GIRLS IN JAJPUR  ESCORTSJAJPUR CALL GIRL ❤ 82729*64427❤ CALL GIRLS IN JAJPUR  ESCORTS
JAJPUR CALL GIRL ❤ 82729*64427❤ CALL GIRLS IN JAJPUR ESCORTS
 
Organizational Transformation Lead with Culture
Organizational Transformation Lead with CultureOrganizational Transformation Lead with Culture
Organizational Transformation Lead with Culture
 
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
 
CROSS CULTURAL NEGOTIATION BY PANMISEM NS
CROSS CULTURAL NEGOTIATION BY PANMISEM NSCROSS CULTURAL NEGOTIATION BY PANMISEM NS
CROSS CULTURAL NEGOTIATION BY PANMISEM NS
 
SEO Case Study: How I Increased SEO Traffic & Ranking by 50-60% in 6 Months
SEO Case Study: How I Increased SEO Traffic & Ranking by 50-60%  in 6 MonthsSEO Case Study: How I Increased SEO Traffic & Ranking by 50-60%  in 6 Months
SEO Case Study: How I Increased SEO Traffic & Ranking by 50-60% in 6 Months
 
Uneak White's Personal Brand Exploration Presentation
Uneak White's Personal Brand Exploration PresentationUneak White's Personal Brand Exploration Presentation
Uneak White's Personal Brand Exploration Presentation
 
Lucknow Housewife Escorts by Sexy Bhabhi Service 8250092165
Lucknow Housewife Escorts  by Sexy Bhabhi Service 8250092165Lucknow Housewife Escorts  by Sexy Bhabhi Service 8250092165
Lucknow Housewife Escorts by Sexy Bhabhi Service 8250092165
 
Falcon Invoice Discounting: Unlock Your Business Potential
Falcon Invoice Discounting: Unlock Your Business PotentialFalcon Invoice Discounting: Unlock Your Business Potential
Falcon Invoice Discounting: Unlock Your Business Potential
 
WheelTug Short Pitch Deck 2024 | Byond Insights
WheelTug Short Pitch Deck 2024 | Byond InsightsWheelTug Short Pitch Deck 2024 | Byond Insights
WheelTug Short Pitch Deck 2024 | Byond Insights
 
Horngren’s Cost Accounting A Managerial Emphasis, Canadian 9th edition soluti...
Horngren’s Cost Accounting A Managerial Emphasis, Canadian 9th edition soluti...Horngren’s Cost Accounting A Managerial Emphasis, Canadian 9th edition soluti...
Horngren’s Cost Accounting A Managerial Emphasis, Canadian 9th edition soluti...
 
UAE Bur Dubai Call Girls ☏ 0564401582 Call Girl in Bur Dubai
UAE Bur Dubai Call Girls ☏ 0564401582 Call Girl in Bur DubaiUAE Bur Dubai Call Girls ☏ 0564401582 Call Girl in Bur Dubai
UAE Bur Dubai Call Girls ☏ 0564401582 Call Girl in Bur Dubai
 
joint cost.pptx COST ACCOUNTING Sixteenth Edition ...
joint cost.pptx  COST ACCOUNTING  Sixteenth Edition                          ...joint cost.pptx  COST ACCOUNTING  Sixteenth Edition                          ...
joint cost.pptx COST ACCOUNTING Sixteenth Edition ...
 
Falcon Invoice Discounting: The best investment platform in india for investors
Falcon Invoice Discounting: The best investment platform in india for investorsFalcon Invoice Discounting: The best investment platform in india for investors
Falcon Invoice Discounting: The best investment platform in india for investors
 
Lundin Gold - Q1 2024 Conference Call Presentation (Revised)
Lundin Gold - Q1 2024 Conference Call Presentation (Revised)Lundin Gold - Q1 2024 Conference Call Presentation (Revised)
Lundin Gold - Q1 2024 Conference Call Presentation (Revised)
 
GUWAHATI 💋 Call Girl 9827461493 Call Girls in Escort service book now
GUWAHATI 💋 Call Girl 9827461493 Call Girls in  Escort service book nowGUWAHATI 💋 Call Girl 9827461493 Call Girls in  Escort service book now
GUWAHATI 💋 Call Girl 9827461493 Call Girls in Escort service book now
 

PDF/A: A Preservation Format

  • 1. + PDF/A A Preservation Format Mid-Atlantic Regional Archives Conference 21 October 2011 Geof Huth geofhuth@gmail.com
  • 2. + File Format Confusion  From 5,000 to 15,000 extant file formats  Most are proprietary  The numbers add complexity to preservation  Real preservation formats are few in number  And we can really count on none of them
  • 3. + Two General Classes of Formats  Proprietary  Controlled by one company  Underlying code is a trade secret  If the company goes under, the file format becomes obsolete  Open  Controlled by a standards body, a consortium, wiki-like bodies  Code is free and open to all  In absence of an “owner,” can still use the code to make a reader  Neither Guarantees Preservation  But open formats give you an opening to preservation
  • 4. + Proprietary Formats  Tend to be rich in features  Limited readers for each format  Limited ability to exchange data  Difficult for long-term accessibility  Greater associated costs
  • 5. + Advantages of Open Formats  More choice in what application to use  Better exchange of data  Better support of long-term preservation  Possible lower costs  Ability to create own readers
  • 6. + Format/Software Confusion  Software  Creates a file in the format  Reads the file for you  Allows you to interact with the file  Format  Is the specific technical form in which a certain file exists  Can be created by one software product or many  Examples  Adobe Acrobat (and many others) vs PDF  Microsoft Word vs .doc (and .docx, etc.)
  • 7. + Criteria for Preservation Formats (and Files)  Ubiquitous  Long-lived  Documented  Metadata-supporting  Accurate  Open  Uncompressed  Unencrypted
  • 8. + When to Use a Preservation Format  Creation  Begin with a format you know will last  If so, choose a format that allows modification to a file  Recordation  When information becomes a record, save it in a chosen format  This freezes the file and demonstrates it is a record  Archiving  Convert to persistent formats those records needed long-term  The conversion preserves the records and marks is as permanent  Early Action Can Save Money and Time
  • 9. + Normalization (action at the point of archiving)  Conversion to a format  Not expected to change  Not expected to disappear  Not expected to become unreadable  Usually conversion to a different format from original  Generally how preservation formats are used  Still, may cause data loss or corruption
  • 10. + Options for Preservation of Text  American Standard Coding for Information Interchange (ASCII)  Unicode  Portable Document Format / Archive (PDF/A)  Extensible Markup Language (XML)  Open Document Format (ODF) (ISO/IEC 26300:2006)  Office Open XML (OOXML) (ISO/IEC 29500:2008)
  • 11. + What is Portable Document Format?  Originally developed by Adobe in 1991  Specifications made available for free in 2001  Format made an open international standard in 2008  Includes text and image features
  • 12. + Advantages of PDF  Has accessibility across platforms  Saves look and searchability of original  Embeds fonts (if desired)  Allows copying of text from files  Remains fairly stable and universal  Is difficult to modify  Has enhanced document security  Supports authenticity
  • 13. + Disadvantages of PDF  Won’t always perfectly represent original  Some files are more difficult to convert  Some formatting may be lost if saved back to original file format  Limited ability to modify  A complex format saving image and text  Tends to be larger than a word processing document
  • 14. + PDF’s Advantage over Others  Image and text in one bundle  Intelligent text  Accepts importance of format to meaning  Ubiquity of format and readers
  • 15. + Conversion Practices  Have necessary fonts installed  Ensure lossless compression  Important for embedded images  When converting PDF to PDF/A  Eliminate prohibited features  Check beforehand or fix during
  • 16. + Flavors of the PDF Standard  PDF (vanilla)  PDF/A (for archival preservation)  PDF/X (for publishing)  PDF/E (for engineering drawings)  PDF/VT (for variable data and transactional printing)  PDF/UA (for accessibility—in development)  PDF/H (for healthcare records—a guide, not a standard)  GeoPDF (for geospatial records—only based on standards)
  • 17. + Portable Document Format / Archive Standards  PDF/ A-1  ISO Standard 19005-1:2005  Based on PDF Reference 1.4 (Acrobat 5)  PDF/A-2  ISO Standard 19005-2:2011  Based on PDF Reference 1.7  Published 20 June 2011  New versions of PDF/A expected
  • 18. + Uses of PDF/A  Standard textual documents  Paper documents  Word-processing and PDF documents  Sequences of related digital images  Documents where appearance matters  Static documents
  • 19. + Less Appropriate for PDF/A  Webpages  Databases  Spreadsheets  Dynamic documents
  • 20. + Creating PDF/As  Need a product that can produce one  Like Adobe Acrobat 8 Professional  Can convert documents individually  Opening and converting one at a time  Can use batch processing  Converting multiple documents at once  Supported by Acrobat 8
  • 21. + General Goals of PDF/A  Specifies limited stable set of features  To ensure long-term validity  Eliminate features that are not “archival”  An open preservation standard  Format designed to be a preservation standard
  • 22. + Required in PDF/A  All fonts embedded  Unlimited legal use of embedded fonts  Device-independent color  Metadata describing the file  File must self-identify the PDF/A version
  • 23. + Excluded from PDF/A-1  Audio and video content  JavaScript and executable files  Encryption  LZW and JPEG 2000 image compression  Reference to outside content  Transparency  Embedded files
  • 24. + Differences in PDF/A-2  Allows embedding of OpenType fonts  Allows JPEG2000 image compression  Supports transparent objects  Supports layers, which can be hidden for viewing  Defines use of digital signatures  Defines rules via PDF Advanced Electronic Signatures (PAdES)  Specifies requirements for custom XMP metadata  Allows embedded files, but in only one context  In a PDF/A-2 you can embed PDF/A files  Allows creation of sets of documents in a single file (e.g. emails)  All PDF/A-1s are compliant with PDF/A-2 standard  PDF/A-2 is an extension of PDF/A-1
  • 25. + PDF/A-1 Conformance Levels  PDF/A-1, Level A (full compliance)  Preserves document’s logical structure  Preserves text stream in reading order  Requires language specification  Requires UNICODE mapping  PDF/A-1, Level B (minimal compliance)  Preserves visual appearance  Doesn’t require as much descriptive info  Less “accessible” format
  • 26. + Flavors of PDF/A  PDF/A-1a (a = accessible)  RGB Color  CMYK Color  PDF/A-1b (b = basic)  Same color choices  PDF/A-2a (extension of A-1a)  PDF/A-2b (extension of A-1b)  PDF/A-2u (u = Unicode)  Must use Unicode  Does not require representation of logical structure
  • 27. + PDF/A Product Lines  Adobe Acrobat (www.adobe.com)  Apago (www.apagoinc.com)  Callas (www.callassoftware.com)  Compart (www.compart.net)  PDFlib (www.pdflib.com)  PDF Tools AG (www.pdf-tools.com)
  • 28. + PDF/A Validation Tools  Adobe Acrobat Preflight Function (www.adobe.com)  Callas Software pdfaPilot (www.callassoftware.com)  PDF Tools AG's 3-Heights PDF Validator (www.pdf-tools.com)
  • 29. + Formats are Not Everything  Preservation Programs Require Work  Conversion procedures  Quality control  Version control  Environmental controls  Metadata creation and maintenance  Metadata about the records and their information  Metadata about your preservation actions  Data management controls (backups, etc.)  Ensuring that chosen normalized formats are still valid  Vigilance

Notas del editor

  1. JPEG2000 compression was introduced after release of PDF/A-1 standard Transparency not defined well enough by time of PDF/A-1 standard Transparency found in dropped shadows, cross fades, and highlighting Layers allows layers in maps and engineering drawings to be hidden to help viewer see the data better