SlideShare una empresa de Scribd logo
1 de 29
Significant characteristics in Planets


            Manfred Thaller
           Universität zu* Köln

          *University at not of Cologne
What are “significant characteristics”?

Those properties of a digital file which have to
  be known to enable the processing of the file
  within a specific setup.
Why extract them by software?

To create technical metadata as required by
  organizational models for long term
  preservation. (NLNZ)
Within Planets …

… served by solutions to identify formats:
  formats registry / PRONOM / DROID.

… and a solution for extracting and processing
  such characteristics: XCL.
A Vision
            Extractor                   tiff XCDL
  tiff

                                  93%
Migrator                                Comparator



  png      tiff XCEL   png XCEL         png XCDL
A Vision
 Extractor




                    Comparator
Appropriate XCELs

                        C-Set
Why automate?

1 million objects: use one second for each.

== 16666.7 minutes == 277.8 hours

== 11.57 working days of a computer

== 34.7 8-hour days for a Human

== 7 working weeks
Why automate?

1 million objects: use five minutes for each.

== 416 666.7 hours

== 52 803.4 8-hour days for a Human
Why automate?

Assumption: Preservation is only feasible, if the
content of two digital objects can be compared
without human intervention, giving a numerical
estimate of their degree of similarity.
Demo
Abstract solution I
(1) Language to represent the complete content of a digital object.
    XCDL
(2) Language to describe any machine readable format in a formal
     language.
    XCEL
(3) Software to extract the content of a file based upon a description
     as under (2) and express it in the language as specified under (1).
    “extractor”
(4) Software to compare two such content descriptions.
     “comparator”
<XCELDocument...>           ...                       <xcdl>
<formatDescription>....                                 <object id=quot;o1quot; >
<symbol identifier=quot;ID01_I01_I01_S02quot;                       <normData id=quot;nd1quot; > ... </normData>
     originalName=quot;height“ interpretation=quot;uint32quot;>
                                                            <property id=quot;p1quot; source=quot;rawquot;
    <range><startposition xsi:type=quot;sequential“>         cat=quot;descrquot; >
     </startposition>
       <length xsi:type=quot;fixedquot;>4</length></range>               <name>   compression</name>
     <name>height</name>                                       <valueSet id=quot;i_i1_s6quot; >
                                                                   <rawValue>0 </rawValue>
</symbol>
                                                                   <labValue>...</labValue>
<symbol identifier=quot;ID01_I01_I01_S04quot;
     originalName=quot;colourTypequot;>                                    <dataRef ind=quot;normAllquot; />
   <range>                                                         <propRel/>
     <startposition xsi:type=quot;sequentialquot;>                     </valueSet>
     </startposition>                                      </property>
     <length xsi:type=quot;fixedquot;>1</length></range>           <property id=quot;p2quot; source=quot;rawquot;
   <valueInterpretation>                                cat=quot;descrquot; >
         <valueLabel>greyscale</valueLabel>                      <name>   height</name>
         <value>0</value></valueinterpretation>
                                                               <valueSet id=quot;i_i1_s3quot; >
    <name>imageType</name>                                         <rawValue>0 0 1 ad </rawValue>
</symbol>                                                          <labValue>
<symbol identifier=quot;ID01_I01_I01_S05quot;                                  <val>429</val>
     originalName=quot;compressionMethodquot;>                                 <type>uint32</type>
   <range>                                                         </labValue>
       <startposition xsi:type=quot;sequential“>                       <dataRef ind=quot;normAllquot; />
     </startposition>                                              <propRel/>
        <length                                                </valueSet>
     xsi:type=quot;fixedquot;>1</length></range>                   </property>
   <valueInterpretation>                                   <property id=quot;p3quot; source=quot;rawquot;
     <valueLabel>zlibDeflateInflate</valueLabel>        cat=quot;descrquot; >
     <value>0</value></valueInterpretation>
                                                                 <name>   imageType</name>
   <name>compression</name>
                                                         .....
</symbol>...
<request2>                                      <property id=quot;2quot;
    <measurementRequest>                           name=quot;imageHeightquot;
          <source name=quot;XCDL1.xmlquot;/>               unit=quot;pixelquot;
          <target name=quot;XCDL2.xmlquot;/>               compStatus=quot;completequot;>
        <property id=quot;45quot; name=quot;rgbPalettequot;>
                                                   <values type=quot;intquot;>
            <metric id=quot;10quot;
     name=quot;hammingDistancequot;/>                         <src>32</src>
     </property>
        <property id=quot;300quot; name=quot;normDataquot;>
                                                      <tar>32</tar>
            <metric id=quot;10quot;                        </values>
     name=quot;hammingDistancequot;/>
            <metric id=quot;50quot; name=quot;RMSEquot;/>          <metric id=quot;200quot;
        </property>                                name=quot;equalquot; result=quot;truequot;/>
         <property id=quot;2quot;                          <metric id=quot;201quot;
     name=quot;imageHeightquot; unit=quot;pixelquot;>              name=quot;intDiffquot; result=quot;0quot;/>
     <metric id=quot;200quot; name=quot;equalquot;/>               <metric id=quot;210quot;
     <metric id=quot;201quot; name=quot;intDiffquot;/>             name=quot;percDevquot;
     <metric id=quot;210quot; name=quot;percDevquot;/>             result=quot;0.000000quot;/>
          </property>                           </property>
        <property id=quot;30quot; name=quot;imageWidthquot;
     unit=quot;pixelquot;>
            <metric id=quot;200quot; name=quot;equalquot;/>
            <metric id=quot;201quot; name=quot;intDiffquot;/>
            <metric id=quot;210quot; name=quot;percDevquot;/>
        </property>
Abstract solution I
(1) Language to represent the complete content of a digital object.
    XCDL
(2) Language to describe any machine readable format in a formal
     language.
    XCEL
(3) Software to extract the content of a file based upon a description
     as under (2) and express it in the language as specified under (1).
    “extractor”
(4) Software to compare two such content descriptions.
     “comparator”
Are the following two items equal:




          VIII  8
eight   eight


VIII  8
otto

       eight   eight


otto   VIII  8
otto                   acht

       eight   eight


otto   VIII  8       acht
8.0
otto                         acht

       eight         eight


otto   VIII  8             acht
Information model: „an image“

otto                                   acht

           eight           eight


otto     VIII  8                     acht
information model: „an image“

 format ontology: „what terms are
used in formats to describe image
            properties“


   VIII  8
Information model: „what is an image“

  Format ontology: „what terms are
  used in formats to describe image
              properties“

Extraction language: “how to get the
terms describing an image out of a file”
Abstract solution II
(1) A theoretical model of information (not: data) types – “image”,
    “text”, “audio” ...

(2) Ontologies, which map existing file format terminologies onto
    these model.

(3) A language – XCDL – which allows to express the content of files
    in different formats using the vocabulary of the ontologies and the
    “grammar” of the information model.
XCDL

eXtensible Characterisation Definition
  Language

Purpose: Describe the contents of a file in
  terms of an abstract model.
XCDL: text model (1)

A text (= <object>) is composed of
data (= <normData>) plus
interpretations of data according to the
underlying format specification (=
<property>).
XCDL: text model (2)

Or, one level of abstraction higher, a text
is composed of content carrying tokens,
accompanied by rendering info plus
deployment info plus historical info.
This text   is a

<refData id=quot;1quot;>54 68 69 73 20 69 73 20 61 20 74 65 78 74</refData>
…
<property>
<name>fontsize</name>
<rawVal>
<val>48</val>
<type>unsignedInt8</type>
</rawVal>
<dataRef> <!-- property refers to discrete part of reference data-->
<ref id=quot;1quot; start=quot;0quot; end=quot;3quot;/>
<ref id=quot;1quot; start=“10quot; end=quot;12quot;/>
</dataRef>
</property>
This text   is a

<refData id=quot;1quot;>54 68 69 73 20 69 73 20 61 20 74 65 78 74</refData>
…
<property>
<name>fontsize</name>
<rawVal>
<val>48</val>
<type>unsignedInt8</type>
</rawVal>
<dataRef> <!-- property refers to discrete part of reference data-->
<ref id=quot;1quot; start=quot;0quot; end=quot;3quot;/>
<ref id=quot;1quot; start=“10quot; end=quot;12quot;/>
</dataRef>
</property>
Thank you!

     Questions?

Manfred.thaller@uni-koeln.de

Más contenido relacionado

Destacado

Moon Powerpoint
Moon PowerpointMoon Powerpoint
Moon Powerpointmbla3081
 
Moon Formation And Features
Moon Formation And FeaturesMoon Formation And Features
Moon Formation And FeaturesTeach5ch
 
the outer planets
the outer planetsthe outer planets
the outer planetsBob Smullen
 
The Outer Planets
The Outer PlanetsThe Outer Planets
The Outer PlanetsEric C
 
Pluto Example PowerPoint
Pluto Example PowerPointPluto Example PowerPoint
Pluto Example PowerPointjasmu1mn
 
Pluto The Dwarf Planet
Pluto The Dwarf PlanetPluto The Dwarf Planet
Pluto The Dwarf PlanetJon Dav
 
Best ppt on solar system
Best ppt on solar systemBest ppt on solar system
Best ppt on solar systemMake Megenius
 

Destacado (11)

Moon Powerpoint
Moon PowerpointMoon Powerpoint
Moon Powerpoint
 
My planets´activity
My planets´activityMy planets´activity
My planets´activity
 
Moon Formation And Features
Moon Formation And FeaturesMoon Formation And Features
Moon Formation And Features
 
the outer planets
the outer planetsthe outer planets
the outer planets
 
The Outer Planets
The Outer PlanetsThe Outer Planets
The Outer Planets
 
Planets Lesson
Planets LessonPlanets Lesson
Planets Lesson
 
Pluto Example PowerPoint
Pluto Example PowerPointPluto Example PowerPoint
Pluto Example PowerPoint
 
SunCyles_GT
SunCyles_GTSunCyles_GT
SunCyles_GT
 
Pluto The Dwarf Planet
Pluto The Dwarf PlanetPluto The Dwarf Planet
Pluto The Dwarf Planet
 
Best ppt on solar system
Best ppt on solar systemBest ppt on solar system
Best ppt on solar system
 
Our Moon
Our MoonOur Moon
Our Moon
 

Similar a Significant Characteristics In Planets Manfred Thaller

Csphtp1 18
Csphtp1 18Csphtp1 18
Csphtp1 18HUST
 
Everything You Always Wanted To Know About XML But Were Afraid To Ask
Everything You Always Wanted To Know About XML But Were Afraid To AskEverything You Always Wanted To Know About XML But Were Afraid To Ask
Everything You Always Wanted To Know About XML But Were Afraid To AskRichard Davis
 
OpenSocial - GTUG Stockholm Meeting Oct 1 2009
OpenSocial - GTUG Stockholm Meeting Oct 1 2009OpenSocial - GTUG Stockholm Meeting Oct 1 2009
OpenSocial - GTUG Stockholm Meeting Oct 1 2009Jacob Gyllenstierna
 
07 Collada Overview
07 Collada Overview07 Collada Overview
07 Collada Overviewjohny2008
 
3 xml namespaces and xml schema
3   xml namespaces and xml schema3   xml namespaces and xml schema
3 xml namespaces and xml schemagauravashq
 
JWU Guest Talk: JavaScript and AJAX
JWU Guest Talk: JavaScript and AJAXJWU Guest Talk: JavaScript and AJAX
JWU Guest Talk: JavaScript and AJAXHilary Mason
 
CSIS 138 Javascript Class1
CSIS 138 Javascript Class1CSIS 138 Javascript Class1
CSIS 138 Javascript Class1Teresa Pelkie
 
Cleveland Silverlight Firestarter - XAML Basics
Cleveland Silverlight Firestarter - XAML BasicsCleveland Silverlight Firestarter - XAML Basics
Cleveland Silverlight Firestarter - XAML BasicsSarah Dutkiewicz
 
Ajax On S2 Odp
Ajax On S2 OdpAjax On S2 Odp
Ajax On S2 Odpghessler
 
IBM Lotus Notes Domino XPages and XPages for Mobile
IBM Lotus Notes Domino XPages and XPages for MobileIBM Lotus Notes Domino XPages and XPages for Mobile
IBM Lotus Notes Domino XPages and XPages for MobileChris Toohey
 
Building Complex GUI Apps The Right Way. With Ample SDK - SWDC2010
Building Complex GUI Apps The Right Way. With Ample SDK - SWDC2010Building Complex GUI Apps The Right Way. With Ample SDK - SWDC2010
Building Complex GUI Apps The Right Way. With Ample SDK - SWDC2010Sergey Ilinsky
 
Java Script
Java ScriptJava Script
Java Scriptsiddaram
 

Similar a Significant Characteristics In Planets Manfred Thaller (20)

Csphtp1 18
Csphtp1 18Csphtp1 18
Csphtp1 18
 
Everything You Always Wanted To Know About XML But Were Afraid To Ask
Everything You Always Wanted To Know About XML But Were Afraid To AskEverything You Always Wanted To Know About XML But Were Afraid To Ask
Everything You Always Wanted To Know About XML But Were Afraid To Ask
 
Ajax ons2
Ajax ons2Ajax ons2
Ajax ons2
 
OpenSocial - GTUG Stockholm Meeting Oct 1 2009
OpenSocial - GTUG Stockholm Meeting Oct 1 2009OpenSocial - GTUG Stockholm Meeting Oct 1 2009
OpenSocial - GTUG Stockholm Meeting Oct 1 2009
 
JQuery 101
JQuery 101JQuery 101
JQuery 101
 
Xml encryption
Xml encryptionXml encryption
Xml encryption
 
07 Collada Overview
07 Collada Overview07 Collada Overview
07 Collada Overview
 
3 xml namespaces and xml schema
3   xml namespaces and xml schema3   xml namespaces and xml schema
3 xml namespaces and xml schema
 
Using Forms in Share
Using Forms in ShareUsing Forms in Share
Using Forms in Share
 
JWU Guest Talk: JavaScript and AJAX
JWU Guest Talk: JavaScript and AJAXJWU Guest Talk: JavaScript and AJAX
JWU Guest Talk: JavaScript and AJAX
 
Front End on Rails
Front End on RailsFront End on Rails
Front End on Rails
 
CSIS 138 Javascript Class1
CSIS 138 Javascript Class1CSIS 138 Javascript Class1
CSIS 138 Javascript Class1
 
Ridingapachecamel
RidingapachecamelRidingapachecamel
Ridingapachecamel
 
Cleveland Silverlight Firestarter - XAML Basics
Cleveland Silverlight Firestarter - XAML BasicsCleveland Silverlight Firestarter - XAML Basics
Cleveland Silverlight Firestarter - XAML Basics
 
Embedded Metadata working group
Embedded Metadata working groupEmbedded Metadata working group
Embedded Metadata working group
 
Ajax On S2 Odp
Ajax On S2 OdpAjax On S2 Odp
Ajax On S2 Odp
 
IBM Lotus Notes Domino XPages and XPages for Mobile
IBM Lotus Notes Domino XPages and XPages for MobileIBM Lotus Notes Domino XPages and XPages for Mobile
IBM Lotus Notes Domino XPages and XPages for Mobile
 
Building Complex GUI Apps The Right Way. With Ample SDK - SWDC2010
Building Complex GUI Apps The Right Way. With Ample SDK - SWDC2010Building Complex GUI Apps The Right Way. With Ample SDK - SWDC2010
Building Complex GUI Apps The Right Way. With Ample SDK - SWDC2010
 
XML and DTD
XML and DTDXML and DTD
XML and DTD
 
Java Script
Java ScriptJava Script
Java Script
 

Más de DigitalPreservationEurope

Digital Preservation Process: Preparation and Requirements
Digital Preservation Process: Preparation and RequirementsDigital Preservation Process: Preparation and Requirements
Digital Preservation Process: Preparation and RequirementsDigitalPreservationEurope
 
Building A Sustainable Model for Digital Preservation Services, Clive Billenn...
Building A Sustainable Model for Digital Preservation Services, Clive Billenn...Building A Sustainable Model for Digital Preservation Services, Clive Billenn...
Building A Sustainable Model for Digital Preservation Services, Clive Billenn...DigitalPreservationEurope
 
Scalable Services For Digital Preservation Ross King
Scalable Services For Digital Preservation Ross KingScalable Services For Digital Preservation Ross King
Scalable Services For Digital Preservation Ross KingDigitalPreservationEurope
 
Preservation Challenge Radioactive Waste Ian Upshall
Preservation Challenge Radioactive Waste Ian UpshallPreservation Challenge Radioactive Waste Ian Upshall
Preservation Challenge Radioactive Waste Ian UpshallDigitalPreservationEurope
 
Preservation And Reuse In High Energy Physics Salvatore Mele
Preservation And Reuse In High Energy Physics Salvatore MelePreservation And Reuse In High Energy Physics Salvatore Mele
Preservation And Reuse In High Energy Physics Salvatore MeleDigitalPreservationEurope
 

Más de DigitalPreservationEurope (20)

Infrastructure Training Session
Infrastructure Training SessionInfrastructure Training Session
Infrastructure Training Session
 
Drm Training Session
Drm Training SessionDrm Training Session
Drm Training Session
 
2009 Barcelona Wepreserve Nestor
2009 Barcelona Wepreserve Nestor2009 Barcelona Wepreserve Nestor
2009 Barcelona Wepreserve Nestor
 
Trusted Repositories
Trusted RepositoriesTrusted Repositories
Trusted Repositories
 
Preservation Metadata
Preservation MetadataPreservation Metadata
Preservation Metadata
 
An Introduction to Digital Preservation
An Introduction to Digital PreservationAn Introduction to Digital Preservation
An Introduction to Digital Preservation
 
Introduction to Planets
Introduction to PlanetsIntroduction to Planets
Introduction to Planets
 
Digital Preservation Process: Preparation and Requirements
Digital Preservation Process: Preparation and RequirementsDigital Preservation Process: Preparation and Requirements
Digital Preservation Process: Preparation and Requirements
 
The Planets Preservation Planning workflow
The Planets Preservation Planning workflowThe Planets Preservation Planning workflow
The Planets Preservation Planning workflow
 
Building A Sustainable Model for Digital Preservation Services, Clive Billenn...
Building A Sustainable Model for Digital Preservation Services, Clive Billenn...Building A Sustainable Model for Digital Preservation Services, Clive Billenn...
Building A Sustainable Model for Digital Preservation Services, Clive Billenn...
 
Preservation Metadata, Michael Day, DCC
Preservation Metadata, Michael Day, DCCPreservation Metadata, Michael Day, DCC
Preservation Metadata, Michael Day, DCC
 
PLATTER - Jan Hutar
PLATTER - Jan HutarPLATTER - Jan Hutar
PLATTER - Jan Hutar
 
Sustainability Clive Billenness
Sustainability Clive  BillennessSustainability Clive  Billenness
Sustainability Clive Billenness
 
Shaman Project Hemmje
Shaman Project  HemmjeShaman Project  Hemmje
Shaman Project Hemmje
 
Scalable Services For Digital Preservation Ross King
Scalable Services For Digital Preservation Ross KingScalable Services For Digital Preservation Ross King
Scalable Services For Digital Preservation Ross King
 
Risks Benefits And Motivations Seamus Ross
Risks Benefits And Motivations Seamus RossRisks Benefits And Motivations Seamus Ross
Risks Benefits And Motivations Seamus Ross
 
Representation Information Steve Rankin
Representation Information Steve RankinRepresentation Information Steve Rankin
Representation Information Steve Rankin
 
Preservation Challenge Radioactive Waste Ian Upshall
Preservation Challenge Radioactive Waste Ian UpshallPreservation Challenge Radioactive Waste Ian Upshall
Preservation Challenge Radioactive Waste Ian Upshall
 
Preservation And Reuse In High Energy Physics Salvatore Mele
Preservation And Reuse In High Energy Physics Salvatore MelePreservation And Reuse In High Energy Physics Salvatore Mele
Preservation And Reuse In High Energy Physics Salvatore Mele
 
Platter Colin Rosenthal
Platter Colin RosenthalPlatter Colin Rosenthal
Platter Colin Rosenthal
 

Último

Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusZilliz
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbuapidays
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...apidays
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 

Último (20)

Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 

Significant Characteristics In Planets Manfred Thaller

  • 1. Significant characteristics in Planets Manfred Thaller Universität zu* Köln *University at not of Cologne
  • 2. What are “significant characteristics”? Those properties of a digital file which have to be known to enable the processing of the file within a specific setup.
  • 3. Why extract them by software? To create technical metadata as required by organizational models for long term preservation. (NLNZ)
  • 4. Within Planets … … served by solutions to identify formats: formats registry / PRONOM / DROID. … and a solution for extracting and processing such characteristics: XCL.
  • 5. A Vision Extractor tiff XCDL tiff 93% Migrator Comparator png tiff XCEL png XCEL png XCDL
  • 6. A Vision Extractor Comparator Appropriate XCELs C-Set
  • 7. Why automate? 1 million objects: use one second for each. == 16666.7 minutes == 277.8 hours == 11.57 working days of a computer == 34.7 8-hour days for a Human == 7 working weeks
  • 8. Why automate? 1 million objects: use five minutes for each. == 416 666.7 hours == 52 803.4 8-hour days for a Human
  • 9. Why automate? Assumption: Preservation is only feasible, if the content of two digital objects can be compared without human intervention, giving a numerical estimate of their degree of similarity.
  • 10. Demo
  • 11. Abstract solution I (1) Language to represent the complete content of a digital object. XCDL (2) Language to describe any machine readable format in a formal language. XCEL (3) Software to extract the content of a file based upon a description as under (2) and express it in the language as specified under (1). “extractor” (4) Software to compare two such content descriptions. “comparator”
  • 12. <XCELDocument...> ... <xcdl> <formatDescription>.... <object id=quot;o1quot; > <symbol identifier=quot;ID01_I01_I01_S02quot; <normData id=quot;nd1quot; > ... </normData> originalName=quot;height“ interpretation=quot;uint32quot;> <property id=quot;p1quot; source=quot;rawquot; <range><startposition xsi:type=quot;sequential“> cat=quot;descrquot; > </startposition> <length xsi:type=quot;fixedquot;>4</length></range> <name> compression</name> <name>height</name> <valueSet id=quot;i_i1_s6quot; > <rawValue>0 </rawValue> </symbol> <labValue>...</labValue> <symbol identifier=quot;ID01_I01_I01_S04quot; originalName=quot;colourTypequot;> <dataRef ind=quot;normAllquot; /> <range> <propRel/> <startposition xsi:type=quot;sequentialquot;> </valueSet> </startposition> </property> <length xsi:type=quot;fixedquot;>1</length></range> <property id=quot;p2quot; source=quot;rawquot; <valueInterpretation> cat=quot;descrquot; > <valueLabel>greyscale</valueLabel> <name> height</name> <value>0</value></valueinterpretation> <valueSet id=quot;i_i1_s3quot; > <name>imageType</name> <rawValue>0 0 1 ad </rawValue> </symbol> <labValue> <symbol identifier=quot;ID01_I01_I01_S05quot; <val>429</val> originalName=quot;compressionMethodquot;> <type>uint32</type> <range> </labValue> <startposition xsi:type=quot;sequential“> <dataRef ind=quot;normAllquot; /> </startposition> <propRel/> <length </valueSet> xsi:type=quot;fixedquot;>1</length></range> </property> <valueInterpretation> <property id=quot;p3quot; source=quot;rawquot; <valueLabel>zlibDeflateInflate</valueLabel> cat=quot;descrquot; > <value>0</value></valueInterpretation> <name> imageType</name> <name>compression</name> ..... </symbol>...
  • 13. <request2> <property id=quot;2quot; <measurementRequest> name=quot;imageHeightquot; <source name=quot;XCDL1.xmlquot;/> unit=quot;pixelquot; <target name=quot;XCDL2.xmlquot;/> compStatus=quot;completequot;> <property id=quot;45quot; name=quot;rgbPalettequot;> <values type=quot;intquot;> <metric id=quot;10quot; name=quot;hammingDistancequot;/> <src>32</src> </property> <property id=quot;300quot; name=quot;normDataquot;> <tar>32</tar> <metric id=quot;10quot; </values> name=quot;hammingDistancequot;/> <metric id=quot;50quot; name=quot;RMSEquot;/> <metric id=quot;200quot; </property> name=quot;equalquot; result=quot;truequot;/> <property id=quot;2quot; <metric id=quot;201quot; name=quot;imageHeightquot; unit=quot;pixelquot;> name=quot;intDiffquot; result=quot;0quot;/> <metric id=quot;200quot; name=quot;equalquot;/> <metric id=quot;210quot; <metric id=quot;201quot; name=quot;intDiffquot;/> name=quot;percDevquot; <metric id=quot;210quot; name=quot;percDevquot;/> result=quot;0.000000quot;/> </property> </property> <property id=quot;30quot; name=quot;imageWidthquot; unit=quot;pixelquot;> <metric id=quot;200quot; name=quot;equalquot;/> <metric id=quot;201quot; name=quot;intDiffquot;/> <metric id=quot;210quot; name=quot;percDevquot;/> </property>
  • 14. Abstract solution I (1) Language to represent the complete content of a digital object. XCDL (2) Language to describe any machine readable format in a formal language. XCEL (3) Software to extract the content of a file based upon a description as under (2) and express it in the language as specified under (1). “extractor” (4) Software to compare two such content descriptions. “comparator”
  • 15. Are the following two items equal: VIII  8
  • 16. eight eight VIII  8
  • 17. otto eight eight otto VIII  8
  • 18. otto acht eight eight otto VIII  8 acht
  • 19. 8.0 otto acht eight eight otto VIII  8 acht
  • 20. Information model: „an image“ otto acht eight eight otto VIII  8 acht
  • 21. information model: „an image“ format ontology: „what terms are used in formats to describe image properties“ VIII  8
  • 22. Information model: „what is an image“ Format ontology: „what terms are used in formats to describe image properties“ Extraction language: “how to get the terms describing an image out of a file”
  • 23. Abstract solution II (1) A theoretical model of information (not: data) types – “image”, “text”, “audio” ... (2) Ontologies, which map existing file format terminologies onto these model. (3) A language – XCDL – which allows to express the content of files in different formats using the vocabulary of the ontologies and the “grammar” of the information model.
  • 24. XCDL eXtensible Characterisation Definition Language Purpose: Describe the contents of a file in terms of an abstract model.
  • 25. XCDL: text model (1) A text (= <object>) is composed of data (= <normData>) plus interpretations of data according to the underlying format specification (= <property>).
  • 26. XCDL: text model (2) Or, one level of abstraction higher, a text is composed of content carrying tokens, accompanied by rendering info plus deployment info plus historical info.
  • 27. This text is a <refData id=quot;1quot;>54 68 69 73 20 69 73 20 61 20 74 65 78 74</refData> … <property> <name>fontsize</name> <rawVal> <val>48</val> <type>unsignedInt8</type> </rawVal> <dataRef> <!-- property refers to discrete part of reference data--> <ref id=quot;1quot; start=quot;0quot; end=quot;3quot;/> <ref id=quot;1quot; start=“10quot; end=quot;12quot;/> </dataRef> </property>
  • 28. This text is a <refData id=quot;1quot;>54 68 69 73 20 69 73 20 61 20 74 65 78 74</refData> … <property> <name>fontsize</name> <rawVal> <val>48</val> <type>unsignedInt8</type> </rawVal> <dataRef> <!-- property refers to discrete part of reference data--> <ref id=quot;1quot; start=quot;0quot; end=quot;3quot;/> <ref id=quot;1quot; start=“10quot; end=quot;12quot;/> </dataRef> </property>
  • 29. Thank you! Questions? Manfred.thaller@uni-koeln.de