SlideShare una empresa de Scribd logo
1 de 10
Descargar para leer sin conexión
Universal File Format
Converter AKA the NCSA
Polyglot
By: Catherine Bell, Stacy Hays and
Marisa Mendez-Brady
What is Polyglot?
Polyglot is an attempt to create a universal file format
converter through the National Center for Supercomputer
Applications through the University of Illinois Urbana-
Champaign
The National Archives and Records Administration (NARA)
is sponsoring the development of the NCSA Polyglot
Definition of Polyglot - One who speaks many languages
Why make the NCSA Polyglot?
● There are hundreds of thousands of file formats in the world, most of which are
not transferrable between software
● There isn’t any way to convert most file extensions
● Most discrepancy in file formats are the result of proprietary software
companies competing against each other to increase their user base
● Not only does the lack of compatibility between file formats make it hard to
share information, but it makes the task of preservation for born digital
materials increasing difficult as proprietary software constantly develops and
changes
● File incompatibility is most evident in 3D file formats
● Polyglot has focused on finding ways to convert between 3D file formats, as
they provide the most complications
Why are 3D files so Complicated?
● There are over 140 types of 3D file
extensions
● Most 3D viewers are manufactured by
the proprietary software companies that
create the file format
● Different types of file formats supports
different kinds of 3D content
● Extreme amounts of data can loss occurs
when converting 3D files between
formats
● 3D objects point to a need for a
universal file format converter that
produces the least amount of data loss
to ensure preservation quality
Towards a Universal File Format Converter
● Polyglot analyzes and automates the import/export
features of third party software
● Creates an I/O graph weighted tool from information about
the software available on multiple servers
● Uses a quantifiable scale for measuring data loss that
occurs when conversions are done to calculate the path of
conversion for best possible quality
● Submits script using Java to servers to do conversion
● Uses third party applications in the conversion process
So, How does Polyglot Work?
http://isda.ncsa.illinois.
edu/NARA/videos/SoftwareServers/polyglot_convert.avi
Here is video demonstrating how Polyglot Works:
1. Download Polyglot onto institutional servers
2. Polyglot can then take advantage of all of the software
contained on the servers to make conversions
3. Can be utilized through either desktop or web based version
4. Once Polyglot is set up, you can drag and drop files to
convert
NOTE: Anyone can test
Polyglot through the
NCSA website...
theoretically
Problems, Problems Indeed.
Functional File Converters
There are several other file converters out there file converters are usually file
type specific. Many of these converters are proprietary. There is nothing close
to a universal converter.
Examples of file format converters:
Quick 3D
Sourceforge
Okino
Switch
Zamzar
Media Converter
Youconvertit
Converts Anything? Not quite.
● Same people designed web-based Conversion Software Registry (CSR)
for collecting information about software that are capable of file format
conversions
● Motivated by a community need for finding file format conversions
● Create a login and add softwares to the registry
● Currently has over 2000 softwares registered
● Over 260,000 possible conversions
● Contains I/O graph to see best possible conversions
● Also searchable by conversion and software
http://isda.ncsa.illinois.edu/NARA/CSR/php/search/graph.php
Conversion Software Registry
Resources
● Bajcsy P, Kooper R, Marini L, McHenry K, Ondrejcek M. A Framework for Understanding File Format Conversions. In: ACM
ICPS US Workshop on roadmap for Digital Preservation Interoperability Framework.; 2011.
● "ISDA Polyglot." Image and Spatial Data Analasys Division. U of Illinois, 2013. Web. 10 Nov. 2013. <http://isda.ncsa.illinois.
edu/drupal/software/polyglot>.
● Kenton McHenry and Peter Bajcsy "3D+Time File Formats.", Technical Report NCSA-ISDA10-001, October 15, 2010.
● McHenry K, Ondrejcek M, Marini L, Kooper R, Bajcsy P.Towards a Universal Viewer for Digital Content. In: International
Conference on Computer Science, Executable Paper Workshop.; 2011.
● McHenry K, Kooper R, Marini L, Bajcsy P. Designing a Scalable Cross Platform Imposed Code Reuse Framework. In:
Microsoft Research eScience Workshop. Berkeley, CA,; 2010.
● McHenry K, Kooper R, Bajcsy P.Taking Matters into Your Own Hands: Imposing Code Reusability for Universal File Format
Conversion. In: Microsoft Research eScience workshop. Pittsburg, PA,; 2009.
● McHenry K, Kooper R, Bajcsy P.Towards a Universal, Quantifiable, and Scalable File Format Converter. Oxford, UK; 2009.
● McHenry K, Bajcsy P. Framework Converts Files of Any Format.; 2009.
● Ondrejcek M, McHenry K, Bajcsy P. The Conversion Software Registry. Berkeley, CA; 2010
● "Towards a Universal File Format Converter." Analysis of Electronic Records, Document Appraisal Framework. NCSA at the
U of Illinois at Urbana-Champaign, 17 Feb. 2011. Web. 10 Nov. 2013. <http://isda.ncsa.uiuc.edu/NARA/conversion.html>.
● McHenry, K. and Bajcsy P. "An Overview of 3D Data Content, File Formats and Viewers.", Technical Report NCSA-ISDA08-
002, October 31, 2008.

Más contenido relacionado

Similar a New Technology Presentation for the School of Information

IN PARTIAL FULFILLMENT OF POST GRADUATE DIPLOMA IN COMPUTER APPLICATIONS
IN PARTIAL FULFILLMENT OF  POST GRADUATE DIPLOMA IN COMPUTER APPLICATIONSIN PARTIAL FULFILLMENT OF  POST GRADUATE DIPLOMA IN COMPUTER APPLICATIONS
IN PARTIAL FULFILLMENT OF POST GRADUATE DIPLOMA IN COMPUTER APPLICATIONS
ssuserb054d21
 
2013 05-15 Intro to Archivematica - UBC SLAIS Digital Records Forensics Class
2013 05-15 Intro to Archivematica - UBC SLAIS Digital Records Forensics Class2013 05-15 Intro to Archivematica - UBC SLAIS Digital Records Forensics Class
2013 05-15 Intro to Archivematica - UBC SLAIS Digital Records Forensics Class
Courtney Mumma
 

Similar a New Technology Presentation for the School of Information (20)

Ballerina cloud native middleware as a programming language | Yenlo - WSO2 In...
Ballerina cloud native middleware as a programming language | Yenlo - WSO2 In...Ballerina cloud native middleware as a programming language | Yenlo - WSO2 In...
Ballerina cloud native middleware as a programming language | Yenlo - WSO2 In...
 
Webrecorder: Web Archiving for All!
Webrecorder: Web Archiving for All!Webrecorder: Web Archiving for All!
Webrecorder: Web Archiving for All!
 
Semantic Web in the Fog of Browsers
Semantic Web in the Fog of BrowsersSemantic Web in the Fog of Browsers
Semantic Web in the Fog of Browsers
 
Archival Technologies
Archival TechnologiesArchival Technologies
Archival Technologies
 
Project On-Science
Project On-ScienceProject On-Science
Project On-Science
 
Shillum "Building for the Future: Interoperability"
Shillum "Building for the Future: Interoperability"Shillum "Building for the Future: Interoperability"
Shillum "Building for the Future: Interoperability"
 
Selling the open-source philosophy - DrupalCon Bogotá 2015
Selling the open-source philosophy - DrupalCon Bogotá 2015Selling the open-source philosophy - DrupalCon Bogotá 2015
Selling the open-source philosophy - DrupalCon Bogotá 2015
 
Selling the open-source philosophy - DrupalCon Latin America 2015
Selling the open-source philosophy - DrupalCon Latin America 2015Selling the open-source philosophy - DrupalCon Latin America 2015
Selling the open-source philosophy - DrupalCon Latin America 2015
 
Selling the Open-Source Philosophy - DrupalCon Latin America
Selling the Open-Source Philosophy - DrupalCon Latin AmericaSelling the Open-Source Philosophy - DrupalCon Latin America
Selling the Open-Source Philosophy - DrupalCon Latin America
 
Selling the Open-Source Philosophy - DrupalCon Latin America
Selling the Open-Source Philosophy - DrupalCon Latin AmericaSelling the Open-Source Philosophy - DrupalCon Latin America
Selling the Open-Source Philosophy - DrupalCon Latin America
 
IN PARTIAL FULFILLMENT OF POST GRADUATE DIPLOMA IN COMPUTER APPLICATIONS
IN PARTIAL FULFILLMENT OF  POST GRADUATE DIPLOMA IN COMPUTER APPLICATIONSIN PARTIAL FULFILLMENT OF  POST GRADUATE DIPLOMA IN COMPUTER APPLICATIONS
IN PARTIAL FULFILLMENT OF POST GRADUATE DIPLOMA IN COMPUTER APPLICATIONS
 
Code, ci, infrastructure - the gophers way
Code, ci, infrastructure - the gophers wayCode, ci, infrastructure - the gophers way
Code, ci, infrastructure - the gophers way
 
Better Software, Better Research
Better Software, Better ResearchBetter Software, Better Research
Better Software, Better Research
 
Video game controlled vocabulary in wikidata
Video game controlled vocabulary in wikidataVideo game controlled vocabulary in wikidata
Video game controlled vocabulary in wikidata
 
TAMU-Corpus Christi Connect and Reflect 2/25/2015
TAMU-Corpus Christi  Connect and Reflect 2/25/2015TAMU-Corpus Christi  Connect and Reflect 2/25/2015
TAMU-Corpus Christi Connect and Reflect 2/25/2015
 
Basics to framework programming
Basics to framework programmingBasics to framework programming
Basics to framework programming
 
Useful Open Source Software
Useful Open Source SoftwareUseful Open Source Software
Useful Open Source Software
 
MediaMosa for Managing Video Content
MediaMosa for Managing Video ContentMediaMosa for Managing Video Content
MediaMosa for Managing Video Content
 
2013 05-15 Intro to Archivematica - UBC SLAIS Digital Records Forensics Class
2013 05-15 Intro to Archivematica - UBC SLAIS Digital Records Forensics Class2013 05-15 Intro to Archivematica - UBC SLAIS Digital Records Forensics Class
2013 05-15 Intro to Archivematica - UBC SLAIS Digital Records Forensics Class
 
TechRadarCon 2022 | Have you built your platform yet ?
TechRadarCon 2022 | Have you built your platform yet ?TechRadarCon 2022 | Have you built your platform yet ?
TechRadarCon 2022 | Have you built your platform yet ?
 

Último

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Último (20)

MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 

New Technology Presentation for the School of Information

  • 1. Universal File Format Converter AKA the NCSA Polyglot By: Catherine Bell, Stacy Hays and Marisa Mendez-Brady
  • 2. What is Polyglot? Polyglot is an attempt to create a universal file format converter through the National Center for Supercomputer Applications through the University of Illinois Urbana- Champaign The National Archives and Records Administration (NARA) is sponsoring the development of the NCSA Polyglot Definition of Polyglot - One who speaks many languages
  • 3. Why make the NCSA Polyglot? ● There are hundreds of thousands of file formats in the world, most of which are not transferrable between software ● There isn’t any way to convert most file extensions ● Most discrepancy in file formats are the result of proprietary software companies competing against each other to increase their user base ● Not only does the lack of compatibility between file formats make it hard to share information, but it makes the task of preservation for born digital materials increasing difficult as proprietary software constantly develops and changes ● File incompatibility is most evident in 3D file formats ● Polyglot has focused on finding ways to convert between 3D file formats, as they provide the most complications
  • 4. Why are 3D files so Complicated? ● There are over 140 types of 3D file extensions ● Most 3D viewers are manufactured by the proprietary software companies that create the file format ● Different types of file formats supports different kinds of 3D content ● Extreme amounts of data can loss occurs when converting 3D files between formats ● 3D objects point to a need for a universal file format converter that produces the least amount of data loss to ensure preservation quality
  • 5. Towards a Universal File Format Converter ● Polyglot analyzes and automates the import/export features of third party software ● Creates an I/O graph weighted tool from information about the software available on multiple servers ● Uses a quantifiable scale for measuring data loss that occurs when conversions are done to calculate the path of conversion for best possible quality ● Submits script using Java to servers to do conversion ● Uses third party applications in the conversion process
  • 6. So, How does Polyglot Work? http://isda.ncsa.illinois. edu/NARA/videos/SoftwareServers/polyglot_convert.avi Here is video demonstrating how Polyglot Works: 1. Download Polyglot onto institutional servers 2. Polyglot can then take advantage of all of the software contained on the servers to make conversions 3. Can be utilized through either desktop or web based version 4. Once Polyglot is set up, you can drag and drop files to convert NOTE: Anyone can test Polyglot through the NCSA website... theoretically
  • 8. Functional File Converters There are several other file converters out there file converters are usually file type specific. Many of these converters are proprietary. There is nothing close to a universal converter. Examples of file format converters: Quick 3D Sourceforge Okino Switch Zamzar Media Converter Youconvertit Converts Anything? Not quite.
  • 9. ● Same people designed web-based Conversion Software Registry (CSR) for collecting information about software that are capable of file format conversions ● Motivated by a community need for finding file format conversions ● Create a login and add softwares to the registry ● Currently has over 2000 softwares registered ● Over 260,000 possible conversions ● Contains I/O graph to see best possible conversions ● Also searchable by conversion and software http://isda.ncsa.illinois.edu/NARA/CSR/php/search/graph.php Conversion Software Registry
  • 10. Resources ● Bajcsy P, Kooper R, Marini L, McHenry K, Ondrejcek M. A Framework for Understanding File Format Conversions. In: ACM ICPS US Workshop on roadmap for Digital Preservation Interoperability Framework.; 2011. ● "ISDA Polyglot." Image and Spatial Data Analasys Division. U of Illinois, 2013. Web. 10 Nov. 2013. <http://isda.ncsa.illinois. edu/drupal/software/polyglot>. ● Kenton McHenry and Peter Bajcsy "3D+Time File Formats.", Technical Report NCSA-ISDA10-001, October 15, 2010. ● McHenry K, Ondrejcek M, Marini L, Kooper R, Bajcsy P.Towards a Universal Viewer for Digital Content. In: International Conference on Computer Science, Executable Paper Workshop.; 2011. ● McHenry K, Kooper R, Marini L, Bajcsy P. Designing a Scalable Cross Platform Imposed Code Reuse Framework. In: Microsoft Research eScience Workshop. Berkeley, CA,; 2010. ● McHenry K, Kooper R, Bajcsy P.Taking Matters into Your Own Hands: Imposing Code Reusability for Universal File Format Conversion. In: Microsoft Research eScience workshop. Pittsburg, PA,; 2009. ● McHenry K, Kooper R, Bajcsy P.Towards a Universal, Quantifiable, and Scalable File Format Converter. Oxford, UK; 2009. ● McHenry K, Bajcsy P. Framework Converts Files of Any Format.; 2009. ● Ondrejcek M, McHenry K, Bajcsy P. The Conversion Software Registry. Berkeley, CA; 2010 ● "Towards a Universal File Format Converter." Analysis of Electronic Records, Document Appraisal Framework. NCSA at the U of Illinois at Urbana-Champaign, 17 Feb. 2011. Web. 10 Nov. 2013. <http://isda.ncsa.uiuc.edu/NARA/conversion.html>. ● McHenry, K. and Bajcsy P. "An Overview of 3D Data Content, File Formats and Viewers.", Technical Report NCSA-ISDA08- 002, October 31, 2008.