SlideShare una empresa de Scribd logo
1 de 31
Unlock Content




                          First Encounters with Office Open XML


                                                                   Matt Turner
                                                                   Principal Consultant
                                                                   December 3rd, 2007



Copyright © 2007 Mark Logic Corporation. All rights reserved.                             Slide 1
Agenda


      Office Open XML basics
              p
      Office Open XML and XML tools
      Some examples
      Parting Thoughts




Copyright © 2007 Mark Logic Corporation. All rights reserved.   Slide 2
Office Open XML

      Native format of MS Office 2007
              Complete rework of the entire
              productivity suite
              Word, PowerPoint, Excel, etc.
              All have native format of XML!


      OOXML = Office Open XML

      Standard through Ecma International
              Formally k
              F    ll known as E
                               Ecma 376
              Approved in December 2006




Copyright © 2007 Mark Logic Corporation. All rights reserved.   Slide 3
Lets Have a Look


     There is a lot of it . . .

     A Requirement was 100% compatibility
        Layout based
        6500+ page specification
                        ifi ti
        Thousands of elements + attributes

     And speed and space
        Single-character QNames
        Single character
        Single-character namespace prefixes
        No spare whitespace

     But the core element set is manageable . . .

                                                                . . . for simple documents ☺
Copyright © 2007 Mark Logic Corporation. All rights reserved.                                  Slide 4
It’s Not XML, It’s Zipped XML




        Zipped container with content, formatting info and
        manifest
        Payload varies by application but its all XML
        New extension: .docx, pptx .xlsx
                        docx pptx, xlsx

Copyright © 2007 Mark Logic Corporation. All rights reserved.   Slide 5
Office Open XML Sample




Copyright © 2007 Mark Logic Corporation. All rights reserved.   Slide 6
Runs Present Some Issues

      The core of a Word file comprises text “runs”
         New styles require new runs sometimes they just show up
                                   runs,
         Sometimes split text (!!)




      Needs some special handling which we can do with XQuery

Copyright © 2007 Mark Logic Corporation. All rights reserved.      Slide 7
Hooks to Add Structure


      Spec allows for customXml tag that you can use to add
      structure
      Word also support to let you add structure
               Support for schemas to control editing
               Controls to let you add arbitrary tags
               And flow content from external (XML) sources
               Available from the Developer Tab
                  Enable developer tab from word options under the ‘big button’
      Word itself can be configured with XML to jump start
      custom editing and XML interactions



Copyright © 2007 Mark Logic Corporation. All rights reserved.                 Slide 8
Agenda


      Office Open basics
              p
      Office Open XML and XML tools
      Some examples
      Parting Thoughts




Copyright © 2007 Mark Logic Corporation. All rights reserved.   Slide 9
What can we do with it?

      It’s XML – anything!!
               You
               Y can query it transform it . . . the whole enchilada.
                           it, t   f             th   hl      hil d
      Create it
               MS Word is now (just) an OOXML editor ( )
                              (j )                   (!!)
               There are lots of other ways to edit and create OOXML
      Make the desktop connection
               Drive application context direct from end-user documents
               Output quot;first-draftquot; of end-user documents that work on (real)
               desktops
               Create content apps that work directly on collections of
               Office documents, without conversion
      Simplify the XML-ifying of business processes
                   XML ifying

Copyright © 2007 Mark Logic Corporation. All rights reserved.                   Slide 10
XQuery Makes It Happen

      XQuery - much more than a query language
        W3C standard
        Query, manipulate and render XML

      XML Content Servers (like MarkLogic Server)
        Application ready extensions provide complete
        platform for content applications
        Such as
           Update features to load / maintain content
           HTTP / REST interfaces
           Zip tools to h dl the
           Zi t l t handle th packaging*
                                       ki*
                                                                *MarkLogic Server only

Copyright © 2007 Mark Logic Corporation. All rights reserved.                     Slide 11
Agenda


      Office Open basics
              p
      Office Open XML and XML tools
      Some examples
      Parting Thoughts




Copyright © 2007 Mark Logic Corporation. All rights reserved.   Slide 12
Examples


1. Exploring Office Open XML
     p     g         p
   •         Open up .zip package
   •         Update XML
   •         Repackage

   •         Load into Content Server
   •         Create CustomXml + Controls
   •         Query and Update
   •         Repackage into .docx




Copyright © 2007 Mark Logic Corporation. All rights reserved.   Slide 13
Examples

      Unzip package and edit document.xml to add Custom Structure in
      XML editor




Copyright © 2007 Mark Logic Corporation. All rights reserved.          Slide 14
Examples

      Zip back up, rename to .docx and open in word
      use developer tab to view CustomXml




Copyright © 2007 Mark Logic Corporation. All rights reserved.   Slide 15
Examples

•     Load into MarkLogic Server, unzip and expand to load individual
      XML files




Copyright © 2007 Mark Logic Corporation. All rights reserved.           Slide 16
Examples

•     Query the XML: this XQuery




•     Returns:
      Rt




Copyright © 2007 Mark Logic Corporation. All rights reserved.   Slide 17
Examples

•     Package the files back up to create a new .docx




Copyright © 2007 Mark Logic Corporation. All rights reserved.   Slide 18
Content Server Examples


      Office Open XML = accessible format
         gives you the building blocks to create purpose built
         applications to leverage desktop apps



      Content Servers put it together
        MarkLogic Server combines the XML tools (XQuery)
        to process it and the extension to seamlessly round
        trip content




Copyright © 2007 Mark Logic Corporation. All rights reserved.    Slide 19
Content Server Examples


Generate, Query and Mash-up MS Word
       1. Use XQuery to transform XML into Office Open XML

       2. Use XQuery t access granular content elements in
       2 U XQ        to            l      t tl       ti
          word documents and create new Office Open XML

       3. Customize Word Ribbons to query MarkLogic Server
          to get content and save new content back




Copyright © 2007 Mark Logic Corporation. All rights reserved.   Slide 20
Content Server Example 1

From source XML (shakespeare plays)




Copyright © 2007 Mark Logic Corporation. All rights reserved.   Slide 21
Content Server Example 1

Generate Open Office XML




Copyright © 2007 Mark Logic Corporation. All rights reserved.   Slide 22
Content Server Example 1

Package and open in Word – with structure in customXML elements




Copyright © 2007 Mark Logic Corporation. All rights reserved.     Slide 23
Content Server Example 2

Access granular elements of Office Open XML and create new content




Copyright © 2007 Mark Logic Corporation. All rights reserved.        Slide 24
Content Server Example 3

Add content using custom Mark Logic Ribbon
   Insert new content into
   Shakespeare play
   Query MarkLogic
   Server for content from
   tech support content
   base




 Copyright © 2007 Mark Logic Corporation. All rights reserved.   Slide 25
Content Server Example 3

Add content using custom Mark Logic Ribbon
   Insert new content into
   Shakespeare play
   Query MarkLogic
   Server for content from
   tech support content
   base
   Insert content into play
   Actions contained in
   MarkLogic Ribbon




 Copyright © 2007 Mark Logic Corporation. All rights reserved.   Slide 26
Content Server Example 3

Save content back to contentbase
   Select any content
   Create a new
   document with the
   snippet in MarkLogic
   Server contentbase




 Copyright © 2007 Mark Logic Corporation. All rights reserved.   Slide 27
Agenda


      Open Office basics
       p
      OOXML and XML tools
      Some examples
      Parting Thoughts




Copyright © 2007 Mark Logic Corporation. All rights reserved.   Slide 28
Conclusions

      Office 2007’s native file format is XML! For real!

      XQuery provides powerful tools to ingest, query,
      manipulate and g
          p          generate the format ( XML after all)
                                          (it’s         )

      OOXML provides the building blocks for integrated
      content apps based on desktop content

      XML Content Servers enable these applications so . . .




Copyright © 2007 Mark Logic Corporation. All rights reserved.   Slide 29
Content Application Resources

   OOXML Standard
            http://www.ecma-international.org/publications/standards/Ecma-376.htm
   Small changes – featuring OOXML
            http://developer.marklogic.com/columns/smallchanges
   OOXML Developers
            http://openxmldeveloper.org/
   Discovering XQuery (my blog)
            http://xquery.typepad.com
               p q y yp p
   MarkMail (XML Lists)
            http://markmail.org
   Mark Logic CEO Blog
            http://marklogic.blogspot.com
   XQuery site / developers group
            htt //
            http://x-query.com
   Querying XML (book) Melton and Buxton
Copyright © 2007 Mark Logic Corporation. All rights reserved.                       Slide 30
Unlock Content™




                   Thank You
                    Sample Template
                     Matt Turner
                     Principal Consultant
                    What do we think of this
                     matt.turner@marklogic.com
                                             Sfdhskdfjh kjsfhd
                     http://xquery.typepad.com Sdflkhsdf sdflk



Copyright © 2007 Mark Logic Corporation. All rights reserved.                     Slide 31

Más contenido relacionado

Destacado

Educación inclusiva, diversidad y discursividad.aldo ocampo gonzález
Educación inclusiva, diversidad y discursividad.aldo ocampo gonzálezEducación inclusiva, diversidad y discursividad.aldo ocampo gonzález
Educación inclusiva, diversidad y discursividad.aldo ocampo gonzálezhipocampus0727
 
Kte helye es szerepe lazarne szanadi csilla
Kte helye es szerepe lazarne szanadi csillaKte helye es szerepe lazarne szanadi csilla
Kte helye es szerepe lazarne szanadi csillaktegyesulet
 
Colaborativo 1 fase_1_rosa_elena_rodriguez
Colaborativo 1 fase_1_rosa_elena_rodriguezColaborativo 1 fase_1_rosa_elena_rodriguez
Colaborativo 1 fase_1_rosa_elena_rodriguezRositar66
 
LASU 2016/2017 Comprehensive Merit Admission List www.alluniversitynews.com
LASU 2016/2017 Comprehensive Merit Admission List www.alluniversitynews.comLASU 2016/2017 Comprehensive Merit Admission List www.alluniversitynews.com
LASU 2016/2017 Comprehensive Merit Admission List www.alluniversitynews.comMbadinuju Chiemeka Valentine B.Sc
 

Destacado (10)

Cole-Week 4-COM411A
Cole-Week 4-COM411ACole-Week 4-COM411A
Cole-Week 4-COM411A
 
Educación inclusiva, diversidad y discursividad.aldo ocampo gonzález
Educación inclusiva, diversidad y discursividad.aldo ocampo gonzálezEducación inclusiva, diversidad y discursividad.aldo ocampo gonzález
Educación inclusiva, diversidad y discursividad.aldo ocampo gonzález
 
Kte helye es szerepe lazarne szanadi csilla
Kte helye es szerepe lazarne szanadi csillaKte helye es szerepe lazarne szanadi csilla
Kte helye es szerepe lazarne szanadi csilla
 
Colaborativo 1 fase_1_rosa_elena_rodriguez
Colaborativo 1 fase_1_rosa_elena_rodriguezColaborativo 1 fase_1_rosa_elena_rodriguez
Colaborativo 1 fase_1_rosa_elena_rodriguez
 
Saranyaaaaaaaa
SaranyaaaaaaaaSaranyaaaaaaaa
Saranyaaaaaaaa
 
Digital Decimal Counter
Digital Decimal CounterDigital Decimal Counter
Digital Decimal Counter
 
здоровье
здоровьездоровье
здоровье
 
Resume - Alejandro Mercado
Resume - Alejandro MercadoResume - Alejandro Mercado
Resume - Alejandro Mercado
 
TDAH
TDAHTDAH
TDAH
 
LASU 2016/2017 Comprehensive Merit Admission List www.alluniversitynews.com
LASU 2016/2017 Comprehensive Merit Admission List www.alluniversitynews.comLASU 2016/2017 Comprehensive Merit Admission List www.alluniversitynews.com
LASU 2016/2017 Comprehensive Merit Admission List www.alluniversitynews.com
 

Similar a First Encounters With Office Open Xml

epicenter2010 Open Xml
epicenter2010   Open Xmlepicenter2010   Open Xml
epicenter2010 Open XmlCraig Murphy
 
Office OpenXML: a technical approach for OOo.
Office OpenXML: a technical approach for OOo.Office OpenXML: a technical approach for OOo.
Office OpenXML: a technical approach for OOo.Alexandro Colorado
 
Document Sucuess With Office 2007
Document Sucuess With Office 2007 Document Sucuess With Office 2007
Document Sucuess With Office 2007 Severus Prime
 
Xml and xml processor
Xml and xml processorXml and xml processor
Xml and xml processorHimanshu Soni
 
Xml and xml processor
Xml and xml processorXml and xml processor
Xml and xml processorHimanshu Soni
 
A Technical Comparison: ISO/IEC 26300 vs Microsoft Office Open XML
A Technical Comparison: ISO/IEC 26300 vs Microsoft Office Open XML A Technical Comparison: ISO/IEC 26300 vs Microsoft Office Open XML
A Technical Comparison: ISO/IEC 26300 vs Microsoft Office Open XML Alexandro Colorado
 
Working with xml data
Working with xml dataWorking with xml data
Working with xml dataaspnet123
 
Xml in bio medical field
Xml in bio medical fieldXml in bio medical field
Xml in bio medical fieldJuman Ghazi
 
Lotusphere 2007 AD507 Leveraging the Power of Object Oriented Programming in ...
Lotusphere 2007 AD507 Leveraging the Power of Object Oriented Programming in ...Lotusphere 2007 AD507 Leveraging the Power of Object Oriented Programming in ...
Lotusphere 2007 AD507 Leveraging the Power of Object Oriented Programming in ...Bill Buchan
 
LINQ to XML
LINQ to XMLLINQ to XML
LINQ to XMLukdpe
 
Creating xml publisher documents with people code
Creating xml publisher documents with people codeCreating xml publisher documents with people code
Creating xml publisher documents with people codeRandall Groncki
 
ILUG 2007 - Notes and Office Integration
ILUG 2007 - Notes and Office IntegrationILUG 2007 - Notes and Office Integration
ILUG 2007 - Notes and Office IntegrationJohn Head
 
Introduction to .net and asp
Introduction to .net and aspIntroduction to .net and asp
Introduction to .net and aspPrachi Agarwal
 

Similar a First Encounters With Office Open Xml (20)

epicenter2010 Open Xml
epicenter2010   Open Xmlepicenter2010   Open Xml
epicenter2010 Open Xml
 
BusinessAnalyst_Training.pptx
BusinessAnalyst_Training.pptxBusinessAnalyst_Training.pptx
BusinessAnalyst_Training.pptx
 
Office OpenXML: a technical approach for OOo.
Office OpenXML: a technical approach for OOo.Office OpenXML: a technical approach for OOo.
Office OpenXML: a technical approach for OOo.
 
X Usax Pdf
X Usax PdfX Usax Pdf
X Usax Pdf
 
Document Sucuess With Office 2007
Document Sucuess With Office 2007 Document Sucuess With Office 2007
Document Sucuess With Office 2007
 
Xml and xml processor
Xml and xml processorXml and xml processor
Xml and xml processor
 
Xml and xml processor
Xml and xml processorXml and xml processor
Xml and xml processor
 
Java Web Services
Java Web ServicesJava Web Services
Java Web Services
 
sidje
sidjesidje
sidje
 
A Technical Comparison: ISO/IEC 26300 vs Microsoft Office Open XML
A Technical Comparison: ISO/IEC 26300 vs Microsoft Office Open XML A Technical Comparison: ISO/IEC 26300 vs Microsoft Office Open XML
A Technical Comparison: ISO/IEC 26300 vs Microsoft Office Open XML
 
Working with xml data
Working with xml dataWorking with xml data
Working with xml data
 
Xml in bio medical field
Xml in bio medical fieldXml in bio medical field
Xml in bio medical field
 
Lotusphere 2007 AD507 Leveraging the Power of Object Oriented Programming in ...
Lotusphere 2007 AD507 Leveraging the Power of Object Oriented Programming in ...Lotusphere 2007 AD507 Leveraging the Power of Object Oriented Programming in ...
Lotusphere 2007 AD507 Leveraging the Power of Object Oriented Programming in ...
 
LINQ to XML
LINQ to XMLLINQ to XML
LINQ to XML
 
Creating xml publisher documents with people code
Creating xml publisher documents with people codeCreating xml publisher documents with people code
Creating xml publisher documents with people code
 
ILUG 2007 - Notes and Office Integration
ILUG 2007 - Notes and Office IntegrationILUG 2007 - Notes and Office Integration
ILUG 2007 - Notes and Office Integration
 
treeview
treeviewtreeview
treeview
 
treeview
treeviewtreeview
treeview
 
UNIT-1 Web services
UNIT-1 Web servicesUNIT-1 Web services
UNIT-1 Web services
 
Introduction to .net and asp
Introduction to .net and aspIntroduction to .net and asp
Introduction to .net and asp
 

Más de Matt Turner

Data In Action: Business Value of Data
Data In Action: Business Value of DataData In Action: Business Value of Data
Data In Action: Business Value of DataMatt Turner
 
Data2030 Summit MEA: Data Chaos to Data Culture March 2023
Data2030 Summit MEA: Data Chaos to Data Culture March 2023Data2030 Summit MEA: Data Chaos to Data Culture March 2023
Data2030 Summit MEA: Data Chaos to Data Culture March 2023Matt Turner
 
Data2030 Summit Data Megatrends Turner Sept 2022.pptx
Data2030 Summit Data Megatrends Turner Sept 2022.pptxData2030 Summit Data Megatrends Turner Sept 2022.pptx
Data2030 Summit Data Megatrends Turner Sept 2022.pptxMatt Turner
 
From Data Chaos to Data Culture
From Data Chaos to Data CultureFrom Data Chaos to Data Culture
From Data Chaos to Data CultureMatt Turner
 
How Data is Driving AI Innovation
How Data is Driving AI InnovationHow Data is Driving AI Innovation
How Data is Driving AI InnovationMatt Turner
 
Principles of Information Access
Principles of Information AccessPrinciples of Information Access
Principles of Information AccessMatt Turner
 
Securing the Right Metadata and Making it Work for You
Securing the Right Metadata and Making it Work for YouSecuring the Right Metadata and Making it Work for You
Securing the Right Metadata and Making it Work for YouMatt Turner
 
Operationalize Your Data and Lead Your Business Transformation
Operationalize Your Data and Lead Your Business TransformationOperationalize Your Data and Lead Your Business Transformation
Operationalize Your Data and Lead Your Business TransformationMatt Turner
 
Three Cool Things You Can Do with Standards
Three Cool Things You Can Do with StandardsThree Cool Things You Can Do with Standards
Three Cool Things You Can Do with StandardsMatt Turner
 
Mark logic Industrialize Your Data IOT Berlin Sept 2019
Mark logic Industrialize Your Data IOT Berlin Sept 2019Mark logic Industrialize Your Data IOT Berlin Sept 2019
Mark logic Industrialize Your Data IOT Berlin Sept 2019Matt Turner
 
BBC olympics 2012 experience oct18
BBC olympics 2012 experience oct18BBC olympics 2012 experience oct18
BBC olympics 2012 experience oct18Matt Turner
 
Operationalize Your Linked Data
Operationalize Your Linked DataOperationalize Your Linked Data
Operationalize Your Linked DataMatt Turner
 
Smart Content Summit: Unlock the Value with the Right Data Pattern
Smart Content Summit: Unlock the Value with the Right Data PatternSmart Content Summit: Unlock the Value with the Right Data Pattern
Smart Content Summit: Unlock the Value with the Right Data PatternMatt Turner
 
Data Security and the Hard Outer Shell
Data Security and the Hard Outer ShellData Security and the Hard Outer Shell
Data Security and the Hard Outer ShellMatt Turner
 
Media publishing meetup ocean of data july 2016
Media publishing meetup ocean of data july 2016Media publishing meetup ocean of data july 2016
Media publishing meetup ocean of data july 2016Matt Turner
 
Northeastern DB Class Introduction to Marklogic NoSQL april 2016
Northeastern DB Class Introduction to Marklogic NoSQL april 2016Northeastern DB Class Introduction to Marklogic NoSQL april 2016
Northeastern DB Class Introduction to Marklogic NoSQL april 2016Matt Turner
 
The Impact of Smart Content
The Impact of Smart ContentThe Impact of Smart Content
The Impact of Smart ContentMatt Turner
 
Metadata Madness: Semantics Takes Center Stage
Metadata Madness: Semantics Takes Center StageMetadata Madness: Semantics Takes Center Stage
Metadata Madness: Semantics Takes Center StageMatt Turner
 
New Trends in Data Management in the Information Industries
New Trends in Data Management in the Information Industries New Trends in Data Management in the Information Industries
New Trends in Data Management in the Information Industries Matt Turner
 
Smart Content Summit - Unlocking Content With Semantics and Metadata
Smart Content Summit - Unlocking Content With Semantics and MetadataSmart Content Summit - Unlocking Content With Semantics and Metadata
Smart Content Summit - Unlocking Content With Semantics and MetadataMatt Turner
 

Más de Matt Turner (20)

Data In Action: Business Value of Data
Data In Action: Business Value of DataData In Action: Business Value of Data
Data In Action: Business Value of Data
 
Data2030 Summit MEA: Data Chaos to Data Culture March 2023
Data2030 Summit MEA: Data Chaos to Data Culture March 2023Data2030 Summit MEA: Data Chaos to Data Culture March 2023
Data2030 Summit MEA: Data Chaos to Data Culture March 2023
 
Data2030 Summit Data Megatrends Turner Sept 2022.pptx
Data2030 Summit Data Megatrends Turner Sept 2022.pptxData2030 Summit Data Megatrends Turner Sept 2022.pptx
Data2030 Summit Data Megatrends Turner Sept 2022.pptx
 
From Data Chaos to Data Culture
From Data Chaos to Data CultureFrom Data Chaos to Data Culture
From Data Chaos to Data Culture
 
How Data is Driving AI Innovation
How Data is Driving AI InnovationHow Data is Driving AI Innovation
How Data is Driving AI Innovation
 
Principles of Information Access
Principles of Information AccessPrinciples of Information Access
Principles of Information Access
 
Securing the Right Metadata and Making it Work for You
Securing the Right Metadata and Making it Work for YouSecuring the Right Metadata and Making it Work for You
Securing the Right Metadata and Making it Work for You
 
Operationalize Your Data and Lead Your Business Transformation
Operationalize Your Data and Lead Your Business TransformationOperationalize Your Data and Lead Your Business Transformation
Operationalize Your Data and Lead Your Business Transformation
 
Three Cool Things You Can Do with Standards
Three Cool Things You Can Do with StandardsThree Cool Things You Can Do with Standards
Three Cool Things You Can Do with Standards
 
Mark logic Industrialize Your Data IOT Berlin Sept 2019
Mark logic Industrialize Your Data IOT Berlin Sept 2019Mark logic Industrialize Your Data IOT Berlin Sept 2019
Mark logic Industrialize Your Data IOT Berlin Sept 2019
 
BBC olympics 2012 experience oct18
BBC olympics 2012 experience oct18BBC olympics 2012 experience oct18
BBC olympics 2012 experience oct18
 
Operationalize Your Linked Data
Operationalize Your Linked DataOperationalize Your Linked Data
Operationalize Your Linked Data
 
Smart Content Summit: Unlock the Value with the Right Data Pattern
Smart Content Summit: Unlock the Value with the Right Data PatternSmart Content Summit: Unlock the Value with the Right Data Pattern
Smart Content Summit: Unlock the Value with the Right Data Pattern
 
Data Security and the Hard Outer Shell
Data Security and the Hard Outer ShellData Security and the Hard Outer Shell
Data Security and the Hard Outer Shell
 
Media publishing meetup ocean of data july 2016
Media publishing meetup ocean of data july 2016Media publishing meetup ocean of data july 2016
Media publishing meetup ocean of data july 2016
 
Northeastern DB Class Introduction to Marklogic NoSQL april 2016
Northeastern DB Class Introduction to Marklogic NoSQL april 2016Northeastern DB Class Introduction to Marklogic NoSQL april 2016
Northeastern DB Class Introduction to Marklogic NoSQL april 2016
 
The Impact of Smart Content
The Impact of Smart ContentThe Impact of Smart Content
The Impact of Smart Content
 
Metadata Madness: Semantics Takes Center Stage
Metadata Madness: Semantics Takes Center StageMetadata Madness: Semantics Takes Center Stage
Metadata Madness: Semantics Takes Center Stage
 
New Trends in Data Management in the Information Industries
New Trends in Data Management in the Information Industries New Trends in Data Management in the Information Industries
New Trends in Data Management in the Information Industries
 
Smart Content Summit - Unlocking Content With Semantics and Metadata
Smart Content Summit - Unlocking Content With Semantics and MetadataSmart Content Summit - Unlocking Content With Semantics and Metadata
Smart Content Summit - Unlocking Content With Semantics and Metadata
 

Último

The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Scott Andery
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 

Último (20)

The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 

First Encounters With Office Open Xml

  • 1. Unlock Content First Encounters with Office Open XML Matt Turner Principal Consultant December 3rd, 2007 Copyright © 2007 Mark Logic Corporation. All rights reserved. Slide 1
  • 2. Agenda Office Open XML basics p Office Open XML and XML tools Some examples Parting Thoughts Copyright © 2007 Mark Logic Corporation. All rights reserved. Slide 2
  • 3. Office Open XML Native format of MS Office 2007 Complete rework of the entire productivity suite Word, PowerPoint, Excel, etc. All have native format of XML! OOXML = Office Open XML Standard through Ecma International Formally k F ll known as E Ecma 376 Approved in December 2006 Copyright © 2007 Mark Logic Corporation. All rights reserved. Slide 3
  • 4. Lets Have a Look There is a lot of it . . . A Requirement was 100% compatibility Layout based 6500+ page specification ifi ti Thousands of elements + attributes And speed and space Single-character QNames Single character Single-character namespace prefixes No spare whitespace But the core element set is manageable . . . . . . for simple documents ☺ Copyright © 2007 Mark Logic Corporation. All rights reserved. Slide 4
  • 5. It’s Not XML, It’s Zipped XML Zipped container with content, formatting info and manifest Payload varies by application but its all XML New extension: .docx, pptx .xlsx docx pptx, xlsx Copyright © 2007 Mark Logic Corporation. All rights reserved. Slide 5
  • 6. Office Open XML Sample Copyright © 2007 Mark Logic Corporation. All rights reserved. Slide 6
  • 7. Runs Present Some Issues The core of a Word file comprises text “runs” New styles require new runs sometimes they just show up runs, Sometimes split text (!!) Needs some special handling which we can do with XQuery Copyright © 2007 Mark Logic Corporation. All rights reserved. Slide 7
  • 8. Hooks to Add Structure Spec allows for customXml tag that you can use to add structure Word also support to let you add structure Support for schemas to control editing Controls to let you add arbitrary tags And flow content from external (XML) sources Available from the Developer Tab Enable developer tab from word options under the ‘big button’ Word itself can be configured with XML to jump start custom editing and XML interactions Copyright © 2007 Mark Logic Corporation. All rights reserved. Slide 8
  • 9. Agenda Office Open basics p Office Open XML and XML tools Some examples Parting Thoughts Copyright © 2007 Mark Logic Corporation. All rights reserved. Slide 9
  • 10. What can we do with it? It’s XML – anything!! You Y can query it transform it . . . the whole enchilada. it, t f th hl hil d Create it MS Word is now (just) an OOXML editor ( ) (j ) (!!) There are lots of other ways to edit and create OOXML Make the desktop connection Drive application context direct from end-user documents Output quot;first-draftquot; of end-user documents that work on (real) desktops Create content apps that work directly on collections of Office documents, without conversion Simplify the XML-ifying of business processes XML ifying Copyright © 2007 Mark Logic Corporation. All rights reserved. Slide 10
  • 11. XQuery Makes It Happen XQuery - much more than a query language W3C standard Query, manipulate and render XML XML Content Servers (like MarkLogic Server) Application ready extensions provide complete platform for content applications Such as Update features to load / maintain content HTTP / REST interfaces Zip tools to h dl the Zi t l t handle th packaging* ki* *MarkLogic Server only Copyright © 2007 Mark Logic Corporation. All rights reserved. Slide 11
  • 12. Agenda Office Open basics p Office Open XML and XML tools Some examples Parting Thoughts Copyright © 2007 Mark Logic Corporation. All rights reserved. Slide 12
  • 13. Examples 1. Exploring Office Open XML p g p • Open up .zip package • Update XML • Repackage • Load into Content Server • Create CustomXml + Controls • Query and Update • Repackage into .docx Copyright © 2007 Mark Logic Corporation. All rights reserved. Slide 13
  • 14. Examples Unzip package and edit document.xml to add Custom Structure in XML editor Copyright © 2007 Mark Logic Corporation. All rights reserved. Slide 14
  • 15. Examples Zip back up, rename to .docx and open in word use developer tab to view CustomXml Copyright © 2007 Mark Logic Corporation. All rights reserved. Slide 15
  • 16. Examples • Load into MarkLogic Server, unzip and expand to load individual XML files Copyright © 2007 Mark Logic Corporation. All rights reserved. Slide 16
  • 17. Examples • Query the XML: this XQuery • Returns: Rt Copyright © 2007 Mark Logic Corporation. All rights reserved. Slide 17
  • 18. Examples • Package the files back up to create a new .docx Copyright © 2007 Mark Logic Corporation. All rights reserved. Slide 18
  • 19. Content Server Examples Office Open XML = accessible format gives you the building blocks to create purpose built applications to leverage desktop apps Content Servers put it together MarkLogic Server combines the XML tools (XQuery) to process it and the extension to seamlessly round trip content Copyright © 2007 Mark Logic Corporation. All rights reserved. Slide 19
  • 20. Content Server Examples Generate, Query and Mash-up MS Word 1. Use XQuery to transform XML into Office Open XML 2. Use XQuery t access granular content elements in 2 U XQ to l t tl ti word documents and create new Office Open XML 3. Customize Word Ribbons to query MarkLogic Server to get content and save new content back Copyright © 2007 Mark Logic Corporation. All rights reserved. Slide 20
  • 21. Content Server Example 1 From source XML (shakespeare plays) Copyright © 2007 Mark Logic Corporation. All rights reserved. Slide 21
  • 22. Content Server Example 1 Generate Open Office XML Copyright © 2007 Mark Logic Corporation. All rights reserved. Slide 22
  • 23. Content Server Example 1 Package and open in Word – with structure in customXML elements Copyright © 2007 Mark Logic Corporation. All rights reserved. Slide 23
  • 24. Content Server Example 2 Access granular elements of Office Open XML and create new content Copyright © 2007 Mark Logic Corporation. All rights reserved. Slide 24
  • 25. Content Server Example 3 Add content using custom Mark Logic Ribbon Insert new content into Shakespeare play Query MarkLogic Server for content from tech support content base Copyright © 2007 Mark Logic Corporation. All rights reserved. Slide 25
  • 26. Content Server Example 3 Add content using custom Mark Logic Ribbon Insert new content into Shakespeare play Query MarkLogic Server for content from tech support content base Insert content into play Actions contained in MarkLogic Ribbon Copyright © 2007 Mark Logic Corporation. All rights reserved. Slide 26
  • 27. Content Server Example 3 Save content back to contentbase Select any content Create a new document with the snippet in MarkLogic Server contentbase Copyright © 2007 Mark Logic Corporation. All rights reserved. Slide 27
  • 28. Agenda Open Office basics p OOXML and XML tools Some examples Parting Thoughts Copyright © 2007 Mark Logic Corporation. All rights reserved. Slide 28
  • 29. Conclusions Office 2007’s native file format is XML! For real! XQuery provides powerful tools to ingest, query, manipulate and g p generate the format ( XML after all) (it’s ) OOXML provides the building blocks for integrated content apps based on desktop content XML Content Servers enable these applications so . . . Copyright © 2007 Mark Logic Corporation. All rights reserved. Slide 29
  • 30. Content Application Resources OOXML Standard http://www.ecma-international.org/publications/standards/Ecma-376.htm Small changes – featuring OOXML http://developer.marklogic.com/columns/smallchanges OOXML Developers http://openxmldeveloper.org/ Discovering XQuery (my blog) http://xquery.typepad.com p q y yp p MarkMail (XML Lists) http://markmail.org Mark Logic CEO Blog http://marklogic.blogspot.com XQuery site / developers group htt // http://x-query.com Querying XML (book) Melton and Buxton Copyright © 2007 Mark Logic Corporation. All rights reserved. Slide 30
  • 31. Unlock Content™ Thank You Sample Template Matt Turner Principal Consultant What do we think of this matt.turner@marklogic.com Sfdhskdfjh kjsfhd http://xquery.typepad.com Sdflkhsdf sdflk Copyright © 2007 Mark Logic Corporation. All rights reserved. Slide 31