SlideShare una empresa de Scribd logo
1 de 22
Kyong-Ha Lee and Bongki Moon The University of Arizona Bitmap Indexes For Relational XML Twig Query Processing
CIKM'09, Hong Kong 2 XML Data and Queries a1 0 (1, 32,1) <a>    <a>        <b>t1</b>       <c>          <d>t2</d>          <e>t3</e>       </c>   </a>   <a>     <b>         <e>t4</e>     </b>     <d>         <c>t5</c>     </d>   </a> . . . . . </a>  a2 a3 a4 1 6 11 (2,11,2) (12,21,2) (22,31,2) 9 c1 b1 7 12 15 2 3 d2 b2 e3 b3 (13,16,3) (17,20,3) (23,28,3) (29,30,3) (5,10,3) (3,4,3) 10 c2 e2 d3 d1 e1 8 c3 13 14 4 5 (26,27,4) (24,25,4) (18,19,4) (6,7,4) (8,9,4) (14,15,4) //A/B/C //A[//B]//C //A[./B/C]//E A A A B C E B B C C
CIKM'09, Hong Kong 3 XML Stored in RDB NODE table PATH table . . . . . . . . .
To answer a twig query A twig pattern is decomposed into several path patterns. Path solutions are joined together to compose a final result.	 Holistic Twig Join(HTJ) algorithm Specialized multi-way& sort-merge join guarantees I/O optimality for a certain subset of XML query. The optimality depends on how the elements are partitioned. uses stacks and streams in which elements are sorted in an order. CIKM'09, Hong Kong 4 Twig  Join A A E B C  SA A A SE SB B E  SC C Stacks Streams
Discrepancy between XML in RDB and conventional HTJ algorithms Logical: Streams vs. Table Physical: partitioned vs. record-oriented Supporting actual data including a large volume of texts requires references to records. How to feed tuples to HTJ algorithm? What’s the best partitioning scheme for XML stored in RDB? Bitmap index, a conventional index in RDBMS An efficient way to indicate tuples. Efficient support for logical operations Can we use the bitmap index for supporting HTJ? CIKM'09, Hong Kong 5 Motivation
Tag-based partitioning Simple, and skipping technique can be used to read useful elements only.  For a query node, only one stream is accessed Tag+Level partitioning More I/O optimality, suitable for deep XML Some streams may be accessed for a single query node Path-based partitioning More I/O optimality, suitable for shallow XML A path with //-axes may require accessing many streams for a single query node CIKM'09, Hong Kong 6 HTJ on Different Partitioning Schemes
CIKM'09, Hong Kong 7 Bitmap Index How to partition tuples in NODE table  By building a bitmap index on certain column(s) in the table. bitTag for tagName,  bitTag+ for (tagName, Level),  bitPath for pathId column Determines I/O optimality of holistic twig join algorithms. During twig join process, useful tuples are accessed via the bitmap index.   A B E . . .  110000 1 0 0 0 0010000100 0000010000 Bit-vectors . . . disk blocks
bitAnc : A bit-vector represents terminal elements corr. to a certain path and all their ancestors.  bitDesc: A bit–vector represents terminal elements corr. to a certain path and all their descendants. CIKM'09, Hong Kong 8 Additional Indexes a1 0 a2 a3 a4 1 6 11 b1 2 7 12 b2 b3 14 e2 d3 8 c3 13 A subtree covered by the left 3 bit-vectors bitPath,bitAnc, andbitDescfor PathId=2,  i.e.  /A/A/B
Basic index Bit-vectors are built on a single column or a group of columns Requires labeled values, and reading records  Hybrid index A Combination of two different indexes descTag : bitDesc & bitTag bitTwig : bitPath & bitAnc does not require labeled values to compute twig solution CIKM'09, Hong Kong 9 Two Types of Indexes
CIKM'09, Hong Kong 10 Identifying Element Relationship with Bit-vectors a1 1 1 1 0 0 0 1 1 0 0 0 1 1 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1100001000010000 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 a2 b1 ,[object Object],b2 a1 0 a2 a3 a4 1 6 11 b1 2 7 12 b2 b3 P2: /A/A/B P0: /A P1: /A/A
Choose the minimum position value among the current 1’s as a current element for a query node Check if 1 exists in an interval, pos(a) and pos(d)? looking-ahead at the next 1 CIKM'09, Hong Kong 11 Advancing Cursors 0  eov P0 : /A  P1 : /A/A q : //A  (0,0,1)  6 1 Currq Current1 Next1
Early detection with a bit-vector absence Condensing query nodes For path-based partition Reduces |INDEX| and |RECORD| Skipping reading obsolete records  with advance(k) For tag, (tag, level)-based partition Reduces |RECORD| Moving cursors over compressed bit-vectors with no decompression A composite cursor moving over a bit-vector compressed by run-length encoding scheme Reduces |INDEX| CIKM'09, Hong Kong 12 Optimizations A A E B E C C P: //A/B/C CA = 11 10000000000100000 CB = 4 advance(11) 00001000010000100
CIKM'09, Hong Kong 13 Compressed Bit-vector 000100000000100000000000000011  00000000000 . . . 00000000000000    0000000000000000000000000000001  00 (a) An original bit-vector with 8,000 bits 31 bits 2 bits 256* 31 bits 31 bits (b) Grouping as a unit of 31 bits and Merging identical groups 000010…010…011 100… 0100000000 000…001 000…000 Run-length is 256 31 literal bits Remaining word Uncompressed word  Compressed word (c) Encoding each group as 1 word (4byte on a 32-bit machine) Cursor C    ={ C.position,  //Integer position value (Logical address)        C. word, // The current word C is located at.        C.bit, // The position of the bit C is visiting, in C.word        C. rest } //The bit position in the remaining word
CIKM'09, Hong Kong 14 Moving A Cursor over A Compressed Bit-vector a) Get the position of the next 1 C = {31, 0, 31,0} Skip to examine  31* 256 bits C={7998, 2, 31, 0} 000010…010…011 100… 0100000000 000…001 000…000 Remaining word Run-length is 256 b) Check a bit value at the position 3,000 C = {31, 0, 31,0} with distance to move,  2,869=(3000-31) Since 31* 256 > 2,869, The bit we find is within the word 1.  000010…010…011 100… 0100000000 000…001 000…000
CIKM'09, Hong Kong 15 Experiments Datasets  Synthetic : XMark Real : DBLP, Treebank, Swiss-prot Query sets
CIKM'09, Hong Kong 16 Statistics of Dataset and Indexes ,[object Object]
# of distinct tag names are not  much different
Index build time is largelyaffected by attribute cardinality ,[object Object], labeled value size in most cases
CIKM'09, Hong Kong 17 Query Execution Time
CIKM'09, Hong Kong 18 Input Data Size
Merging used bit-vectors for a path pattern with //-axes and putting it into a bitmap index for the next time for a given path //A//B, P:/A/A/B P:/A/B acts like a pre-computed join index A path pattern with //-axes can be represented by a single bit-vector. Logical operations: OR, NOT  are simply supported by bitwise-logical operations: &, |, ^ CIKM'09, Hong Kong 19 Other Features on bitPath
CIKM'09, Hong Kong 20 Twig Queries with Logical Operations P//A, P//A//B//X ≡P//A//B//C V P//A//B//D , P//A//E A A A A B E B E X (C|D) //A[./B/C or ./B/D]//E P//A , P//A//E , P//A/B ⓧ(P//A/B ⊙A//A/B/C) A A A A A B B E E B C ¬ C //A[./B/not(C)]//E

Más contenido relacionado

La actualidad más candente

FINDING FREQUENT SUBPATHS IN A GRAPH
FINDING FREQUENT SUBPATHS IN A GRAPHFINDING FREQUENT SUBPATHS IN A GRAPH
FINDING FREQUENT SUBPATHS IN A GRAPHIJDKP
 
Code optimization in compiler design
Code optimization in compiler designCode optimization in compiler design
Code optimization in compiler designKuppusamy P
 
Optimization of basic blocks
Optimization of basic blocksOptimization of basic blocks
Optimization of basic blocksishwarya516
 
Chapter Eight(1)
Chapter Eight(1)Chapter Eight(1)
Chapter Eight(1)bolovv
 
Assignment on different types of addressing modes
Assignment on different types of addressing modesAssignment on different types of addressing modes
Assignment on different types of addressing modesNusratJahan263
 
Computer Science Engineering : Data structure & algorithm, THE GATE ACADEMY
Computer Science Engineering : Data structure & algorithm, THE GATE ACADEMYComputer Science Engineering : Data structure & algorithm, THE GATE ACADEMY
Computer Science Engineering : Data structure & algorithm, THE GATE ACADEMYklirantga
 
Inductive Triple Graphs: A purely functional approach to represent RDF
Inductive Triple Graphs: A purely functional approach to represent RDFInductive Triple Graphs: A purely functional approach to represent RDF
Inductive Triple Graphs: A purely functional approach to represent RDFJose Emilio Labra Gayo
 
1327 Addressing Modes Of 8086
1327 Addressing Modes Of 80861327 Addressing Modes Of 8086
1327 Addressing Modes Of 8086techbed
 
Assembly language programming
Assembly language programming Assembly language programming
Assembly language programming Gaurav Takrani
 
Notes 8086 instruction format
Notes 8086 instruction formatNotes 8086 instruction format
Notes 8086 instruction formatHarshitParkar6677
 
1984 Article on An Application of AI to Operations Reserach
1984 Article on An Application of AI to Operations Reserach1984 Article on An Application of AI to Operations Reserach
1984 Article on An Application of AI to Operations ReserachBob Marcus
 
Microprocessor Basics CH-3
Microprocessor Basics CH-3Microprocessor Basics CH-3
Microprocessor Basics CH-3Neelam Kapoor
 
Design of High speed Low Power Reversible Vedic multiplier and Reversible Div...
Design of High speed Low Power Reversible Vedic multiplier and Reversible Div...Design of High speed Low Power Reversible Vedic multiplier and Reversible Div...
Design of High speed Low Power Reversible Vedic multiplier and Reversible Div...IJERA Editor
 
Advanced computer architect lesson 3 and 4
Advanced computer architect lesson 3 and 4Advanced computer architect lesson 3 and 4
Advanced computer architect lesson 3 and 4Ismail Mukiibi
 

La actualidad más candente (20)

Assembly Language -I
Assembly Language -IAssembly Language -I
Assembly Language -I
 
FINDING FREQUENT SUBPATHS IN A GRAPH
FINDING FREQUENT SUBPATHS IN A GRAPHFINDING FREQUENT SUBPATHS IN A GRAPH
FINDING FREQUENT SUBPATHS IN A GRAPH
 
Code optimization in compiler design
Code optimization in compiler designCode optimization in compiler design
Code optimization in compiler design
 
Lo18
Lo18Lo18
Lo18
 
3. adressingmodes1
3. adressingmodes13. adressingmodes1
3. adressingmodes1
 
Optimization of basic blocks
Optimization of basic blocksOptimization of basic blocks
Optimization of basic blocks
 
Chapter Eight(1)
Chapter Eight(1)Chapter Eight(1)
Chapter Eight(1)
 
Assignment on different types of addressing modes
Assignment on different types of addressing modesAssignment on different types of addressing modes
Assignment on different types of addressing modes
 
Computer Science Engineering : Data structure & algorithm, THE GATE ACADEMY
Computer Science Engineering : Data structure & algorithm, THE GATE ACADEMYComputer Science Engineering : Data structure & algorithm, THE GATE ACADEMY
Computer Science Engineering : Data structure & algorithm, THE GATE ACADEMY
 
Inductive Triple Graphs: A purely functional approach to represent RDF
Inductive Triple Graphs: A purely functional approach to represent RDFInductive Triple Graphs: A purely functional approach to represent RDF
Inductive Triple Graphs: A purely functional approach to represent RDF
 
1327 Addressing Modes Of 8086
1327 Addressing Modes Of 80861327 Addressing Modes Of 8086
1327 Addressing Modes Of 8086
 
Assembly language programming
Assembly language programming Assembly language programming
Assembly language programming
 
Notes 8086 instruction format
Notes 8086 instruction formatNotes 8086 instruction format
Notes 8086 instruction format
 
1984 Article on An Application of AI to Operations Reserach
1984 Article on An Application of AI to Operations Reserach1984 Article on An Application of AI to Operations Reserach
1984 Article on An Application of AI to Operations Reserach
 
Compiler unit 2&3
Compiler unit 2&3Compiler unit 2&3
Compiler unit 2&3
 
Lecture1
Lecture1Lecture1
Lecture1
 
Unit iv(simple code generator)
Unit iv(simple code generator)Unit iv(simple code generator)
Unit iv(simple code generator)
 
Microprocessor Basics CH-3
Microprocessor Basics CH-3Microprocessor Basics CH-3
Microprocessor Basics CH-3
 
Design of High speed Low Power Reversible Vedic multiplier and Reversible Div...
Design of High speed Low Power Reversible Vedic multiplier and Reversible Div...Design of High speed Low Power Reversible Vedic multiplier and Reversible Div...
Design of High speed Low Power Reversible Vedic multiplier and Reversible Div...
 
Advanced computer architect lesson 3 and 4
Advanced computer architect lesson 3 and 4Advanced computer architect lesson 3 and 4
Advanced computer architect lesson 3 and 4
 

Destacado

Kutadgu Corporate Profile
Kutadgu Corporate ProfileKutadgu Corporate Profile
Kutadgu Corporate ProfileAhter Kutadgu
 
Top down xml keyword query processing
Top down xml keyword query processingTop down xml keyword query processing
Top down xml keyword query processingShakas Technologies
 
Definitive Consulting E Brochure
Definitive Consulting E BrochureDefinitive Consulting E Brochure
Definitive Consulting E Brochurerichardschaefer
 
エンジニア目線での対外ブランディング ~ヌーラボ編~
エンジニア目線での対外ブランディング ~ヌーラボ編~エンジニア目線での対外ブランディング ~ヌーラボ編~
エンジニア目線での対外ブランディング ~ヌーラボ編~ikikko
 
Ruby 1.9 And Rails 3.0
Ruby 1.9 And Rails 3.0Ruby 1.9 And Rails 3.0
Ruby 1.9 And Rails 3.0ArrrrCamp
 
Wellspiration 6 - Fighting Heart Disease Naturally
Wellspiration 6  - Fighting Heart Disease NaturallyWellspiration 6  - Fighting Heart Disease Naturally
Wellspiration 6 - Fighting Heart Disease NaturallyYafa Sakkejha
 
What’S The Plan.Web
What’S The Plan.WebWhat’S The Plan.Web
What’S The Plan.Webnancybostock
 
mobipoll @ РИФ+КИБ
mobipoll @ РИФ+КИБmobipoll @ РИФ+КИБ
mobipoll @ РИФ+КИБAnton Kuchumov
 
The Effects of Work Habits Around Agility Through Simulations
The Effects of Work Habits Around Agility Through SimulationsThe Effects of Work Habits Around Agility Through Simulations
The Effects of Work Habits Around Agility Through SimulationsPaul Boos
 
【労働者健康福祉機構】平成17年度環境報告書
【労働者健康福祉機構】平成17年度環境報告書【労働者健康福祉機構】平成17年度環境報告書
【労働者健康福祉機構】平成17年度環境報告書env25
 
Derecho métodos filosofía_musa_majad
Derecho métodos filosofía_musa_majadDerecho métodos filosofía_musa_majad
Derecho métodos filosofía_musa_majadMusa Majad
 
Customer development oxford 14.02.2015
Customer development   oxford 14.02.2015Customer development   oxford 14.02.2015
Customer development oxford 14.02.2015Lino Velev
 

Destacado (20)

Kutadgu Corporate Profile
Kutadgu Corporate ProfileKutadgu Corporate Profile
Kutadgu Corporate Profile
 
Top down xml keyword query processing
Top down xml keyword query processingTop down xml keyword query processing
Top down xml keyword query processing
 
Definitive Consulting E Brochure
Definitive Consulting E BrochureDefinitive Consulting E Brochure
Definitive Consulting E Brochure
 
エンジニア目線での対外ブランディング ~ヌーラボ編~
エンジニア目線での対外ブランディング ~ヌーラボ編~エンジニア目線での対外ブランディング ~ヌーラボ編~
エンジニア目線での対外ブランディング ~ヌーラボ編~
 
Picking the right business model
Picking the right business modelPicking the right business model
Picking the right business model
 
Ruby 1.9 And Rails 3.0
Ruby 1.9 And Rails 3.0Ruby 1.9 And Rails 3.0
Ruby 1.9 And Rails 3.0
 
Prawn
PrawnPrawn
Prawn
 
World AIDS Day
World AIDS DayWorld AIDS Day
World AIDS Day
 
At the Crossroads
At the CrossroadsAt the Crossroads
At the Crossroads
 
Wellspiration 6 - Fighting Heart Disease Naturally
Wellspiration 6  - Fighting Heart Disease NaturallyWellspiration 6  - Fighting Heart Disease Naturally
Wellspiration 6 - Fighting Heart Disease Naturally
 
What’S The Plan.Web
What’S The Plan.WebWhat’S The Plan.Web
What’S The Plan.Web
 
mobipoll @ РИФ+КИБ
mobipoll @ РИФ+КИБmobipoll @ РИФ+КИБ
mobipoll @ РИФ+КИБ
 
Save antarctica
Save antarcticaSave antarctica
Save antarctica
 
The Effects of Work Habits Around Agility Through Simulations
The Effects of Work Habits Around Agility Through SimulationsThe Effects of Work Habits Around Agility Through Simulations
The Effects of Work Habits Around Agility Through Simulations
 
【労働者健康福祉機構】平成17年度環境報告書
【労働者健康福祉機構】平成17年度環境報告書【労働者健康福祉機構】平成17年度環境報告書
【労働者健康福祉機構】平成17年度環境報告書
 
Derecho métodos filosofía_musa_majad
Derecho métodos filosofía_musa_majadDerecho métodos filosofía_musa_majad
Derecho métodos filosofía_musa_majad
 
Cd8 y cmv en vih
Cd8 y cmv en vihCd8 y cmv en vih
Cd8 y cmv en vih
 
Leadership: The Warrior's Art
Leadership: The Warrior's ArtLeadership: The Warrior's Art
Leadership: The Warrior's Art
 
Customer development oxford 14.02.2015
Customer development   oxford 14.02.2015Customer development   oxford 14.02.2015
Customer development oxford 14.02.2015
 
Truvada kaletra o maraviroc PrEp
Truvada kaletra o maraviroc PrEpTruvada kaletra o maraviroc PrEp
Truvada kaletra o maraviroc PrEp
 

Similar a Bitmap Indexes for Relational XML Twig Query Processing

Systematic error codes implimentation for matched data encoded 47405
Systematic error codes implimentation for matched data encoded 47405Systematic error codes implimentation for matched data encoded 47405
Systematic error codes implimentation for matched data encoded 47405EditorIJAERD
 
128-Bit Area Efficient Reconfigurable Carry Select Adder
128-Bit Area Efficient Reconfigurable Carry Select Adder 128-Bit Area Efficient Reconfigurable Carry Select Adder
128-Bit Area Efficient Reconfigurable Carry Select Adder ijcisjournal
 
Leakage power optimization for ripple carry adder
Leakage power optimization for ripple carry adder Leakage power optimization for ripple carry adder
Leakage power optimization for ripple carry adder NAVEEN TOKAS
 
Implementation and Comparison of Efficient 16-Bit SQRT CSLA Using Parity Pres...
Implementation and Comparison of Efficient 16-Bit SQRT CSLA Using Parity Pres...Implementation and Comparison of Efficient 16-Bit SQRT CSLA Using Parity Pres...
Implementation and Comparison of Efficient 16-Bit SQRT CSLA Using Parity Pres...IJERA Editor
 
Design of high speed adders for efficient digital design blocks
Design of high speed adders for efficient digital design blocksDesign of high speed adders for efficient digital design blocks
Design of high speed adders for efficient digital design blocksBharath Chary
 
design-compiler.pdf
design-compiler.pdfdesign-compiler.pdf
design-compiler.pdfFrangoCamila
 
Reversed-Trellis Tail-Biting Convolutional Code (RT-TBCC) Decoder Architectur...
Reversed-Trellis Tail-Biting Convolutional Code (RT-TBCC) Decoder Architectur...Reversed-Trellis Tail-Biting Convolutional Code (RT-TBCC) Decoder Architectur...
Reversed-Trellis Tail-Biting Convolutional Code (RT-TBCC) Decoder Architectur...IJECEIAES
 
Grasp the Critical Issues for a Functioning JESD204B Interface
Grasp the Critical Issues for a Functioning JESD204B InterfaceGrasp the Critical Issues for a Functioning JESD204B Interface
Grasp the Critical Issues for a Functioning JESD204B InterfaceAnalog Devices, Inc.
 
Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...
Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...
Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...Intel® Software
 
Design and Estimation of delay, power and area for Parallel prefix adders
Design and Estimation of delay, power and area for Parallel prefix addersDesign and Estimation of delay, power and area for Parallel prefix adders
Design and Estimation of delay, power and area for Parallel prefix addersIJERA Editor
 
Evaluation of High Speed and Low Memory Parallel Prefix Adders
Evaluation of High Speed and Low Memory Parallel Prefix AddersEvaluation of High Speed and Low Memory Parallel Prefix Adders
Evaluation of High Speed and Low Memory Parallel Prefix AddersIOSR Journals
 
Boolean algebra
Boolean algebraBoolean algebra
Boolean algebraAswiniT3
 
IRJET- Performance Analysis of a Synchronized Receiver over Noiseless and Fad...
IRJET- Performance Analysis of a Synchronized Receiver over Noiseless and Fad...IRJET- Performance Analysis of a Synchronized Receiver over Noiseless and Fad...
IRJET- Performance Analysis of a Synchronized Receiver over Noiseless and Fad...IRJET Journal
 
Introduction number systems and conversion
 Introduction number systems and conversion Introduction number systems and conversion
Introduction number systems and conversionkanyuma jitjumnong
 
Correctness and Performance of Apache Spark SQL with Bogdan Ghit and Nicolas ...
Correctness and Performance of Apache Spark SQL with Bogdan Ghit and Nicolas ...Correctness and Performance of Apache Spark SQL with Bogdan Ghit and Nicolas ...
Correctness and Performance of Apache Spark SQL with Bogdan Ghit and Nicolas ...Databricks
 
Correctness and Performance of Apache Spark SQL
Correctness and Performance of Apache Spark SQLCorrectness and Performance of Apache Spark SQL
Correctness and Performance of Apache Spark SQLNicolas Poggi
 

Similar a Bitmap Indexes for Relational XML Twig Query Processing (20)

Systematic error codes implimentation for matched data encoded 47405
Systematic error codes implimentation for matched data encoded 47405Systematic error codes implimentation for matched data encoded 47405
Systematic error codes implimentation for matched data encoded 47405
 
128-Bit Area Efficient Reconfigurable Carry Select Adder
128-Bit Area Efficient Reconfigurable Carry Select Adder 128-Bit Area Efficient Reconfigurable Carry Select Adder
128-Bit Area Efficient Reconfigurable Carry Select Adder
 
Leakage power optimization for ripple carry adder
Leakage power optimization for ripple carry adder Leakage power optimization for ripple carry adder
Leakage power optimization for ripple carry adder
 
Implementation and Comparison of Efficient 16-Bit SQRT CSLA Using Parity Pres...
Implementation and Comparison of Efficient 16-Bit SQRT CSLA Using Parity Pres...Implementation and Comparison of Efficient 16-Bit SQRT CSLA Using Parity Pres...
Implementation and Comparison of Efficient 16-Bit SQRT CSLA Using Parity Pres...
 
Design of high speed adders for efficient digital design blocks
Design of high speed adders for efficient digital design blocksDesign of high speed adders for efficient digital design blocks
Design of high speed adders for efficient digital design blocks
 
design-compiler.pdf
design-compiler.pdfdesign-compiler.pdf
design-compiler.pdf
 
Reversed-Trellis Tail-Biting Convolutional Code (RT-TBCC) Decoder Architectur...
Reversed-Trellis Tail-Biting Convolutional Code (RT-TBCC) Decoder Architectur...Reversed-Trellis Tail-Biting Convolutional Code (RT-TBCC) Decoder Architectur...
Reversed-Trellis Tail-Biting Convolutional Code (RT-TBCC) Decoder Architectur...
 
Survey on Prefix adders
Survey on Prefix addersSurvey on Prefix adders
Survey on Prefix adders
 
Grasp the Critical Issues for a Functioning JESD204B Interface
Grasp the Critical Issues for a Functioning JESD204B InterfaceGrasp the Critical Issues for a Functioning JESD204B Interface
Grasp the Critical Issues for a Functioning JESD204B Interface
 
dfl
dfldfl
dfl
 
Kailash(13EC35032)_mtp.pptx
Kailash(13EC35032)_mtp.pptxKailash(13EC35032)_mtp.pptx
Kailash(13EC35032)_mtp.pptx
 
Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...
Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...
Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...
 
Design and Estimation of delay, power and area for Parallel prefix adders
Design and Estimation of delay, power and area for Parallel prefix addersDesign and Estimation of delay, power and area for Parallel prefix adders
Design and Estimation of delay, power and area for Parallel prefix adders
 
Evaluation of High Speed and Low Memory Parallel Prefix Adders
Evaluation of High Speed and Low Memory Parallel Prefix AddersEvaluation of High Speed and Low Memory Parallel Prefix Adders
Evaluation of High Speed and Low Memory Parallel Prefix Adders
 
Boolean algebra
Boolean algebraBoolean algebra
Boolean algebra
 
IRJET- Performance Analysis of a Synchronized Receiver over Noiseless and Fad...
IRJET- Performance Analysis of a Synchronized Receiver over Noiseless and Fad...IRJET- Performance Analysis of a Synchronized Receiver over Noiseless and Fad...
IRJET- Performance Analysis of a Synchronized Receiver over Noiseless and Fad...
 
Introduction number systems and conversion
 Introduction number systems and conversion Introduction number systems and conversion
Introduction number systems and conversion
 
Correctness and Performance of Apache Spark SQL with Bogdan Ghit and Nicolas ...
Correctness and Performance of Apache Spark SQL with Bogdan Ghit and Nicolas ...Correctness and Performance of Apache Spark SQL with Bogdan Ghit and Nicolas ...
Correctness and Performance of Apache Spark SQL with Bogdan Ghit and Nicolas ...
 
Correctness and Performance of Apache Spark SQL
Correctness and Performance of Apache Spark SQLCorrectness and Performance of Apache Spark SQL
Correctness and Performance of Apache Spark SQL
 
E010422834
E010422834E010422834
E010422834
 

Más de Kyong-Ha Lee

SASUM: A Sharing-based Approach to Fast Approximate Subgraph Matching for Lar...
SASUM: A Sharing-based Approach to Fast Approximate Subgraph Matching for Lar...SASUM: A Sharing-based Approach to Fast Approximate Subgraph Matching for Lar...
SASUM: A Sharing-based Approach to Fast Approximate Subgraph Matching for Lar...Kyong-Ha Lee
 
Scalable and Adaptive Graph Querying with MapReduce
Scalable and Adaptive Graph Querying with MapReduceScalable and Adaptive Graph Querying with MapReduce
Scalable and Adaptive Graph Querying with MapReduceKyong-Ha Lee
 
좋은 논문 찾기
좋은 논문 찾기좋은 논문 찾기
좋은 논문 찾기Kyong-Ha Lee
 
A poster version of HadoopXML
A poster version of HadoopXMLA poster version of HadoopXML
A poster version of HadoopXMLKyong-Ha Lee
 
HadoopXML: A Suite for Parallel Processing of Massive XML Data with Multiple ...
HadoopXML: A Suite for Parallel Processing of Massive XML Data with Multiple ...HadoopXML: A Suite for Parallel Processing of Massive XML Data with Multiple ...
HadoopXML: A Suite for Parallel Processing of Massive XML Data with Multiple ...Kyong-Ha Lee
 
KIISE:SIGDB Workshop presentation.
KIISE:SIGDB Workshop presentation.KIISE:SIGDB Workshop presentation.
KIISE:SIGDB Workshop presentation.Kyong-Ha Lee
 
MapReduce: A useful parallel tool that still has room for improvement
MapReduce: A useful parallel tool that still has room for improvementMapReduce: A useful parallel tool that still has room for improvement
MapReduce: A useful parallel tool that still has room for improvementKyong-Ha Lee
 
Parallel Data Processing with MapReduce: A Survey
Parallel Data Processing with MapReduce: A SurveyParallel Data Processing with MapReduce: A Survey
Parallel Data Processing with MapReduce: A SurveyKyong-Ha Lee
 
Database Research on Modern Computing Architecture
Database Research on Modern Computing ArchitectureDatabase Research on Modern Computing Architecture
Database Research on Modern Computing ArchitectureKyong-Ha Lee
 
Bitmap Indexes for Relational XML Twig Query Processing
Bitmap Indexes for Relational XML Twig Query ProcessingBitmap Indexes for Relational XML Twig Query Processing
Bitmap Indexes for Relational XML Twig Query ProcessingKyong-Ha Lee
 

Más de Kyong-Ha Lee (10)

SASUM: A Sharing-based Approach to Fast Approximate Subgraph Matching for Lar...
SASUM: A Sharing-based Approach to Fast Approximate Subgraph Matching for Lar...SASUM: A Sharing-based Approach to Fast Approximate Subgraph Matching for Lar...
SASUM: A Sharing-based Approach to Fast Approximate Subgraph Matching for Lar...
 
Scalable and Adaptive Graph Querying with MapReduce
Scalable and Adaptive Graph Querying with MapReduceScalable and Adaptive Graph Querying with MapReduce
Scalable and Adaptive Graph Querying with MapReduce
 
좋은 논문 찾기
좋은 논문 찾기좋은 논문 찾기
좋은 논문 찾기
 
A poster version of HadoopXML
A poster version of HadoopXMLA poster version of HadoopXML
A poster version of HadoopXML
 
HadoopXML: A Suite for Parallel Processing of Massive XML Data with Multiple ...
HadoopXML: A Suite for Parallel Processing of Massive XML Data with Multiple ...HadoopXML: A Suite for Parallel Processing of Massive XML Data with Multiple ...
HadoopXML: A Suite for Parallel Processing of Massive XML Data with Multiple ...
 
KIISE:SIGDB Workshop presentation.
KIISE:SIGDB Workshop presentation.KIISE:SIGDB Workshop presentation.
KIISE:SIGDB Workshop presentation.
 
MapReduce: A useful parallel tool that still has room for improvement
MapReduce: A useful parallel tool that still has room for improvementMapReduce: A useful parallel tool that still has room for improvement
MapReduce: A useful parallel tool that still has room for improvement
 
Parallel Data Processing with MapReduce: A Survey
Parallel Data Processing with MapReduce: A SurveyParallel Data Processing with MapReduce: A Survey
Parallel Data Processing with MapReduce: A Survey
 
Database Research on Modern Computing Architecture
Database Research on Modern Computing ArchitectureDatabase Research on Modern Computing Architecture
Database Research on Modern Computing Architecture
 
Bitmap Indexes for Relational XML Twig Query Processing
Bitmap Indexes for Relational XML Twig Query ProcessingBitmap Indexes for Relational XML Twig Query Processing
Bitmap Indexes for Relational XML Twig Query Processing
 

Último

Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Principled Technologies
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024SynarionITSolutions
 

Último (20)

Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 

Bitmap Indexes for Relational XML Twig Query Processing

  • 1. Kyong-Ha Lee and Bongki Moon The University of Arizona Bitmap Indexes For Relational XML Twig Query Processing
  • 2. CIKM'09, Hong Kong 2 XML Data and Queries a1 0 (1, 32,1) <a> <a> <b>t1</b> <c> <d>t2</d> <e>t3</e> </c> </a> <a> <b> <e>t4</e> </b> <d> <c>t5</c> </d> </a> . . . . . </a> a2 a3 a4 1 6 11 (2,11,2) (12,21,2) (22,31,2) 9 c1 b1 7 12 15 2 3 d2 b2 e3 b3 (13,16,3) (17,20,3) (23,28,3) (29,30,3) (5,10,3) (3,4,3) 10 c2 e2 d3 d1 e1 8 c3 13 14 4 5 (26,27,4) (24,25,4) (18,19,4) (6,7,4) (8,9,4) (14,15,4) //A/B/C //A[//B]//C //A[./B/C]//E A A A B C E B B C C
  • 3. CIKM'09, Hong Kong 3 XML Stored in RDB NODE table PATH table . . . . . . . . .
  • 4. To answer a twig query A twig pattern is decomposed into several path patterns. Path solutions are joined together to compose a final result. Holistic Twig Join(HTJ) algorithm Specialized multi-way& sort-merge join guarantees I/O optimality for a certain subset of XML query. The optimality depends on how the elements are partitioned. uses stacks and streams in which elements are sorted in an order. CIKM'09, Hong Kong 4 Twig Join A A E B C SA A A SE SB B E SC C Stacks Streams
  • 5. Discrepancy between XML in RDB and conventional HTJ algorithms Logical: Streams vs. Table Physical: partitioned vs. record-oriented Supporting actual data including a large volume of texts requires references to records. How to feed tuples to HTJ algorithm? What’s the best partitioning scheme for XML stored in RDB? Bitmap index, a conventional index in RDBMS An efficient way to indicate tuples. Efficient support for logical operations Can we use the bitmap index for supporting HTJ? CIKM'09, Hong Kong 5 Motivation
  • 6. Tag-based partitioning Simple, and skipping technique can be used to read useful elements only. For a query node, only one stream is accessed Tag+Level partitioning More I/O optimality, suitable for deep XML Some streams may be accessed for a single query node Path-based partitioning More I/O optimality, suitable for shallow XML A path with //-axes may require accessing many streams for a single query node CIKM'09, Hong Kong 6 HTJ on Different Partitioning Schemes
  • 7. CIKM'09, Hong Kong 7 Bitmap Index How to partition tuples in NODE table By building a bitmap index on certain column(s) in the table. bitTag for tagName, bitTag+ for (tagName, Level), bitPath for pathId column Determines I/O optimality of holistic twig join algorithms. During twig join process, useful tuples are accessed via the bitmap index. A B E . . . 110000 1 0 0 0 0010000100 0000010000 Bit-vectors . . . disk blocks
  • 8. bitAnc : A bit-vector represents terminal elements corr. to a certain path and all their ancestors. bitDesc: A bit–vector represents terminal elements corr. to a certain path and all their descendants. CIKM'09, Hong Kong 8 Additional Indexes a1 0 a2 a3 a4 1 6 11 b1 2 7 12 b2 b3 14 e2 d3 8 c3 13 A subtree covered by the left 3 bit-vectors bitPath,bitAnc, andbitDescfor PathId=2, i.e. /A/A/B
  • 9. Basic index Bit-vectors are built on a single column or a group of columns Requires labeled values, and reading records Hybrid index A Combination of two different indexes descTag : bitDesc & bitTag bitTwig : bitPath & bitAnc does not require labeled values to compute twig solution CIKM'09, Hong Kong 9 Two Types of Indexes
  • 10.
  • 11. Choose the minimum position value among the current 1’s as a current element for a query node Check if 1 exists in an interval, pos(a) and pos(d)? looking-ahead at the next 1 CIKM'09, Hong Kong 11 Advancing Cursors 0 eov P0 : /A P1 : /A/A q : //A (0,0,1) 6 1 Currq Current1 Next1
  • 12. Early detection with a bit-vector absence Condensing query nodes For path-based partition Reduces |INDEX| and |RECORD| Skipping reading obsolete records with advance(k) For tag, (tag, level)-based partition Reduces |RECORD| Moving cursors over compressed bit-vectors with no decompression A composite cursor moving over a bit-vector compressed by run-length encoding scheme Reduces |INDEX| CIKM'09, Hong Kong 12 Optimizations A A E B E C C P: //A/B/C CA = 11 10000000000100000 CB = 4 advance(11) 00001000010000100
  • 13. CIKM'09, Hong Kong 13 Compressed Bit-vector 000100000000100000000000000011 00000000000 . . . 00000000000000 0000000000000000000000000000001 00 (a) An original bit-vector with 8,000 bits 31 bits 2 bits 256* 31 bits 31 bits (b) Grouping as a unit of 31 bits and Merging identical groups 000010…010…011 100… 0100000000 000…001 000…000 Run-length is 256 31 literal bits Remaining word Uncompressed word Compressed word (c) Encoding each group as 1 word (4byte on a 32-bit machine) Cursor C ={ C.position, //Integer position value (Logical address) C. word, // The current word C is located at. C.bit, // The position of the bit C is visiting, in C.word C. rest } //The bit position in the remaining word
  • 14. CIKM'09, Hong Kong 14 Moving A Cursor over A Compressed Bit-vector a) Get the position of the next 1 C = {31, 0, 31,0} Skip to examine 31* 256 bits C={7998, 2, 31, 0} 000010…010…011 100… 0100000000 000…001 000…000 Remaining word Run-length is 256 b) Check a bit value at the position 3,000 C = {31, 0, 31,0} with distance to move, 2,869=(3000-31) Since 31* 256 > 2,869, The bit we find is within the word 1. 000010…010…011 100… 0100000000 000…001 000…000
  • 15. CIKM'09, Hong Kong 15 Experiments Datasets Synthetic : XMark Real : DBLP, Treebank, Swiss-prot Query sets
  • 16.
  • 17. # of distinct tag names are not much different
  • 18.
  • 19. CIKM'09, Hong Kong 17 Query Execution Time
  • 20. CIKM'09, Hong Kong 18 Input Data Size
  • 21. Merging used bit-vectors for a path pattern with //-axes and putting it into a bitmap index for the next time for a given path //A//B, P:/A/A/B P:/A/B acts like a pre-computed join index A path pattern with //-axes can be represented by a single bit-vector. Logical operations: OR, NOT are simply supported by bitwise-logical operations: &, |, ^ CIKM'09, Hong Kong 19 Other Features on bitPath
  • 22. CIKM'09, Hong Kong 20 Twig Queries with Logical Operations P//A, P//A//B//X ≡P//A//B//C V P//A//B//D , P//A//E A A A A B E B E X (C|D) //A[./B/C or ./B/D]//E P//A , P//A//E , P//A/B ⓧ(P//A/B ⊙A//A/B/C) A A A A A B B E E B C ¬ C //A[./B/not(C)]//E
  • 23. We investigated the possibilities of bitmap indexes for XML query processing Partitioning XML stored in RDB in various ways Cursor movements do not require decompression of bit-vectors We devised a way to identify element relationship with only bitmap index, bitTwig Our experiments showed that bitTwig was best for queries against shallow XML documents For deep XML documents, bitTag/w advance(k) showed the best performance. Future work: evaluating our system with more HTJ algorithms and other indexes CIKM'09, Hong Kong 21 Conclusions