SlideShare a Scribd company logo
1 of 21
Download to read offline
Designing Aggregates

Ing. Julio Ernesto Carreño Vargas
Designing Aggregates
   Once you have chosen dimensional aggregates, they must
    be designed and documented. This is the point of greatest
    risk for aggregate implementation.




       2
Definig The Base Schema




3
The Base Schema
   Declaration of grain is an essential part of schema design.
    Proper definition of grain not only enables the future
    identification of aggregates, it is crucial to the success of
    the base schema itself.




        4
Rollup dimensions
   Conforming rollup dimensions and their natural keys




       5
Rollup dimensions
   Rollup dimensions should be sourced from the base
    dimensions, and their attributes must follow the same
    rules for slow change processing.




       6
Hierarchies
   Documenting dimensional hierarchies
    may be important for business
    intelligence software and database
    features such as materialized views and
    materialized query tables.
   The hierarchies identify potential
    aggregation points and can aid in
    estimating degree of summarization.




        7
Housekeeping Columns
   they are present for a purely technical reason




        8
Design Principles for the Aggregate
                               Schema




9
A Separate Star for Each Aggregation
   Dimensional aggregates should be stored in separate
    tables for each aggregation.




       10
A Separate Star for Each Aggregation
   Do not store different levels of aggregation in the same
    schema. The schema will be capable of providing wrong
    results.




       11
Aggregate facts
   Aggregate facts should be stored in separate tables for
    each level of aggregation. These may be separate
    aggregate fact tables or separate prejoined aggregate
    tables




       12
Naming Conventions
   Facts and dimensional attributes should receive the same
    name in anaggregate schema as they do in the base
    schema.
   The name of an aggregate dimension table should
    describe the contents of its rows.
   The names of aggregate fact tables are always
    problematic. The best you can do is establish a convention
    and stick to it.




       13
Aggregate Dimension Design
   Attributes of the aggregate dimension must be identical
    to those in the base dimension in name and data type.
    Slow change processing rules must be identical. The
    natural key of an aggregate dimension will be different
    from the base dimension.
   Source aggregate dimensions from the base dimension,
    rather than the original source system. This eliminates
    redundant processing, and ensures uniform presentation
    of data values.




       14
Aggregate Dimension Design
   Aggregate dimension tables are often shared by multiple
    aggregates, and sometimes used by base fact tables. These
    shared dimension tables do not need to be built
    redundantly; the various fact tables can use the same
    dimension table. If the shared table is to be instantiated
    more than once, build it a single time and then replicate
    it.
       The documentation for a shared dimension must enumerate all
        dependent fact tables, whether part of the base schema or
        aggregates. In some cases, frequent updates to a dimension may
        require updates to fact tables outside their normal load
        windows.

         15
Aggregate Fact Table Design
   Aggregate Facts: Names and Data Types
       The aggregate fact should have the same business definition and
        column name as the base fact
       Unlike dimensional attributes, the aggregate fact may have a different
        data type than its counterpart in the base schema
   No New Facts, Including Counts
       Counts cannot be accurately performed against aggregate schemas,
        even if all attributes are the same. All counts must be performed
        against the base schema.
       As a general rule of thumb, the only count to be added to an
        aggregate should show the number of base rows summarized. If this
        fact is added to the aggregate, it should also appear in the base fact
        table with a constant value of 1. Counts of any other attribute should
        be directed to the base schema only.


         16
Aggregate Fact Table Design
   Audit Dimension:
       The audit record associated with a row in the aggregate fact
        table does not summarize the audit data associated with the
        base fact table. It describes the process by which the aggregate
        row was inserted or updated.
   Sourcing Aggregate Fact Tables
       Facts will be sourced from the base fact table and aggregated
        by the load process as appropriate.




         17
Documenting the Aggregate Schema




18
Documenting the Aggregate Schema
   Identify Schema Families
   Identify Dimensional Conformance




       19
Documenting the Aggregate Schema
   Documenting Aggregate Dimension Tables
   Documenting Aggregate Fact Tables




       20
Bibliografía
   Mastering Data Warehouse Aggregates.Solutions for Star
    Schema Performance. Christopher Adamson.




       21

More Related Content

What's hot

Schemas for multidimensional databases
Schemas for multidimensional databasesSchemas for multidimensional databases
Schemas for multidimensional databasesyazad dumasia
 
BW Multi-Dimensional Model
BW Multi-Dimensional ModelBW Multi-Dimensional Model
BW Multi-Dimensional Modelyujesh
 
Multidimensional data models
Multidimensional data  modelsMultidimensional data  models
Multidimensional data models774474
 
Dwh lecture slides-week5&6
Dwh lecture slides-week5&6Dwh lecture slides-week5&6
Dwh lecture slides-week5&6Shani729
 
080811 Miller Inventory Pilot Study
080811 Miller Inventory Pilot Study080811 Miller Inventory Pilot Study
080811 Miller Inventory Pilot Studyrwmill9716
 
Twp Upgrading 10g To 11g What To Expect From Optimizer
Twp Upgrading 10g To 11g What To Expect From OptimizerTwp Upgrading 10g To 11g What To Expect From Optimizer
Twp Upgrading 10g To 11g What To Expect From Optimizerqiw
 
Dwh lecture 08-denormalization tech
Dwh   lecture 08-denormalization techDwh   lecture 08-denormalization tech
Dwh lecture 08-denormalization techSulman Ahmed
 
Star ,Snow and Fact-Constullation Schemas??
Star ,Snow and  Fact-Constullation Schemas??Star ,Snow and  Fact-Constullation Schemas??
Star ,Snow and Fact-Constullation Schemas??Abdul Aslam
 

What's hot (11)

Schemas for multidimensional databases
Schemas for multidimensional databasesSchemas for multidimensional databases
Schemas for multidimensional databases
 
BW Multi-Dimensional Model
BW Multi-Dimensional ModelBW Multi-Dimensional Model
BW Multi-Dimensional Model
 
Multidimensional data models
Multidimensional data  modelsMultidimensional data  models
Multidimensional data models
 
Dwh lecture slides-week5&6
Dwh lecture slides-week5&6Dwh lecture slides-week5&6
Dwh lecture slides-week5&6
 
080811 Miller Inventory Pilot Study
080811 Miller Inventory Pilot Study080811 Miller Inventory Pilot Study
080811 Miller Inventory Pilot Study
 
Twp Upgrading 10g To 11g What To Expect From Optimizer
Twp Upgrading 10g To 11g What To Expect From OptimizerTwp Upgrading 10g To 11g What To Expect From Optimizer
Twp Upgrading 10g To 11g What To Expect From Optimizer
 
Dwh lecture 08-denormalization tech
Dwh   lecture 08-denormalization techDwh   lecture 08-denormalization tech
Dwh lecture 08-denormalization tech
 
Star schema PPT
Star schema PPTStar schema PPT
Star schema PPT
 
Star ,Snow and Fact-Constullation Schemas??
Star ,Snow and  Fact-Constullation Schemas??Star ,Snow and  Fact-Constullation Schemas??
Star ,Snow and Fact-Constullation Schemas??
 
Cs437 lecture 7-8
Cs437 lecture 7-8Cs437 lecture 7-8
Cs437 lecture 7-8
 
Dw concepts
Dw conceptsDw concepts
Dw concepts
 

Viewers also liked

Distributed systems
Distributed systemsDistributed systems
Distributed systemsMudur Alkan
 
Rowena chiu 2 (embedded music)
Rowena chiu 2 (embedded music)Rowena chiu 2 (embedded music)
Rowena chiu 2 (embedded music)S459tan
 
Data Warehouse Solution - EFICAZ
Data Warehouse Solution - EFICAZData Warehouse Solution - EFICAZ
Data Warehouse Solution - EFICAZLera Technologies
 
Online Analytical Processing
Online Analytical  ProcessingOnline Analytical  Processing
Online Analytical ProcessingMudur Alkan
 

Viewers also liked (7)

Creació de l’Associated Press
Creació de l’Associated PressCreació de l’Associated Press
Creació de l’Associated Press
 
Distributed systems
Distributed systemsDistributed systems
Distributed systems
 
Rowena chiu 2 (embedded music)
Rowena chiu 2 (embedded music)Rowena chiu 2 (embedded music)
Rowena chiu 2 (embedded music)
 
Data Warehouse Solution - EFICAZ
Data Warehouse Solution - EFICAZData Warehouse Solution - EFICAZ
Data Warehouse Solution - EFICAZ
 
Online Analytical Processing
Online Analytical  ProcessingOnline Analytical  Processing
Online Analytical Processing
 
USDA
USDAUSDA
USDA
 
3 dw architectures
3 dw architectures3 dw architectures
3 dw architectures
 

Similar to Agreggates ii

Dw design 4_bus_architecture
Dw design 4_bus_architectureDw design 4_bus_architecture
Dw design 4_bus_architectureClaudia Gomez
 
A Gentle Introduction to Microsoft SSAS
A Gentle Introduction to Microsoft SSASA Gentle Introduction to Microsoft SSAS
A Gentle Introduction to Microsoft SSASJohn Paredes
 
Info cube modeling_dimension_design_erada_bw_infoalert
Info cube modeling_dimension_design_erada_bw_infoalertInfo cube modeling_dimension_design_erada_bw_infoalert
Info cube modeling_dimension_design_erada_bw_infoalertPhani Kumar
 
Performance By Design
Performance By DesignPerformance By Design
Performance By DesignGuy Harrison
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)theijes
 
Essbase aso a quick reference guide part i
Essbase aso a quick reference guide part iEssbase aso a quick reference guide part i
Essbase aso a quick reference guide part iAmit Sharma
 
World2016_T1_S8_How to upgrade your cubes from 9.x to 10 and turn on optimize...
World2016_T1_S8_How to upgrade your cubes from 9.x to 10 and turn on optimize...World2016_T1_S8_How to upgrade your cubes from 9.x to 10 and turn on optimize...
World2016_T1_S8_How to upgrade your cubes from 9.x to 10 and turn on optimize...Karthik K Iyengar
 
Obiee interview questions and answers faq
Obiee interview questions and answers faqObiee interview questions and answers faq
Obiee interview questions and answers faqmaheshboggula
 
Bigtable
BigtableBigtable
Bigtableptdorf
 
Maintaining aggregates
Maintaining aggregatesMaintaining aggregates
Maintaining aggregatesSirisha Kumari
 
Cost Based Optimizer - Part 2 of 2
Cost Based Optimizer - Part 2 of 2Cost Based Optimizer - Part 2 of 2
Cost Based Optimizer - Part 2 of 2Mahesh Vallampati
 
Business Analytics 1 Module 4.pdf
Business Analytics 1 Module 4.pdfBusiness Analytics 1 Module 4.pdf
Business Analytics 1 Module 4.pdfJayanti Pande
 
Asset accounting config steps
Asset accounting config stepsAsset accounting config steps
Asset accounting config stepskrishnaKumarK33
 
Asset accounting config steps
Asset accounting config stepsAsset accounting config steps
Asset accounting config stepskrishnaKumarK33
 
LECTURE 7.ppt.pdf
LECTURE 7.ppt.pdfLECTURE 7.ppt.pdf
LECTURE 7.ppt.pdfcikajen791
 
Teradata Aggregate Join Indices And Dimensional Models
Teradata Aggregate Join Indices And Dimensional ModelsTeradata Aggregate Join Indices And Dimensional Models
Teradata Aggregate Join Indices And Dimensional Modelspepeborja
 

Similar to Agreggates ii (20)

Agreggates iii
Agreggates iiiAgreggates iii
Agreggates iii
 
Dw design 4_bus_architecture
Dw design 4_bus_architectureDw design 4_bus_architecture
Dw design 4_bus_architecture
 
A Gentle Introduction to Microsoft SSAS
A Gentle Introduction to Microsoft SSASA Gentle Introduction to Microsoft SSAS
A Gentle Introduction to Microsoft SSAS
 
Info cube modeling_dimension_design_erada_bw_infoalert
Info cube modeling_dimension_design_erada_bw_infoalertInfo cube modeling_dimension_design_erada_bw_infoalert
Info cube modeling_dimension_design_erada_bw_infoalert
 
Performance By Design
Performance By DesignPerformance By Design
Performance By Design
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)
 
Dbms schemas for decision support
Dbms schemas for decision supportDbms schemas for decision support
Dbms schemas for decision support
 
Essbase aso a quick reference guide part i
Essbase aso a quick reference guide part iEssbase aso a quick reference guide part i
Essbase aso a quick reference guide part i
 
World2016_T1_S8_How to upgrade your cubes from 9.x to 10 and turn on optimize...
World2016_T1_S8_How to upgrade your cubes from 9.x to 10 and turn on optimize...World2016_T1_S8_How to upgrade your cubes from 9.x to 10 and turn on optimize...
World2016_T1_S8_How to upgrade your cubes from 9.x to 10 and turn on optimize...
 
Obiee interview questions and answers faq
Obiee interview questions and answers faqObiee interview questions and answers faq
Obiee interview questions and answers faq
 
Bigtable
BigtableBigtable
Bigtable
 
Maintaining aggregates
Maintaining aggregatesMaintaining aggregates
Maintaining aggregates
 
Cost Based Optimizer - Part 2 of 2
Cost Based Optimizer - Part 2 of 2Cost Based Optimizer - Part 2 of 2
Cost Based Optimizer - Part 2 of 2
 
Database aggregation using metadata
Database aggregation using metadataDatabase aggregation using metadata
Database aggregation using metadata
 
Business Analytics 1 Module 4.pdf
Business Analytics 1 Module 4.pdfBusiness Analytics 1 Module 4.pdf
Business Analytics 1 Module 4.pdf
 
Asset accounting config steps
Asset accounting config stepsAsset accounting config steps
Asset accounting config steps
 
Asset accounting config steps
Asset accounting config stepsAsset accounting config steps
Asset accounting config steps
 
LECTURE 7.ppt.pdf
LECTURE 7.ppt.pdfLECTURE 7.ppt.pdf
LECTURE 7.ppt.pdf
 
Teradata Aggregate Join Indices And Dimensional Models
Teradata Aggregate Join Indices And Dimensional ModelsTeradata Aggregate Join Indices And Dimensional Models
Teradata Aggregate Join Indices And Dimensional Models
 
spss Help
spss Helpspss Help
spss Help
 

More from Claudia Gomez

More from Claudia Gomez (19)

Olapsql
OlapsqlOlapsql
Olapsql
 
3 olap storage
3 olap storage3 olap storage
3 olap storage
 
3 olap storage
3 olap storage3 olap storage
3 olap storage
 
2 olap operaciones
2 olap operaciones2 olap operaciones
2 olap operaciones
 
1 introba
1 introba1 introba
1 introba
 
Diseño fisico particiones_3
Diseño fisico particiones_3Diseño fisico particiones_3
Diseño fisico particiones_3
 
Diseño fisico indices_2
Diseño fisico indices_2Diseño fisico indices_2
Diseño fisico indices_2
 
Diseño fisico 1
Diseño fisico 1Diseño fisico 1
Diseño fisico 1
 
Dw design hierarchies_7
Dw design hierarchies_7Dw design hierarchies_7
Dw design hierarchies_7
 
Dw design fact_tables_types_6
Dw design fact_tables_types_6Dw design fact_tables_types_6
Dw design fact_tables_types_6
 
Dw design date_dimension_1_1
Dw design date_dimension_1_1Dw design date_dimension_1_1
Dw design date_dimension_1_1
 
Dw design 3_surro_keys
Dw design 3_surro_keysDw design 3_surro_keys
Dw design 3_surro_keys
 
Dw design 2_conceptual_model
Dw design 2_conceptual_modelDw design 2_conceptual_model
Dw design 2_conceptual_model
 
Dw design 1_dim_facts
Dw design 1_dim_factsDw design 1_dim_facts
Dw design 1_dim_facts
 
2 dw requeriments
2 dw requeriments2 dw requeriments
2 dw requeriments
 
1 dw projectplanning
1 dw projectplanning1 dw projectplanning
1 dw projectplanning
 
0 dw process
0 dw process0 dw process
0 dw process
 
Clase2 introdw
Clase2 introdwClase2 introdw
Clase2 introdw
 
Intro bi
Intro biIntro bi
Intro bi
 

Recently uploaded

Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????blackmambaettijean
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 

Recently uploaded (20)

Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 

Agreggates ii

  • 1. Designing Aggregates Ing. Julio Ernesto Carreño Vargas
  • 2. Designing Aggregates  Once you have chosen dimensional aggregates, they must be designed and documented. This is the point of greatest risk for aggregate implementation. 2
  • 3. Definig The Base Schema 3
  • 4. The Base Schema  Declaration of grain is an essential part of schema design. Proper definition of grain not only enables the future identification of aggregates, it is crucial to the success of the base schema itself. 4
  • 5. Rollup dimensions  Conforming rollup dimensions and their natural keys 5
  • 6. Rollup dimensions  Rollup dimensions should be sourced from the base dimensions, and their attributes must follow the same rules for slow change processing. 6
  • 7. Hierarchies  Documenting dimensional hierarchies may be important for business intelligence software and database features such as materialized views and materialized query tables.  The hierarchies identify potential aggregation points and can aid in estimating degree of summarization. 7
  • 8. Housekeeping Columns  they are present for a purely technical reason 8
  • 9. Design Principles for the Aggregate Schema 9
  • 10. A Separate Star for Each Aggregation  Dimensional aggregates should be stored in separate tables for each aggregation. 10
  • 11. A Separate Star for Each Aggregation  Do not store different levels of aggregation in the same schema. The schema will be capable of providing wrong results. 11
  • 12. Aggregate facts  Aggregate facts should be stored in separate tables for each level of aggregation. These may be separate aggregate fact tables or separate prejoined aggregate tables 12
  • 13. Naming Conventions  Facts and dimensional attributes should receive the same name in anaggregate schema as they do in the base schema.  The name of an aggregate dimension table should describe the contents of its rows.  The names of aggregate fact tables are always problematic. The best you can do is establish a convention and stick to it. 13
  • 14. Aggregate Dimension Design  Attributes of the aggregate dimension must be identical to those in the base dimension in name and data type. Slow change processing rules must be identical. The natural key of an aggregate dimension will be different from the base dimension.  Source aggregate dimensions from the base dimension, rather than the original source system. This eliminates redundant processing, and ensures uniform presentation of data values. 14
  • 15. Aggregate Dimension Design  Aggregate dimension tables are often shared by multiple aggregates, and sometimes used by base fact tables. These shared dimension tables do not need to be built redundantly; the various fact tables can use the same dimension table. If the shared table is to be instantiated more than once, build it a single time and then replicate it.  The documentation for a shared dimension must enumerate all dependent fact tables, whether part of the base schema or aggregates. In some cases, frequent updates to a dimension may require updates to fact tables outside their normal load windows. 15
  • 16. Aggregate Fact Table Design  Aggregate Facts: Names and Data Types  The aggregate fact should have the same business definition and column name as the base fact  Unlike dimensional attributes, the aggregate fact may have a different data type than its counterpart in the base schema  No New Facts, Including Counts  Counts cannot be accurately performed against aggregate schemas, even if all attributes are the same. All counts must be performed against the base schema.  As a general rule of thumb, the only count to be added to an aggregate should show the number of base rows summarized. If this fact is added to the aggregate, it should also appear in the base fact table with a constant value of 1. Counts of any other attribute should be directed to the base schema only. 16
  • 17. Aggregate Fact Table Design  Audit Dimension:  The audit record associated with a row in the aggregate fact table does not summarize the audit data associated with the base fact table. It describes the process by which the aggregate row was inserted or updated.  Sourcing Aggregate Fact Tables  Facts will be sourced from the base fact table and aggregated by the load process as appropriate. 17
  • 19. Documenting the Aggregate Schema  Identify Schema Families  Identify Dimensional Conformance 19
  • 20. Documenting the Aggregate Schema  Documenting Aggregate Dimension Tables  Documenting Aggregate Fact Tables 20
  • 21. Bibliografía  Mastering Data Warehouse Aggregates.Solutions for Star Schema Performance. Christopher Adamson. 21