SlideShare una empresa de Scribd logo
1 de 51
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 Databases are developed on the IDEA that
DATA is one of the critical materials of the
Information Age
 Information, which is created by data,
becomes the bases for decision making
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 Created to facilitate the decision making
process
 So much information that it is difficult to
extract it all from a traditional database
 Need for a more comprehensive data storage
facility
 Data Warehouse
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 Extract Information from data to use as
the basis for decision making
 Used at all levels of the Organization
 Tailored to specific business areas
 Interactive
 Ad Hoc queries to retrieve and display
information
 Combines historical operation data with
business activities
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 Data Store – The DSS Database
 Business Data
 Business Model Data
 Internal and External Data
 Data Extraction and Filtering
 Extract and validate data from the operational
database and the external data sources
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 End-User Query Tool
 Create Queries that access either the
Operational or the DSS database
 End User Presentation Tools
 Organize and Present the Data
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 Operational
 Stored in Normalized Relational Database
 Support transactions that represent daily
operations (Not Query Friendly)
 3 Main Differences
 Time Span
 Granularity
 Dimensionality
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 Operational
 Real Time
 Current Transactions
 Short Time Frame
 Specific Data Facts
 DSS
 Historic
 Long Time Frame (Months/Quarters/Years)
 Patterns
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 Operational
 Specific Transactions that occur at a given time
 DSS
 Shown at different levels of aggregation
 Different Summary Levels
 Decompose (drill down)
 Summarize (roll up)
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 Most distinguishing characteristic of DSS data
 Operational
 Represents atomic transactions
 DSS
 Data is related in Many ways
 Develop the larger picture
 Multi-dimensional view of data
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 DSS Database Scheme
 Support Complex and Non-Normalized data
 Summarized and Aggregate data
 Multiple Relationships
 Queries must extract multi-dimensional time slices
 Redundant Data
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 Data Extraction and Filtering
DSS databases are created mainly by extracting
data from operational databases combined
with data imported from external source
 Need for advanced data extraction & filtering tools
 Allow batch / scheduled data extraction
 Support different types of data sources
 Check for inconsistent data / data validation rules
 Support advanced data integration / data formatting
conflicts
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 End User Analytical Interface
Must support advanced data modeling and data
presentation tools
Data analysis tools
Query generation
Must Allow the User to Navigate through the
DSS
 Size Requirements
VERY Large – Terabytes
Advanced Hardware (Multiple processors,
multiple disk arrays, etc.)
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 DSS – friendly data repository for the DSS is
the DATA WAREHOUSE
 Definition: Integrated, Subject-Oriented,
Time-Variant, Nonvolatile database that
provides support for decision making
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 The data warehouse is a centralized,
consolidated database that integrated data
derived from the entire organization
 Multiple Sources
 Diverse Sources
 Diverse Formats
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 Data is arranged and optimized to provide
answer to questions from diverse functional
areas
 Data is organized and summarized by topic
 Sales / Marketing / Finance / Distribution / Etc.
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 The Data Warehouse represents the flow of
data through time
 Can contain projected data from statistical
models
 Data is periodically uploaded then time-
dependent data is recomputed
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 Once data is entered it is NEVER removed
 Represents the company’s entire history
 Near term history is continually added to it
 Always growing
 Must support terabyte databases and
multiprocessors
 Read-Only database for data analysis and
query processing
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 Small Data Stores
 More manageable data sets
 Targeted to meet the needs of small groups
within the organization
 Small, Single-Subject data warehouse subset
that provides decision support to a small
group of people
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 Online Analytical Processing Tools
 DSS tools that use multidimensional data
analysis techniques
 Support for a DSS data store
 Data extraction and integration filter
 Specialized presentation interface
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 Data Warehouse and Operational
Environments are Separated
 Data is integrated
 Contains historical data over a long period of
time
 Data is a snapshot data captured at a given
point in time
 Data is subject-oriented
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 Mainly read-only with periodic batch updates
 Development Life Cycle has a data driven
approach versus the traditional process-
driven approach
 Data contains several levels of detail
 Current, Old, Lightly Summarized, Highly
Summarized
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 Environment is characterized by Read-
only transactions to very large data sets
 System that traces data sources,
transformations, and storage
 Metadata is a critical component
Source, transformation, integration, storage,
relationships, history, etc
 Contains a chargeback mechanism for
resource usage that enforces optimal use
of data by end users
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 Need for More Intensive Decision Support
 4 Main Characteristics
 Multidimensional data analysis
 Advanced Database Support
 Easy-to-use end-user interfaces
 Support Client/Server architecture
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 Advanced Data Presentation Functions
 3-D graphics, Pivot Tables, Crosstabs, etc.
 Compatible with Spreadsheets & Statistical
packages
 Advanced data aggregations, consolidation and
classification across time dimensions
 Advanced computational functions
 Advanced data modeling functions
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 Advanced Data Access Features
 Access to many kinds of DBMS’s, flat files, and
internal and external data sources
 Access to aggregated data warehouse data
 Advanced data navigation (drill-downs and roll-
ups)
 Ability to map end-user requests to the
appropriate data source
 Support for Very Large Databases
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 Graphical User Interfaces
 Much more useful if access is kept simple
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 Framework for the new systems to be
designed, developed and implemented
 Divide the OLAP system into several
components that define its architecture
 Same Computer
 Distributed among several computer
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 3 Main Modules
 GUI
 Analytical Processing Logic
 Data-processing Logic
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
OLAP Client/ServerOLAP Client/Server
ArchitectureArchitecture
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 Relational Online Analytical Processing
 OLAP functionality using relational database and
familiar query tools to store and analyze
multidimensional data
 Multidimensional data schema support
 Data access language & query performance
for multidimensional data
 Support for Very Large Databases
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 Decision Support Data tends to be
 Nonnormalized
 Duplicated
 Preaggregated
 Star Schema
 Special Design technique for multidimensional
data representations
 Optimize data query operations instead of data
update operations
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 Data Modeling Technique to map
multidimensional decision support data into
a relational database
 Current Relational modeling techniques do
not serve the needs of advanced data
requirements
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 4 Components
 Facts
 Dimensions
 Attributes
 Attribute Hierarchies
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 Numeric measurements (values) that
represent a specific business aspect or
activity
 Stored in a fact table at the center of the
star scheme
 Contains facts that are linked through
their dimensions
 Can be computed or derived at run time
 Updated periodically with data from
operational databases
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 Qualifying characteristics that provide
additional perspectives to a given fact
 DSS data is almost always viewed in relation to
other data
 Dimensions are normally stored in dimension
tables
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 Dimension Tables contain Attributes
 Attributes are used to search, filter, or
classify facts
 Dimensions provide descriptive
characteristics about the facts through
their attributed
 Must define common business attributes
that will be used to narrow a search,
group information, or describe
dimensions. (ex.: Time / Location /
Product)
 No mathematical limit to the number of
dimensions (3-D makes it easy to model)
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 Provides a Top-Down data organization
 Aggregation
 Drill-down / Roll-Up data analysis
 Attributes from different dimensions can be
grouped to form a hierarchy
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
Fact Table
Dimension
Tables
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 Fact and Dimensions are represented by
physical tables in the data warehouse
database
 Fact tables are related to each dimension
table in a Many to One relationship
(Primary/Foreign Key Relationships)
 Fact Table is related to many dimension
tables
The primary key of the fact table is a
composite primary key from the dimension
tables
 Each fact table is designed to answer a
specific DSS question
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 The fact table is always the larges table in
the star schema
 Each dimension record is related to thousand
of fact records
 Star Schema facilitated data retrieval
functions
 DBMS first searches the Dimension Tables
before the larger fact table
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 An Active Decision Support Framework
 Not a Static Database
 Always a Work in Process
 Complete Infrastructure for Company-Wide
decision support
 Hardware / Software / People / Procedures /
Data
 Data Warehouse is a critical component of the
Modern DSS – But not the Only critical component
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 Discover Previously unknown data
characteristics, relationships, dependencies,
or trends
 Typical Data Analysis Relies on end users
 Define the Problem
 Select the Data
 Initial the Data Analysis
 Reacts to External Stimulus
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 Proactive
 Automatically searches
Anomalies
Possible Relationships
Identify Problems before the end-user
 Data Mining tools analyze the data,
uncover problems or opportunities hidden
in data relationships, form computer
models based on their findings, and then
user the models to predict business
behavior – with minimal end-user
intervention
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 A methodology designed to perform
knowledge-discovery expeditions over the
database data with minimal end-user
intervention
 3 Stages of Data
 Data
 Information
 Knowledge
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 Data Preparation
 Identify the main data sets to be used by the
data mining operation (usually the data
warehouse)
 Data Analysis and Classification
 Study the data to identify common data
characteristics or patterns
 Data groupings, classifications, clusters, sequences
 Data dependencies, links, or relationships
 Data patterns, trends, deviation
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 Knowledge Acquisition
 Uses the Results of the Data Analysis and Classification
phase
 Data mining tool selects the appropriate modeling or
knowledge-acquisition algorithms
 Neural Networks
 Decision Trees
 Rules Induction
 Genetic algorithms
 Memory-Based Reasoning
 Prognosis
 Predict Future Behavior
 Forecast Business Outcomes
 65% of customers who did not use a particular credit card in
the last 6 months are 88% likely to cancel the account.
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
 Still a New Technique
 May find many Unmeaningful Relationships
 Good at finding Practical Relationships
 Define Customer Buying Patterns
 Improve Product Development and Acceptance
 Etc.
 Potential of becoming the next frontier in
database development
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in
+919892900103 | info@vibranttechnologies.co.in | www.
Vibranttechnologies.co.in

Más contenido relacionado

La actualidad más candente

Big Data in Manufacturing Final PPT
Big Data in Manufacturing Final PPTBig Data in Manufacturing Final PPT
Big Data in Manufacturing Final PPT
Nikhil Atkuri
 

La actualidad más candente (20)

Harmonize Your Enterprise Processes with Product Master Data Management Solut...
Harmonize Your Enterprise Processes with Product Master Data Management Solut...Harmonize Your Enterprise Processes with Product Master Data Management Solut...
Harmonize Your Enterprise Processes with Product Master Data Management Solut...
 
Analyst Keynote: Forrester: Data Fabric Strategy is Vital for Business Innova...
Analyst Keynote: Forrester: Data Fabric Strategy is Vital for Business Innova...Analyst Keynote: Forrester: Data Fabric Strategy is Vital for Business Innova...
Analyst Keynote: Forrester: Data Fabric Strategy is Vital for Business Innova...
 
Crowdsourcing Data Governance
Crowdsourcing Data GovernanceCrowdsourcing Data Governance
Crowdsourcing Data Governance
 
Neo4j GraphTalks - Einführung in Graphdatenbanken
Neo4j GraphTalks - Einführung in GraphdatenbankenNeo4j GraphTalks - Einführung in Graphdatenbanken
Neo4j GraphTalks - Einführung in Graphdatenbanken
 
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
 
A technical Introduction to Big Data Analytics
A technical Introduction to Big Data AnalyticsA technical Introduction to Big Data Analytics
A technical Introduction to Big Data Analytics
 
Enterprise Architecture - An Introduction
Enterprise Architecture - An Introduction Enterprise Architecture - An Introduction
Enterprise Architecture - An Introduction
 
Applying Big Data
Applying Big DataApplying Big Data
Applying Big Data
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 
Big data
Big dataBig data
Big data
 
How businesses can benefit from privacy preserving synthetic data
How businesses can benefit from privacy preserving synthetic dataHow businesses can benefit from privacy preserving synthetic data
How businesses can benefit from privacy preserving synthetic data
 
Big Data in Manufacturing Final PPT
Big Data in Manufacturing Final PPTBig Data in Manufacturing Final PPT
Big Data in Manufacturing Final PPT
 
Data Analysis in Manufacturing Application to Steel Industry
Data Analysis in Manufacturing Application to Steel IndustryData Analysis in Manufacturing Application to Steel Industry
Data Analysis in Manufacturing Application to Steel Industry
 
Data warehousev2.1
Data warehousev2.1Data warehousev2.1
Data warehousev2.1
 
Estimating the Total Costs of Your Cloud Analytics Platform 
Estimating the Total Costs of Your Cloud Analytics Platform Estimating the Total Costs of Your Cloud Analytics Platform 
Estimating the Total Costs of Your Cloud Analytics Platform 
 
The opportunity of the business data lake
The opportunity of the business data lakeThe opportunity of the business data lake
The opportunity of the business data lake
 
United Technologies, Hands On Reference Data Management For Corporate Finance...
United Technologies, Hands On Reference Data Management For Corporate Finance...United Technologies, Hands On Reference Data Management For Corporate Finance...
United Technologies, Hands On Reference Data Management For Corporate Finance...
 
Foundational Strategies for Trust in Big Data Part 3: Data Lineage
Foundational Strategies for Trust in Big Data Part 3: Data LineageFoundational Strategies for Trust in Big Data Part 3: Data Lineage
Foundational Strategies for Trust in Big Data Part 3: Data Lineage
 
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
 
Why You Need to Govern Big Data
Why You Need to Govern Big DataWhy You Need to Govern Big Data
Why You Need to Govern Big Data
 

Similar a Corporate-data-warehousing-training

Thought leadership Oct2015 selfserve
Thought leadership Oct2015 selfserveThought leadership Oct2015 selfserve
Thought leadership Oct2015 selfserve
Ron Krzoska
 
solution-brief-tibco-mdm-platform
solution-brief-tibco-mdm-platformsolution-brief-tibco-mdm-platform
solution-brief-tibco-mdm-platform
Matt Jaworovich
 
Running head Database and Data Warehousing design1Database and.docx
Running head Database and Data Warehousing design1Database and.docxRunning head Database and Data Warehousing design1Database and.docx
Running head Database and Data Warehousing design1Database and.docx
healdkathaleen
 
Running head Database and Data Warehousing design1Database and.docx
Running head Database and Data Warehousing design1Database and.docxRunning head Database and Data Warehousing design1Database and.docx
Running head Database and Data Warehousing design1Database and.docx
todd271
 

Similar a Corporate-data-warehousing-training (20)

A Logical Architecture is Always a Flexible Architecture (ASEAN)
A Logical Architecture is Always a Flexible Architecture (ASEAN)A Logical Architecture is Always a Flexible Architecture (ASEAN)
A Logical Architecture is Always a Flexible Architecture (ASEAN)
 
DAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
DAMA Webinar: Turn Grand Designs into a Reality with Data VirtualizationDAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
DAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
 
KASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
KASHTECH AND DENODO: ROI and Economic Value of Data VirtualizationKASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
KASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
 
Retail Design
Retail DesignRetail Design
Retail Design
 
Making the Case for Legacy Data in Modern Data Analytics Platforms
Making the Case for Legacy Data in Modern Data Analytics PlatformsMaking the Case for Legacy Data in Modern Data Analytics Platforms
Making the Case for Legacy Data in Modern Data Analytics Platforms
 
2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics
 
Why Data Virtualization? An Introduction
Why Data Virtualization? An IntroductionWhy Data Virtualization? An Introduction
Why Data Virtualization? An Introduction
 
Accelerate Cloud Migrations and Architecture with Data Virtualization
Accelerate Cloud Migrations and Architecture with Data VirtualizationAccelerate Cloud Migrations and Architecture with Data Virtualization
Accelerate Cloud Migrations and Architecture with Data Virtualization
 
3 Reasons Data Virtualization Matters in Your Portfolio
3 Reasons Data Virtualization Matters in Your Portfolio3 Reasons Data Virtualization Matters in Your Portfolio
3 Reasons Data Virtualization Matters in Your Portfolio
 
TIBCO Innovation Workshop Series: Reducing Decision Latency with Streaming An...
TIBCO Innovation Workshop Series: Reducing Decision Latency with Streaming An...TIBCO Innovation Workshop Series: Reducing Decision Latency with Streaming An...
TIBCO Innovation Workshop Series: Reducing Decision Latency with Streaming An...
 
Introduction to Modern Data Virtualization (US)
Introduction to Modern Data Virtualization (US)Introduction to Modern Data Virtualization (US)
Introduction to Modern Data Virtualization (US)
 
Connecting Silos in Real Time with Data Virtualization
Connecting Silos in Real Time with Data VirtualizationConnecting Silos in Real Time with Data Virtualization
Connecting Silos in Real Time with Data Virtualization
 
Thought leadership Oct2015 selfserve
Thought leadership Oct2015 selfserveThought leadership Oct2015 selfserve
Thought leadership Oct2015 selfserve
 
Webinar #2 - Transforming Challenges into Opportunities for Credit Unions
Webinar #2 - Transforming Challenges into Opportunities for Credit UnionsWebinar #2 - Transforming Challenges into Opportunities for Credit Unions
Webinar #2 - Transforming Challenges into Opportunities for Credit Unions
 
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
 
solution-brief-tibco-mdm-platform
solution-brief-tibco-mdm-platformsolution-brief-tibco-mdm-platform
solution-brief-tibco-mdm-platform
 
Cloud Data Integration Best Practices
Cloud Data Integration Best PracticesCloud Data Integration Best Practices
Cloud Data Integration Best Practices
 
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)
 
Running head Database and Data Warehousing design1Database and.docx
Running head Database and Data Warehousing design1Database and.docxRunning head Database and Data Warehousing design1Database and.docx
Running head Database and Data Warehousing design1Database and.docx
 
Running head Database and Data Warehousing design1Database and.docx
Running head Database and Data Warehousing design1Database and.docxRunning head Database and Data Warehousing design1Database and.docx
Running head Database and Data Warehousing design1Database and.docx
 

Más de Unmesh Baile

Más de Unmesh Baile (20)

java-corporate-training-institute-in-mumbai
java-corporate-training-institute-in-mumbaijava-corporate-training-institute-in-mumbai
java-corporate-training-institute-in-mumbai
 
Php mysql training-in-mumbai
Php mysql training-in-mumbaiPhp mysql training-in-mumbai
Php mysql training-in-mumbai
 
Java course-in-mumbai
Java course-in-mumbaiJava course-in-mumbai
Java course-in-mumbai
 
Robotics corporate-training-in-mumbai
Robotics corporate-training-in-mumbaiRobotics corporate-training-in-mumbai
Robotics corporate-training-in-mumbai
 
Corporate-training-for-msbi-course-in-mumbai
Corporate-training-for-msbi-course-in-mumbaiCorporate-training-for-msbi-course-in-mumbai
Corporate-training-for-msbi-course-in-mumbai
 
Linux corporate-training-in-mumbai
Linux corporate-training-in-mumbaiLinux corporate-training-in-mumbai
Linux corporate-training-in-mumbai
 
Professional dataware-housing-training-in-mumbai
Professional dataware-housing-training-in-mumbaiProfessional dataware-housing-training-in-mumbai
Professional dataware-housing-training-in-mumbai
 
Best-embedded-corporate-training-in-mumbai
Best-embedded-corporate-training-in-mumbaiBest-embedded-corporate-training-in-mumbai
Best-embedded-corporate-training-in-mumbai
 
Selenium-corporate-training-in-mumbai
Selenium-corporate-training-in-mumbaiSelenium-corporate-training-in-mumbai
Selenium-corporate-training-in-mumbai
 
Weblogic-clustering-failover-and-load-balancing-training
Weblogic-clustering-failover-and-load-balancing-trainingWeblogic-clustering-failover-and-load-balancing-training
Weblogic-clustering-failover-and-load-balancing-training
 
Advance-excel-professional-trainer-in-mumbai
Advance-excel-professional-trainer-in-mumbaiAdvance-excel-professional-trainer-in-mumbai
Advance-excel-professional-trainer-in-mumbai
 
Best corporate-r-programming-training-in-mumbai
Best corporate-r-programming-training-in-mumbaiBest corporate-r-programming-training-in-mumbai
Best corporate-r-programming-training-in-mumbai
 
R-programming-training-in-mumbai
R-programming-training-in-mumbaiR-programming-training-in-mumbai
R-programming-training-in-mumbai
 
Sas-training-in-mumbai
Sas-training-in-mumbaiSas-training-in-mumbai
Sas-training-in-mumbai
 
Microsoft-business-intelligence-training-in-mumbai
Microsoft-business-intelligence-training-in-mumbaiMicrosoft-business-intelligence-training-in-mumbai
Microsoft-business-intelligence-training-in-mumbai
 
Linux-training-for-beginners-in-mumbai
Linux-training-for-beginners-in-mumbaiLinux-training-for-beginners-in-mumbai
Linux-training-for-beginners-in-mumbai
 
Corporate-informatica-training-in-mumbai
Corporate-informatica-training-in-mumbaiCorporate-informatica-training-in-mumbai
Corporate-informatica-training-in-mumbai
 
Corporate-informatica-training-in-mumbai
Corporate-informatica-training-in-mumbaiCorporate-informatica-training-in-mumbai
Corporate-informatica-training-in-mumbai
 
Best-robotics-training-in-mumbai
Best-robotics-training-in-mumbaiBest-robotics-training-in-mumbai
Best-robotics-training-in-mumbai
 
Best-embedded-system-classes-in-mumbai
Best-embedded-system-classes-in-mumbaiBest-embedded-system-classes-in-mumbai
Best-embedded-system-classes-in-mumbai
 

Último

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 

Último (20)

Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 

Corporate-data-warehousing-training

  • 1. +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 2.  Databases are developed on the IDEA that DATA is one of the critical materials of the Information Age  Information, which is created by data, becomes the bases for decision making +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 3.  Created to facilitate the decision making process  So much information that it is difficult to extract it all from a traditional database  Need for a more comprehensive data storage facility  Data Warehouse +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 4.  Extract Information from data to use as the basis for decision making  Used at all levels of the Organization  Tailored to specific business areas  Interactive  Ad Hoc queries to retrieve and display information  Combines historical operation data with business activities +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 5.  Data Store – The DSS Database  Business Data  Business Model Data  Internal and External Data  Data Extraction and Filtering  Extract and validate data from the operational database and the external data sources +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 6.  End-User Query Tool  Create Queries that access either the Operational or the DSS database  End User Presentation Tools  Organize and Present the Data +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 7.  Operational  Stored in Normalized Relational Database  Support transactions that represent daily operations (Not Query Friendly)  3 Main Differences  Time Span  Granularity  Dimensionality +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 8.  Operational  Real Time  Current Transactions  Short Time Frame  Specific Data Facts  DSS  Historic  Long Time Frame (Months/Quarters/Years)  Patterns +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 9.  Operational  Specific Transactions that occur at a given time  DSS  Shown at different levels of aggregation  Different Summary Levels  Decompose (drill down)  Summarize (roll up) +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 10.  Most distinguishing characteristic of DSS data  Operational  Represents atomic transactions  DSS  Data is related in Many ways  Develop the larger picture  Multi-dimensional view of data +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 11.  DSS Database Scheme  Support Complex and Non-Normalized data  Summarized and Aggregate data  Multiple Relationships  Queries must extract multi-dimensional time slices  Redundant Data +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 12.  Data Extraction and Filtering DSS databases are created mainly by extracting data from operational databases combined with data imported from external source  Need for advanced data extraction & filtering tools  Allow batch / scheduled data extraction  Support different types of data sources  Check for inconsistent data / data validation rules  Support advanced data integration / data formatting conflicts +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 13.  End User Analytical Interface Must support advanced data modeling and data presentation tools Data analysis tools Query generation Must Allow the User to Navigate through the DSS  Size Requirements VERY Large – Terabytes Advanced Hardware (Multiple processors, multiple disk arrays, etc.) +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 14.  DSS – friendly data repository for the DSS is the DATA WAREHOUSE  Definition: Integrated, Subject-Oriented, Time-Variant, Nonvolatile database that provides support for decision making +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 15.  The data warehouse is a centralized, consolidated database that integrated data derived from the entire organization  Multiple Sources  Diverse Sources  Diverse Formats +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 16.  Data is arranged and optimized to provide answer to questions from diverse functional areas  Data is organized and summarized by topic  Sales / Marketing / Finance / Distribution / Etc. +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 17.  The Data Warehouse represents the flow of data through time  Can contain projected data from statistical models  Data is periodically uploaded then time- dependent data is recomputed +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 18.  Once data is entered it is NEVER removed  Represents the company’s entire history  Near term history is continually added to it  Always growing  Must support terabyte databases and multiprocessors  Read-Only database for data analysis and query processing +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 19.  Small Data Stores  More manageable data sets  Targeted to meet the needs of small groups within the organization  Small, Single-Subject data warehouse subset that provides decision support to a small group of people +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 20.  Online Analytical Processing Tools  DSS tools that use multidimensional data analysis techniques  Support for a DSS data store  Data extraction and integration filter  Specialized presentation interface +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 21.  Data Warehouse and Operational Environments are Separated  Data is integrated  Contains historical data over a long period of time  Data is a snapshot data captured at a given point in time  Data is subject-oriented +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 22.  Mainly read-only with periodic batch updates  Development Life Cycle has a data driven approach versus the traditional process- driven approach  Data contains several levels of detail  Current, Old, Lightly Summarized, Highly Summarized +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 23.  Environment is characterized by Read- only transactions to very large data sets  System that traces data sources, transformations, and storage  Metadata is a critical component Source, transformation, integration, storage, relationships, history, etc  Contains a chargeback mechanism for resource usage that enforces optimal use of data by end users +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 24.  Need for More Intensive Decision Support  4 Main Characteristics  Multidimensional data analysis  Advanced Database Support  Easy-to-use end-user interfaces  Support Client/Server architecture +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 25.  Advanced Data Presentation Functions  3-D graphics, Pivot Tables, Crosstabs, etc.  Compatible with Spreadsheets & Statistical packages  Advanced data aggregations, consolidation and classification across time dimensions  Advanced computational functions  Advanced data modeling functions +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 26.  Advanced Data Access Features  Access to many kinds of DBMS’s, flat files, and internal and external data sources  Access to aggregated data warehouse data  Advanced data navigation (drill-downs and roll- ups)  Ability to map end-user requests to the appropriate data source  Support for Very Large Databases +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 27.  Graphical User Interfaces  Much more useful if access is kept simple +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 28.  Framework for the new systems to be designed, developed and implemented  Divide the OLAP system into several components that define its architecture  Same Computer  Distributed among several computer +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 29.  3 Main Modules  GUI  Analytical Processing Logic  Data-processing Logic +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 30. OLAP Client/ServerOLAP Client/Server ArchitectureArchitecture +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 31.  Relational Online Analytical Processing  OLAP functionality using relational database and familiar query tools to store and analyze multidimensional data  Multidimensional data schema support  Data access language & query performance for multidimensional data  Support for Very Large Databases +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 32.  Decision Support Data tends to be  Nonnormalized  Duplicated  Preaggregated  Star Schema  Special Design technique for multidimensional data representations  Optimize data query operations instead of data update operations +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 33.  Data Modeling Technique to map multidimensional decision support data into a relational database  Current Relational modeling techniques do not serve the needs of advanced data requirements +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 34.  4 Components  Facts  Dimensions  Attributes  Attribute Hierarchies +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 35.  Numeric measurements (values) that represent a specific business aspect or activity  Stored in a fact table at the center of the star scheme  Contains facts that are linked through their dimensions  Can be computed or derived at run time  Updated periodically with data from operational databases +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 36.  Qualifying characteristics that provide additional perspectives to a given fact  DSS data is almost always viewed in relation to other data  Dimensions are normally stored in dimension tables +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 37.  Dimension Tables contain Attributes  Attributes are used to search, filter, or classify facts  Dimensions provide descriptive characteristics about the facts through their attributed  Must define common business attributes that will be used to narrow a search, group information, or describe dimensions. (ex.: Time / Location / Product)  No mathematical limit to the number of dimensions (3-D makes it easy to model) +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 38.  Provides a Top-Down data organization  Aggregation  Drill-down / Roll-Up data analysis  Attributes from different dimensions can be grouped to form a hierarchy +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 39. Fact Table Dimension Tables +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 40.  Fact and Dimensions are represented by physical tables in the data warehouse database  Fact tables are related to each dimension table in a Many to One relationship (Primary/Foreign Key Relationships)  Fact Table is related to many dimension tables The primary key of the fact table is a composite primary key from the dimension tables  Each fact table is designed to answer a specific DSS question +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 41.  The fact table is always the larges table in the star schema  Each dimension record is related to thousand of fact records  Star Schema facilitated data retrieval functions  DBMS first searches the Dimension Tables before the larger fact table +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 42.  An Active Decision Support Framework  Not a Static Database  Always a Work in Process  Complete Infrastructure for Company-Wide decision support  Hardware / Software / People / Procedures / Data  Data Warehouse is a critical component of the Modern DSS – But not the Only critical component +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 43.  Discover Previously unknown data characteristics, relationships, dependencies, or trends  Typical Data Analysis Relies on end users  Define the Problem  Select the Data  Initial the Data Analysis  Reacts to External Stimulus +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 44.  Proactive  Automatically searches Anomalies Possible Relationships Identify Problems before the end-user  Data Mining tools analyze the data, uncover problems or opportunities hidden in data relationships, form computer models based on their findings, and then user the models to predict business behavior – with minimal end-user intervention +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 45.  A methodology designed to perform knowledge-discovery expeditions over the database data with minimal end-user intervention  3 Stages of Data  Data  Information  Knowledge +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 46. +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 47.  Data Preparation  Identify the main data sets to be used by the data mining operation (usually the data warehouse)  Data Analysis and Classification  Study the data to identify common data characteristics or patterns  Data groupings, classifications, clusters, sequences  Data dependencies, links, or relationships  Data patterns, trends, deviation +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 48.  Knowledge Acquisition  Uses the Results of the Data Analysis and Classification phase  Data mining tool selects the appropriate modeling or knowledge-acquisition algorithms  Neural Networks  Decision Trees  Rules Induction  Genetic algorithms  Memory-Based Reasoning  Prognosis  Predict Future Behavior  Forecast Business Outcomes  65% of customers who did not use a particular credit card in the last 6 months are 88% likely to cancel the account. +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 49.  Still a New Technique  May find many Unmeaningful Relationships  Good at finding Practical Relationships  Define Customer Buying Patterns  Improve Product Development and Acceptance  Etc.  Potential of becoming the next frontier in database development +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 50. +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in
  • 51. +919892900103 | info@vibranttechnologies.co.in | www. Vibranttechnologies.co.in