SlideShare una empresa de Scribd logo
1 de 28
The Concepts of Business
Intelligence
Microsoft® Business Intelligence Solutions
Roadmap
 BI Concepts slides (this PowerPoint)
 BI Concepts Video
 Cubes Demo Video
 Dashboards Demo Video
 Data Mining Video
 Additional slides
 Introduction
 Consolidating Data from Multiple
Sources
 Supporting Different Types of Users
 Identifying Elements to Support
Analysis
DATA WAREHOUSING AND BUSINESS INTELLIGENCE SKILLS FOR INFORMATION SYSTEMS
GRADUATES: ANALYSIS BASED ON MARKETPLACE DEMAND
Ashraf Shirani, Malu Roldan
Issues in Information Systems, 2009
http://www.iacis.org/iis/2009_iis/pdf/P2009_1265.pdf
OLAP vs. Business Intelligence
Online analytical processing, or OLAP
 It is an approach to quickly answer
multi-dimensional analytical queries.
 OLAP is part of the broader category
of business intelligence, which also
encompasses reporting, data mining,
and analytics.
The Challenges of Building BI
Solutions
 There are several issues inherent to
any BI project:
 Data exists in multiple places
 Data is not formatted to support complex
analysis
 Different kinds of workers have different
data needs
 What data should be examined and in what
detail
 How will users interact with that data
Consolidation of Data
 The process of consolidating data
means moving it, making it consistent,
and cleaning up the data as much as
possible
 Data is frequently stored in different
formats
 Data is frequently inconsistent between
sources
 Data may be dirty
 Internally inconsistent or missing values
Disparate Data
 Data in a variety of locations and
formats:
 Relational databases (operational data
systems)
 XML files
 Desktop databases
 Microsoft ® Excel™ spreadsheets
 The data may also be in databases on
different operating system and
hardware platforms
Inconsistent Data
 Data may be inconsistent
 Two plants might have different part
numbers for the same physical part
 To represent True and False, one system
may use 1 and 0, while another system
may use T and F
 Data stored in different countries will likely
store sales in their local currency
 These sales must be converted to a common
currency
Data Quality Issues
 Clean data facilitates more accurate
analysis
 Many data entry systems allow free-
form data entry of text values
 For example, the same city might be
entered as Louisville, Lewisville, and
Luisville
 Routines to clean up data need to take
into account all possible variations of
bad data
Extraction, Transformation, and
Loading (ETL)
 The process of data consolidation is
often called Extraction, Transformation,
and Loading (ETL)
 The ETL process extracts data from the
various source systems
 Data is then transformed to make it
consistent and improve data quality
 The consolidated, consistent, and cleaned
data is then loaded into a data repository
 Developing the ETL process often
consumes 80% of the development
time
Extraction, Transformation, and
Loading (ETL) Tools
 Some ETL Tools
 Oracle Data Integrator (ODI)
 Informatica
 IBM Ascential
 Abinitio
Technical Issues with Data
Consolidation
 Access to different data sources can be
problematic
 Servers may be geographically distributed
and have inconsistent network connectivity
 Different data formats may require
different drivers and data access
methodologies
 Data access permissions may present
issues
 Data cleanup may require complex
transformation logic
Business Issues with Data
Consolidation
 Business users must drive what should
be in the data warehouse
 Someone in the business must decide
how to consolidate inconsistent data
 If True is 1 in one system and T in another,
what should the value be once the data is
consolidated from the two systems?
 The business must decide how to
handle other necessary items - such as
currency conversions
Supporting Different Types of
Users
 One of the great benefits of BI is that it
can support the data needs of the
entire business
 This support comes from the many
different ways that users can consume BI
data
 Different tools exist to support these
different data needs
The Users of Business
Intelligence
 Executives and business decision
makers look at the business from a
high level, performing limited analysis
 Analysts perform complex, detailed
data analysis
 Information workers need static reports
or limited analytic power
 Line workers need no analytic
capabilities as BI is presented to them
as part of their job
The Users of Business
Intelligence
The Approaches to Consuming
Business Intelligence
 Scorecards
 Customized high-level views with limited
analytic capabilities
 Reports
 Standardized reports aimed at a large
audience, with no or limited analytic
capabilities
 Analytics Applications
 Applications designed to allow complex
data analysis
 Custom Applications
 Embed BI data within an application
The Components of a Data
Warehouse
 There are several items that make up a
data warehouse
 Cubes
 Measures
 Key Performance Indicators
 Dimensions
 Attributes
 Hierarchies
Asking a BI Question
 Humans tend to think in a
multidimensional way, even if they
don’t realize it
 We often want to see a particular value
in a certain context
 Show me sales by month by product for
North America
 “What” you want to see (sales in this
case) is called a measure
 How you want to see it (month,
product, and North America) is called a
dimension
Cubes
 Cubes are the structures in which data
is stored
 Users access data in the cubes by
navigating through various dimensions
Measures
 Measures are what you want to see
 They are almost always numeric
 They are often additive
 Dollar sales, unit sales, profit, expenses,
and more
 Some measures are not additive
 Date of last shipment
 Inventory counts and number of unique
customers
Key Performance Indicators
 Key Performance Indicators (KPIs) are
typically a special type of measure
 A KPI might be Customer Retention, which
is a calculation of customer churn
 A KPI may be Customer satisfaction
derived from one or more measures
(ratings in a survey or product returns +
number of repeat customers).
 KPIs are often what are shown on scorecards
 KPIs often contain not just the number, but
also a target number
 Used to evaluate the “health” of the value
Dimensions
 Dimensions are how you want to see
the data
 You usually want to see data by time,
geography, product, account,
employee, …
 Dimensions are made up of attributes
and may or may not include hierarchies
 Year – Semester – Quarter – Month – Day
 Product Category – Product Subcategory -
Product
Attributes
 Attributes are individual values that
make up dimensions
 A Time dimension may have a Month
attribute, a Year attribute, and so forth
 A Geography dimension may have a
Country attribute, a Region attribute, a
City attribute, and so on
 A Product dimension may have a Part
Number attribute, a size attribute, a color
attribute, a manufacturer attribute, and
more
Hierarchies
 You can put attributes into a
hierarchical structure to assist user
analysis
 One of the most common functions in
BI is to “drill down” to a more detailed
level
 For example, Time hierarchy might be
to go from Year to Quarter to Month to
Day
 Another Time hierarchy might go from
Year to Month to Week to Day to Hour
Summary
 The ETL process extracts data from
source systems, transforms it and
then loads it to a data warehouse or
a data mart.
 Using reports and dashboards, BI
looks at data as a collection of
measures and KPIs viewed by
dimensions.
Oracle DW/BI Products
 OBIEE – mainly based on Siebel
technology.
 Oracle Hyperion Essbase

Más contenido relacionado

La actualidad más candente

Business intelligence overview
Business intelligence overviewBusiness intelligence overview
Business intelligence overviewCanara bank
 
Business Intelligence Module 5
Business Intelligence Module 5Business Intelligence Module 5
Business Intelligence Module 5Home
 
Business intelligence- Components, Tools, Need and Applications
Business intelligence- Components, Tools, Need and ApplicationsBusiness intelligence- Components, Tools, Need and Applications
Business intelligence- Components, Tools, Need and Applicationsraj
 
Business intelligence in the real time economy
Business intelligence in the real time economyBusiness intelligence in the real time economy
Business intelligence in the real time economyJohan Blomme
 
Business Intelligence Data Warehouse System
Business Intelligence Data Warehouse SystemBusiness Intelligence Data Warehouse System
Business Intelligence Data Warehouse SystemKiran kumar
 
Business intelligence implementation case study
Business intelligence implementation case studyBusiness intelligence implementation case study
Business intelligence implementation case studyJennie Chen, CTP
 
Introduction to Business Intelligence (SAP BI)
Introduction to Business Intelligence (SAP BI)Introduction to Business Intelligence (SAP BI)
Introduction to Business Intelligence (SAP BI)salam muthuswamy Shiva
 
Top 15 Business Intelligence (BI) Software
Top 15 Business Intelligence (BI) SoftwareTop 15 Business Intelligence (BI) Software
Top 15 Business Intelligence (BI) SoftwareMopinion
 
BI slides_Viatec DTAPP
BI slides_Viatec DTAPPBI slides_Viatec DTAPP
BI slides_Viatec DTAPPEric Anderson
 
Business Intelligence and Business Analytics
Business Intelligence and Business AnalyticsBusiness Intelligence and Business Analytics
Business Intelligence and Business Analyticssnehal_152
 
Business Intelligence Module 2
Business Intelligence Module 2Business Intelligence Module 2
Business Intelligence Module 2Home
 
Business Intelligence, Portals, Dashboards and Operational Matrix with ShareP...
Business Intelligence, Portals, Dashboards and Operational Matrix with ShareP...Business Intelligence, Portals, Dashboards and Operational Matrix with ShareP...
Business Intelligence, Portals, Dashboards and Operational Matrix with ShareP...Optimus BT
 
Business Intelligence - A Management Perspective
Business Intelligence - A Management PerspectiveBusiness Intelligence - A Management Perspective
Business Intelligence - A Management Perspectivevinaya.hs
 

La actualidad más candente (20)

Business intelligence overview
Business intelligence overviewBusiness intelligence overview
Business intelligence overview
 
Business Intelligence Module 5
Business Intelligence Module 5Business Intelligence Module 5
Business Intelligence Module 5
 
Business intelligence- Components, Tools, Need and Applications
Business intelligence- Components, Tools, Need and ApplicationsBusiness intelligence- Components, Tools, Need and Applications
Business intelligence- Components, Tools, Need and Applications
 
Business intelligence in the real time economy
Business intelligence in the real time economyBusiness intelligence in the real time economy
Business intelligence in the real time economy
 
Business Intelligence Data Warehouse System
Business Intelligence Data Warehouse SystemBusiness Intelligence Data Warehouse System
Business Intelligence Data Warehouse System
 
Business Intelligence concepts
Business Intelligence conceptsBusiness Intelligence concepts
Business Intelligence concepts
 
Introduction to business intelligence
Introduction to business intelligenceIntroduction to business intelligence
Introduction to business intelligence
 
Business intelligence implementation case study
Business intelligence implementation case studyBusiness intelligence implementation case study
Business intelligence implementation case study
 
Business intelligence
Business intelligenceBusiness intelligence
Business intelligence
 
BUSINESS INTELLIGENCE
BUSINESS INTELLIGENCEBUSINESS INTELLIGENCE
BUSINESS INTELLIGENCE
 
Introduction to Business Intelligence (SAP BI)
Introduction to Business Intelligence (SAP BI)Introduction to Business Intelligence (SAP BI)
Introduction to Business Intelligence (SAP BI)
 
Top 15 Business Intelligence (BI) Software
Top 15 Business Intelligence (BI) SoftwareTop 15 Business Intelligence (BI) Software
Top 15 Business Intelligence (BI) Software
 
Business analytics
Business analyticsBusiness analytics
Business analytics
 
Ahmed Rami Elsherif, CV
Ahmed Rami Elsherif, CVAhmed Rami Elsherif, CV
Ahmed Rami Elsherif, CV
 
BI slides_Viatec DTAPP
BI slides_Viatec DTAPPBI slides_Viatec DTAPP
BI slides_Viatec DTAPP
 
Business Intelligence and Business Analytics
Business Intelligence and Business AnalyticsBusiness Intelligence and Business Analytics
Business Intelligence and Business Analytics
 
Business Intelligence Module 2
Business Intelligence Module 2Business Intelligence Module 2
Business Intelligence Module 2
 
Business Intelligence, Portals, Dashboards and Operational Matrix with ShareP...
Business Intelligence, Portals, Dashboards and Operational Matrix with ShareP...Business Intelligence, Portals, Dashboards and Operational Matrix with ShareP...
Business Intelligence, Portals, Dashboards and Operational Matrix with ShareP...
 
Business intelligence
Business intelligenceBusiness intelligence
Business intelligence
 
Business Intelligence - A Management Perspective
Business Intelligence - A Management PerspectiveBusiness Intelligence - A Management Perspective
Business Intelligence - A Management Perspective
 

Destacado

Distance & midpoint formulas 11.5
Distance & midpoint formulas 11.5Distance & midpoint formulas 11.5
Distance & midpoint formulas 11.5bweldon
 
Pythagorean theorem and distance formula
Pythagorean theorem and distance formulaPythagorean theorem and distance formula
Pythagorean theorem and distance formulaporscha227
 
Passive house integrated design bratislava
Passive house integrated design bratislavaPassive house integrated design bratislava
Passive house integrated design bratislavaCarl-peter Goossen
 
Beyond CRM - Customer Lifecycle Management
Beyond CRM - Customer Lifecycle ManagementBeyond CRM - Customer Lifecycle Management
Beyond CRM - Customer Lifecycle ManagementCollabor Inc.
 
Final pizza hut 2010
Final pizza hut 2010Final pizza hut 2010
Final pizza hut 2010Quvol.com
 
Customer Lifecycle Management
Customer Lifecycle ManagementCustomer Lifecycle Management
Customer Lifecycle ManagementDispatch
 
Pizza hut final ppt
Pizza hut final pptPizza hut final ppt
Pizza hut final pptWasim Akram
 

Destacado (10)

Satyam case study
Satyam case studySatyam case study
Satyam case study
 
Distance & midpoint formulas 11.5
Distance & midpoint formulas 11.5Distance & midpoint formulas 11.5
Distance & midpoint formulas 11.5
 
Pythagorean theorem and distance formula
Pythagorean theorem and distance formulaPythagorean theorem and distance formula
Pythagorean theorem and distance formula
 
CRM & Customer Relationship Lifecycle
CRM & Customer Relationship  LifecycleCRM & Customer Relationship  Lifecycle
CRM & Customer Relationship Lifecycle
 
Amazing Customer Journey
Amazing Customer Journey Amazing Customer Journey
Amazing Customer Journey
 
Passive house integrated design bratislava
Passive house integrated design bratislavaPassive house integrated design bratislava
Passive house integrated design bratislava
 
Beyond CRM - Customer Lifecycle Management
Beyond CRM - Customer Lifecycle ManagementBeyond CRM - Customer Lifecycle Management
Beyond CRM - Customer Lifecycle Management
 
Final pizza hut 2010
Final pizza hut 2010Final pizza hut 2010
Final pizza hut 2010
 
Customer Lifecycle Management
Customer Lifecycle ManagementCustomer Lifecycle Management
Customer Lifecycle Management
 
Pizza hut final ppt
Pizza hut final pptPizza hut final ppt
Pizza hut final ppt
 

Similar a Bi concepts

Business Intelligence Challenges 2009
Business Intelligence Challenges 2009Business Intelligence Challenges 2009
Business Intelligence Challenges 2009Lonnell Branch
 
Bi presentation
Bi presentationBi presentation
Bi presentationbani1322
 
SSAS R2 and SharePoint 2010 – Business Intelligence
SSAS R2 and SharePoint 2010 – Business IntelligenceSSAS R2 and SharePoint 2010 – Business Intelligence
SSAS R2 and SharePoint 2010 – Business IntelligenceSlava Kokaev
 
Business Intelligence and Analytics Capability
Business Intelligence and Analytics CapabilityBusiness Intelligence and Analytics Capability
Business Intelligence and Analytics CapabilityALTEN Calsoft Labs
 
SBOeCube
SBOeCubeSBOeCube
SBOeCubeNASSCOM
 
Overview of business intelligence
Overview of business intelligenceOverview of business intelligence
Overview of business intelligenceAhsan Kabir
 
Business Intelligence: Data Warehouses
Business Intelligence: Data WarehousesBusiness Intelligence: Data Warehouses
Business Intelligence: Data WarehousesMichael Lamont
 
Business Intelligence Industry Perspective Session I
Business Intelligence   Industry Perspective Session IBusiness Intelligence   Industry Perspective Session I
Business Intelligence Industry Perspective Session IPrithwis Mukerjee
 
Business intelligence(bi)
Business intelligence(bi)Business intelligence(bi)
Business intelligence(bi)ArvindDubey36
 
Bi Architecture And Conceptual Framework
Bi Architecture And Conceptual FrameworkBi Architecture And Conceptual Framework
Bi Architecture And Conceptual FrameworkSlava Kokaev
 
E comm final review
E comm final reviewE comm final review
E comm final review200253049
 
Business intellegence erp
Business intellegence erpBusiness intellegence erp
Business intellegence erpSargam Sethi
 
Business Intelligence and Analytics Services
Business Intelligence and Analytics Services  Business Intelligence and Analytics Services
Business Intelligence and Analytics Services Thinklayer
 
Sean\'s EBI Introduction Presentation
Sean\'s EBI Introduction PresentationSean\'s EBI Introduction Presentation
Sean\'s EBI Introduction Presentationseanmayers
 

Similar a Bi concepts (20)

Business Intelligence Challenges 2009
Business Intelligence Challenges 2009Business Intelligence Challenges 2009
Business Intelligence Challenges 2009
 
Bi presentation
Bi presentationBi presentation
Bi presentation
 
businessintelligence.pptx
businessintelligence.pptxbusinessintelligence.pptx
businessintelligence.pptx
 
SSAS R2 and SharePoint 2010 – Business Intelligence
SSAS R2 and SharePoint 2010 – Business IntelligenceSSAS R2 and SharePoint 2010 – Business Intelligence
SSAS R2 and SharePoint 2010 – Business Intelligence
 
Business Intelligence and Analytics Capability
Business Intelligence and Analytics CapabilityBusiness Intelligence and Analytics Capability
Business Intelligence and Analytics Capability
 
SBOeCube
SBOeCubeSBOeCube
SBOeCube
 
bi
bibi
bi
 
Technologies
TechnologiesTechnologies
Technologies
 
Overview of business intelligence
Overview of business intelligenceOverview of business intelligence
Overview of business intelligence
 
Business Intelligence: Data Warehouses
Business Intelligence: Data WarehousesBusiness Intelligence: Data Warehouses
Business Intelligence: Data Warehouses
 
BI
BIBI
BI
 
Business Intelligence Industry Perspective Session I
Business Intelligence   Industry Perspective Session IBusiness Intelligence   Industry Perspective Session I
Business Intelligence Industry Perspective Session I
 
Business intelligence(bi)
Business intelligence(bi)Business intelligence(bi)
Business intelligence(bi)
 
Bi Architecture And Conceptual Framework
Bi Architecture And Conceptual FrameworkBi Architecture And Conceptual Framework
Bi Architecture And Conceptual Framework
 
E comm final review
E comm final reviewE comm final review
E comm final review
 
Business intellegence erp
Business intellegence erpBusiness intellegence erp
Business intellegence erp
 
Business Analytics
Business AnalyticsBusiness Analytics
Business Analytics
 
Business Intelligence and Analytics Services
Business Intelligence and Analytics Services  Business Intelligence and Analytics Services
Business Intelligence and Analytics Services
 
Sean\'s EBI Introduction Presentation
Sean\'s EBI Introduction PresentationSean\'s EBI Introduction Presentation
Sean\'s EBI Introduction Presentation
 
BA MODULE1.pdf
BA MODULE1.pdfBA MODULE1.pdf
BA MODULE1.pdf
 

Último

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Bhuvaneswari Subramani
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 

Último (20)

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 

Bi concepts

  • 1. The Concepts of Business Intelligence Microsoft® Business Intelligence Solutions
  • 2. Roadmap  BI Concepts slides (this PowerPoint)  BI Concepts Video  Cubes Demo Video  Dashboards Demo Video  Data Mining Video  Additional slides
  • 3.  Introduction  Consolidating Data from Multiple Sources  Supporting Different Types of Users  Identifying Elements to Support Analysis
  • 4. DATA WAREHOUSING AND BUSINESS INTELLIGENCE SKILLS FOR INFORMATION SYSTEMS GRADUATES: ANALYSIS BASED ON MARKETPLACE DEMAND Ashraf Shirani, Malu Roldan Issues in Information Systems, 2009 http://www.iacis.org/iis/2009_iis/pdf/P2009_1265.pdf
  • 5. OLAP vs. Business Intelligence Online analytical processing, or OLAP  It is an approach to quickly answer multi-dimensional analytical queries.  OLAP is part of the broader category of business intelligence, which also encompasses reporting, data mining, and analytics.
  • 6. The Challenges of Building BI Solutions  There are several issues inherent to any BI project:  Data exists in multiple places  Data is not formatted to support complex analysis  Different kinds of workers have different data needs  What data should be examined and in what detail  How will users interact with that data
  • 7. Consolidation of Data  The process of consolidating data means moving it, making it consistent, and cleaning up the data as much as possible  Data is frequently stored in different formats  Data is frequently inconsistent between sources  Data may be dirty  Internally inconsistent or missing values
  • 8. Disparate Data  Data in a variety of locations and formats:  Relational databases (operational data systems)  XML files  Desktop databases  Microsoft ® Excel™ spreadsheets  The data may also be in databases on different operating system and hardware platforms
  • 9. Inconsistent Data  Data may be inconsistent  Two plants might have different part numbers for the same physical part  To represent True and False, one system may use 1 and 0, while another system may use T and F  Data stored in different countries will likely store sales in their local currency  These sales must be converted to a common currency
  • 10. Data Quality Issues  Clean data facilitates more accurate analysis  Many data entry systems allow free- form data entry of text values  For example, the same city might be entered as Louisville, Lewisville, and Luisville  Routines to clean up data need to take into account all possible variations of bad data
  • 11. Extraction, Transformation, and Loading (ETL)  The process of data consolidation is often called Extraction, Transformation, and Loading (ETL)  The ETL process extracts data from the various source systems  Data is then transformed to make it consistent and improve data quality  The consolidated, consistent, and cleaned data is then loaded into a data repository  Developing the ETL process often consumes 80% of the development time
  • 12. Extraction, Transformation, and Loading (ETL) Tools  Some ETL Tools  Oracle Data Integrator (ODI)  Informatica  IBM Ascential  Abinitio
  • 13. Technical Issues with Data Consolidation  Access to different data sources can be problematic  Servers may be geographically distributed and have inconsistent network connectivity  Different data formats may require different drivers and data access methodologies  Data access permissions may present issues  Data cleanup may require complex transformation logic
  • 14. Business Issues with Data Consolidation  Business users must drive what should be in the data warehouse  Someone in the business must decide how to consolidate inconsistent data  If True is 1 in one system and T in another, what should the value be once the data is consolidated from the two systems?  The business must decide how to handle other necessary items - such as currency conversions
  • 15. Supporting Different Types of Users  One of the great benefits of BI is that it can support the data needs of the entire business  This support comes from the many different ways that users can consume BI data  Different tools exist to support these different data needs
  • 16. The Users of Business Intelligence  Executives and business decision makers look at the business from a high level, performing limited analysis  Analysts perform complex, detailed data analysis  Information workers need static reports or limited analytic power  Line workers need no analytic capabilities as BI is presented to them as part of their job
  • 17. The Users of Business Intelligence
  • 18. The Approaches to Consuming Business Intelligence  Scorecards  Customized high-level views with limited analytic capabilities  Reports  Standardized reports aimed at a large audience, with no or limited analytic capabilities  Analytics Applications  Applications designed to allow complex data analysis  Custom Applications  Embed BI data within an application
  • 19. The Components of a Data Warehouse  There are several items that make up a data warehouse  Cubes  Measures  Key Performance Indicators  Dimensions  Attributes  Hierarchies
  • 20. Asking a BI Question  Humans tend to think in a multidimensional way, even if they don’t realize it  We often want to see a particular value in a certain context  Show me sales by month by product for North America  “What” you want to see (sales in this case) is called a measure  How you want to see it (month, product, and North America) is called a dimension
  • 21. Cubes  Cubes are the structures in which data is stored  Users access data in the cubes by navigating through various dimensions
  • 22. Measures  Measures are what you want to see  They are almost always numeric  They are often additive  Dollar sales, unit sales, profit, expenses, and more  Some measures are not additive  Date of last shipment  Inventory counts and number of unique customers
  • 23. Key Performance Indicators  Key Performance Indicators (KPIs) are typically a special type of measure  A KPI might be Customer Retention, which is a calculation of customer churn  A KPI may be Customer satisfaction derived from one or more measures (ratings in a survey or product returns + number of repeat customers).  KPIs are often what are shown on scorecards  KPIs often contain not just the number, but also a target number  Used to evaluate the “health” of the value
  • 24. Dimensions  Dimensions are how you want to see the data  You usually want to see data by time, geography, product, account, employee, …  Dimensions are made up of attributes and may or may not include hierarchies  Year – Semester – Quarter – Month – Day  Product Category – Product Subcategory - Product
  • 25. Attributes  Attributes are individual values that make up dimensions  A Time dimension may have a Month attribute, a Year attribute, and so forth  A Geography dimension may have a Country attribute, a Region attribute, a City attribute, and so on  A Product dimension may have a Part Number attribute, a size attribute, a color attribute, a manufacturer attribute, and more
  • 26. Hierarchies  You can put attributes into a hierarchical structure to assist user analysis  One of the most common functions in BI is to “drill down” to a more detailed level  For example, Time hierarchy might be to go from Year to Quarter to Month to Day  Another Time hierarchy might go from Year to Month to Week to Day to Hour
  • 27. Summary  The ETL process extracts data from source systems, transforms it and then loads it to a data warehouse or a data mart.  Using reports and dashboards, BI looks at data as a collection of measures and KPIs viewed by dimensions.
  • 28. Oracle DW/BI Products  OBIEE – mainly based on Siebel technology.  Oracle Hyperion Essbase

Notas del editor

  1. The section, though somewhat technical, it avoids a deep technical discussion as it's aimed at both technology and business students. It seeks to prepare the technology students for the concepts in the session about the technical aspects of BI, while providing business students with an understanding of the challenges and steps necessary to create a BI solution that delivers business value.
  2. There are a number of challenges with BI. First, there are technical issues: disparate data, different operating systems, different database platforms, and more. This data is usually stored in an OLTP format designed for fast inserts, updates, and deletes, not analysis. Some challenges are human issues: different workers have different levels of expertise for working with data and therefore need different tools. Other challenges are business challenges: what data should be available and to whom and what level.
  3. The first step in building a BI solution is to consolidate data. There are many challenges here, such as distributed data being inconsistent. Data is often “dirty” which means bad data has crept into the system. These data challenges are covered on the next three slides.
  4. The data is often distributed throughout an organization. A company might have custom applications that use Microsoft SQL Server, purchased applications that use Oracle, employees storing data in Microsoft Excel, and data arriving in XML format. In addition, SQL Server may be running on Windows Server 2003 while Oracle may be running on a Unix server. In order to perform any analysis, data must be consolidated. In the vast majority of cases, the data is physically consolidated on a single server; however, in some rare cases, it is accessed directly where it is. This is rarely done because accessing the data where it resides has an adverse impact on the performance of those OLTP system.
  5. Data is often stored in inconsistent formats. True and False are often represented differently in different systems; True may be 1, 0, T, True, or some binary value that means “true.” Revenues and expenses are often recorded in the local currency, creating a challenge for multinational corporations. Different systems may have different part number or employee IDs. In order to do any sort of analysis, all of these issues must be corrected; data must be made consistent before analysis can be performed.
  6. Virtually all companies have dirty data. Dirty data are bad values that enter a system. It can be a simple typo when entering a number, but it is often a bad value entered into a free-form text field. For example, sales may be attributed to employee IDs that don’t exist in the Employees system, city names might be spelled many different ways, and so forth. Cleaning up bad data can require extensive routines that require updating each time a new bad value is encountered. Organizations can also use data mining algorithms to help clean up data; for example, a fuzzy lookup can be used to help match text values that are similar.
  7. The process of moving data from its source systems, consolidating it in a central location, and fixing data inconsistencies is called Extraction, Transformation, and Loading, or ETL. The extraction step is pulling the data from various source systems. It is then transformed, or made consistent (“True” values are all set to the same, currencies are converted, and so forth.) Finally, it is loaded into a data repository (often called a data factory or data warehouse.)
  8. Performing the ETL process is one of the most difficult tasks of building a BI solution; in most cases, it is the most difficult task. The first challenge is simply identifying the data that is needed and where it resides. This is not always as easy as it sounds, as business users may know what they want to see but know where the data is stored. Once data sources have been identified, these sources may be on systems that are difficult to reach or that have inconsistent connections. Data can be stored in different formats, making consolidation a challenge. Many systems store dates in different formats and converting dates from one system to another can be problematic. Not only can data be stored in different places and formats, but each system can have its own security and permissions requirements, greatly complicating access. Finally, cleaning up bad values (such as misspelled city names) can require complex logic that must be updated each time a new misspelling appears.
  9. There are a number of business issues with data consolidation, and unfortunately politics can come into play. Different groups always want to argue that “their” data is correct and no one else’s is. The business must decide what values to use when different systems store the same value differently, and this is especially contentious if different systems store the same product or part with different values. Currency conversions must be made to a common currency, and the format of this currency is another business decision.
  10. There are many different tools that can be used against a common BI database. These different tools support a wide variety of users with different needs when working with data.
  11. The users of BI can be broken down into: High-level users with the need for a broad view and limited analytics capabilities Those specialized users who perform detailed data analysis and need powerful tools Workers who need basic reports with possible analytic features Workers who have BI built into the systems they use without realizing it is BI
  12. The users of BI can be broken down into: High-level users with the need for a broad view and limited analytics capabilities Those specialized users who perform detailed data analysis and need powerful tools Workers who need basic reports with possible analytic features Workers who have BI built into the systems they use without realizing it is BI
  13. There are many different approaches to consuming BI. Different approaches can map to multiple groups of users.
  14. There are a number of components that make up a “data warehouse.” A “data warehouse” and “data mart” are built exactly the same way; the only difference is the scope (a “warehouse” is for the entire business, a “mart” is for a business functional area.) The terms listed here are defined on the next few slides.
  15. In order to retrieve data from a warehouse, it helps to know the components of a question. Typically users ask to see “something” (sales, expenses, number of units, etc.) segmented “by” certain things (time, location, salesperson, and so forth.) What people want to see are usually numeric and they are called measures. Measures are the basis for KPIs. How the data should be segmented are called dimensions . These measures, and dimensions, are stored in cubes.
  16. A cube is the basic building block of a data warehouse. A warehouse may contain one or more cubes. A cube is a multidimensional structure that holds data based on dimensions. Consider an example of a data warehouse for a shipping company/organization such as FedEx, UPS, or USPS. In this diagram, there are three dimensions shown: Time, Source, and Route. At each intersection of Time, Source, and Route is a cell. Within that cell are two measures: the number of packages and the date one was shipped. This is far different from a relational setup: relational databases are two-dimensional (rows and columns) and each cell can have only a single value.
  17. Measures are the “what” people want to see. They are almost always numeric. They are often additive, but not always. Measures may be KPIs or serve as the basis of KPIs. Unlike in a relational schema, in a cube you would typically want to store calculated values in order to make retrieval faster, and most cubes include the concept of calculated measures.
  18. KPIs may be regular measures or special measures. These special measures may require calculations based on existing measures. Where KPIs are often different is that some engines allow KPIs to be stored with references to historical values (for trend analysis) and references to budget numbers (to determine the health of the value.)
  19. Dimensions are how people like to segment, or slice, the data. Almost anytime someone asks a question, they describe how they want to see it. For example, sales by store by month. Cubes may contain many dimensions, but the more dimensions that are available, the more challenging it is for non-technical users to explore.
  20. Attributes represent different ways of looking at something in a dimension. For example, in a product dimension, a user might want to compare the sales of a product by color; is the red product selling better than the blue? Does it depend on which area of the country is examined? Many of the columns in a relational table can become attributes in a warehouse. When looking at employees, attributes such as age, sex, race, postal code, and more, all make sense for performing analysis.
  21. Most dimensions contain hierarchies which allow users to drill down on data. For example, a Time dimension often has a Year level which can then be broken down into Quarters. Quarters can then be broken into Months, and finally Days. Values in the cube are physically stored at the lowest level of granularity but summarized values are stored at each higher level of the dimension, so when a user asks to see Quarterly data, the value is already stored and retrieval is nearly instantaneous.