5. Phases of Enterprise Information Management
Project Barcelona* Integration Services
Discover Origins & Market-leading ETL
Relationships between and data integration
artifacts tool
Easy-start solution for Knowledge-based Data
master and reference Cleansing & Matching
data management
Master Data Services Data Quality
Services
* Will be shipped separately from SQL Server 2012 and is subject to change
Preliminary Information Subject to Change
6. Integrated Data Management Scenario
Cleanse,
Data Sources match
DQS
Acquire
Discover SSIS
Barcelona
Curate
MDS
Publish Match, de-
SSIS duplicate
DQS
Preliminary Information Subject to Change
9. Project Barcelona Key Features
• No up front planning, • As more of the • New crawlers, UIs
modeling, ongoing enterprise is crawled, added over time
maintenance more and more
• In-house customized
dependencies are
• Out of the box, just solutions
uncovered
starts working
• Opportunity for
partners
• Crawlers shipped out of
band
10. Why is Data Quality Important?
Data quality problems cost U.S. businesses more
than $600 billion a year.
Data Warehousing Institute (TDWI)
Costs associated with bad data include:
• Excess inventory
• Higher supply chain costs
• higher direct marketing costs
• Billing
• And more…
Preliminary Information Subject to Change
11. Common Data Quality Issues
Data Quality Issue Sample Data Problem
Format Do values follow consistent formatting Telephone number formats:
standards ? xxxxxxxxxx,
(xxx) xxx-xxxx
1.xxx.xxx.xxxx, etc.
Standard Are data elements consistently defined and ‘Gender code’ = M, F, U
understood ?
‘Gender code’ = 0, 1, 2
Consistent Do values represent the same meaning ? How is revenue presented ?
Dollars, Euro, Both?
Complete Is all necessary data present ? 20% of customers’ last name is blank,
50% of zip-codes are 99999
Accurate Does the data accurately represent reality or a A Supplier is listed as ‘Active’ but went out
verifiable source? of business six years ago
Valid Do data values fall within acceptable ranges? Salary values should be between
60,000-120,000
Duplicates Data appears several times Both John Ryan and Jack Ryan appear in the
system – are they the same person?
Preliminary Information Subject to Change
12. How to Manage Data Quality?
People Technology Processes
Preliminary Information Subject to Change
13. Make Data Quality Approachable to Everyone
Preliminary Information Subject to Change
14. DQS Solution Concepts
Knowledge-Driven
Based on a Data Quality Knowledge Base (DQKB) that is reusable for a variety of
data quality improvements
Semantics
Data is mapped into Data Domains, which capture its Semantics
Knowledge Discovery
Acquire additional knowledge through data samples and user feedback
Open and Extendible
Support use of user-generated knowledge and IP by 3rd party reference data
providers
Easy to Use
Compelling user experience designed for increased productivity
Preliminary Information Subject to Change
15. DQS Architecture Overview
DQS Clients DQS Cloud Services
DQS Client DataMarket - Categorized Reference Data DQS Store - KB, Domains
Knowledge
Discovery and
Management
DQS Server 3rd Party
Reference
Interactive DQ Data
Projects Reference Data API Reference Data API
(Browse, Set, Validate…) (Browse, Get, Update…)
DQS Engine Reference
Cleansing
Administration Data
Knowledge Data Profiling Services
Discovery Matching Reference Data
Exploration
Other DQS Clients
DQ Projects Store Common Knowledge Store
SSIS DQS Cleansing
Component
Future Clients: Excel, DQ Active Published
SharePoint, Projects KBs
MDS…
Preliminary Information Subject to Change
16. dqs demo flow
Create DQKB
Knowledge
Create DQS
Discovery to
Project
create a domain
Manage Domains
Preliminary Information Subject to Change
17. What is Master Data?
Preliminary Information Subject to Change
18. Data Solutions Data Warehouse / Operational Data
Data Marts Mgmt Management
Provides storage and Enables business users to Central data records mgmt
management of the objects manage the dimensions and and consumption sourced by
and metadata used as the hierarchies of DW / Data other operational systems
application knowledge Marts
• Object mappings • BI scenarios
A company has adopted 6 “best of
breed” systems from different
• Reference Data / managed vendors. They need to be able to
object lists propagate the correct customer
information to each system in a
• Metadata management / consistent way.
data dictionary MDS provides a platform for
central schema, integration
points and validation for
SI/ISV/Internal IT to develop a
custom solution1
MDS focus
Partners Value Add
Preliminary Information Subject to Change
19. MDS Capabilities
Validation
Modeling Authoring business rules to ensure
Entities, Attributes, Hierarchies data correctness
Excel Add-In MDS Web UI Data Matching
Role-based Security and
Transaction Annotation Master Data
Stewardship
Versioning
Enabling Integration & Sharing
Loading batched data Registering to changes Consuming data Workflow /
through Staging Tables through APIs through Views Notifications
External
Excel DWH (CRM, ..)
Preliminary Information Subject to Change
20. Empowering IWs through
Excel Add-in and improved
Web UI
Enhanced performance and
scalability
Improved quality (usability,
Focus on Foundational robustness, security)
Platform
V1 product
21. Empowering IW: MDS Excel Add-in
Preliminary Information Subject to Change
22. mds demo flow
Create MDS
Model
Create
Create Entity's in
subscription
XLS
views
Explore/Update Publish data
data updates in XLS
Preliminary Information Subject to Change
23. Whats new in SQL Server 2012
Preliminary Information Subject to Change
24. Analysis Services: Tomorrow
Build on the strengths
Embrace the relational
and success of Analysis
data model – well
Services and expand its
understood by
reach to a much
developers and IT Pros
broader user base
Bring together the
relational and Provide flexibility in the
multidimensional platform to suit the
models under a single diverse needs of BI
unified BI platform – applications
best of both worlds!
Preliminary Information Subject to Change
25. BI Semantic Model: Architecture
Reporting
Third-party SharePoint
Services & Power Excel
applications PowerPivot Insights
View
Databases LOB Applications Files OData Feeds Cloud Services
Preliminary Information Subject to Change
26. How Should I Build my Model?
Depends on the application needs for each layer
Data model
Business logic
Data access & storage
Two Visual Studio (BIDS) project types in SQL Server 2012
Multidimensional project – with MDX and MOLAP/ROLAP
Tabular project – with DAX and VertiPaq/DirectQuery
Preliminary Information Subject to Change
27. SSAS: Multidimensional Model Improvements
Multidimensional projects received over 300 improvements
across the board for performance, supportability, reliability
and functionality
Almost 100 were reported directly by customers
Major new features include:
Visual Studio 2010 designers
Removal of 4GB string store limit for attributes
New events for monitoring lock usage and contentions
New messages for tracking resources used per command
New PowerShell support
Preliminary Information Subject to Change
28. What’s New in PowerPivot for Excel Add-in
Includes the same designer and the same features
available to the IT Pro in the Tabular Project in BIDS,
except:
Table partitions
Security roles
Configuring Direct Query mode
Note it is possible to restore a PowerPivot workbook on a
tabular instance of Analysis Services and then create and
manage table partitions
Preliminary Information Subject to Change
29. What’s New in PowerPivot for Excel Add-in
Preliminary Information Subject to Change
30. Excel’s PowerPivot ribbon tab includes the ability to create,
edit and delete KPIs
The PowerPivot Field List has been updated to:
Allow perspective selection
Display hierarchies and KPIs
Create a KPI based on a measure
The Measure Settings window, used to create and edit
measures, supports the configuration of formatting options
Preliminary Information Subject to Change
31. SharePoint Server 2010 SP1 is a prerequisite
The add-in has update to include:
New administrative capabilities
New setup experience
Power View authoring from the PowerPivot Galley
Preliminary Information Subject to Change
32. Whats new in Reporting Services
Preliminary Information Subject to Change
36. What is Power View
Power View is an interactive
data exploration and visual
presentation experience.
37. SQL Server 2012 Power View
Highly Visual Design Experience
• Interactive, web-based authoring and sharing of information
• Familiar Microsoft Office design patterns
• Powerful data layout with banding, callout and small multiples
visualizations
Rich metadata-driven interactivity
• Fully integrated with PowerPivot
• Drive greater insight through smart and powerful querying
• Zero configuration highlighting and filtering
• Animated trending and comparisons
Presentation-ready at all times
• Interactive Presentation turns pervasive information into
persuasive information
• Deliver and collaborate through SharePoint
• Full screen presentation mode for interactive boardroom session
Preliminary Information Subject to Change
38. Power View Architecture
SharePoint Farm
SQL Server BIDS
Web Front End App Server BISM Model
RS Shared
SSRS
Service
Power View client Addin for
SharePoint
AS
PowerPivot Server
System Tabular
PowerPivot Service
Web Service
Analysis
Services SP
Integrated
Excel PowerPivot
Model
Data sources
Preliminary Information Subject to Change
Start by addressing islands of problems within the org – bottoms up approach (analogy Data Marts vs Data Warehouses)Enable data experts to create and modify their data in a controlled and secure wayEase Admins tasksSimple deploymentEasy to define and manage data models
Where do we want to take this product. Multi-release, multi-year vision for the productSometimes you want fast time to solution, sometimes you want complex calculations and huge scaleBISM is the name we give to Analysis Services that encompasses these goals