Jim Boriotti presents an overview and demo of Azure Synapse Analytics, an integrated data platform for business intelligence, artificial intelligence, and continuous intelligence. Azure Synapse Analytics includes Synapse SQL for querying with T-SQL, Synapse Spark for notebooks in Python, Scala, and .NET, and Synapse Pipelines for data workflows. The demo shows how Azure Synapse Analytics provides a unified environment for all data tasks through the Synapse Studio interface.
6. Azure Synapse Analytics
Integrated data platform for BI, AI and continuous intelligence
Platform
Azure
Data Lake Storage
Common Data Model
Enterprise Security
Optimized for Analytics
DATA INTEGRATION
Analytics Runtimes
PROVISIONED ON-DEMAND
Form Factors
SQL
Languages
Python .NET Java Scala R
Experience Azure Synapse Studio
Artificial Intelligence / Machine Learning / Internet of Things
Intelligent Apps / Business Intelligence
METASTORE
SECURITY
MANAGEMENT
MONITORING
7. Synapse SQL Synapse Spark Synapse Pipelines Synapse Studio
Azure Synapse Analytics
Query and analyze data with
T-SQL using both provisioned
and serverless models
Quickly create notebooks with
your choice of Python, Scala,
SparkSQL, and .NET for Spark
Build end-to-end data-driven
workflows for your data
movement and data
processing scenario
Execute all data tasks with a
simple UI and unified
environment
8. Synapse Analytics – Timeline
• Today - Azure SQL DW “Azure Synapse” SQL Pools SQL Provisioned
• Materialized Views, Result Set Caching, Ordered CCI’s
• Next Week – Azure Synapse Analytics (Public Preview)
• Azure Synapse Studio – single console
• ADF integration – migration considerations
• Serverless on-demand query
• Big data processing
• Embedded Apache Spark with Notebooks
• Hybrid data integration: CSV, RCFile, ORC, Parquet
• Workspaces – security & mgmt. boundary
• End of Summer - SQL Gen 3
• Engine updates – Merge, Update delete with join, multi-column dist, updatable dists…
• Multi-cluster support, Online scaling, Cross DB Support, Time Travel
Only pay for the
capabilities you use!
10. Q & A ?
Jim Boriotti – Data & AI Cloud Solution Architect
jim.boriotti@microsoft.com
Link to me at: www.linkedin.com/in/JimBoriotti
11. Azure Synapse Analytics
Build announcements
GA
• Workload Isolation
• COPY Data Loading
• Updatable Hash Key
• Materialized View Improvement
Public Preview
• PREDICT Scoring
• Bulk Load & External Table Wizards
• Serverless Query Perf Enhancements
• Pay-per-query consumption model
• CSV Schema Inference
Private Preview
• SQL MERGE support, ANSI Joins
• Column Encryption
• Multi-Column Hash Distribution
Public Preview
• DeltaLake Tables v0.6
• CDM Support
• .NET for Apache Spark 0.11
• Built-in Samples
• Templated Code Gen for Notebooks
• Statistical Sample Visualization of
Data
• Spark Job Graph Debugging
Public Preview
• CosmosDB with Synapse Link
• Managed Virtual Networks & Private
Endpoints
• Improved Notebook Usability
• More Granular Workspace RBAC
• Getting Started
• SQL & Spark Pool Monitoring and
Management
Public Preview
• Trusted Service for Azure Storage
and Azure Key Vault
• Managed Identity for Mapping Data
Flows
• Static IP ranges Azure Integration
Runtime
• Checkpoint and resume for binary
file copy
Private Preview
• Data Flow CDM Support
Query and analyze data with
T-SQL using both provisioned
and serverless models
Quickly create notebooks with
your choice of Python, Scala,
SparkSQL, and .NET for Spark
Build end-to-end data-driven
workflows for your data
movement and data
processing scenarios
Execute all data tasks with a
simple UI and unified
workspace environment
Synapse SQL Synapse Spark Synapse Pipelines Synapse Studio
12. Performance, Performance, Performance
•
Intelligent data warehouse
•
Heterogeneous data loading
•
Azure Synapse Analytics
the next version
Advanced workload management
•
•
•
•
Premium Databricks integration
•
•
Smart & auto system tuning
•
•
•
Execution models to fit the need
•
Confidential material – covered by Microsoft NDA
13. APACHE SPARKSQL ANALYTICS STUDIO DATA INTEGRATION
Synapse Analytics (PREVIEW)
Synapse Analytics (GA)
(formerly SQL DW)
Synapse Analytics (GA)
NOV 2019
New GA features
• Resultset caching
• Materialized Views
• Ordered Columnstore
• JSON support
• Dynamic Data Masking
• SSDT support
• Read committed snapshot isolation
Preview features
• Workload Isolation
• Simple ingestion with COPY
• Share DW data with Azure Data Share
• Private LINK support
Private preview features
• Streaming ingestion & analytics in DW
• Native Prediction/Scoring
• Fast query over Parquet files
• FROM clause with joins
NOV 2019
Preview features
• Synapse Studio
• Collaborative workspaces
• Distributed T-SQL Query service
• SQL Script editor
• Unified security model
• Notebooks
• Apache Spark
• Code-free data flows
• Orchestration Pipelines
• Data movement