SlideShare una empresa de Scribd logo
1 de 34
Descargar para leer sin conexión
© 2020 Snowflake Inc. All Rights Reserved
FROM DATA TO INSIGHTS:
DATA ENGINEERING
MIT SNOWFLAKE
ScaleUp 360° Smart Data
29. Sept. 2020
Harald Erb | harald.erb@snowflake.com
Sr. Solutions Engineer, Central Europe
© 2020 Snowflake Computing Inc. All Rights Reserved
ABOUT ME
Sr. Solutions Engineer
Central Europe
harald.erb@snowflake.com
Llinkedin.com/in/haralderb
Enthusiastic about Business Analytics &
Data Management for 20+ years
> Consulting: Delivered large-scale Data
Warehouse and BI projects as Developer,
Information Analyst, Solution Architect,
Project Lead at Oracle D/A/CH
> Presales #2 at Snowflake in Central
Europe with focus on Modern Data
Management & Analytics
> Worked with clients on Big Data & IoT
solutions as Architect and Solutions
Engineer at Oracle EMEA, Pentaho and
Hitachi Vantara
© 2020 Snowflake Computing Inc. All Rights Reserved
AGENDA
> Snowflake Cloud Data Platform – for Data Engineering
> Solution Study: Let‘s Build Something!
> Session Takeaway
© 2020 Snowflake Inc. All Rights Reserved.
SNOWFLAKE FOR
DATA ENGINEERING
© 2020 Snowflake Computing Inc. All Rights Reserved
© 2020 Snowflake Inc. All Rights Reserved
SNOWFLAKE CLOUD DATA PLATFORM
5
OLTP DATABASES
ENTERPRISE
APPLICATIONS
THIRD-PARTY
WEB/LOG DATA
IoT
DATA MONETIZATION
OPERATIONAL
REPORTING
AD HOC ANALYSIS
REAL-TIME ANALYTICS
DATA
SOURCES
DATA
CONSUMERS
Thema heute
© 2020 Snowflake Inc. All Rights Reserved
Rethink
transformation
with robust
and integrated
data pipelines
Simplify and
accelerate your
data lake with
one platform for
all your data
Develop apps
with fast and
scalable analytics
that delight
customers
Deliver
analytics at
scale with
a modern
data warehouse
Empower your
ecosystem
to securely
collaborate
across all data
Simplify and
accelerate
machine learning
and artificial
intelligence
ONE PLATFORM, ONE COPY OF DATA,
MANY WORKLOADS
6
© 2020 Snowflake Computing Inc. All Rights Reserved
OVERCOMING DATA SILOS WITH SNOWFLAKE
Data Sources Data Consumers
Structured Data
Semi-Structured Data
Web APIs
IoT Data
Data Visualization /
Reporting
Data Science
Ad hoc Queries
Data Zones
Enterprise data in one place (as much as possible), organized (e.g. in logical Data Zones) and accessible for all users
Work Area (Exploratory, AI / ML)
Persistent, user/team space, one or more Databases
Landing Zone
Transient, ELT processes, truncate/reload
Raw
Raw data, schema-
less (JSON…): no
transformations,
matches source data
Conformed
Raw +
de-duplicated, data
type standardization
(dates)
Reference
Master data, ,
manual mappings,
Business hierarchies
Modeled
Integrated, cleansed,
modeled data (3NF,
Data Vault,
Dimensional Model)
“Data Lake" “Data Warehouse”
© 2020 Snowflake Computing Inc. All Rights Reserved
ELASTIC SERVICE, SUPPORT FOR MULTIPLE WORKLOADS
8
Continuous
Loading (4TB/day)
S3
<5min SLA
Compute Cluster
“Medium”
Batch Data Loads
& Transformations
Compute Cluster
"Large”
Compute
Cluster
"2X-Large”
Customer
Analytics &
Segmentation
Interactive
Dashboard
50% < 1s
85% < 2s
95% < 5s
Compute Cluster
Auto Scale –
”X-Large” x 5
Prod DB
Snowflake Shared Data, Multi-Cluster Architecture: All data available in a central repository,
major workloads isolated, performance on demand, and easy data access for everybody via SQL
Benefit:
Deliver Reporting
SLA’s
Benefit:
Add teams as needed,
support agile development &
a data driven culture
Benefit:
Always fresh data
Benefit:
Complete more tasks
within same time frame
Structured & Semi-structured Data at Petabyte-Scale
(all encrypted, compressed)
© 2020 Snowflake Inc. All Rights Reserved
SUPPORTING CAPABILITIES FOR DATA ENGINEERING
Thema heute
© 2020 Snowflake Inc. All Rights Reserved.
SOLUTION STUDY:
LET‘S BUILD SOMETHING!
© 2020 Snowflake Computing Inc. All Rights Reserved
© 2020 Snowflake Inc. All Rights Reserved© 2020 Snowflake Inc. All Rights Reserved
SOLUTION SCENARIOSCENARIO: INGESTING FUEL PRICE DATA FOR ANALYIS
Source: tankerkoenig.de
© 2020 Snowflake Inc. All Rights Reserved
SOLUTION
ARCHITECTURE
© 2020 Snowflake Inc. All Rights Reserved
Thema heute
© 2020 Snowflake Inc. All Rights Reserved 13
Key Steps
>Integrate with AWS S3 and connect
Snowflake via External Stage
>Create a Pipe for Automatic Data Ingestion
> Test Snowpipe with new data
SCENARIO - Part #1
DATA INGESTION WITH
SNOWPIPE
© 2020 Snowflake Inc. All Rights Reserved© 2020 Snowflake Inc. All Rights Reserved
INTEGRATE AWS S3 WITH VIA EXTERNAL STAGE
What is… a Storage Integration and (External) Stage?
> Storage Integration: is a Snowflake object that stores a generated identity and access management (IAM)
entity for external cloud storage, along with an optional set of allowed or blocked storage locations (Amazon
S3, Google Cloud Storage, or Microsoft Azure)
> (External) Stage: a Snowflake object which encapsulates all of the required information for staging files: S3
bucket where the files are staged; the named storage integration object or S3 credentials for the bucket (if it
is protected); an encryption key (if the files in the bucket have been encrypted)
v
v
SF Admin Task, typically
not done by developers!
© 2020 Snowflake Inc. All Rights Reserved© 2020 Snowflake Inc. All Rights Reserved
IDENTIFY DATA TO BE LOADED FROM EXTERNAL STAGE
v
List content of a S3 bucket directly
from Snowflake, navigate subfolder
structure.
Identify, inspect and select files to be
loaded using “ * ” and RegExp etc.
Compute statistics
on files to be loaded
into Snowflake
© 2020 Snowflake Inc. All Rights Reserved
AUTOMATIC DATA INGESTION WITH SNOWPIPE
v
v
Bulk load
command
v
Target table to be updated
What is… Snowpipe?
> Snowpipe enables loading data from files as soon as
they’re available in a stage. Data can be loaded from files
in micro-batches, making it available to users within
minutes, rather than manually executing COPY statements
on a schedule to load larger batches.
> Alternative: Clients can call public Snowpipe REST
endpoints to load data and retrieve load history reports
Source location,
external stage
(e.g. S3 Bucket)
© 2020 Snowflake Inc. All Rights Reserved© 2020 Snowflake Inc. All Rights Reserved
UPLOAD NEW DATA TO S3 & CHECK STATUS OF SNOWPIPE
v
© 2020 Snowflake Inc. All Rights Reserved© 2020 Snowflake Inc. All Rights Reserved
VALIDATE RESULT IN SNOWSIGHT DASHBOARD
© 2020 Snowflake Inc. All Rights Reserved 19
Key Steps
>Integrate AWS Lambda Function
>Automate API Calls + store Payloads (JSON)
> Implement Change Data Capture
> Automate JSON flattening + Data Loading
SCENARIO - Part #2
AUTOMATED RETRIEVAL +
PROCESSING OF API DATA
© 2020 Snowflake Inc. All Rights Reserved© 2020 Snowflake Inc. All Rights Reserved
INTEGRATE AWS LAMBDA WITH VIA EXTERNAL FUNCTION
What is… an API Integration and External Function?
> API Integration (Preview Feature): object stores information about an HTTPS proxy service, including information
about: The cloud platform provider (e.g. Amazon AWS); type of proxy service (in case the cloud platform provider
offers more than one type of proxy service); identifier and access credentials
> External Function (Preview Feature): Snowflake does not call a remote service directly. Instead, Snowflake calls
the remote service through a cloud provider’s native HTTPS proxy service, for example API Gateway on AWS
SF Admin Task, typically
not done by developers
v
© 2020 Snowflake Inc. All Rights Reserved© 2020 Snowflake Inc. All Rights Reserved
AUTOMATE API REQUESTS WITH TASK #1
V Automation Task with dedicated
compute (TASK_WAREHOUSE),
schedule and no dependencies
V
What is… a Task?
> User-defined tasks allow scheduled execution of SQL statements. Tasks run according
to a specified execution configuration, using any combination of a set interval and/or a
flexible schedule using a subset of familiar cron utility syntax.
> There is no event source that can trigger a task; instead, a task runs on a schedule,
which can be defined when creating a task (using CREATE TASK) or later
(using ALTER TASK)
© 2020 Snowflake Inc. All Rights Reserved© 2020 Snowflake Inc. All Rights Reserved
API PAYLOAD RETRIEVED
V
Fuel price data of multiple
gas stations
© 2020 Snowflake Inc. All Rights Reserved© 2020 Snowflake Inc. All Rights Reserved
API PAYLOAD RETRIEVED
V
Fuel price data of multiple
gas stations
© 2020 Snowflake Inc. All Rights Reserved© 2020 Snowflake Inc. All Rights Reserved
ACTIVATE CHANGE DATA CAPTURE WITH STREAMS
V
V
Source table where
data record changes
should be tracked
V
SQL query on a table stream
to view which records have
been added, changed, deleted
V
What is… a Stream?
> An individual table stream
tracks the changes made
to rows in a source table. A
table stream makes a
“change table” available of
what changed, at the row
level, between two
transactional points of time
in a table.
> a stream itself does not
contain any table data, it
only stores the offset for
the source table and
returns CDC records by
leveraging the versioning
history.
© 2020 Snowflake Inc. All Rights Reserved© 2020 Snowflake Inc. All Rights Reserved
AUTOMATE DELTA LOAD WITH STREAMS AND TASK #2
V
Task will only start if table stream
has new data records to process
à saves compute resources!
Only CDC data
records of interest will
be processed and then
cleared from stream
when committed
V
Lateral view and flatten table function
used to split price data by Gas Station
and store as separate records in the
target table REMOTE_FUEL_PRICES
V
© 2020 Snowflake Inc. All Rights Reserved© 2020 Snowflake Inc. All Rights Reserved
STREAM CLEARED & PRICE DATA READY FOR ANALYSIS!
New fuel prices prepared
and stored in target table
REMOTE_FUEL_PRICES
(still in JSON format)
V
V
Query of table stream returns no
rows because the stream was
cleared after successful INSERT
into target table (Auto committed)
© 2020 Snowflake Inc. All Rights Reserved 27
Key Steps
>Consolidate Data for Analysis
>Query + visualize data for a given Gas Station
in Germany
>Analyze Snowflake Consumption
SCENARIO - Part #3
DATA CONSOLIDATION +
VISUALIZATION
© 2020 Snowflake Inc. All Rights Reserved
COMBINING
HISTORIC DATA
WITH API DATA
© 2020 Snowflake Inc. All Rights Reserved
V
Reading, formatting and
joining JSON price data
directly with master data
V
Putting all together:
Historic data from
dimensional model
combined with
current price data
© 2020 Snowflake Inc. All Rights Reserved© 2020 Snowflake Inc. All Rights Reserved
ANALYSIS & VISUALIZATION FOR A GIVEN GAS STATION
© 2020 Snowflake Inc. All Rights Reserved© 2020 Snowflake Inc. All Rights Reserved
PAY AS YOU USE + BUILT-IN COST TRANSPARENCY
Snowflake Default Billing & Usage Dashboard Snowpipe Usage History queried via SQL
© 2020 Snowflake Inc. All Rights Reserved.
SESSION TAKEAWAY
© 2020 Snowflake Computing Inc. All Rights Reserved
© 2020 Snowflake Inc. All Rights Reserved
A COMPLETE AND EASY-TO-USE DATA PLATFORM
Structured Data
Semi-Structured Data
Web APIs
IoT Data
Visualization /
Reporting
Data Science
Ad hoc Queries
Data Sources Stage
Presentation /
Consumers
JSON, AVRO
(VARIANT)
Hive Metastore
Integration
External Tables
Parquet
Load/Unload
ANSI SQL
Data Lake Warehouse Aggregation
Semantic /
Federated
Elastic Multi-
Cluster Compute
Data Vault,
3NF Modeling
ACID
Transactional
Consistency
Secure Views /
Data Masking
Materialized
Views
Zero Copy
Cloning
SSO
LDAP
OAUTH
SCIM
ODBC/JDBC
Python/R/Spark
Connector
End-to-End Security (RBAC, Encryption at Rest/in Motion)
Web UI
External
Functions
Data Sharing /
Marketplace
Streams (CDC) &
Tasks (Scheduler)
Time Travel
Kafka-Connector /
Snowpipe
Stored Procs /
UDFs
Geospatial
Snowflake supports Data Lake, Data Warehouse, and Data Engineering workloads
Dimensional
Modeling
32
Information
Schema
© 2020 Snowflake Inc. All Rights Reserved
SNOWFLAKE FOR DATA ENGINEERING
ALL DATA,
ANY SPEED
BETTER PRICE &
PERFORMANCE
NO SUPER POWERS
REQUIRED
Structured & Semi-Structured Data
Batch & Continuous Data Ingestion
Partner Ecosystems
Dedicated Resources
Auto Scaling
SQL-based
Single Platform with Near-Zero
Maintenance
Streams & Tasks
© 2020 Snowflake Inc. All Rights Reserved
THANK YOU

Más contenido relacionado

La actualidad más candente

Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to Mesh
Jeffrey T. Pollock
 

La actualidad más candente (20)

Snowflake Datawarehouse Architecturing
Snowflake Datawarehouse ArchitecturingSnowflake Datawarehouse Architecturing
Snowflake Datawarehouse Architecturing
 
Building a modern data warehouse
Building a modern data warehouseBuilding a modern data warehouse
Building a modern data warehouse
 
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
 
Snowflake essentials
Snowflake essentialsSnowflake essentials
Snowflake essentials
 
Master the Multi-Clustered Data Warehouse - Snowflake
Master the Multi-Clustered Data Warehouse - SnowflakeMaster the Multi-Clustered Data Warehouse - Snowflake
Master the Multi-Clustered Data Warehouse - Snowflake
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
 
Snowflake: The Good, the Bad, and the Ugly
Snowflake: The Good, the Bad, and the UglySnowflake: The Good, the Bad, and the Ugly
Snowflake: The Good, the Bad, and the Ugly
 
How to Take Advantage of an Enterprise Data Warehouse in the Cloud
How to Take Advantage of an Enterprise Data Warehouse in the CloudHow to Take Advantage of an Enterprise Data Warehouse in the Cloud
How to Take Advantage of an Enterprise Data Warehouse in the Cloud
 
Snowflake Best Practices for Elastic Data Warehousing
Snowflake Best Practices for Elastic Data WarehousingSnowflake Best Practices for Elastic Data Warehousing
Snowflake Best Practices for Elastic Data Warehousing
 
Modernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureModernizing to a Cloud Data Architecture
Modernizing to a Cloud Data Architecture
 
Altis: AWS Snowflake Practice
Altis: AWS Snowflake PracticeAltis: AWS Snowflake Practice
Altis: AWS Snowflake Practice
 
Snowflake Overview
Snowflake OverviewSnowflake Overview
Snowflake Overview
 
Snowflake Data Loading.pptx
Snowflake Data Loading.pptxSnowflake Data Loading.pptx
Snowflake Data Loading.pptx
 
Building Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics PrimerBuilding Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics Primer
 
Snowflake Data Governance
Snowflake Data GovernanceSnowflake Data Governance
Snowflake Data Governance
 
Let’s get to know Snowflake
Let’s get to know SnowflakeLet’s get to know Snowflake
Let’s get to know Snowflake
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh Architecture
 
Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to Mesh
 
BDA311 Introduction to AWS Glue
BDA311 Introduction to AWS GlueBDA311 Introduction to AWS Glue
BDA311 Introduction to AWS Glue
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)
 

Similar a Snowflake for Data Engineering

Similar a Snowflake for Data Engineering (20)

Actionable Insights with AI - Snowflake for Data Science
Actionable Insights with AI - Snowflake for Data ScienceActionable Insights with AI - Snowflake for Data Science
Actionable Insights with AI - Snowflake for Data Science
 
IBM THINK 2018 - IBM Cloud SQL Query Introduction
IBM THINK 2018 - IBM Cloud SQL Query IntroductionIBM THINK 2018 - IBM Cloud SQL Query Introduction
IBM THINK 2018 - IBM Cloud SQL Query Introduction
 
Snowflake’s Cloud Data Platform and Modern Analytics
Snowflake’s Cloud Data Platform and Modern AnalyticsSnowflake’s Cloud Data Platform and Modern Analytics
Snowflake’s Cloud Data Platform and Modern Analytics
 
IBM THINK 2020 - Cloud Data Lake with IBM Cloud Data Services
IBM THINK 2020 - Cloud Data Lake with IBM Cloud Data Services IBM THINK 2020 - Cloud Data Lake with IBM Cloud Data Services
IBM THINK 2020 - Cloud Data Lake with IBM Cloud Data Services
 
Delivering Data Democratization in the Cloud with Snowflake
Delivering Data Democratization in the Cloud with SnowflakeDelivering Data Democratization in the Cloud with Snowflake
Delivering Data Democratization in the Cloud with Snowflake
 
Continuous Data Replication into Cloud Storage with Oracle GoldenGate
Continuous Data Replication into Cloud Storage with Oracle GoldenGateContinuous Data Replication into Cloud Storage with Oracle GoldenGate
Continuous Data Replication into Cloud Storage with Oracle GoldenGate
 
Peteris Arajs - Where is my data
Peteris Arajs - Where is my dataPeteris Arajs - Where is my data
Peteris Arajs - Where is my data
 
Demystifying Data Warehousing as a Service (GLOC 2019)
Demystifying Data Warehousing as a Service (GLOC 2019)Demystifying Data Warehousing as a Service (GLOC 2019)
Demystifying Data Warehousing as a Service (GLOC 2019)
 
Loading Data into Redshift: Data Analytics Week at the SF Loft
Loading Data into Redshift: Data Analytics Week at the SF LoftLoading Data into Redshift: Data Analytics Week at the SF Loft
Loading Data into Redshift: Data Analytics Week at the SF Loft
 
ME_Snowflake_Introduction_for new students.pptx
ME_Snowflake_Introduction_for new students.pptxME_Snowflake_Introduction_for new students.pptx
ME_Snowflake_Introduction_for new students.pptx
 
From Data Warehouse to Lakehouse
From Data Warehouse to LakehouseFrom Data Warehouse to Lakehouse
From Data Warehouse to Lakehouse
 
Loading Data into Amazon Redshift
Loading Data into Amazon RedshiftLoading Data into Amazon Redshift
Loading Data into Amazon Redshift
 
Delivering rapid-fire Analytics with Snowflake and Tableau
Delivering rapid-fire Analytics with Snowflake and TableauDelivering rapid-fire Analytics with Snowflake and Tableau
Delivering rapid-fire Analytics with Snowflake and Tableau
 
Azure Data Factory for Azure Data Week
Azure Data Factory for Azure Data WeekAzure Data Factory for Azure Data Week
Azure Data Factory for Azure Data Week
 
Loading Data into Redshift: Data Analytics Week SF
Loading Data into Redshift: Data Analytics Week SFLoading Data into Redshift: Data Analytics Week SF
Loading Data into Redshift: Data Analytics Week SF
 
Cloud-based Data Lake for Analytics and AI
Cloud-based Data Lake for Analytics and AICloud-based Data Lake for Analytics and AI
Cloud-based Data Lake for Analytics and AI
 
Building Serverless ETL Pipelines
Building Serverless ETL PipelinesBuilding Serverless ETL Pipelines
Building Serverless ETL Pipelines
 
Loading Data into Redshift
Loading Data into RedshiftLoading Data into Redshift
Loading Data into Redshift
 
Loading Data into Redshift
Loading Data into RedshiftLoading Data into Redshift
Loading Data into Redshift
 
MongoDB .local Houston 2019: Wide Ranging Analytical Solutions on MongoDB
MongoDB .local Houston 2019: Wide Ranging Analytical Solutions on MongoDBMongoDB .local Houston 2019: Wide Ranging Analytical Solutions on MongoDB
MongoDB .local Houston 2019: Wide Ranging Analytical Solutions on MongoDB
 

Más de Harald Erb

Más de Harald Erb (11)

Dataiku & Snowflake Meetup Berlin 2020
Dataiku & Snowflake Meetup Berlin 2020Dataiku & Snowflake Meetup Berlin 2020
Dataiku & Snowflake Meetup Berlin 2020
 
Does it only have to be ML + AI?
Does it only have to be ML + AI?Does it only have to be ML + AI?
Does it only have to be ML + AI?
 
Machine Learning - Eine Challenge für Architekten
Machine Learning - Eine Challenge für ArchitektenMachine Learning - Eine Challenge für Architekten
Machine Learning - Eine Challenge für Architekten
 
DOAG Big Data Days 2017 - Cloud Journey
DOAG Big Data Days 2017 - Cloud JourneyDOAG Big Data Days 2017 - Cloud Journey
DOAG Big Data Days 2017 - Cloud Journey
 
Do you know what k-Means? Cluster-Analysen
Do you know what k-Means? Cluster-Analysen Do you know what k-Means? Cluster-Analysen
Do you know what k-Means? Cluster-Analysen
 
Exploratory Analysis in the Data Lab - Team-Sport or for Nerds only?
Exploratory Analysis in the Data Lab - Team-Sport or for Nerds only?Exploratory Analysis in the Data Lab - Team-Sport or for Nerds only?
Exploratory Analysis in the Data Lab - Team-Sport or for Nerds only?
 
Big Data Discovery + Analytics = Datengetriebene Innovation!
Big Data Discovery + Analytics = Datengetriebene Innovation!Big Data Discovery + Analytics = Datengetriebene Innovation!
Big Data Discovery + Analytics = Datengetriebene Innovation!
 
Big Data Discovery
Big Data DiscoveryBig Data Discovery
Big Data Discovery
 
DOAG News 2012 - Analytische Mehrwerte mit Big Data
DOAG News 2012 - Analytische Mehrwerte mit Big DataDOAG News 2012 - Analytische Mehrwerte mit Big Data
DOAG News 2012 - Analytische Mehrwerte mit Big Data
 
Oracle Unified Information Architeture + Analytics by Example
Oracle Unified Information Architeture + Analytics by ExampleOracle Unified Information Architeture + Analytics by Example
Oracle Unified Information Architeture + Analytics by Example
 
Endeca Web Acquisition Toolkit - Integration verteilter Web-Anwendungen und a...
Endeca Web Acquisition Toolkit - Integration verteilter Web-Anwendungen und a...Endeca Web Acquisition Toolkit - Integration verteilter Web-Anwendungen und a...
Endeca Web Acquisition Toolkit - Integration verteilter Web-Anwendungen und a...
 

Último

➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
amitlee9823
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
amitlee9823
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
only4webmaster01
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
amitlee9823
 
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
karishmasinghjnh
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night StandCall Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...
gajnagarg
 
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
gajnagarg
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
amitlee9823
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
amitlee9823
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
amitlee9823
 

Último (20)

➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
 
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night StandCall Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Stand
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
 
Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...
 
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 

Snowflake for Data Engineering

  • 1. © 2020 Snowflake Inc. All Rights Reserved FROM DATA TO INSIGHTS: DATA ENGINEERING MIT SNOWFLAKE ScaleUp 360° Smart Data 29. Sept. 2020 Harald Erb | harald.erb@snowflake.com Sr. Solutions Engineer, Central Europe
  • 2. © 2020 Snowflake Computing Inc. All Rights Reserved ABOUT ME Sr. Solutions Engineer Central Europe harald.erb@snowflake.com Llinkedin.com/in/haralderb Enthusiastic about Business Analytics & Data Management for 20+ years > Consulting: Delivered large-scale Data Warehouse and BI projects as Developer, Information Analyst, Solution Architect, Project Lead at Oracle D/A/CH > Presales #2 at Snowflake in Central Europe with focus on Modern Data Management & Analytics > Worked with clients on Big Data & IoT solutions as Architect and Solutions Engineer at Oracle EMEA, Pentaho and Hitachi Vantara
  • 3. © 2020 Snowflake Computing Inc. All Rights Reserved AGENDA > Snowflake Cloud Data Platform – for Data Engineering > Solution Study: Let‘s Build Something! > Session Takeaway
  • 4. © 2020 Snowflake Inc. All Rights Reserved. SNOWFLAKE FOR DATA ENGINEERING © 2020 Snowflake Computing Inc. All Rights Reserved
  • 5. © 2020 Snowflake Inc. All Rights Reserved SNOWFLAKE CLOUD DATA PLATFORM 5 OLTP DATABASES ENTERPRISE APPLICATIONS THIRD-PARTY WEB/LOG DATA IoT DATA MONETIZATION OPERATIONAL REPORTING AD HOC ANALYSIS REAL-TIME ANALYTICS DATA SOURCES DATA CONSUMERS Thema heute
  • 6. © 2020 Snowflake Inc. All Rights Reserved Rethink transformation with robust and integrated data pipelines Simplify and accelerate your data lake with one platform for all your data Develop apps with fast and scalable analytics that delight customers Deliver analytics at scale with a modern data warehouse Empower your ecosystem to securely collaborate across all data Simplify and accelerate machine learning and artificial intelligence ONE PLATFORM, ONE COPY OF DATA, MANY WORKLOADS 6
  • 7. © 2020 Snowflake Computing Inc. All Rights Reserved OVERCOMING DATA SILOS WITH SNOWFLAKE Data Sources Data Consumers Structured Data Semi-Structured Data Web APIs IoT Data Data Visualization / Reporting Data Science Ad hoc Queries Data Zones Enterprise data in one place (as much as possible), organized (e.g. in logical Data Zones) and accessible for all users Work Area (Exploratory, AI / ML) Persistent, user/team space, one or more Databases Landing Zone Transient, ELT processes, truncate/reload Raw Raw data, schema- less (JSON…): no transformations, matches source data Conformed Raw + de-duplicated, data type standardization (dates) Reference Master data, , manual mappings, Business hierarchies Modeled Integrated, cleansed, modeled data (3NF, Data Vault, Dimensional Model) “Data Lake" “Data Warehouse”
  • 8. © 2020 Snowflake Computing Inc. All Rights Reserved ELASTIC SERVICE, SUPPORT FOR MULTIPLE WORKLOADS 8 Continuous Loading (4TB/day) S3 <5min SLA Compute Cluster “Medium” Batch Data Loads & Transformations Compute Cluster "Large” Compute Cluster "2X-Large” Customer Analytics & Segmentation Interactive Dashboard 50% < 1s 85% < 2s 95% < 5s Compute Cluster Auto Scale – ”X-Large” x 5 Prod DB Snowflake Shared Data, Multi-Cluster Architecture: All data available in a central repository, major workloads isolated, performance on demand, and easy data access for everybody via SQL Benefit: Deliver Reporting SLA’s Benefit: Add teams as needed, support agile development & a data driven culture Benefit: Always fresh data Benefit: Complete more tasks within same time frame Structured & Semi-structured Data at Petabyte-Scale (all encrypted, compressed)
  • 9. © 2020 Snowflake Inc. All Rights Reserved SUPPORTING CAPABILITIES FOR DATA ENGINEERING Thema heute
  • 10. © 2020 Snowflake Inc. All Rights Reserved. SOLUTION STUDY: LET‘S BUILD SOMETHING! © 2020 Snowflake Computing Inc. All Rights Reserved
  • 11. © 2020 Snowflake Inc. All Rights Reserved© 2020 Snowflake Inc. All Rights Reserved SOLUTION SCENARIOSCENARIO: INGESTING FUEL PRICE DATA FOR ANALYIS Source: tankerkoenig.de
  • 12. © 2020 Snowflake Inc. All Rights Reserved SOLUTION ARCHITECTURE © 2020 Snowflake Inc. All Rights Reserved Thema heute
  • 13. © 2020 Snowflake Inc. All Rights Reserved 13 Key Steps >Integrate with AWS S3 and connect Snowflake via External Stage >Create a Pipe for Automatic Data Ingestion > Test Snowpipe with new data SCENARIO - Part #1 DATA INGESTION WITH SNOWPIPE
  • 14. © 2020 Snowflake Inc. All Rights Reserved© 2020 Snowflake Inc. All Rights Reserved INTEGRATE AWS S3 WITH VIA EXTERNAL STAGE What is… a Storage Integration and (External) Stage? > Storage Integration: is a Snowflake object that stores a generated identity and access management (IAM) entity for external cloud storage, along with an optional set of allowed or blocked storage locations (Amazon S3, Google Cloud Storage, or Microsoft Azure) > (External) Stage: a Snowflake object which encapsulates all of the required information for staging files: S3 bucket where the files are staged; the named storage integration object or S3 credentials for the bucket (if it is protected); an encryption key (if the files in the bucket have been encrypted) v v SF Admin Task, typically not done by developers!
  • 15. © 2020 Snowflake Inc. All Rights Reserved© 2020 Snowflake Inc. All Rights Reserved IDENTIFY DATA TO BE LOADED FROM EXTERNAL STAGE v List content of a S3 bucket directly from Snowflake, navigate subfolder structure. Identify, inspect and select files to be loaded using “ * ” and RegExp etc. Compute statistics on files to be loaded into Snowflake
  • 16. © 2020 Snowflake Inc. All Rights Reserved AUTOMATIC DATA INGESTION WITH SNOWPIPE v v Bulk load command v Target table to be updated What is… Snowpipe? > Snowpipe enables loading data from files as soon as they’re available in a stage. Data can be loaded from files in micro-batches, making it available to users within minutes, rather than manually executing COPY statements on a schedule to load larger batches. > Alternative: Clients can call public Snowpipe REST endpoints to load data and retrieve load history reports Source location, external stage (e.g. S3 Bucket)
  • 17. © 2020 Snowflake Inc. All Rights Reserved© 2020 Snowflake Inc. All Rights Reserved UPLOAD NEW DATA TO S3 & CHECK STATUS OF SNOWPIPE v
  • 18. © 2020 Snowflake Inc. All Rights Reserved© 2020 Snowflake Inc. All Rights Reserved VALIDATE RESULT IN SNOWSIGHT DASHBOARD
  • 19. © 2020 Snowflake Inc. All Rights Reserved 19 Key Steps >Integrate AWS Lambda Function >Automate API Calls + store Payloads (JSON) > Implement Change Data Capture > Automate JSON flattening + Data Loading SCENARIO - Part #2 AUTOMATED RETRIEVAL + PROCESSING OF API DATA
  • 20. © 2020 Snowflake Inc. All Rights Reserved© 2020 Snowflake Inc. All Rights Reserved INTEGRATE AWS LAMBDA WITH VIA EXTERNAL FUNCTION What is… an API Integration and External Function? > API Integration (Preview Feature): object stores information about an HTTPS proxy service, including information about: The cloud platform provider (e.g. Amazon AWS); type of proxy service (in case the cloud platform provider offers more than one type of proxy service); identifier and access credentials > External Function (Preview Feature): Snowflake does not call a remote service directly. Instead, Snowflake calls the remote service through a cloud provider’s native HTTPS proxy service, for example API Gateway on AWS SF Admin Task, typically not done by developers v
  • 21. © 2020 Snowflake Inc. All Rights Reserved© 2020 Snowflake Inc. All Rights Reserved AUTOMATE API REQUESTS WITH TASK #1 V Automation Task with dedicated compute (TASK_WAREHOUSE), schedule and no dependencies V What is… a Task? > User-defined tasks allow scheduled execution of SQL statements. Tasks run according to a specified execution configuration, using any combination of a set interval and/or a flexible schedule using a subset of familiar cron utility syntax. > There is no event source that can trigger a task; instead, a task runs on a schedule, which can be defined when creating a task (using CREATE TASK) or later (using ALTER TASK)
  • 22. © 2020 Snowflake Inc. All Rights Reserved© 2020 Snowflake Inc. All Rights Reserved API PAYLOAD RETRIEVED V Fuel price data of multiple gas stations
  • 23. © 2020 Snowflake Inc. All Rights Reserved© 2020 Snowflake Inc. All Rights Reserved API PAYLOAD RETRIEVED V Fuel price data of multiple gas stations
  • 24. © 2020 Snowflake Inc. All Rights Reserved© 2020 Snowflake Inc. All Rights Reserved ACTIVATE CHANGE DATA CAPTURE WITH STREAMS V V Source table where data record changes should be tracked V SQL query on a table stream to view which records have been added, changed, deleted V What is… a Stream? > An individual table stream tracks the changes made to rows in a source table. A table stream makes a “change table” available of what changed, at the row level, between two transactional points of time in a table. > a stream itself does not contain any table data, it only stores the offset for the source table and returns CDC records by leveraging the versioning history.
  • 25. © 2020 Snowflake Inc. All Rights Reserved© 2020 Snowflake Inc. All Rights Reserved AUTOMATE DELTA LOAD WITH STREAMS AND TASK #2 V Task will only start if table stream has new data records to process à saves compute resources! Only CDC data records of interest will be processed and then cleared from stream when committed V Lateral view and flatten table function used to split price data by Gas Station and store as separate records in the target table REMOTE_FUEL_PRICES V
  • 26. © 2020 Snowflake Inc. All Rights Reserved© 2020 Snowflake Inc. All Rights Reserved STREAM CLEARED & PRICE DATA READY FOR ANALYSIS! New fuel prices prepared and stored in target table REMOTE_FUEL_PRICES (still in JSON format) V V Query of table stream returns no rows because the stream was cleared after successful INSERT into target table (Auto committed)
  • 27. © 2020 Snowflake Inc. All Rights Reserved 27 Key Steps >Consolidate Data for Analysis >Query + visualize data for a given Gas Station in Germany >Analyze Snowflake Consumption SCENARIO - Part #3 DATA CONSOLIDATION + VISUALIZATION
  • 28. © 2020 Snowflake Inc. All Rights Reserved COMBINING HISTORIC DATA WITH API DATA © 2020 Snowflake Inc. All Rights Reserved V Reading, formatting and joining JSON price data directly with master data V Putting all together: Historic data from dimensional model combined with current price data
  • 29. © 2020 Snowflake Inc. All Rights Reserved© 2020 Snowflake Inc. All Rights Reserved ANALYSIS & VISUALIZATION FOR A GIVEN GAS STATION
  • 30. © 2020 Snowflake Inc. All Rights Reserved© 2020 Snowflake Inc. All Rights Reserved PAY AS YOU USE + BUILT-IN COST TRANSPARENCY Snowflake Default Billing & Usage Dashboard Snowpipe Usage History queried via SQL
  • 31. © 2020 Snowflake Inc. All Rights Reserved. SESSION TAKEAWAY © 2020 Snowflake Computing Inc. All Rights Reserved
  • 32. © 2020 Snowflake Inc. All Rights Reserved A COMPLETE AND EASY-TO-USE DATA PLATFORM Structured Data Semi-Structured Data Web APIs IoT Data Visualization / Reporting Data Science Ad hoc Queries Data Sources Stage Presentation / Consumers JSON, AVRO (VARIANT) Hive Metastore Integration External Tables Parquet Load/Unload ANSI SQL Data Lake Warehouse Aggregation Semantic / Federated Elastic Multi- Cluster Compute Data Vault, 3NF Modeling ACID Transactional Consistency Secure Views / Data Masking Materialized Views Zero Copy Cloning SSO LDAP OAUTH SCIM ODBC/JDBC Python/R/Spark Connector End-to-End Security (RBAC, Encryption at Rest/in Motion) Web UI External Functions Data Sharing / Marketplace Streams (CDC) & Tasks (Scheduler) Time Travel Kafka-Connector / Snowpipe Stored Procs / UDFs Geospatial Snowflake supports Data Lake, Data Warehouse, and Data Engineering workloads Dimensional Modeling 32 Information Schema
  • 33. © 2020 Snowflake Inc. All Rights Reserved SNOWFLAKE FOR DATA ENGINEERING ALL DATA, ANY SPEED BETTER PRICE & PERFORMANCE NO SUPER POWERS REQUIRED Structured & Semi-Structured Data Batch & Continuous Data Ingestion Partner Ecosystems Dedicated Resources Auto Scaling SQL-based Single Platform with Near-Zero Maintenance Streams & Tasks
  • 34. © 2020 Snowflake Inc. All Rights Reserved THANK YOU