Enviar búsqueda
Cargar
Pentaho Data Integration Introduction
•
48 recomendaciones
•
32,603 vistas
M
mattcasters
Seguir
A gentle and short introduction into Pentaho Data Integration a.k.a. Kettle
Leer menos
Leer más
Tecnología
Vista de diapositivas
Denunciar
Compartir
Vista de diapositivas
Denunciar
Compartir
1 de 18
Recomendados
Introduction To Pentaho
Introduction To Pentaho
DataminingTools Inc
Introduction To Pentaho
Introduction To Pentaho
pentaho Content
Kettle: Pentaho Data Integration tool
Kettle: Pentaho Data Integration tool
Alex Rayón Jerez
Pentaho
Pentaho
teza123
Pentaho etl-tool
Pentaho etl-tool
Sreenivas Kappala
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
Edureka!
ETL
ETL
Mallikarjuna G D
Moving and Transforming Data with Pentaho Data Integration 5.0 CE (aka Kettle)
Moving and Transforming Data with Pentaho Data Integration 5.0 CE (aka Kettle)
Roland Bouman
Recomendados
Introduction To Pentaho
Introduction To Pentaho
DataminingTools Inc
Introduction To Pentaho
Introduction To Pentaho
pentaho Content
Kettle: Pentaho Data Integration tool
Kettle: Pentaho Data Integration tool
Alex Rayón Jerez
Pentaho
Pentaho
teza123
Pentaho etl-tool
Pentaho etl-tool
Sreenivas Kappala
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
Edureka!
ETL
ETL
Mallikarjuna G D
Moving and Transforming Data with Pentaho Data Integration 5.0 CE (aka Kettle)
Moving and Transforming Data with Pentaho Data Integration 5.0 CE (aka Kettle)
Roland Bouman
Building an Effective Data Warehouse Architecture
Building an Effective Data Warehouse Architecture
James Serra
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
Databricks
Data Lake Overview
Data Lake Overview
James Serra
Building Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics Primer
Databricks
Power BI Architecture
Power BI Architecture
Arthur Graus
Pentaho-BI
Pentaho-BI
Edureka!
Demystifying data engineering
Demystifying data engineering
Thang Bui (Bob)
Modernizing to a Cloud Data Architecture
Modernizing to a Cloud Data Architecture
Databricks
Future of Data Engineering
Future of Data Engineering
C4Media
Data Engineering Basics
Data Engineering Basics
Catherine Kimani
Intro to Delta Lake
Intro to Delta Lake
Databricks
ETL VS ELT.pdf
ETL VS ELT.pdf
BOSupport
Build Large-Scale Data Analytics and AI Pipeline Using RayDP
Build Large-Scale Data Analytics and AI Pipeline Using RayDP
Databricks
DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptx
Databricks
Pentaho | Data Integration & Report designer
Pentaho | Data Integration & Report designer
Hamdi Hmidi
Introduction to ETL and Data Integration
Introduction to ETL and Data Integration
CloverDX (formerly known as CloverETL)
Building End-to-End Delta Pipelines on GCP
Building End-to-End Delta Pipelines on GCP
Databricks
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)
James Serra
Power BI visuals
Power BI visuals
Aldis Ērglis
Summary introduction to data engineering
Summary introduction to data engineering
Novita Sari
Pentaho data integration 4.0 and my sql
Pentaho data integration 4.0 and my sql
AHMED ENNAJI
5 Steps for Architecting a Data Lake
5 Steps for Architecting a Data Lake
MetroStar
Más contenido relacionado
La actualidad más candente
Building an Effective Data Warehouse Architecture
Building an Effective Data Warehouse Architecture
James Serra
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
Databricks
Data Lake Overview
Data Lake Overview
James Serra
Building Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics Primer
Databricks
Power BI Architecture
Power BI Architecture
Arthur Graus
Pentaho-BI
Pentaho-BI
Edureka!
Demystifying data engineering
Demystifying data engineering
Thang Bui (Bob)
Modernizing to a Cloud Data Architecture
Modernizing to a Cloud Data Architecture
Databricks
Future of Data Engineering
Future of Data Engineering
C4Media
Data Engineering Basics
Data Engineering Basics
Catherine Kimani
Intro to Delta Lake
Intro to Delta Lake
Databricks
ETL VS ELT.pdf
ETL VS ELT.pdf
BOSupport
Build Large-Scale Data Analytics and AI Pipeline Using RayDP
Build Large-Scale Data Analytics and AI Pipeline Using RayDP
Databricks
DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptx
Databricks
Pentaho | Data Integration & Report designer
Pentaho | Data Integration & Report designer
Hamdi Hmidi
Introduction to ETL and Data Integration
Introduction to ETL and Data Integration
CloverDX (formerly known as CloverETL)
Building End-to-End Delta Pipelines on GCP
Building End-to-End Delta Pipelines on GCP
Databricks
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)
James Serra
Power BI visuals
Power BI visuals
Aldis Ērglis
Summary introduction to data engineering
Summary introduction to data engineering
Novita Sari
La actualidad más candente
(20)
Building an Effective Data Warehouse Architecture
Building an Effective Data Warehouse Architecture
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
Data Lake Overview
Data Lake Overview
Building Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics Primer
Power BI Architecture
Power BI Architecture
Pentaho-BI
Pentaho-BI
Demystifying data engineering
Demystifying data engineering
Modernizing to a Cloud Data Architecture
Modernizing to a Cloud Data Architecture
Future of Data Engineering
Future of Data Engineering
Data Engineering Basics
Data Engineering Basics
Intro to Delta Lake
Intro to Delta Lake
ETL VS ELT.pdf
ETL VS ELT.pdf
Build Large-Scale Data Analytics and AI Pipeline Using RayDP
Build Large-Scale Data Analytics and AI Pipeline Using RayDP
DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptx
Pentaho | Data Integration & Report designer
Pentaho | Data Integration & Report designer
Introduction to ETL and Data Integration
Introduction to ETL and Data Integration
Building End-to-End Delta Pipelines on GCP
Building End-to-End Delta Pipelines on GCP
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Power BI visuals
Power BI visuals
Summary introduction to data engineering
Summary introduction to data engineering
Similar a Pentaho Data Integration Introduction
Pentaho data integration 4.0 and my sql
Pentaho data integration 4.0 and my sql
AHMED ENNAJI
5 Steps for Architecting a Data Lake
5 Steps for Architecting a Data Lake
MetroStar
Big Data Session 1.pptx
Big Data Session 1.pptx
ElsonPaul2
Datalake Architecture
Datalake Architecture
TechYugadi IT Solutions & Consulting
Big Data Analytics: From SQL to Machine Learning and Graph Analysis
Big Data Analytics: From SQL to Machine Learning and Graph Analysis
Yuanyuan Tian
Trivadis Azure Data Lake
Trivadis Azure Data Lake
Trivadis
Introduction Big Data
Introduction Big Data
Frank Kienle
INF2190_W1_2016_public
INF2190_W1_2016_public
Attila Barta
Building the Data Lake with Azure Data Factory and Data Lake Analytics
Building the Data Lake with Azure Data Factory and Data Lake Analytics
Khalid Salama
Is the traditional data warehouse dead?
Is the traditional data warehouse dead?
James Serra
Building big data solutions on azure
Building big data solutions on azure
Eyal Ben Ivri
Meeting today’s dissemination challenges – Implementing International Standar...
Meeting today’s dissemination challenges – Implementing International Standar...
Jonathan Challener
Big data and oracle
Big data and oracle
Sourabh Saxena
Qo Introduction V2
Qo Introduction V2
Joe_F
Hd insight overview
Hd insight overview
vhrocca
Eclipse day Sydney 2014 BIG data presentation
Eclipse day Sydney 2014 BIG data presentation
Sai Paravastu
How to Quickly and Easily Draw Value from Big Data Sources_Q3 symposia(Moa)
How to Quickly and Easily Draw Value from Big Data Sources_Q3 symposia(Moa)
Moacyr Passador
An Overview of VIEW
An Overview of VIEW
Shiyong Lu
INFOGOV14 - Trusting Your KM & ECM Strategy to SharePoint
INFOGOV14 - Trusting Your KM & ECM Strategy to SharePoint
Jonathan Ralton
Modernizing Your Data Warehouse using APS
Modernizing Your Data Warehouse using APS
Stéphane Fréchette
Similar a Pentaho Data Integration Introduction
(20)
Pentaho data integration 4.0 and my sql
Pentaho data integration 4.0 and my sql
5 Steps for Architecting a Data Lake
5 Steps for Architecting a Data Lake
Big Data Session 1.pptx
Big Data Session 1.pptx
Datalake Architecture
Datalake Architecture
Big Data Analytics: From SQL to Machine Learning and Graph Analysis
Big Data Analytics: From SQL to Machine Learning and Graph Analysis
Trivadis Azure Data Lake
Trivadis Azure Data Lake
Introduction Big Data
Introduction Big Data
INF2190_W1_2016_public
INF2190_W1_2016_public
Building the Data Lake with Azure Data Factory and Data Lake Analytics
Building the Data Lake with Azure Data Factory and Data Lake Analytics
Is the traditional data warehouse dead?
Is the traditional data warehouse dead?
Building big data solutions on azure
Building big data solutions on azure
Meeting today’s dissemination challenges – Implementing International Standar...
Meeting today’s dissemination challenges – Implementing International Standar...
Big data and oracle
Big data and oracle
Qo Introduction V2
Qo Introduction V2
Hd insight overview
Hd insight overview
Eclipse day Sydney 2014 BIG data presentation
Eclipse day Sydney 2014 BIG data presentation
How to Quickly and Easily Draw Value from Big Data Sources_Q3 symposia(Moa)
How to Quickly and Easily Draw Value from Big Data Sources_Q3 symposia(Moa)
An Overview of VIEW
An Overview of VIEW
INFOGOV14 - Trusting Your KM & ECM Strategy to SharePoint
INFOGOV14 - Trusting Your KM & ECM Strategy to SharePoint
Modernizing Your Data Warehouse using APS
Modernizing Your Data Warehouse using APS
Último
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
hans926745
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
Malak Abu Hammad
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
Pooja Nehwal
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Roshan Dwivedi
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
The Digital Insurer
How to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
naman860154
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
Results
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
naman860154
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
Sinan KOZAK
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
V3cube
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
Martijn de Jong
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Igalia
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
HampshireHUG
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
Radu Cotescu
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
Gabriella Davis
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
Maria Levchenko
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
The Digital Insurer
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
Anna Loughnan Colquhoun
Último
(20)
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
How to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
Pentaho Data Integration Introduction
1.
2.
3.
Project manager
4.
5.
6.
650 pages
7.
Pentaho Data Integration
for BI Business Intelligence! That's what we do.
8.
Pentaho Data Integration
– Kettle K ettle E xtraction T ransportation T ransformation L oading E nvironment
9.
10.
11.
XML files
12.
XLS files
13.
Xbase files (dBase,
Foxpro, etc)
14.
File systems information
15.
Generated data
16.
MS Access files
17.
LDAP
18.
Geo-data
19.
...
20.
21.
22.
partitioning
23.
merging
24.
joining
25.
duplicating
26.
clustering (MPP)
27.
28.
files
29.
30.
31.
Mapping
32.
Selecting
33.
Filtering
34.
Pivotting ...
35.
36.
Data warehouse population
37.
Partitioned loading
38.
Bulk loading
39.
Parallel loading
40.
Clustering
41.
42.
Debugger
43.
44.
45.
46.
Plugin eco-system
47.
...
48.
49.
50.
All regions on
Earth
51.
Meet on our
Forum : +40,000 posts in 10,000 threads in 4 years
52.
Use our JIRA
case tracking systems
53.
Download more than
10,000 copies of Kettle per month http://www.ohloh.net/projects/3624?p=Kettle http://www.softpedia.com/progClean/Kettle-Clean-80094.html
54.
55.
Export data from
database to text-file or more other databases
56.
Data migration between
database applications
57.
Exploration of data
in existing databases (tables, views, etc.)
58.
Information improvement using
lookups
59.
Data cleaning
60.
Application integration
61.
Data warehouse population
62.
Application integration
63.
Report data generation
64.
...
65.
66.
67.
68.
Natural fit for
additional data sources, targets and transformations
69.
70.
Download free study
at pentaho.com
71.
72.
73.
From Tera-bytes to
Peta-bytes
74.
Big Data stored
in Hadoop (MapReduce) / HDFS / Hive
75.
Reduces complexity for
developers
76.
Leverages standard components
like Pentaho Data Integration
77.
Drag & drop
creation of map and reduce transformations
78.
Cooperation with Apache
79.
Presentation + Demo
: http://vimeo.com/14641559
80.
81.
Forum: http://forums.pentaho.org/forumdisplay.php?f=69
82.
Case tracker:
http://jira.pentaho.org/browse/PDI
83.
Continuous Integration Server:
http://ci.pentaho.com/job/Kettle
84.
Wiki :
http://wiki.pentaho.org/ display/EAI
85.
IRC Channel: ##pentaho
(on Freenode)
86.
Mailing list:
http://groups.google.com/group/kettle-developers
87.
My blog:
http://www.ibridge.be
88.
My coordinates: mcasters
at pentaho dot org
89.
Pentaho Books
90.