1. Pentaho
● What is it ?
● Pentaho Server
● Pentaho Client
● Plugins
● Big Data / Hadoop
● Architecture
● Screen Shots
www.semtech-solutions.co.nz info@semtech-solutions.co.nz
2. Pentaho – What is it ?
● It is a business intelligence system
● It offers
– Analytics
– Visual data integration
– OLAP
– Reports
– Dashboards
– Data mining
– ETL
● Written in Java
www.semtech-solutions.co.nz info@semtech-solutions.co.nz
3. Pentaho – What is it ?
● Offered as
– Free community edition
– Purchased enterprise edition
● Available for
– Windows
– Linux
– Mac OSX
● Community supported
● Open source plugins available
● Uses the Apache Java Application Server
www.semtech-solutions.co.nz info@semtech-solutions.co.nz
4. Pentaho – Server
Pentaho server components
● BI / BA Platform - the core platform, hosts contents and offers services
● Analysis (Mondrian) – an OLAP server, query via MDX, XML
– In memory data aggregation
● Dashboard Designer (PDD) – supports monitoring / decision making
● Analysis (PAZ) – drag and drop OLAP viewer, create MDX queries
● Reporting (PIR) – plugin for adhoc reports
● Data Access – create data sources / define data models
● Mobile – extends Pentaho to the mobile world
– Dashboards and reports on a small screen
www.semtech-solutions.co.nz info@semtech-solutions.co.nz
5. Pentaho – Client
Pentaho client components
● Data Integration (PDI) – Kettle ETL engine, single node / cloud
● Big Data – ETL for Hadoop / NoSQL databases
● Report Designer – codeless report creation
● Data Mining – uses Weka for pattern searching and trend prediction
● Meta Data (PME) – create business models of data sources
● Aggregate Designer (PAD) – pre calculation for aggregation
● Schema Workbench (PSW) – OLAP cube analysis
● Design Studio (PDS) – automation / business logic task support
www.semtech-solutions.co.nz info@semtech-solutions.co.nz
6. Pentaho – Plugins
Pentaho plugins
● Ctools – free / open source tools by webdetails
● Charting (CCC) – create dashboard charts
● Build Framework (CBF) – release management for Pentaho apps
● Data Access (CDA) – common layer for data access
● Data Browser (CDB) – visual OLAP data browser
● Distributed Cache (CDC) – high performance, scalable shared cache
● Data Generator (CDG) – for dashboard creation
– Creates tables, data and Mondrian schema
www.semtech-solutions.co.nz info@semtech-solutions.co.nz
7. Pentaho – Plugins
Pentaho plugins
● Data Validator (CDV) – data integrity via validation tests
● Graphics Gen (CGG) – server side rendering of charts as images
● Dashboard Editor (CDE) – sophisticated community dashboard editor
● Dashboard Framework (CDF)
– extendable, AJAX based dashboard framework
● Startup Tabs (CST) – define start up tabs by user
● Saiku – analysis suite / open source
● Saiku Reporting – drag & drop report design tool
www.semtech-solutions.co.nz info@semtech-solutions.co.nz
8. Pentaho – Big Data
What are Hadoop's characteristics ?
– Mainly command line driven
– Minimal built in processing / filtering / validation options
– No drag & drop ETL like job creation
What can Pentaho bring to Hadoop ?
– Visual drag & drop job creation
– Visual reports
– Ad / Hoc querying
– Dashboards
– And more …
It would be a good extension to Hadoop's core functionality
www.semtech-solutions.co.nz info@semtech-solutions.co.nz
13. Contact Us
● Feel free to contact us at
– www.semtech-solutions.co.nz
– info@semtech-solutions.co.nz
● We offer IT project consultancy
● We are happy to hear about your problems
● You can just pay for those hours that you need
● To solve your problems