SlideShare una empresa de Scribd logo
1 de 13
Introduction to
January 23, 2017
Ali Arabi – Bernie Najlis
Agenda
• What is Knime
• Where to get it and online resources
• What can I do with Knime
• How does it compare with similar tools
• Knime Lingo
• Knime Workbench
• How to build a Workflow
• Samples and Questions
What is Knime?
• KNIME stands for Konstanz Information Miner
• It is an Open Source Data Analytics, Reporting and Integration platform
• Use a GUI to assembly ‘nodes’ for data preprocessing (ETL), modelling
and data analysis and visualization
• Modules for:
• Data Mining
• Data Analysis
• Data Manipulation
• More modules and extensions can be added!
• Written in Java and based on Eclipse
Where to get it and other online resources
• http://knime.org/downloads/overview
• Skip the registration form, go straight to step (2) and download the version with all free
extensions (~2Gb)
• Community Forum and Online Self Training
• Books
KNIME Essentials
By: Gábor Bakos
Publisher: Packt Publishing
Pub. Date: October 16, 2013
Print ISBN-13: 978-1-84969-921-1
Web ISBN-13: 978-1-84969-922-8
Pages in Print Edition: 148
• Videos
Introduction to Data Analytics
with KNIME
By: Rosaria Silipo
Publisher: Infinite Skills
Publication Date: 20-SEP-2016
Insert Date: 26-SEP-2016
What can I do with Knime?
• Data Access
• File
• Database I/O
• Transformation
• Filtering, Grouping, Joining
• Analyze and Data mining
• Weka
• R
• Python
• Mathlab
• Visualization
• Different types of charts
• Deployment
• Text mining
How does Knime compare with others?
• Gartner’s Magic Quadrant for
Advance Analytics Platforms
• Leaders quadrant in 2016 with SAS, IBM
and Dell
• Strong Performer / Contender in
Forrester’s Wave
Knime Lingo
• Store your work in a workspace
• Workspace can contain workflow groups built using the workflow editor
• Workflows can contain nodes, meta nodes, connections, workflow
variables, workflow credentials and annotations
• Each node has a type, which identifies the algorithm associated with it
• Nodes have parameters, inports and
outports, and can have any of these states:
• Misconfigured
• Configured
• Queued for Execution
• Running
• Executed
Knime Workbench
• Workflow Projects
• Favorite Nodes
• Node Repository
• Workflow Editor
• Outline
• Node Description
• Console
How to Build a Knime Workflow
• Search in Node Repository
• Dragging nodes into Workflow Editor
• Connecting Nodes
• Configuring Nodes
• Executing (per node or one-shot)
=> Configure => => Execute =>
Simple Model Training for Classification
Performing k-means Clustering
Example for Data Preprocessing
Example of R Snippet

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

What's New in KNIME Analytics Platform 4.0 and KNIME Server 4.9
What's New in KNIME Analytics Platform 4.0 and KNIME Server 4.9What's New in KNIME Analytics Platform 4.0 and KNIME Server 4.9
What's New in KNIME Analytics Platform 4.0 and KNIME Server 4.9
 
Your Roadmap for An Enterprise Graph Strategy
Your Roadmap for An Enterprise Graph StrategyYour Roadmap for An Enterprise Graph Strategy
Your Roadmap for An Enterprise Graph Strategy
 
Productionizing Machine Learning with a Microservices Architecture
Productionizing Machine Learning with a Microservices ArchitectureProductionizing Machine Learning with a Microservices Architecture
Productionizing Machine Learning with a Microservices Architecture
 
Introduction to Data Engineer and Data Pipeline at Credit OK
Introduction to Data Engineer and Data Pipeline at Credit OKIntroduction to Data Engineer and Data Pipeline at Credit OK
Introduction to Data Engineer and Data Pipeline at Credit OK
 
End to end Machine Learning using Kubeflow - Build, Train, Deploy and Manage
End to end Machine Learning using Kubeflow - Build, Train, Deploy and ManageEnd to end Machine Learning using Kubeflow - Build, Train, Deploy and Manage
End to end Machine Learning using Kubeflow - Build, Train, Deploy and Manage
 
What is Machine Learning
What is Machine LearningWhat is Machine Learning
What is Machine Learning
 
Machine Learning Model Deployment: Strategy to Implementation
Machine Learning Model Deployment: Strategy to ImplementationMachine Learning Model Deployment: Strategy to Implementation
Machine Learning Model Deployment: Strategy to Implementation
 
Ml ops intro session
Ml ops   intro sessionMl ops   intro session
Ml ops intro session
 
Machine Learning using Kubeflow and Kubernetes
Machine Learning using Kubeflow and KubernetesMachine Learning using Kubeflow and Kubernetes
Machine Learning using Kubeflow and Kubernetes
 
Introduction to MLflow
Introduction to MLflowIntroduction to MLflow
Introduction to MLflow
 
MLOps.pptx
MLOps.pptxMLOps.pptx
MLOps.pptx
 
Google Data Studio for business
Google Data Studio for businessGoogle Data Studio for business
Google Data Studio for business
 
AI Foundations Course Module 1 - An AI Transformation Journey
AI Foundations Course Module 1 - An AI Transformation JourneyAI Foundations Course Module 1 - An AI Transformation Journey
AI Foundations Course Module 1 - An AI Transformation Journey
 
MLOps with Azure DevOps
MLOps with Azure DevOpsMLOps with Azure DevOps
MLOps with Azure DevOps
 
Automated Machine Learning (Auto ML)
Automated Machine Learning (Auto ML)Automated Machine Learning (Auto ML)
Automated Machine Learning (Auto ML)
 
Deploy and Serve Model from Azure Databricks onto Azure Machine Learning
Deploy and Serve Model from Azure Databricks onto Azure Machine LearningDeploy and Serve Model from Azure Databricks onto Azure Machine Learning
Deploy and Serve Model from Azure Databricks onto Azure Machine Learning
 
Introdution to Dataops and AIOps (or MLOps)
Introdution to Dataops and AIOps (or MLOps)Introdution to Dataops and AIOps (or MLOps)
Introdution to Dataops and AIOps (or MLOps)
 
Databricks Platform.pptx
Databricks Platform.pptxDatabricks Platform.pptx
Databricks Platform.pptx
 
Hybrid Cloud, Kubeflow and Tensorflow Extended [TFX]
Hybrid Cloud, Kubeflow and Tensorflow Extended [TFX]Hybrid Cloud, Kubeflow and Tensorflow Extended [TFX]
Hybrid Cloud, Kubeflow and Tensorflow Extended [TFX]
 
MLOps with Kubeflow
MLOps with Kubeflow MLOps with Kubeflow
MLOps with Kubeflow
 

Destacado

KNIME - Create Workflow with KNIME
KNIME - Create Workflow with KNIMEKNIME - Create Workflow with KNIME
KNIME - Create Workflow with KNIME
Billy Wong
 

Destacado (20)

CURRICULO_LeonardoLopes _20160623
CURRICULO_LeonardoLopes _20160623CURRICULO_LeonardoLopes _20160623
CURRICULO_LeonardoLopes _20160623
 
KNIME - Create Workflow with KNIME
KNIME - Create Workflow with KNIMEKNIME - Create Workflow with KNIME
KNIME - Create Workflow with KNIME
 
Knime Evaluation Smaller
Knime Evaluation SmallerKnime Evaluation Smaller
Knime Evaluation Smaller
 
Big Data at your Desk with KNIME
Big Data at your Desk with KNIMEBig Data at your Desk with KNIME
Big Data at your Desk with KNIME
 
Big Data with KNIME is as easy as 1, 2, 3, ...4!
Big Data with KNIME is as easy as 1, 2, 3, ...4!Big Data with KNIME is as easy as 1, 2, 3, ...4!
Big Data with KNIME is as easy as 1, 2, 3, ...4!
 
Text Processing with KNIME
Text Processing with KNIMEText Processing with KNIME
Text Processing with KNIME
 
Knime (presentacion grupal)
Knime (presentacion grupal)Knime (presentacion grupal)
Knime (presentacion grupal)
 
Knime
Knime Knime
Knime
 
KNIME Meetup 2016-04-16
KNIME Meetup 2016-04-16KNIME Meetup 2016-04-16
KNIME Meetup 2016-04-16
 
Manual Básico Knime
Manual Básico KnimeManual Básico Knime
Manual Básico Knime
 
Machine Learning with Spark
Machine Learning with SparkMachine Learning with Spark
Machine Learning with Spark
 
Sentiment analysis
Sentiment analysisSentiment analysis
Sentiment analysis
 
Open Analytics - Minería de Datos con Knime
Open Analytics - Minería de Datos con KnimeOpen Analytics - Minería de Datos con Knime
Open Analytics - Minería de Datos con Knime
 
The Web Analytics Swiss Army Knife
The Web Analytics Swiss Army KnifeThe Web Analytics Swiss Army Knife
The Web Analytics Swiss Army Knife
 
virtualization in cloud technology
virtualization in cloud technologyvirtualization in cloud technology
virtualization in cloud technology
 
Sentiment Analysis Using Machine Learning
Sentiment Analysis Using Machine LearningSentiment Analysis Using Machine Learning
Sentiment Analysis Using Machine Learning
 
Just add Imagination
Just add ImaginationJust add Imagination
Just add Imagination
 
Big data, predictive modeling and analytics in online marketing
Big data, predictive modeling and analytics in online marketingBig data, predictive modeling and analytics in online marketing
Big data, predictive modeling and analytics in online marketing
 
Sentiment Analysis Using Hybrid Structure of Machine Learning Algorithms
Sentiment Analysis Using Hybrid Structure of Machine Learning AlgorithmsSentiment Analysis Using Hybrid Structure of Machine Learning Algorithms
Sentiment Analysis Using Hybrid Structure of Machine Learning Algorithms
 
Web mining (1)
Web mining (1)Web mining (1)
Web mining (1)
 

Similar a Introduction to knime

Marc Schwering – Using Flink with MongoDB to enhance relevancy in personaliza...
Marc Schwering – Using Flink with MongoDB to enhance relevancy in personaliza...Marc Schwering – Using Flink with MongoDB to enhance relevancy in personaliza...
Marc Schwering – Using Flink with MongoDB to enhance relevancy in personaliza...
Flink Forward
 

Similar a Introduction to knime (20)

Zero to Sixty with Oracle ApEx
Zero to Sixty with Oracle ApExZero to Sixty with Oracle ApEx
Zero to Sixty with Oracle ApEx
 
Presto Summit 2018 - 02 - LinkedIn
Presto Summit 2018  - 02 - LinkedInPresto Summit 2018  - 02 - LinkedIn
Presto Summit 2018 - 02 - LinkedIn
 
Serverless brewbox
Serverless   brewboxServerless   brewbox
Serverless brewbox
 
Marc Schwering – Using Flink with MongoDB to enhance relevancy in personaliza...
Marc Schwering – Using Flink with MongoDB to enhance relevancy in personaliza...Marc Schwering – Using Flink with MongoDB to enhance relevancy in personaliza...
Marc Schwering – Using Flink with MongoDB to enhance relevancy in personaliza...
 
7 steps to simplifying your AI workflows
7 steps to simplifying your AI workflows7 steps to simplifying your AI workflows
7 steps to simplifying your AI workflows
 
Rootconf 2017 - State of the Open Source monitoring landscape
Rootconf 2017 - State of the Open Source monitoring landscape Rootconf 2017 - State of the Open Source monitoring landscape
Rootconf 2017 - State of the Open Source monitoring landscape
 
Adf and ala design c sharp corner toronto chapter feb 2019 meetup nik shahriar
Adf and ala design c sharp corner toronto chapter feb 2019 meetup nik shahriarAdf and ala design c sharp corner toronto chapter feb 2019 meetup nik shahriar
Adf and ala design c sharp corner toronto chapter feb 2019 meetup nik shahriar
 
.NET per la Data Science e oltre
.NET per la Data Science e oltre.NET per la Data Science e oltre
.NET per la Data Science e oltre
 
Making Data Scientists Productive in Azure
Making Data Scientists Productive in AzureMaking Data Scientists Productive in Azure
Making Data Scientists Productive in Azure
 
Big Data and NoSQL for Database and BI Pros
Big Data and NoSQL for Database and BI ProsBig Data and NoSQL for Database and BI Pros
Big Data and NoSQL for Database and BI Pros
 
Windows Azure introduction
Windows Azure introductionWindows Azure introduction
Windows Azure introduction
 
AD1545 - Extending the XPages Extension Library
AD1545 - Extending the XPages Extension LibraryAD1545 - Extending the XPages Extension Library
AD1545 - Extending the XPages Extension Library
 
OpenStack Documentation in the Open
OpenStack Documentation in the OpenOpenStack Documentation in the Open
OpenStack Documentation in the Open
 
OpenStack Doc Overview for Boot Camp
OpenStack Doc Overview for Boot CampOpenStack Doc Overview for Boot Camp
OpenStack Doc Overview for Boot Camp
 
Text Mining & Sentiment Analysis with Power BI & Azure
Text Mining & Sentiment Analysis with Power BI & AzureText Mining & Sentiment Analysis with Power BI & Azure
Text Mining & Sentiment Analysis with Power BI & Azure
 
GraphDb in XPages
GraphDb in XPagesGraphDb in XPages
GraphDb in XPages
 
SPSNYC2019 - What is Common Data Model and how to use it?
SPSNYC2019 - What is Common Data Model and how to use it?SPSNYC2019 - What is Common Data Model and how to use it?
SPSNYC2019 - What is Common Data Model and how to use it?
 
W3C Digital Publishing Interest Group Update
W3C Digital Publishing Interest Group UpdateW3C Digital Publishing Interest Group Update
W3C Digital Publishing Interest Group Update
 
UI Dev in Big data world using open source
UI Dev in Big data world using open sourceUI Dev in Big data world using open source
UI Dev in Big data world using open source
 
Lantea platform
Lantea platformLantea platform
Lantea platform
 

Más de Bernardo Najlis

Toastmasters speech #7 - Research your Subject
Toastmasters speech #7  - Research your SubjectToastmasters speech #7  - Research your Subject
Toastmasters speech #7 - Research your Subject
Bernardo Najlis
 
Toastmasters project #5 - Just a jump
Toastmasters project #5  - Just a jumpToastmasters project #5  - Just a jump
Toastmasters project #5 - Just a jump
Bernardo Najlis
 

Más de Bernardo Najlis (12)

Named Entity Recognition from Online News
Named Entity Recognition from Online NewsNamed Entity Recognition from Online News
Named Entity Recognition from Online News
 
Named Entity Recognition from Online News
Named Entity Recognition from Online NewsNamed Entity Recognition from Online News
Named Entity Recognition from Online News
 
Investment Fund Analytics
Investment Fund AnalyticsInvestment Fund Analytics
Investment Fund Analytics
 
Social Media World News Impact on Stock Index Values - Investment Fund Analyt...
Social Media World News Impact on Stock Index Values - Investment Fund Analyt...Social Media World News Impact on Stock Index Values - Investment Fund Analyt...
Social Media World News Impact on Stock Index Values - Investment Fund Analyt...
 
Social Media Analytics on Canadian Airlines
Social Media Analytics on Canadian AirlinesSocial Media Analytics on Canadian Airlines
Social Media Analytics on Canadian Airlines
 
#FluxFlow
#FluxFlow#FluxFlow
#FluxFlow
 
Toastmasters speech #7 - Research your Subject
Toastmasters speech #7  - Research your SubjectToastmasters speech #7  - Research your Subject
Toastmasters speech #7 - Research your Subject
 
Toastmasters project #5 - Just a jump
Toastmasters project #5  - Just a jumpToastmasters project #5  - Just a jump
Toastmasters project #5 - Just a jump
 
What is lomography?
What is lomography?What is lomography?
What is lomography?
 
Plethora
PlethoraPlethora
Plethora
 
Business Intelligence Presentation - Data Mining (2/2)
Business Intelligence Presentation - Data Mining (2/2)Business Intelligence Presentation - Data Mining (2/2)
Business Intelligence Presentation - Data Mining (2/2)
 
Business Intelligence Presentation (1/2)
Business Intelligence Presentation (1/2)Business Intelligence Presentation (1/2)
Business Intelligence Presentation (1/2)
 

Último

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Último (20)

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 

Introduction to knime

  • 1. Introduction to January 23, 2017 Ali Arabi – Bernie Najlis
  • 2. Agenda • What is Knime • Where to get it and online resources • What can I do with Knime • How does it compare with similar tools • Knime Lingo • Knime Workbench • How to build a Workflow • Samples and Questions
  • 3. What is Knime? • KNIME stands for Konstanz Information Miner • It is an Open Source Data Analytics, Reporting and Integration platform • Use a GUI to assembly ‘nodes’ for data preprocessing (ETL), modelling and data analysis and visualization • Modules for: • Data Mining • Data Analysis • Data Manipulation • More modules and extensions can be added! • Written in Java and based on Eclipse
  • 4. Where to get it and other online resources • http://knime.org/downloads/overview • Skip the registration form, go straight to step (2) and download the version with all free extensions (~2Gb) • Community Forum and Online Self Training • Books KNIME Essentials By: Gábor Bakos Publisher: Packt Publishing Pub. Date: October 16, 2013 Print ISBN-13: 978-1-84969-921-1 Web ISBN-13: 978-1-84969-922-8 Pages in Print Edition: 148 • Videos Introduction to Data Analytics with KNIME By: Rosaria Silipo Publisher: Infinite Skills Publication Date: 20-SEP-2016 Insert Date: 26-SEP-2016
  • 5. What can I do with Knime? • Data Access • File • Database I/O • Transformation • Filtering, Grouping, Joining • Analyze and Data mining • Weka • R • Python • Mathlab • Visualization • Different types of charts • Deployment • Text mining
  • 6. How does Knime compare with others? • Gartner’s Magic Quadrant for Advance Analytics Platforms • Leaders quadrant in 2016 with SAS, IBM and Dell • Strong Performer / Contender in Forrester’s Wave
  • 7. Knime Lingo • Store your work in a workspace • Workspace can contain workflow groups built using the workflow editor • Workflows can contain nodes, meta nodes, connections, workflow variables, workflow credentials and annotations • Each node has a type, which identifies the algorithm associated with it • Nodes have parameters, inports and outports, and can have any of these states: • Misconfigured • Configured • Queued for Execution • Running • Executed
  • 8. Knime Workbench • Workflow Projects • Favorite Nodes • Node Repository • Workflow Editor • Outline • Node Description • Console
  • 9. How to Build a Knime Workflow • Search in Node Repository • Dragging nodes into Workflow Editor • Connecting Nodes • Configuring Nodes • Executing (per node or one-shot) => Configure => => Execute =>
  • 10. Simple Model Training for Classification
  • 12. Example for Data Preprocessing
  • 13. Example of R Snippet