SlideShare a Scribd company logo
1 of 27
Machine Learning Using Big Data
A SEMINAR ON
SEMINAR GUIDE: PROF A.K HASE PRESENTER: MR. VAIBHAV KURKUTE
15-04-2017
1. History & Traditional Database
2. Introduction
3. Data Mining
4. What is Machine Learning ?
5. Types of Learning's
6. Supervised Learning Algorithms
7. Unsupervised Learning Algorithms
8. Case Studies
9. Future Scope & Tools
10. Conclusion
15-04-2017
Content
• Old Source of Data: Telephone (Text or Voice)
• Computer Invention & Business Uses
• Old Data Storage
• 21st Century Evolution
• Traditional Databases & Drawbacks
• Structure Data
• Use of MySQL Database Use
• Machine Generating Data.
• Unstructured Data
Use MongoDB i.e NoSQL Database
*Hadoop Distributed File System,HBASE ,Hive.
15-04-2017
History
15-04-2017
Introduction to Big Data
• Generated Fast in unstructured form.
• Continuously Processed and Analyzed
• Large amounts of data, like a million rows in an Excel sheet
• Different types of data mostly unstructured data.
• Get Knowledge out of this data.
1. Google processes 20 petabytes of data every day
2. Facebook gets Thousands of Status in an hour.
15-04-2017
Introduction to Big Data
• Web: estimated Google index 45 billion pages
• Transaction data: 5-50 TB/day
• Satellite image feeds: ~1TB/day/satellite
• Sensor networks/arrays
– CERN Large Hadron Collider ~100 petabytes/day
• Biological data: 1-10TB/day/sequencer
• TV: 2TB/day/channel; YouTube 4TB/day uploaded
• Digitized telephony: ~100 petabytes/day
15-04-2017
How big is Big Data ?
15-04-2017
Data Mining
• Data Mining is of no use if we can’t get useful information from data
• To mine insights from the data & make it potentially useful.
• Previously Unknown data to knowledge.
• Which can be used for ?
1. Predict future trends
2. Allowing businesses to make proactive.
3. Knowledge-driven decisions
4. E.G From your travel history on Yatra.com, one can identify your hometown
5. E.G Snyder & Vini Facebook status
15-04-2017
Data Mining
15-04-2017
Machine Learning
• Machine learn on its own
• No need to tell the machine what to do
• No Need of Coding
• We provide what we call the training data set.
• Use of algorithms and Learn Pattern so to.
• Create knowledge from data.
• Example:
If we give sample input & output like
2 -> f(x) -> 4 and 3 -> f(x) -> 9
4 -> f(x) -> 16 then 5 -> f(x) -> ?
15-04-2017
Machine Learning
• Here are few examples:
1. Google’s self-driving cars
2. Blocking of suspicious credit cards & Spam Mails
3. Recommendation engines on an e-commerce site
4. Facebook Friend Suggestion
“People worry that computers will get too smart and take over the world, but the real
problem is that they're too stupid and they've already taken over the world”
15-04-2017
Machine Learning
15-04-2017
Types of Learning
• Training data with correct answers i.e Examples for Computer
• Use training data to prepare the algorithm
• Apply it to data without a correct answer
• It’s like predictive algorithms.
15-04-2017
Type: Supervised Learning
• No Examples for Computer i.e No training data
• We give data to algorithm
• Here we know which algorithm to use.
• It’s like exploratory algorithm
• We have just to input data & Not Output
• Example
Differentiates correctly between the face of a horse, cat or human (clustering of data)
15-04-2017
Type: Unsupervised Learning
• Clustering:
• Splitting records to pre-defined group
• Data with similar property
• Association:
Seeing what often appears together with what.
• K-means clustering
15-04-2017
Unsupervised Algorithm
• Classification:
• Assigning Records to Predefined Groups
• E.g Recognizing handwritten numbers, or classify emails spam or not.
• Regression (predictive analysis):
• Predict the output value using training data
• Naïve Bayes classifier.
• Decision trees
• Nearest neighbors (kNN)
• Neural networks
15-04-2017
Supervised Algorithm
• Classification:
• Assigning Records to Predefined Groups
• E.g a data used by motor vehicle company to find where to sale ?
• Regression (predictive analysis):
• Predict the output value using training data
• Naïve Bayes classifier.
• Decision trees
• Nearest neighbors (kNN)
• Neural networks
15-04-2017
Supervised Algorithm
• Type of Unsupervised Learning.
• We have to predict using training data.
• Association Rules Mining its using If-Then Condition.
• CASE STUDY 1:
How does amazon predict which product will be sold with what ?
15-04-2017
Apriori Algorithm
• It is a type of Market Basket Analysis
• Information of this type used in the form of “if–then” statements.
• Rules are computed from the data
• Examine all possible rules.
• For the items in an if–then format.
• Select only those that are most likely
to be indicators of true dependence.
15-04-2017
Case Study (Amazon)
15-04-2017
Case Study (Amazon)
15-04-2017
Case Study (Amazon)
• Generate frequent item sets
• With two items, then with three items.
• Based on , how many transactions in the database include the item.
15-04-2017
Case Study (Amazon)
Tools
1. R-PROGRAMMING
2. PYTHON (SCIPY, SCIKIT-LEARN)
3. MATLAB (TO GENERATE IN GRAPHICAL FORM)
4. SPSS
5. SAS
15-04-2017
Real life application
• Some real life applications of machine learning:
 Recommender systems – suggesting similar people on Facebook/LinkedIn, similar
movies/ books etc. on Amazon,
 Business applications – Customer segmentation, Customer retention, Targeted
Marketing etc.
 Medical applications – Disease diagnosis,
 Banking – Credit card issue, fraud detection etc.
 Language translation, text to speech or vice versa.
15-04-2017
Future scope
• Companies using ML – Google, FB, Microsoft, BoA and those which are not using are at
loss.
• With the current increase in use of IoT (Household, Business, Industries etc.) so there is
need of continuously analysis data and conclude using machine learning.
• Connected devices, we now have access to so much more data—and along with it, an
increased need to manage and understand what we know.
• In the future, users will receive more precise recommendations and ads will become
both more effective and less annoying.
Conclusion
• Machine Learning can efficiently support fraud/error detection system.
• Association rule is often the most accurate for suggestion product in market basket
analysis.
• ML can play a good role in the different phase of software engineering, like planning,
analysis, design and testing.
• And Mostly in analyzing data Generated from Sensor used in IoT.
“Machine Learning is like magic where you can get answer to any question”
Thank You
Any Questions ?

More Related Content

What's hot

What's hot (20)

[Webinar] How Big Data and Machine Learning Are Transforming ITSM
[Webinar] How Big Data and Machine Learning Are Transforming ITSM[Webinar] How Big Data and Machine Learning Are Transforming ITSM
[Webinar] How Big Data and Machine Learning Are Transforming ITSM
 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learning
 
Machine Learning Introduction for Digital Business Leaders
Machine Learning Introduction for Digital Business LeadersMachine Learning Introduction for Digital Business Leaders
Machine Learning Introduction for Digital Business Leaders
 
Creating a Data Science Ecosystem for Scientific, Societal and Educational Im...
Creating a Data Science Ecosystem for Scientific, Societal and Educational Im...Creating a Data Science Ecosystem for Scientific, Societal and Educational Im...
Creating a Data Science Ecosystem for Scientific, Societal and Educational Im...
 
Machine Learning in Big Data
Machine Learning in Big DataMachine Learning in Big Data
Machine Learning in Big Data
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Unit 3 part 2
Unit  3 part 2Unit  3 part 2
Unit 3 part 2
 
Big Data & Machine Learning - TDC2013 Sao Paulo
Big Data & Machine Learning - TDC2013 Sao PauloBig Data & Machine Learning - TDC2013 Sao Paulo
Big Data & Machine Learning - TDC2013 Sao Paulo
 
Data science presentation
Data science presentationData science presentation
Data science presentation
 
Data science presentation 2nd CI day
Data science presentation 2nd CI dayData science presentation 2nd CI day
Data science presentation 2nd CI day
 
Programming for data science in python
Programming for data science in pythonProgramming for data science in python
Programming for data science in python
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
GTU GeekDay Data Science and Applications
GTU GeekDay Data Science and ApplicationsGTU GeekDay Data Science and Applications
GTU GeekDay Data Science and Applications
 
Introduction to data science intro,ch(1,2,3)
Introduction to data science intro,ch(1,2,3)Introduction to data science intro,ch(1,2,3)
Introduction to data science intro,ch(1,2,3)
 
Introduction To Data Science
Introduction To Data ScienceIntroduction To Data Science
Introduction To Data Science
 
Data Science using Python
Data Science using PythonData Science using Python
Data Science using Python
 
Self Study Business Approach to DS_01022022.docx
Self Study Business Approach to DS_01022022.docxSelf Study Business Approach to DS_01022022.docx
Self Study Business Approach to DS_01022022.docx
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
 
Applications of Machine Learning at USC
Applications of Machine Learning at USCApplications of Machine Learning at USC
Applications of Machine Learning at USC
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 

Similar to Machine Learning using Big data

Data-Ed Online Presents: Data Warehouse Strategies
Data-Ed Online Presents: Data Warehouse StrategiesData-Ed Online Presents: Data Warehouse Strategies
Data-Ed Online Presents: Data Warehouse Strategies
DATAVERSITY
 

Similar to Machine Learning using Big data (20)

Intro to Data Science Big Data
Intro to Data Science Big DataIntro to Data Science Big Data
Intro to Data Science Big Data
 
Predictive Analytics: Context and Use Cases
Predictive Analytics: Context and Use CasesPredictive Analytics: Context and Use Cases
Predictive Analytics: Context and Use Cases
 
data analytics lecture2.pptx
data analytics lecture2.pptxdata analytics lecture2.pptx
data analytics lecture2.pptx
 
Data Architecture Strategies: Artificial Intelligence - Real-World Applicatio...
Data Architecture Strategies: Artificial Intelligence - Real-World Applicatio...Data Architecture Strategies: Artificial Intelligence - Real-World Applicatio...
Data Architecture Strategies: Artificial Intelligence - Real-World Applicatio...
 
Employees, Business Partners and Bad Guys: What Web Data Reveals About Person...
Employees, Business Partners and Bad Guys: What Web Data Reveals About Person...Employees, Business Partners and Bad Guys: What Web Data Reveals About Person...
Employees, Business Partners and Bad Guys: What Web Data Reveals About Person...
 
Altron presentation on Emerging Technologies: Data Science and Artificial Int...
Altron presentation on Emerging Technologies: Data Science and Artificial Int...Altron presentation on Emerging Technologies: Data Science and Artificial Int...
Altron presentation on Emerging Technologies: Data Science and Artificial Int...
 
How to be data savvy manager
How to be data savvy managerHow to be data savvy manager
How to be data savvy manager
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
 
Unit 1 (DSBDA) PD.pptx
Unit 1 (DSBDA)  PD.pptxUnit 1 (DSBDA)  PD.pptx
Unit 1 (DSBDA) PD.pptx
 
Day 00 - Introduction to machine learning with big data
Day 00 - Introduction to machine learning with big dataDay 00 - Introduction to machine learning with big data
Day 00 - Introduction to machine learning with big data
 
Big data Analytics
Big data AnalyticsBig data Analytics
Big data Analytics
 
Pemanfaatan Big Data Dalam Riset 2023.pptx
Pemanfaatan Big Data Dalam Riset 2023.pptxPemanfaatan Big Data Dalam Riset 2023.pptx
Pemanfaatan Big Data Dalam Riset 2023.pptx
 
Hawaii Machine Learning - Our Inaugural Meetup
Hawaii Machine Learning - Our Inaugural MeetupHawaii Machine Learning - Our Inaugural Meetup
Hawaii Machine Learning - Our Inaugural Meetup
 
Data-Ed: Data Warehousing Strategies
Data-Ed: Data Warehousing StrategiesData-Ed: Data Warehousing Strategies
Data-Ed: Data Warehousing Strategies
 
Data-Ed Online Presents: Data Warehouse Strategies
Data-Ed Online Presents: Data Warehouse StrategiesData-Ed Online Presents: Data Warehouse Strategies
Data-Ed Online Presents: Data Warehouse Strategies
 
Data science 101
Data science 101Data science 101
Data science 101
 
Top BI trends and predictions for 2017
Top BI trends and predictions for 2017Top BI trends and predictions for 2017
Top BI trends and predictions for 2017
 
Introduction to Big Data Analytics
Introduction to Big Data AnalyticsIntroduction to Big Data Analytics
Introduction to Big Data Analytics
 
Using big data_to_your_advantage
Using big data_to_your_advantageUsing big data_to_your_advantage
Using big data_to_your_advantage
 
introduction to data science
introduction to data scienceintroduction to data science
introduction to data science
 

Recently uploaded

XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
ssuser89054b
 
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
dharasingh5698
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
dharasingh5698
 
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Christo Ananth
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
dollysharma2066
 
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 

Recently uploaded (20)

The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
 
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
 
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
 
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
 
NFPA 5000 2024 standard .
NFPA 5000 2024 standard                                  .NFPA 5000 2024 standard                                  .
NFPA 5000 2024 standard .
 
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
 
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
 
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
 
Call for Papers - International Journal of Intelligent Systems and Applicatio...
Call for Papers - International Journal of Intelligent Systems and Applicatio...Call for Papers - International Journal of Intelligent Systems and Applicatio...
Call for Papers - International Journal of Intelligent Systems and Applicatio...
 
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELLPVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
 
Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
 
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
 

Machine Learning using Big data

  • 1. Machine Learning Using Big Data A SEMINAR ON SEMINAR GUIDE: PROF A.K HASE PRESENTER: MR. VAIBHAV KURKUTE 15-04-2017
  • 2. 1. History & Traditional Database 2. Introduction 3. Data Mining 4. What is Machine Learning ? 5. Types of Learning's 6. Supervised Learning Algorithms 7. Unsupervised Learning Algorithms 8. Case Studies 9. Future Scope & Tools 10. Conclusion 15-04-2017 Content
  • 3. • Old Source of Data: Telephone (Text or Voice) • Computer Invention & Business Uses • Old Data Storage • 21st Century Evolution • Traditional Databases & Drawbacks • Structure Data • Use of MySQL Database Use • Machine Generating Data. • Unstructured Data Use MongoDB i.e NoSQL Database *Hadoop Distributed File System,HBASE ,Hive. 15-04-2017 History
  • 5. • Generated Fast in unstructured form. • Continuously Processed and Analyzed • Large amounts of data, like a million rows in an Excel sheet • Different types of data mostly unstructured data. • Get Knowledge out of this data. 1. Google processes 20 petabytes of data every day 2. Facebook gets Thousands of Status in an hour. 15-04-2017 Introduction to Big Data
  • 6. • Web: estimated Google index 45 billion pages • Transaction data: 5-50 TB/day • Satellite image feeds: ~1TB/day/satellite • Sensor networks/arrays – CERN Large Hadron Collider ~100 petabytes/day • Biological data: 1-10TB/day/sequencer • TV: 2TB/day/channel; YouTube 4TB/day uploaded • Digitized telephony: ~100 petabytes/day 15-04-2017 How big is Big Data ?
  • 8. • Data Mining is of no use if we can’t get useful information from data • To mine insights from the data & make it potentially useful. • Previously Unknown data to knowledge. • Which can be used for ? 1. Predict future trends 2. Allowing businesses to make proactive. 3. Knowledge-driven decisions 4. E.G From your travel history on Yatra.com, one can identify your hometown 5. E.G Snyder & Vini Facebook status 15-04-2017 Data Mining
  • 10. • Machine learn on its own • No need to tell the machine what to do • No Need of Coding • We provide what we call the training data set. • Use of algorithms and Learn Pattern so to. • Create knowledge from data. • Example: If we give sample input & output like 2 -> f(x) -> 4 and 3 -> f(x) -> 9 4 -> f(x) -> 16 then 5 -> f(x) -> ? 15-04-2017 Machine Learning
  • 11. • Here are few examples: 1. Google’s self-driving cars 2. Blocking of suspicious credit cards & Spam Mails 3. Recommendation engines on an e-commerce site 4. Facebook Friend Suggestion “People worry that computers will get too smart and take over the world, but the real problem is that they're too stupid and they've already taken over the world” 15-04-2017 Machine Learning
  • 13. • Training data with correct answers i.e Examples for Computer • Use training data to prepare the algorithm • Apply it to data without a correct answer • It’s like predictive algorithms. 15-04-2017 Type: Supervised Learning
  • 14. • No Examples for Computer i.e No training data • We give data to algorithm • Here we know which algorithm to use. • It’s like exploratory algorithm • We have just to input data & Not Output • Example Differentiates correctly between the face of a horse, cat or human (clustering of data) 15-04-2017 Type: Unsupervised Learning
  • 15. • Clustering: • Splitting records to pre-defined group • Data with similar property • Association: Seeing what often appears together with what. • K-means clustering 15-04-2017 Unsupervised Algorithm
  • 16. • Classification: • Assigning Records to Predefined Groups • E.g Recognizing handwritten numbers, or classify emails spam or not. • Regression (predictive analysis): • Predict the output value using training data • Naïve Bayes classifier. • Decision trees • Nearest neighbors (kNN) • Neural networks 15-04-2017 Supervised Algorithm
  • 17. • Classification: • Assigning Records to Predefined Groups • E.g a data used by motor vehicle company to find where to sale ? • Regression (predictive analysis): • Predict the output value using training data • Naïve Bayes classifier. • Decision trees • Nearest neighbors (kNN) • Neural networks 15-04-2017 Supervised Algorithm
  • 18. • Type of Unsupervised Learning. • We have to predict using training data. • Association Rules Mining its using If-Then Condition. • CASE STUDY 1: How does amazon predict which product will be sold with what ? 15-04-2017 Apriori Algorithm
  • 19. • It is a type of Market Basket Analysis • Information of this type used in the form of “if–then” statements. • Rules are computed from the data • Examine all possible rules. • For the items in an if–then format. • Select only those that are most likely to be indicators of true dependence. 15-04-2017 Case Study (Amazon)
  • 21. 15-04-2017 Case Study (Amazon) • Generate frequent item sets • With two items, then with three items. • Based on , how many transactions in the database include the item.
  • 23. Tools 1. R-PROGRAMMING 2. PYTHON (SCIPY, SCIKIT-LEARN) 3. MATLAB (TO GENERATE IN GRAPHICAL FORM) 4. SPSS 5. SAS
  • 24. 15-04-2017 Real life application • Some real life applications of machine learning:  Recommender systems – suggesting similar people on Facebook/LinkedIn, similar movies/ books etc. on Amazon,  Business applications – Customer segmentation, Customer retention, Targeted Marketing etc.  Medical applications – Disease diagnosis,  Banking – Credit card issue, fraud detection etc.  Language translation, text to speech or vice versa.
  • 25. 15-04-2017 Future scope • Companies using ML – Google, FB, Microsoft, BoA and those which are not using are at loss. • With the current increase in use of IoT (Household, Business, Industries etc.) so there is need of continuously analysis data and conclude using machine learning. • Connected devices, we now have access to so much more data—and along with it, an increased need to manage and understand what we know. • In the future, users will receive more precise recommendations and ads will become both more effective and less annoying.
  • 26. Conclusion • Machine Learning can efficiently support fraud/error detection system. • Association rule is often the most accurate for suggestion product in market basket analysis. • ML can play a good role in the different phase of software engineering, like planning, analysis, design and testing. • And Mostly in analyzing data Generated from Sensor used in IoT. “Machine Learning is like magic where you can get answer to any question”