SlideShare una empresa de Scribd logo
1 de 21
Amarjit Kaur
Samiksha Sharma
 What is data mining ?
 Why data mining ?
 Data mining as a necessity
 Evolution of database
 Origin of data mining
 Data mining : A KDD process
 Applications
 Management areas
 Examples
 techniques
 Extration of implicit, previously unknown and
potentially useful information from data
 Exploration & analysis by automatic and semi
automatic means of large quantities of data
in order to discover meaningful patterns
 Lots of data is being collected and
warehoused
 Web data , e commerce
 Purchases at departmental store/ groceries
store
 Bank/credit card transactions
 Computers have become cheaper and more
powerful
 Competitve pressure is strong
 Provide better,customized services for an
edge (e.g in customer relationship
management)
DATA MINING –
AS A
NECESSITY
DATA EXPLOSION PROBLEM
 Automated data collection tools and mature
database technology lead to tremendous
amounts of data stored in databases, data
warehouses and other information
repositories
 We are drowning in data, but starving for
knowledge!
 Solution : data mining
 Extraction of interesting knowledge(rules,
regularities,patterns,constraints) from data
in large databases
 1960s:
Data collection, database creation, IMS and
network DBMS.
 1970s:
Relational data model, relational DBMS
implementation
 1980s:
RDBMS, advanced data models( extended
relational, OO, deductive sets) and application-
oriented DBMS(spatial,scientific,engineering etc)
 1990s-2000s:
Data mining and data warehousing, multimedia
databases, and web databases
 The term “data mining” was introduced in the
1990s. Data mining roots are traced back along
three family lines:
Classical
statistics
Artificial
intelligence
Machine
learning
 Statistical are the foundations of most technologies on
which data mining is built, e.g. regression analysis,
standard deviation etc. All these are used to study data
and data relationships.
 Artificial intelligence which is built upon heuristics as
opposed to statistics, attempts to apply human-
thoughts like processing to statistical problems. E.g.
RDBMS.
 Machine learning is to union of statistics and AI.
 DATA MINING therefore uses AI and statistical approach
together. It blends AI heuristics with advanced
statistical analysis to study data and find previously –
hidden trends or patterns within company using
statistical fundamental concepts and adding more
advanced AI algorithms to achieve the goal.
 Database analysis and decision support
 Market analysis and management
 Target marketing, customer relation
management, market basket analysis, cross
selling, market segmentation
 Risk analysis and management
 Forecasting, customer retention,improved
underwriting, quality control,competitive
analysis
 Fraud detection and management
 Other applications
 Text mining(news group,email,documents) and
web analysis
 Intelligent query answering
 Sports
 IBM Advanced Scout analyzed NBA game statistics to
gain competitive advantage for New York Knicks and
Miami Heat
 Astronomy
 JPL and the Palomar Observatory discovered 22
quasars with the help of data-mining.
 Internet Web Surf-Aid
 IBM Surf-Aid applies data mining algorithms to web
access logs for market-related pages to discover
customer preference and behaviour pages, analyzing
effectiveness of web marketing, improving web site
organizations etc
 Cross-market analysis
 Associations/Co-relations between product sales
 Prediction based on the associations information
 Customer profiling
 Data mining can tell you what types of customers
buy what products.
 Identifying customer requirements
 Identifying the best products for different
customers.
 Use prediction to find what factors will attract new
customers.
 Provides summary information
 Various multidimensional summary reports
 Statistical summary information
 Finance planning and asset evaluation
i. Cash flow analysis and prediction
ii. Contingent claim analysis to evaluate assets
iii. Cross-sectional and time series analysis
 Resources planning
i. Summarize and compare the resources and
spending
 Competition
i. Monitor competition and market directions
ii. Set price strategy in the market
iii. Grouping of customer into classes
 Applications
 Widely used in health care, retail, credit card
services, telecommunications etc.
 Approach
 Use historical data to build models of fraudulent
behaviour and use data mining to help identify
similar instances.
 Examples
 Auto insurance : detect a group of people who
stage accidents to collect on insurance.
 Money laundering : detect suspicious money
transactions
Techniques
CLASSIFICATION
ASSOCIATION
SEQUENCE
CLUSTER
Information Technology Data Mining

Más contenido relacionado

La actualidad más candente

Data mining (lecture 1 & 2) conecpts and techniques
Data mining (lecture 1 & 2) conecpts and techniquesData mining (lecture 1 & 2) conecpts and techniques
Data mining (lecture 1 & 2) conecpts and techniques
Saif Ullah
 
Chapter 1. Introduction
Chapter 1. IntroductionChapter 1. Introduction
Chapter 1. Introduction
butest
 

La actualidad más candente (20)

Medhya Rasayana Chathuskaya
Medhya Rasayana ChathuskayaMedhya Rasayana Chathuskaya
Medhya Rasayana Chathuskaya
 
Data mining (lecture 1 & 2) conecpts and techniques
Data mining (lecture 1 & 2) conecpts and techniquesData mining (lecture 1 & 2) conecpts and techniques
Data mining (lecture 1 & 2) conecpts and techniques
 
Alopecia and ayurveda
Alopecia and ayurvedaAlopecia and ayurveda
Alopecia and ayurveda
 
Data mining on Social Media
Data mining on Social MediaData mining on Social Media
Data mining on Social Media
 
Research ayurveda
Research  ayurvedaResearch  ayurveda
Research ayurveda
 
Multimedia Information Retrieval
Multimedia Information RetrievalMultimedia Information Retrieval
Multimedia Information Retrieval
 
Ethics and consent for data sharing
Ethics and consent for data sharingEthics and consent for data sharing
Ethics and consent for data sharing
 
Chapter 1. Introduction
Chapter 1. IntroductionChapter 1. Introduction
Chapter 1. Introduction
 
Data mining presentation.ppt
Data mining presentation.pptData mining presentation.ppt
Data mining presentation.ppt
 
Data mining introduction
Data mining introductionData mining introduction
Data mining introduction
 
introduction to data mining tutorial
introduction to data mining tutorial introduction to data mining tutorial
introduction to data mining tutorial
 
Text MIning
Text MIningText MIning
Text MIning
 
Nasya karma
Nasya karmaNasya karma
Nasya karma
 
2.1 Data Mining-classification Basic concepts
2.1 Data Mining-classification Basic concepts2.1 Data Mining-classification Basic concepts
2.1 Data Mining-classification Basic concepts
 
Chapter 12 outlier
Chapter 12 outlierChapter 12 outlier
Chapter 12 outlier
 
Data mining
Data miningData mining
Data mining
 
Decision tree
Decision treeDecision tree
Decision tree
 
Karsya & Mamsasosha
Karsya & MamsasoshaKarsya & Mamsasosha
Karsya & Mamsasosha
 
DI&A Slides: Descriptive, Prescriptive, and Predictive Analytics
DI&A Slides: Descriptive, Prescriptive, and Predictive AnalyticsDI&A Slides: Descriptive, Prescriptive, and Predictive Analytics
DI&A Slides: Descriptive, Prescriptive, and Predictive Analytics
 
Case Sheet in Ayurveda
Case Sheet in AyurvedaCase Sheet in Ayurveda
Case Sheet in Ayurveda
 

Destacado

Communication Research Methods
Communication Research MethodsCommunication Research Methods
Communication Research Methods
Jenny Donley
 
Major issues in data mining
Major issues in data miningMajor issues in data mining
Major issues in data mining
Slideshare
 
Mr. Charles Kangethe cv 2016
Mr. Charles Kangethe cv 2016Mr. Charles Kangethe cv 2016
Mr. Charles Kangethe cv 2016
charles kangethe
 
PfizerProposal_Final-1
PfizerProposal_Final-1PfizerProposal_Final-1
PfizerProposal_Final-1
Robert Vasquez
 
Flick Film FestivalInternship
Flick Film FestivalInternshipFlick Film FestivalInternship
Flick Film FestivalInternship
Kayla Kehler
 
BVPresentation
BVPresentationBVPresentation
BVPresentation
Ben Arditi
 

Destacado (15)

SAS M2006 Presentation
SAS M2006 PresentationSAS M2006 Presentation
SAS M2006 Presentation
 
SAS M2007 Presentation
SAS M2007 PresentationSAS M2007 Presentation
SAS M2007 Presentation
 
Macquarie University Workshop on Text Mining and Health
Macquarie University Workshop on Text Mining and HealthMacquarie University Workshop on Text Mining and Health
Macquarie University Workshop on Text Mining and Health
 
The Do's and Don'ts of Data Mining
The Do's and Don'ts of Data MiningThe Do's and Don'ts of Data Mining
The Do's and Don'ts of Data Mining
 
Using Data Mining Techniques to Analyze Crime Pattern
Using Data Mining Techniques to Analyze Crime PatternUsing Data Mining Techniques to Analyze Crime Pattern
Using Data Mining Techniques to Analyze Crime Pattern
 
Communication Research Methods
Communication Research MethodsCommunication Research Methods
Communication Research Methods
 
Statistical software
Statistical softwareStatistical software
Statistical software
 
Major issues in data mining
Major issues in data miningMajor issues in data mining
Major issues in data mining
 
4 Essential Lessons for Adopting Predictive Analytics in Healthcare
4 Essential Lessons for Adopting Predictive Analytics in Healthcare4 Essential Lessons for Adopting Predictive Analytics in Healthcare
4 Essential Lessons for Adopting Predictive Analytics in Healthcare
 
Mr. Charles Kangethe cv 2016
Mr. Charles Kangethe cv 2016Mr. Charles Kangethe cv 2016
Mr. Charles Kangethe cv 2016
 
PfizerProposal_Final-1
PfizerProposal_Final-1PfizerProposal_Final-1
PfizerProposal_Final-1
 
klinik medika edukasi PJK
klinik medika edukasi PJKklinik medika edukasi PJK
klinik medika edukasi PJK
 
Flick Film FestivalInternship
Flick Film FestivalInternshipFlick Film FestivalInternship
Flick Film FestivalInternship
 
BVPresentation
BVPresentationBVPresentation
BVPresentation
 
Top 8 Arquitectura europea
Top 8 Arquitectura europeaTop 8 Arquitectura europea
Top 8 Arquitectura europea
 

Similar a Information Technology Data Mining

Data warehouse and data mining
Data warehouse and data miningData warehouse and data mining
Data warehouse and data mining
Rohit Kumar
 
Data mining by_ashok
Data mining by_ashokData mining by_ashok
Data mining by_ashok
Ashok Kumar
 

Similar a Information Technology Data Mining (20)

Data mining and its applications!
Data mining and its applications!Data mining and its applications!
Data mining and its applications!
 
Data mining final year project in jalandhar
Data mining final year project in jalandharData mining final year project in jalandhar
Data mining final year project in jalandhar
 
Data mining final year project in ludhiana
Data mining final year project in ludhianaData mining final year project in ludhiana
Data mining final year project in ludhiana
 
Data warehouse and data mining
Data warehouse and data miningData warehouse and data mining
Data warehouse and data mining
 
Data Mining
Data MiningData Mining
Data Mining
 
Data mining 1 - Introduction (cheat sheet - printable)
Data mining 1 - Introduction (cheat sheet - printable)Data mining 1 - Introduction (cheat sheet - printable)
Data mining 1 - Introduction (cheat sheet - printable)
 
6months industrial training in data mining,ludhiana
6months industrial training in data mining,ludhiana6months industrial training in data mining,ludhiana
6months industrial training in data mining,ludhiana
 
6months industrial training in data mining, jalandhar
6months industrial training in data mining, jalandhar6months industrial training in data mining, jalandhar
6months industrial training in data mining, jalandhar
 
6 weeks summer training in data mining,ludhiana
6 weeks summer training in data mining,ludhiana6 weeks summer training in data mining,ludhiana
6 weeks summer training in data mining,ludhiana
 
6 weeks summer training in data mining,jalandhar
6 weeks summer training in data mining,jalandhar6 weeks summer training in data mining,jalandhar
6 weeks summer training in data mining,jalandhar
 
Data mining
Data miningData mining
Data mining
 
Data mining-basic
Data mining-basicData mining-basic
Data mining-basic
 
Data mining 1
Data mining 1Data mining 1
Data mining 1
 
Data Mining and Business Analytics by Seyed Ziae Mousavi Mojab
Data Mining and Business Analytics by Seyed Ziae Mousavi MojabData Mining and Business Analytics by Seyed Ziae Mousavi Mojab
Data Mining and Business Analytics by Seyed Ziae Mousavi Mojab
 
Data mining
Data miningData mining
Data mining
 
Data mining by_ashok
Data mining by_ashokData mining by_ashok
Data mining by_ashok
 
Introduction.ppt
Introduction.pptIntroduction.ppt
Introduction.ppt
 
1intro
1intro1intro
1intro
 
Data Mining vs. Machine Learning Unveiling Major Differences
Data Mining vs. Machine Learning Unveiling Major DifferencesData Mining vs. Machine Learning Unveiling Major Differences
Data Mining vs. Machine Learning Unveiling Major Differences
 
Introduction
IntroductionIntroduction
Introduction
 

Information Technology Data Mining

  • 2.  What is data mining ?  Why data mining ?  Data mining as a necessity  Evolution of database  Origin of data mining  Data mining : A KDD process  Applications  Management areas  Examples  techniques
  • 3.  Extration of implicit, previously unknown and potentially useful information from data  Exploration & analysis by automatic and semi automatic means of large quantities of data in order to discover meaningful patterns
  • 4.  Lots of data is being collected and warehoused  Web data , e commerce  Purchases at departmental store/ groceries store  Bank/credit card transactions  Computers have become cheaper and more powerful  Competitve pressure is strong  Provide better,customized services for an edge (e.g in customer relationship management)
  • 5. DATA MINING – AS A NECESSITY
  • 6. DATA EXPLOSION PROBLEM  Automated data collection tools and mature database technology lead to tremendous amounts of data stored in databases, data warehouses and other information repositories  We are drowning in data, but starving for knowledge!  Solution : data mining  Extraction of interesting knowledge(rules, regularities,patterns,constraints) from data in large databases
  • 7.  1960s: Data collection, database creation, IMS and network DBMS.  1970s: Relational data model, relational DBMS implementation  1980s: RDBMS, advanced data models( extended relational, OO, deductive sets) and application- oriented DBMS(spatial,scientific,engineering etc)  1990s-2000s: Data mining and data warehousing, multimedia databases, and web databases
  • 8.  The term “data mining” was introduced in the 1990s. Data mining roots are traced back along three family lines: Classical statistics Artificial intelligence Machine learning
  • 9.  Statistical are the foundations of most technologies on which data mining is built, e.g. regression analysis, standard deviation etc. All these are used to study data and data relationships.  Artificial intelligence which is built upon heuristics as opposed to statistics, attempts to apply human- thoughts like processing to statistical problems. E.g. RDBMS.  Machine learning is to union of statistics and AI.  DATA MINING therefore uses AI and statistical approach together. It blends AI heuristics with advanced statistical analysis to study data and find previously – hidden trends or patterns within company using statistical fundamental concepts and adding more advanced AI algorithms to achieve the goal.
  • 10.
  • 11.  Database analysis and decision support  Market analysis and management  Target marketing, customer relation management, market basket analysis, cross selling, market segmentation  Risk analysis and management  Forecasting, customer retention,improved underwriting, quality control,competitive analysis  Fraud detection and management  Other applications  Text mining(news group,email,documents) and web analysis  Intelligent query answering
  • 12.  Sports  IBM Advanced Scout analyzed NBA game statistics to gain competitive advantage for New York Knicks and Miami Heat  Astronomy  JPL and the Palomar Observatory discovered 22 quasars with the help of data-mining.  Internet Web Surf-Aid  IBM Surf-Aid applies data mining algorithms to web access logs for market-related pages to discover customer preference and behaviour pages, analyzing effectiveness of web marketing, improving web site organizations etc
  • 13.  Cross-market analysis  Associations/Co-relations between product sales  Prediction based on the associations information  Customer profiling  Data mining can tell you what types of customers buy what products.  Identifying customer requirements  Identifying the best products for different customers.  Use prediction to find what factors will attract new customers.  Provides summary information  Various multidimensional summary reports  Statistical summary information
  • 14.  Finance planning and asset evaluation i. Cash flow analysis and prediction ii. Contingent claim analysis to evaluate assets iii. Cross-sectional and time series analysis  Resources planning i. Summarize and compare the resources and spending  Competition i. Monitor competition and market directions ii. Set price strategy in the market iii. Grouping of customer into classes
  • 15.  Applications  Widely used in health care, retail, credit card services, telecommunications etc.  Approach  Use historical data to build models of fraudulent behaviour and use data mining to help identify similar instances.  Examples  Auto insurance : detect a group of people who stage accidents to collect on insurance.  Money laundering : detect suspicious money transactions
  • 16.
  • 17.
  • 18.
  • 19.