SlideShare una empresa de Scribd logo
1 de 23
Descargar para leer sin conexión
Data mining: Concepts
and Approaches
Ordibehesht 16th
Professor: Dr. Hossein Siadat
By: Mahsa Rezaei
Presentation on the topics of “IT” course
IT Management - Shahid Beheshti University - Management and Accounting Department
Introduction:
Introduction:
Noisy Data Data Information Knowledge
Introduction:
Introduction:
Knowledge Discovery from Data (KDD)
What is data mining?
Necessity of data mining:
World Wide Web
Engineering and Medical Sciences
Stock exchange Data
Banking Data
Chain Markets
Training Centers
And etc.
Example:
Evolutional Path of Data-based Systems:
Before 1960
• Creation of Data Bases and Keeping
Data
1970-mid 1980 • Creation of Data Base Management Systems
Mid 1980-now
Last 1980-now
After that …
• Advanced Data Base
Systems
• Advanced Data
Analysis (including
Data Mining)
Applications of Data Mining:
Economy and job related cases
Commercial affairs and financial/economic analysis
Human Societies(Social Networks like facebook…)
Banking
Communication over internet(like Skype, Google talk,…) and without internet(like mobiles,…)
Engineering Sciences
Other fields of science
Knowledge Discovery Steps:
Data Cleaning
Data Integration
Data Reduction or Data Selection
Data Transformation
Data Mining
Pattern Evaluation
Knowledge Presentation
Data mining tools:
IBM SPSS Modeler
Oracle
Neuro Solutions
Weka (Java based)
Microsoft SQL server
Matlab, C++, Perl, Python
Lots of other open source and commercial softwares
Refer to Wikipedia for the complete list of tools: http://en.wikipedia.org/wiki/Data_mining
What kind of data can be used as Data mining input?
•Database Data
•Data Warehouse Data
•Transactional Data
Simple Data
•Voice
•Picture
Complicated
Data
Data Mining Outputs Patterns
Descriptive Pattern Provident Pattern
Understandable for human
Valid for the new set of Data
Potentialy efficient
Not evident
Pattern
Specification
Data mining outputs:
Data mining involves six common classes of tasks:
Anomaly Detection(Outlier/Change/Deviation
Detection)
Association Rule Learning(Dependency Modelling)
Clustering
Data mining outputs:
Classification
Regression
Summerization
Difficulties of data mining:
Data Mining
Approaches
Efficiency and
Scalability
Variety of
investigatable Data
Interactive Data
Process mining:
Business Intelligence and Data Mining:
Conclusion:
• Data mining: Discovering interesting patterns from large amounts of data
• A KDD process includes data cleaning, data integration, data selection,
transformation, data mining, pattern evaluation, and knowledge
presentation
• Mining can be performed in a variety of information repositories
• Data mining functionalities: characterization, discrimination, association,
classification, clustering, outlier and trend analysis, etc.
• Major issues in data mining
20
Conferences and Journals on Data Mining:
• KDD Conferences
• ACM SIGKDD Int. Conf. on
Knowledge Discovery in
Databases and Data Mining
(KDD)
• SIAM Data Mining Conf. (SDM)
• (IEEE) Int. Conf. on Data Mining
(ICDM)
• Conf. on Principles and
practices of Knowledge
Discovery and Data Mining
(PKDD)
• Pacific-Asia Conf. on
Knowledge Discovery and Data
Mining (PAKDD)
 Other related conferences
 ACM SIGMOD
 VLDB
 (IEEE) ICDE
 WWW, SIGIR
 ICML, CVPR, NIPS
 Journals
 Data Mining and Knowledge
Discovery (DAMI or DMKD)
 IEEE Trans. On Knowledge and
Data Eng. (TKDE)
 KDD Explorations
 ACM Trans. on KDD
21
Recommended Reference Books:
• S. Chakrabarti. Mining the Web: Statistical Analysis of Hypertex and Semi-Structured Data.
Morgan Kaufmann, 2002
• R. O. Duda, P. E. Hart, and D. G. Stork, Pattern Classification, 2ed., Wiley-Interscience, 2000
• T. Dasu and T. Johnson. Exploratory Data Mining and Data Cleaning. John Wiley & Sons, 2003
• U. M. Fayyad, G. Piatetsky-Shapiro, P. Smyth, and R. Uthurusamy. Advances in Knowledge
Discovery and Data Mining. AAAI/MIT Press, 1996
• U. Fayyad, G. Grinstein, and A. Wierse, Information Visualization in Data Mining and Knowledge
Discovery, Morgan Kaufmann, 2001
• J. Han and M. Kamber. Data Mining: Concepts and Techniques. Morgan Kaufmann, 2nd ed.,
2006
D. J. Hand, H. Mannila, and P. Smyth, Principles of Data Mining, MIT Press, 2001
T. Hastie, R. Tibshirani, and J. Friedman, The Elements of Statistical Learning: Data Mining,
Inference, and Prediction, Springer-Verlag, 2001
T. M. Mitchell, Machine Learning, McGraw Hill, 1997
G. Piatetsky-Shapiro and W. J. Frawley. Knowledge Discovery in Databases. AAAI/MIT Press, 1991
P.-N. Tan, M. Steinbach and V. Kumar, Introduction to Data Mining, Wiley, 2005
S. M. Weiss and N. Indurkhya, Predictive Data Mining, Morgan Kaufmann, 1998
I. H. Witten and E. Frank, Data Mining: Practical Machine Learning Tools and Techniques with Java
Implementations, Morgan Kaufmann, 2nd ed. 2005
Recommended Reference Books:
data mining

Más contenido relacionado

Similar a data mining

Unit 1 (Chapter-1) on data mining concepts.ppt
Unit 1 (Chapter-1) on data mining concepts.pptUnit 1 (Chapter-1) on data mining concepts.ppt
Unit 1 (Chapter-1) on data mining concepts.pptPadmajaLaksh
 
Data mining Introduction
Data mining IntroductionData mining Introduction
Data mining IntroductionVijayasankariS
 
Data Mining : Concepts and Techniques
Data Mining : Concepts and TechniquesData Mining : Concepts and Techniques
Data Mining : Concepts and TechniquesDeepaR42
 
data mining
data miningdata mining
data mininguoitc
 
Data Mining introduction and basic concepts
Data Mining introduction and basic conceptsData Mining introduction and basic concepts
Data Mining introduction and basic conceptsPritiRishi
 
01Introduction to data mining chapter 1.ppt
01Introduction to data mining chapter 1.ppt01Introduction to data mining chapter 1.ppt
01Introduction to data mining chapter 1.pptadmsoyadm4
 
Chapter 1. Introduction
Chapter 1. IntroductionChapter 1. Introduction
Chapter 1. Introductionbutest
 
Upstate CSCI 525 Data Mining Chapter 1
Upstate CSCI 525 Data Mining Chapter 1Upstate CSCI 525 Data Mining Chapter 1
Upstate CSCI 525 Data Mining Chapter 1DanWooster1
 
Datamininglecture
DatamininglectureDatamininglecture
DatamininglectureManish Rana
 
Introduction to Data Mining and technologies .ppt
Introduction to Data Mining and technologies .pptIntroduction to Data Mining and technologies .ppt
Introduction to Data Mining and technologies .pptSangrangBargayary3
 

Similar a data mining (20)

Unit 1 (Chapter-1) on data mining concepts.ppt
Unit 1 (Chapter-1) on data mining concepts.pptUnit 1 (Chapter-1) on data mining concepts.ppt
Unit 1 (Chapter-1) on data mining concepts.ppt
 
Data mining Introduction
Data mining IntroductionData mining Introduction
Data mining Introduction
 
Data Mining : Concepts and Techniques
Data Mining : Concepts and TechniquesData Mining : Concepts and Techniques
Data Mining : Concepts and Techniques
 
Introduction to data warehouse
Introduction to data warehouseIntroduction to data warehouse
Introduction to data warehouse
 
Unit 1
Unit 1Unit 1
Unit 1
 
data mining
data miningdata mining
data mining
 
Data Mining introduction and basic concepts
Data Mining introduction and basic conceptsData Mining introduction and basic concepts
Data Mining introduction and basic concepts
 
isd314-01
isd314-01isd314-01
isd314-01
 
Chapter 1. Introduction.ppt
Chapter 1. Introduction.pptChapter 1. Introduction.ppt
Chapter 1. Introduction.ppt
 
unit 1 DATA MINING.ppt
unit 1 DATA MINING.pptunit 1 DATA MINING.ppt
unit 1 DATA MINING.ppt
 
Data Mining Intro
Data Mining IntroData Mining Intro
Data Mining Intro
 
data mining
data miningdata mining
data mining
 
01Intro.ppt
01Intro.ppt01Intro.ppt
01Intro.ppt
 
01Introduction to data mining chapter 1.ppt
01Introduction to data mining chapter 1.ppt01Introduction to data mining chapter 1.ppt
01Introduction to data mining chapter 1.ppt
 
01Intro.ppt
01Intro.ppt01Intro.ppt
01Intro.ppt
 
Chapter 1. Introduction
Chapter 1. IntroductionChapter 1. Introduction
Chapter 1. Introduction
 
Upstate CSCI 525 Data Mining Chapter 1
Upstate CSCI 525 Data Mining Chapter 1Upstate CSCI 525 Data Mining Chapter 1
Upstate CSCI 525 Data Mining Chapter 1
 
01datamining.pdf
01datamining.pdf01datamining.pdf
01datamining.pdf
 
Datamininglecture
DatamininglectureDatamininglecture
Datamininglecture
 
Introduction to Data Mining and technologies .ppt
Introduction to Data Mining and technologies .pptIntroduction to Data Mining and technologies .ppt
Introduction to Data Mining and technologies .ppt
 

Más de mahsa rezaei

Excel Solver(By Mahsa Rezaei)
Excel Solver(By Mahsa Rezaei)Excel Solver(By Mahsa Rezaei)
Excel Solver(By Mahsa Rezaei)mahsa rezaei
 
seminar_presentation
seminar_presentationseminar_presentation
seminar_presentationmahsa rezaei
 
Master of Science Thesis in Electrical Engineering
Master of Science Thesis in Electrical EngineeringMaster of Science Thesis in Electrical Engineering
Master of Science Thesis in Electrical Engineeringmahsa rezaei
 
bachelor final project
bachelor final projectbachelor final project
bachelor final projectmahsa rezaei
 

Más de mahsa rezaei (9)

Kalman_filtering
Kalman_filteringKalman_filtering
Kalman_filtering
 
Excel Solver(By Mahsa Rezaei)
Excel Solver(By Mahsa Rezaei)Excel Solver(By Mahsa Rezaei)
Excel Solver(By Mahsa Rezaei)
 
seminar_presentation
seminar_presentationseminar_presentation
seminar_presentation
 
Master of Science Thesis in Electrical Engineering
Master of Science Thesis in Electrical EngineeringMaster of Science Thesis in Electrical Engineering
Master of Science Thesis in Electrical Engineering
 
thesis1
thesis1thesis1
thesis1
 
bachelor final project
bachelor final projectbachelor final project
bachelor final project
 
BPMN and Bizagi
BPMN and BizagiBPMN and Bizagi
BPMN and Bizagi
 
BIandDataMining
BIandDataMiningBIandDataMining
BIandDataMining
 
Mahsa_Rezaei_BPMS
Mahsa_Rezaei_BPMSMahsa_Rezaei_BPMS
Mahsa_Rezaei_BPMS
 

data mining

  • 1. Data mining: Concepts and Approaches Ordibehesht 16th Professor: Dr. Hossein Siadat By: Mahsa Rezaei Presentation on the topics of “IT” course IT Management - Shahid Beheshti University - Management and Accounting Department
  • 4. Noisy Data Data Information Knowledge Introduction:
  • 6. What is data mining?
  • 7. Necessity of data mining: World Wide Web Engineering and Medical Sciences Stock exchange Data Banking Data Chain Markets Training Centers And etc. Example:
  • 8. Evolutional Path of Data-based Systems: Before 1960 • Creation of Data Bases and Keeping Data 1970-mid 1980 • Creation of Data Base Management Systems Mid 1980-now Last 1980-now After that … • Advanced Data Base Systems • Advanced Data Analysis (including Data Mining)
  • 9. Applications of Data Mining: Economy and job related cases Commercial affairs and financial/economic analysis Human Societies(Social Networks like facebook…) Banking Communication over internet(like Skype, Google talk,…) and without internet(like mobiles,…) Engineering Sciences Other fields of science
  • 10. Knowledge Discovery Steps: Data Cleaning Data Integration Data Reduction or Data Selection Data Transformation Data Mining Pattern Evaluation Knowledge Presentation
  • 11. Data mining tools: IBM SPSS Modeler Oracle Neuro Solutions Weka (Java based) Microsoft SQL server Matlab, C++, Perl, Python Lots of other open source and commercial softwares Refer to Wikipedia for the complete list of tools: http://en.wikipedia.org/wiki/Data_mining
  • 12. What kind of data can be used as Data mining input? •Database Data •Data Warehouse Data •Transactional Data Simple Data •Voice •Picture Complicated Data
  • 13. Data Mining Outputs Patterns Descriptive Pattern Provident Pattern Understandable for human Valid for the new set of Data Potentialy efficient Not evident Pattern Specification
  • 14. Data mining outputs: Data mining involves six common classes of tasks: Anomaly Detection(Outlier/Change/Deviation Detection) Association Rule Learning(Dependency Modelling) Clustering
  • 16. Difficulties of data mining: Data Mining Approaches Efficiency and Scalability Variety of investigatable Data Interactive Data
  • 19. Conclusion: • Data mining: Discovering interesting patterns from large amounts of data • A KDD process includes data cleaning, data integration, data selection, transformation, data mining, pattern evaluation, and knowledge presentation • Mining can be performed in a variety of information repositories • Data mining functionalities: characterization, discrimination, association, classification, clustering, outlier and trend analysis, etc. • Major issues in data mining
  • 20. 20 Conferences and Journals on Data Mining: • KDD Conferences • ACM SIGKDD Int. Conf. on Knowledge Discovery in Databases and Data Mining (KDD) • SIAM Data Mining Conf. (SDM) • (IEEE) Int. Conf. on Data Mining (ICDM) • Conf. on Principles and practices of Knowledge Discovery and Data Mining (PKDD) • Pacific-Asia Conf. on Knowledge Discovery and Data Mining (PAKDD)  Other related conferences  ACM SIGMOD  VLDB  (IEEE) ICDE  WWW, SIGIR  ICML, CVPR, NIPS  Journals  Data Mining and Knowledge Discovery (DAMI or DMKD)  IEEE Trans. On Knowledge and Data Eng. (TKDE)  KDD Explorations  ACM Trans. on KDD
  • 21. 21 Recommended Reference Books: • S. Chakrabarti. Mining the Web: Statistical Analysis of Hypertex and Semi-Structured Data. Morgan Kaufmann, 2002 • R. O. Duda, P. E. Hart, and D. G. Stork, Pattern Classification, 2ed., Wiley-Interscience, 2000 • T. Dasu and T. Johnson. Exploratory Data Mining and Data Cleaning. John Wiley & Sons, 2003 • U. M. Fayyad, G. Piatetsky-Shapiro, P. Smyth, and R. Uthurusamy. Advances in Knowledge Discovery and Data Mining. AAAI/MIT Press, 1996 • U. Fayyad, G. Grinstein, and A. Wierse, Information Visualization in Data Mining and Knowledge Discovery, Morgan Kaufmann, 2001 • J. Han and M. Kamber. Data Mining: Concepts and Techniques. Morgan Kaufmann, 2nd ed., 2006
  • 22. D. J. Hand, H. Mannila, and P. Smyth, Principles of Data Mining, MIT Press, 2001 T. Hastie, R. Tibshirani, and J. Friedman, The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Springer-Verlag, 2001 T. M. Mitchell, Machine Learning, McGraw Hill, 1997 G. Piatetsky-Shapiro and W. J. Frawley. Knowledge Discovery in Databases. AAAI/MIT Press, 1991 P.-N. Tan, M. Steinbach and V. Kumar, Introduction to Data Mining, Wiley, 2005 S. M. Weiss and N. Indurkhya, Predictive Data Mining, Morgan Kaufmann, 1998 I. H. Witten and E. Frank, Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations, Morgan Kaufmann, 2nd ed. 2005 Recommended Reference Books: