Data mining - Process, Techniques and Research Topics
1. P R O C E S S A N D T E C H N I Q U E S
Data Mining
2. What is Data Mining?
Data Mining is the process of transforming
unprocessed data to useful one by use certain
methodologies and tactics. Data Mining involves
discovering and identifying patterns in large data
sets which is used by large companies to
anticipate the future trends.
3. Process of Data Mining
The Procsss of Data Mining is based on the following
phases:
Problem Definition
Data understanding and exploration
Data Preparation
Modeling
Evaluation
Deployment
4. Data Mining Techniques
Following techniques are employed for the process of data
mining:
Association
Classification
Clustering
Decision Trees
Prediction
Sequential Analysis
5. Association - In this technique, a pattern is
identified based on the relationship between items of
similar proceedings.
Classification - This technique of data mining is
based on machine learning using the concepts of
decision trees, linear programming, neural networks,
and statistics.
Clustering - Clustering is the process of making a
cluster of abstract objects having similar
characteristics.
6. Decision Trees - It is a graphical technique of data
mining in which root of the tree is a condition and its
branches are its solutions.
Prediction - This data mining technique identifies
the relationship between independent and
dependent variables and is mainly used in predicting
the future for a sale.
Sequential Analysis - Sequential analysis is a
technique that discovers and identifies similar
patterns, events, and trends in transactional data
over a certain period of time.
7. Application Areas of Data Mining
In Medical Science
In Banking/Finance
In Marketing and Sales
In Science and Engineering
8. Thesis and Research Areas in Data Mining
Web Mining
Predictive Analysis
Oracle Data Mining
Clustering
Text Mining
Fraud Detection
Data Mining as a Service
Graph Mining
9. Web Mining
Web Mining is an application of Data Mining and an
important topic for research and thesis. It is a
technique to discover patterns from WWW i.e World
Wide Web. The information for web mining is
collected through browser activities, page content
and server logins. It is a very good area for master
thesis data mining. There are three types of Web
Mining:
Web Usage Mining
Web Content Mining
Web Structure Mining
10. Text Mining
It is an important field of Data Mining. It refers to
the process of extracting valuable information from
text and is also referred to as text analytics. This
high-quality information is extracted through
patterns and methods like statistical pattern
learning. It is another good area for the Ph.D. thesis
on Data Mining. In Text Mining, input data is
structured and patterns are derived from this
structured data. There are various research areas and
thesis topics in the field of text mining.