SlideShare una empresa de Scribd logo
1 de 4
Descargar para leer sin conexión
Code No: R05410503                                           Set No. 1
      IV B.Tech I Semester Regular Examinations, November 2008
             DATA WAREHOUSING AND DATA MINING
                   (Computer Science & Engineering)
Time: 3 hours                                         Max Marks: 80
                      Answer any FIVE Questions
                    All Questions carry equal marks
                                ⋆⋆⋆⋆⋆


  1. (a) Explain data mining as a step in the process of knowledge discovery.
     (b) Differentiate operational database systems and data warehousing.          [8+8]

  2. (a) Discuss in detail about data transformation.
     (b) Explain about concept hierarchy generation for categorical data.         [8+8]

  3. Discuss the importance of establishing a standardized data mining query language.
     What are some of the potential benefits and challenges involved in such a task?
     List and explain a few of the recent proposals in this area.                 [16]

  4. Write short notes for the following in detail:

      (a) Attribute-oriented induction.
     (b) Efficient implementation of Attribute-oriented induction.                  [8+8]

  5. (a) What is an iceberg query? Give an example.
     (b) Explain about mining distance based association rules.
      (c) How are meta rules useful? Explain with example.                      [5+6+5]

  6. (a) Why naive Bayesian classification called “naive”? Briefly outline the major
         ideas of naive Bayesian classification.
     (b) Define regression. Briefly explain about linear, non-linear and multiple regres-
         sions.                                                                  [8+8]

  7. (a) What is Cluster Analysis? What are some typical applications of clustering?
         What are some typical requirements of clustering in data mining?
     (b) Discuss about model-based clustering methods.                      [2+2+5+7]

  8. Explain the following:

      (a) Mining time-series and sequence data
     (b) Mining text databases.                                                   [8+8]


                                          ⋆⋆⋆⋆⋆




                                          1 of 1
Code No: R05410503                                             Set No. 2
      IV B.Tech I Semester Regular Examinations, November 2008
             DATA WAREHOUSING AND DATA MINING
                   (Computer Science & Engineering)
Time: 3 hours                                         Max Marks: 80
                      Answer any FIVE Questions
                    All Questions carry equal marks
                                ⋆⋆⋆⋆⋆

  1. (a) Explain data mining as a step in the process of knowledge discovery.
     (b) Differentiate operational database systems and data warehousing.          [8+8]

  2. (a) Briefly discuss about data integration.
     (b) Briefly discuss about data transformation.                                [8+8]

  3. Write the syntax for the following data mining primitives:

     (a) The kind of knowledge to be mined.
     (b) Measures of pattern interestingness.                                      [16]

  4. (a) Write the algorithm for attribute-oriented induction. Explain the steps in-
         volved in it.
     (b) How can concept description mining be performed incrementally and in a
         distributed manner?                                              [8+8]

  5. (a) Explain about iceberg queries with example.
     (b) Can we design a method that mines the complete set of frequent item sets
         without candidate generation? If yes, explain with example.        [8+8]

  6. (a) Why is tree pruning useful in decision tree induction? What is a draw back
         of using a separate set of samples to evaluate pruning?
     (b) How rough set approach and fuzzy set approaches are useful for classification?
         Explain.                                                                [8+8]

  7. (a) Give an example of how specific clustering methods may be integrated, for
         example, where one clustering algorithm is used as a preprocessing step for
         another.
     (b) Write CURE algorithm and explain.                                       [10+6]

  8. (a) Explain about multidimensional analysis and descriptive mining of complex
         data objects.
     (b) Describe similarity search in time-series analysis.
      (c) What are cases and parameters for sequential pattern mining?          [8+4+4]

                                       ⋆⋆⋆⋆⋆


                                        1 of 1
Code No: R05410503                                             Set No. 3
      IV B.Tech I Semester Regular Examinations, November 2008
             DATA WAREHOUSING AND DATA MINING
                   (Computer Science & Engineering)
Time: 3 hours                                         Max Marks: 80
                      Answer any FIVE Questions
                    All Questions carry equal marks
                                ⋆⋆⋆⋆⋆

  1. (a) Explain data mining as a step in the process of knowledge discovery.
     (b) Differentiate operational database systems and data warehousing.            [8+8]
  2. (a) Briefly discuss about data integration.
     (b) Briefly discuss about data transformation.                                  [8+8]
  3. Explain the syntax for the following data mining primitives:
     (a)   Task-relevant data
     (b)   The kind of knowledge to be mined
     (c)   Interestingness measures
     (d)   Presentation and visualization of discovered patterns.                     [16]
  4. (a) Attribute-oriented induction generates one or a set of generalized descriptions.
         How can these descriptions be visualized?
     (b) Discuss about the methods of attribute relevance analysis?                [8+8]
  5. (a) How can we mine multilevel Association rules efficiently using concept hierar-
         chies? Explain.
     (b) Can we design a method that mines the complete set of frequent item sets
         without candidate generation. If yes, explain with example.           [8+8]
  6. (a) Explain about basic decision tree induction algorithm.
     (b) Discuss about Bayesian classification.                                      [8+8]
  7. (a) Use a diagram to illustrate how, for a constant MinPts value, density-based
         clusters with respect to a higher density (i.e., a lower value for ε , the neigh-
         borhood radius) are completely contained in density- connected sets obtained
         with respect to a lower density.
     (b) Give an example of how specific clustering methods may be integrated, for
         example, where one clustering algorithm is used as a preprocessing step for
         another.                                                                    [8+8]
  8. (a)   Explain multidimensional analysis of multimedia data.
     (b)   Define Information retrieval. What are basic measures for text retrieval?
     (c)   What is keyword-based association analysis?
     (d)   Briefly discuss about mining the World Wide Web.                  [5+4+3+4]

                                        ⋆⋆⋆⋆⋆


                                         1 of 1
Code No: R05410503                                           Set No. 4
      IV B.Tech I Semester Regular Examinations, November 2008
             DATA WAREHOUSING AND DATA MINING
                   (Computer Science & Engineering)
Time: 3 hours                                         Max Marks: 80
                      Answer any FIVE Questions
                    All Questions carry equal marks
                                ⋆⋆⋆⋆⋆


  1. (a) Discuss about Concept hierarchy.
     (b) Briefly explain about - classification of database systems.              [8+8]

  2. Explain various data reduction techniques.                                  [16]

  3. The four major types of concept hierarchies are: schema hierarchies, set-grouping
     hierarchies, operation-derived hierarchies, and rule-based hierarchies.

     (a) Briefly define each type of hierarchy.
     (b) For each hierarchy type, provide an example.                             [16]

  4. (a) What is Concept description? Explain.
     (b) What are the differences between concept description in large data bases and
         OLAP?                                                                 [8+8]

  5. (a) Which algorithm is an influential algorithm for mining frequent item sets for
         Boolean association rules. Explain.
     (b) Discuss about association mining using correlation rules.              [8+8]

  6. (a) How scalable is decision tree induction? Explain.
     (b) Explain about prediction.                                              [8+8]

  7. (a) How does the k-means algorithm work? Explain with example.
     (b) Explain about grid-based methods in clustering.                        [8+8]

  8. (a) Describe latent semantic indexing technique with an example.
     (b) Discuss about mining time-series and sequence data.                   [4+12]


                                      ⋆⋆⋆⋆⋆




                                       1 of 1

Más contenido relacionado

La actualidad más candente

Investigating the 3D structure of the genome with Hi-C data analysis
Investigating the 3D structure of the genome with Hi-C data analysisInvestigating the 3D structure of the genome with Hi-C data analysis
Investigating the 3D structure of the genome with Hi-C data analysistuxette
 
Probabilistic Retrieval
Probabilistic RetrievalProbabilistic Retrieval
Probabilistic Retrievalotisg
 
AN EFFECT OF USING A STORAGE MEDIUM IN DIJKSTRA ALGORITHM PERFORMANCE FOR IDE...
AN EFFECT OF USING A STORAGE MEDIUM IN DIJKSTRA ALGORITHM PERFORMANCE FOR IDE...AN EFFECT OF USING A STORAGE MEDIUM IN DIJKSTRA ALGORITHM PERFORMANCE FOR IDE...
AN EFFECT OF USING A STORAGE MEDIUM IN DIJKSTRA ALGORITHM PERFORMANCE FOR IDE...ijcsit
 
Probabilistic Retrieval Models - Sean Golliher Lecture 8 MSU CSCI 494
Probabilistic Retrieval Models - Sean Golliher Lecture 8 MSU CSCI 494Probabilistic Retrieval Models - Sean Golliher Lecture 8 MSU CSCI 494
Probabilistic Retrieval Models - Sean Golliher Lecture 8 MSU CSCI 494Sean Golliher
 
Object detection with Tensorflow Api
Object detection with Tensorflow ApiObject detection with Tensorflow Api
Object detection with Tensorflow ApiArwinKhan1
 
Data clustering using map reduce
Data clustering using map reduceData clustering using map reduce
Data clustering using map reduceVarad Meru
 
Сергей Кольцов —НИУ ВШЭ —ICBDA 2015
Сергей Кольцов —НИУ ВШЭ —ICBDA 2015Сергей Кольцов —НИУ ВШЭ —ICBDA 2015
Сергей Кольцов —НИУ ВШЭ —ICBDA 2015rusbase
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)ijceronline
 
Trends In Graph Data Management And Mining
Trends In Graph Data Management And MiningTrends In Graph Data Management And Mining
Trends In Graph Data Management And MiningSrinath Srinivasa
 
Graph Neural Network in practice
Graph Neural Network in practiceGraph Neural Network in practice
Graph Neural Network in practicetuxette
 
Document ranking using qprp with concept of multi dimensional subspace
Document ranking using qprp with concept of multi dimensional subspaceDocument ranking using qprp with concept of multi dimensional subspace
Document ranking using qprp with concept of multi dimensional subspacePrakash Dubey
 
A PSO-Based Subtractive Data Clustering Algorithm
A PSO-Based Subtractive Data Clustering AlgorithmA PSO-Based Subtractive Data Clustering Algorithm
A PSO-Based Subtractive Data Clustering AlgorithmIJORCS
 
Dynamics in graph analysis (PyData Carolinas 2016)
Dynamics in graph analysis (PyData Carolinas 2016)Dynamics in graph analysis (PyData Carolinas 2016)
Dynamics in graph analysis (PyData Carolinas 2016)Benjamin Bengfort
 

La actualidad más candente (16)

Investigating the 3D structure of the genome with Hi-C data analysis
Investigating the 3D structure of the genome with Hi-C data analysisInvestigating the 3D structure of the genome with Hi-C data analysis
Investigating the 3D structure of the genome with Hi-C data analysis
 
Probabilistic Retrieval
Probabilistic RetrievalProbabilistic Retrieval
Probabilistic Retrieval
 
AN EFFECT OF USING A STORAGE MEDIUM IN DIJKSTRA ALGORITHM PERFORMANCE FOR IDE...
AN EFFECT OF USING A STORAGE MEDIUM IN DIJKSTRA ALGORITHM PERFORMANCE FOR IDE...AN EFFECT OF USING A STORAGE MEDIUM IN DIJKSTRA ALGORITHM PERFORMANCE FOR IDE...
AN EFFECT OF USING A STORAGE MEDIUM IN DIJKSTRA ALGORITHM PERFORMANCE FOR IDE...
 
Fuzzy Clustering(C-means, K-means)
Fuzzy Clustering(C-means, K-means)Fuzzy Clustering(C-means, K-means)
Fuzzy Clustering(C-means, K-means)
 
Objects Clustering of Movie Using Graph Mining Technique
Objects Clustering of Movie Using Graph Mining TechniqueObjects Clustering of Movie Using Graph Mining Technique
Objects Clustering of Movie Using Graph Mining Technique
 
Probabilistic Retrieval Models - Sean Golliher Lecture 8 MSU CSCI 494
Probabilistic Retrieval Models - Sean Golliher Lecture 8 MSU CSCI 494Probabilistic Retrieval Models - Sean Golliher Lecture 8 MSU CSCI 494
Probabilistic Retrieval Models - Sean Golliher Lecture 8 MSU CSCI 494
 
Object detection with Tensorflow Api
Object detection with Tensorflow ApiObject detection with Tensorflow Api
Object detection with Tensorflow Api
 
Data clustering using map reduce
Data clustering using map reduceData clustering using map reduce
Data clustering using map reduce
 
Сергей Кольцов —НИУ ВШЭ —ICBDA 2015
Сергей Кольцов —НИУ ВШЭ —ICBDA 2015Сергей Кольцов —НИУ ВШЭ —ICBDA 2015
Сергей Кольцов —НИУ ВШЭ —ICBDA 2015
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
 
Trends In Graph Data Management And Mining
Trends In Graph Data Management And MiningTrends In Graph Data Management And Mining
Trends In Graph Data Management And Mining
 
Graph Neural Network in practice
Graph Neural Network in practiceGraph Neural Network in practice
Graph Neural Network in practice
 
Document ranking using qprp with concept of multi dimensional subspace
Document ranking using qprp with concept of multi dimensional subspaceDocument ranking using qprp with concept of multi dimensional subspace
Document ranking using qprp with concept of multi dimensional subspace
 
MATLAB and HDF-EOS
MATLAB and HDF-EOSMATLAB and HDF-EOS
MATLAB and HDF-EOS
 
A PSO-Based Subtractive Data Clustering Algorithm
A PSO-Based Subtractive Data Clustering AlgorithmA PSO-Based Subtractive Data Clustering Algorithm
A PSO-Based Subtractive Data Clustering Algorithm
 
Dynamics in graph analysis (PyData Carolinas 2016)
Dynamics in graph analysis (PyData Carolinas 2016)Dynamics in graph analysis (PyData Carolinas 2016)
Dynamics in graph analysis (PyData Carolinas 2016)
 

Similar a Dmdw1

D A T A B A S E M A N A G E M E N T S Y S T E M S J N T U M O D E L P A P ...
D A T A B A S E M A N A G E M E N T S Y S T E M S  J N T U  M O D E L  P A P ...D A T A B A S E M A N A G E M E N T S Y S T E M S  J N T U  M O D E L  P A P ...
D A T A B A S E M A N A G E M E N T S Y S T E M S J N T U M O D E L P A P ...guest3f9c6b
 
Databasemanagementsystems Jntu Model Paper{Www.Studentyogi.Com}
Databasemanagementsystems Jntu Model Paper{Www.Studentyogi.Com}Databasemanagementsystems Jntu Model Paper{Www.Studentyogi.Com}
Databasemanagementsystems Jntu Model Paper{Www.Studentyogi.Com}guest3f9c6b
 
distributeddatabases
distributeddatabasesdistributeddatabases
distributeddatabasesdjkshav
 
05211201 A D V A N C E D D A T A S T R U C T U R E S A N D A L G O R I...
05211201  A D V A N C E D  D A T A  S T R U C T U R E S   A N D   A L G O R I...05211201  A D V A N C E D  D A T A  S T R U C T U R E S   A N D   A L G O R I...
05211201 A D V A N C E D D A T A S T R U C T U R E S A N D A L G O R I...guestd436758
 
05211201 Advanced Data Structures And Algorithms
05211201 Advanced Data Structures  And  Algorithms05211201 Advanced Data Structures  And  Algorithms
05211201 Advanced Data Structures And Algorithmsguestac67362
 
C O M P U T E R O R G A N I Z A T I O N J N T U M O D E L P A P E R{Www
C O M P U T E R  O R G A N I Z A T I O N  J N T U  M O D E L  P A P E R{WwwC O M P U T E R  O R G A N I Z A T I O N  J N T U  M O D E L  P A P E R{Www
C O M P U T E R O R G A N I Z A T I O N J N T U M O D E L P A P E R{Wwwguest3f9c6b
 
Computer Organization Jntu Model Paper{Www.Studentyogi.Com}
Computer Organization Jntu Model Paper{Www.Studentyogi.Com}Computer Organization Jntu Model Paper{Www.Studentyogi.Com}
Computer Organization Jntu Model Paper{Www.Studentyogi.Com}guest3f9c6b
 
212101 Object Oriented Analysis Design Through Uml
212101 Object Oriented Analysis Design Through Uml212101 Object Oriented Analysis Design Through Uml
212101 Object Oriented Analysis Design Through Umlguestd436758
 
212101 Object Oriented Analysis Design Through Uml
212101 Object Oriented Analysis Design Through Uml212101 Object Oriented Analysis Design Through Uml
212101 Object Oriented Analysis Design Through Umlguestac67362
 
Decision Support Systems Jntu Model Paper{Www.Studentyogi.Com}
Decision Support Systems Jntu Model Paper{Www.Studentyogi.Com}Decision Support Systems Jntu Model Paper{Www.Studentyogi.Com}
Decision Support Systems Jntu Model Paper{Www.Studentyogi.Com}guest3f9c6b
 
D E C I S I O N S U P P O R T S Y S T E M S J N T U M O D E L P A P E R{Www
D E C I S I O N  S U P P O R T  S Y S T E M S  J N T U  M O D E L  P A P E R{WwwD E C I S I O N  S U P P O R T  S Y S T E M S  J N T U  M O D E L  P A P E R{Www
D E C I S I O N S U P P O R T S Y S T E M S J N T U M O D E L P A P E R{Wwwguest3f9c6b
 
Automata And Compiler Design
Automata And Compiler DesignAutomata And Compiler Design
Automata And Compiler Designguestac67362
 
Artificial Neural Networks
Artificial Neural NetworksArtificial Neural Networks
Artificial Neural Networksguestac67362
 
A R T I F I C I A L N E U R A L N E T W O R K S J N T U M O D E L P A P ...
A R T I F I C I A L  N E U R A L  N E T W O R K S  J N T U  M O D E L  P A P ...A R T I F I C I A L  N E U R A L  N E T W O R K S  J N T U  M O D E L  P A P ...
A R T I F I C I A L N E U R A L N E T W O R K S J N T U M O D E L P A P ...guest3f9c6b
 
Artificial Neural Networks Jntu Model Paper{Www.Studentyogi.Com}
Artificial Neural Networks Jntu Model Paper{Www.Studentyogi.Com}Artificial Neural Networks Jntu Model Paper{Www.Studentyogi.Com}
Artificial Neural Networks Jntu Model Paper{Www.Studentyogi.Com}guest3f9c6b
 

Similar a Dmdw1 (20)

D A T A B A S E M A N A G E M E N T S Y S T E M S J N T U M O D E L P A P ...
D A T A B A S E M A N A G E M E N T S Y S T E M S  J N T U  M O D E L  P A P ...D A T A B A S E M A N A G E M E N T S Y S T E M S  J N T U  M O D E L  P A P ...
D A T A B A S E M A N A G E M E N T S Y S T E M S J N T U M O D E L P A P ...
 
Databasemanagementsystems Jntu Model Paper{Www.Studentyogi.Com}
Databasemanagementsystems Jntu Model Paper{Www.Studentyogi.Com}Databasemanagementsystems Jntu Model Paper{Www.Studentyogi.Com}
Databasemanagementsystems Jntu Model Paper{Www.Studentyogi.Com}
 
distributeddatabases
distributeddatabasesdistributeddatabases
distributeddatabases
 
Aca
AcaAca
Aca
 
Ads
AdsAds
Ads
 
05211201 A D V A N C E D D A T A S T R U C T U R E S A N D A L G O R I...
05211201  A D V A N C E D  D A T A  S T R U C T U R E S   A N D   A L G O R I...05211201  A D V A N C E D  D A T A  S T R U C T U R E S   A N D   A L G O R I...
05211201 A D V A N C E D D A T A S T R U C T U R E S A N D A L G O R I...
 
05211201 Advanced Data Structures And Algorithms
05211201 Advanced Data Structures  And  Algorithms05211201 Advanced Data Structures  And  Algorithms
05211201 Advanced Data Structures And Algorithms
 
C O M P U T E R O R G A N I Z A T I O N J N T U M O D E L P A P E R{Www
C O M P U T E R  O R G A N I Z A T I O N  J N T U  M O D E L  P A P E R{WwwC O M P U T E R  O R G A N I Z A T I O N  J N T U  M O D E L  P A P E R{Www
C O M P U T E R O R G A N I Z A T I O N J N T U M O D E L P A P E R{Www
 
Computer Organization Jntu Model Paper{Www.Studentyogi.Com}
Computer Organization Jntu Model Paper{Www.Studentyogi.Com}Computer Organization Jntu Model Paper{Www.Studentyogi.Com}
Computer Organization Jntu Model Paper{Www.Studentyogi.Com}
 
Ooad q.papers
Ooad q.papersOoad q.papers
Ooad q.papers
 
212101 Object Oriented Analysis Design Through Uml
212101 Object Oriented Analysis Design Through Uml212101 Object Oriented Analysis Design Through Uml
212101 Object Oriented Analysis Design Through Uml
 
212101 Object Oriented Analysis Design Through Uml
212101 Object Oriented Analysis Design Through Uml212101 Object Oriented Analysis Design Through Uml
212101 Object Oriented Analysis Design Through Uml
 
Be it
Be itBe it
Be it
 
Decision Support Systems Jntu Model Paper{Www.Studentyogi.Com}
Decision Support Systems Jntu Model Paper{Www.Studentyogi.Com}Decision Support Systems Jntu Model Paper{Www.Studentyogi.Com}
Decision Support Systems Jntu Model Paper{Www.Studentyogi.Com}
 
D E C I S I O N S U P P O R T S Y S T E M S J N T U M O D E L P A P E R{Www
D E C I S I O N  S U P P O R T  S Y S T E M S  J N T U  M O D E L  P A P E R{WwwD E C I S I O N  S U P P O R T  S Y S T E M S  J N T U  M O D E L  P A P E R{Www
D E C I S I O N S U P P O R T S Y S T E M S J N T U M O D E L P A P E R{Www
 
Paper
PaperPaper
Paper
 
Automata And Compiler Design
Automata And Compiler DesignAutomata And Compiler Design
Automata And Compiler Design
 
Artificial Neural Networks
Artificial Neural NetworksArtificial Neural Networks
Artificial Neural Networks
 
A R T I F I C I A L N E U R A L N E T W O R K S J N T U M O D E L P A P ...
A R T I F I C I A L  N E U R A L  N E T W O R K S  J N T U  M O D E L  P A P ...A R T I F I C I A L  N E U R A L  N E T W O R K S  J N T U  M O D E L  P A P ...
A R T I F I C I A L N E U R A L N E T W O R K S J N T U M O D E L P A P ...
 
Artificial Neural Networks Jntu Model Paper{Www.Studentyogi.Com}
Artificial Neural Networks Jntu Model Paper{Www.Studentyogi.Com}Artificial Neural Networks Jntu Model Paper{Www.Studentyogi.Com}
Artificial Neural Networks Jntu Model Paper{Www.Studentyogi.Com}
 

Dmdw1

  • 1. Code No: R05410503 Set No. 1 IV B.Tech I Semester Regular Examinations, November 2008 DATA WAREHOUSING AND DATA MINING (Computer Science & Engineering) Time: 3 hours Max Marks: 80 Answer any FIVE Questions All Questions carry equal marks ⋆⋆⋆⋆⋆ 1. (a) Explain data mining as a step in the process of knowledge discovery. (b) Differentiate operational database systems and data warehousing. [8+8] 2. (a) Discuss in detail about data transformation. (b) Explain about concept hierarchy generation for categorical data. [8+8] 3. Discuss the importance of establishing a standardized data mining query language. What are some of the potential benefits and challenges involved in such a task? List and explain a few of the recent proposals in this area. [16] 4. Write short notes for the following in detail: (a) Attribute-oriented induction. (b) Efficient implementation of Attribute-oriented induction. [8+8] 5. (a) What is an iceberg query? Give an example. (b) Explain about mining distance based association rules. (c) How are meta rules useful? Explain with example. [5+6+5] 6. (a) Why naive Bayesian classification called “naive”? Briefly outline the major ideas of naive Bayesian classification. (b) Define regression. Briefly explain about linear, non-linear and multiple regres- sions. [8+8] 7. (a) What is Cluster Analysis? What are some typical applications of clustering? What are some typical requirements of clustering in data mining? (b) Discuss about model-based clustering methods. [2+2+5+7] 8. Explain the following: (a) Mining time-series and sequence data (b) Mining text databases. [8+8] ⋆⋆⋆⋆⋆ 1 of 1
  • 2. Code No: R05410503 Set No. 2 IV B.Tech I Semester Regular Examinations, November 2008 DATA WAREHOUSING AND DATA MINING (Computer Science & Engineering) Time: 3 hours Max Marks: 80 Answer any FIVE Questions All Questions carry equal marks ⋆⋆⋆⋆⋆ 1. (a) Explain data mining as a step in the process of knowledge discovery. (b) Differentiate operational database systems and data warehousing. [8+8] 2. (a) Briefly discuss about data integration. (b) Briefly discuss about data transformation. [8+8] 3. Write the syntax for the following data mining primitives: (a) The kind of knowledge to be mined. (b) Measures of pattern interestingness. [16] 4. (a) Write the algorithm for attribute-oriented induction. Explain the steps in- volved in it. (b) How can concept description mining be performed incrementally and in a distributed manner? [8+8] 5. (a) Explain about iceberg queries with example. (b) Can we design a method that mines the complete set of frequent item sets without candidate generation? If yes, explain with example. [8+8] 6. (a) Why is tree pruning useful in decision tree induction? What is a draw back of using a separate set of samples to evaluate pruning? (b) How rough set approach and fuzzy set approaches are useful for classification? Explain. [8+8] 7. (a) Give an example of how specific clustering methods may be integrated, for example, where one clustering algorithm is used as a preprocessing step for another. (b) Write CURE algorithm and explain. [10+6] 8. (a) Explain about multidimensional analysis and descriptive mining of complex data objects. (b) Describe similarity search in time-series analysis. (c) What are cases and parameters for sequential pattern mining? [8+4+4] ⋆⋆⋆⋆⋆ 1 of 1
  • 3. Code No: R05410503 Set No. 3 IV B.Tech I Semester Regular Examinations, November 2008 DATA WAREHOUSING AND DATA MINING (Computer Science & Engineering) Time: 3 hours Max Marks: 80 Answer any FIVE Questions All Questions carry equal marks ⋆⋆⋆⋆⋆ 1. (a) Explain data mining as a step in the process of knowledge discovery. (b) Differentiate operational database systems and data warehousing. [8+8] 2. (a) Briefly discuss about data integration. (b) Briefly discuss about data transformation. [8+8] 3. Explain the syntax for the following data mining primitives: (a) Task-relevant data (b) The kind of knowledge to be mined (c) Interestingness measures (d) Presentation and visualization of discovered patterns. [16] 4. (a) Attribute-oriented induction generates one or a set of generalized descriptions. How can these descriptions be visualized? (b) Discuss about the methods of attribute relevance analysis? [8+8] 5. (a) How can we mine multilevel Association rules efficiently using concept hierar- chies? Explain. (b) Can we design a method that mines the complete set of frequent item sets without candidate generation. If yes, explain with example. [8+8] 6. (a) Explain about basic decision tree induction algorithm. (b) Discuss about Bayesian classification. [8+8] 7. (a) Use a diagram to illustrate how, for a constant MinPts value, density-based clusters with respect to a higher density (i.e., a lower value for ε , the neigh- borhood radius) are completely contained in density- connected sets obtained with respect to a lower density. (b) Give an example of how specific clustering methods may be integrated, for example, where one clustering algorithm is used as a preprocessing step for another. [8+8] 8. (a) Explain multidimensional analysis of multimedia data. (b) Define Information retrieval. What are basic measures for text retrieval? (c) What is keyword-based association analysis? (d) Briefly discuss about mining the World Wide Web. [5+4+3+4] ⋆⋆⋆⋆⋆ 1 of 1
  • 4. Code No: R05410503 Set No. 4 IV B.Tech I Semester Regular Examinations, November 2008 DATA WAREHOUSING AND DATA MINING (Computer Science & Engineering) Time: 3 hours Max Marks: 80 Answer any FIVE Questions All Questions carry equal marks ⋆⋆⋆⋆⋆ 1. (a) Discuss about Concept hierarchy. (b) Briefly explain about - classification of database systems. [8+8] 2. Explain various data reduction techniques. [16] 3. The four major types of concept hierarchies are: schema hierarchies, set-grouping hierarchies, operation-derived hierarchies, and rule-based hierarchies. (a) Briefly define each type of hierarchy. (b) For each hierarchy type, provide an example. [16] 4. (a) What is Concept description? Explain. (b) What are the differences between concept description in large data bases and OLAP? [8+8] 5. (a) Which algorithm is an influential algorithm for mining frequent item sets for Boolean association rules. Explain. (b) Discuss about association mining using correlation rules. [8+8] 6. (a) How scalable is decision tree induction? Explain. (b) Explain about prediction. [8+8] 7. (a) How does the k-means algorithm work? Explain with example. (b) Explain about grid-based methods in clustering. [8+8] 8. (a) Describe latent semantic indexing technique with an example. (b) Discuss about mining time-series and sequence data. [4+12] ⋆⋆⋆⋆⋆ 1 of 1