GTU last year question papers
GUJARAT TECHNOLOGICAL UNIVERSITY
BE SEM-VII Examination-Nov/Dec.-2011
Subject code: 171601
Subject Name: Data warehousing and Data Mining
1. Attempt all questions.
2. Make suitable assumptions wherever necessary.
3. Figures to the right indicate full marks.
Q.1 (a) List and describe major issues in data mining.
(b) Explain the methodologies for stream data processing and stream data Systems.
Q.2 (a) Short note: Information gain, Gain ratio, Gini index.
(b) Write the typical requirements of clustering in data mining.
(b) Explain k-means and k-medoids algorithms of clustering.
Q.3 (a) What is noise? Describe the possible reasons for noisy data. Explain the
different techniques to remove the noise from data.
(b) Explain the KDD process in details.
Q.3 (a) List and describe the methods for handling the missing values in data cleaning.
(b) Explain meta data repository.
Q.4 (a) Differentiate between OLTP and OLAP systems.
(b) Explain rule based classification and case based reasoning in details.
Q.4 (a) Write an algorithm for finding frequent item-sets using candidate generation.
(b) Short Note: Distributive and Holistic measures.
Q.5 (a) Describe the list of techniques for improving the efficiency of
(b) Explain three-tier data warehouse architecture.
Q.5 (a) What are the challenges for effective resource and knowledge discovery
in mining the world wide web?
(b) Explain data transformation in data mining.