Académique Documents
Professionnel Documents
Culture Documents
com
What type of operation can be performed on a package that has been built and
saved for further use?
o a. Edited
o b. Password protected
o c. Scheduled for execution
o d. Retrieved by version
o e. All of the above
If w is the window size and n is the size of data set, then the complexity of merging
phase in BSN method is…
o a. O (n)
o b. O(w)
o c. O(wn)
o d. O (w log n)
o e. O (n log n)
• Define de-normalization. What are the four fundamental guidelines for de-
normalization?
• List and explain three fundamental advantages of Bit map indexing
• List any five steps for extracting data using the SQL server DTS wizard.
• Define Data warehouse. Why we require Data warehouse? Give at least four
reasons.
• What is the difference between data matrix and similarity/dissimilarity matrix in
terms of rows and columns draw both of them? Which one is symmetric?
• List any four significant points about the architecture in the data warehouse
development lifecycle and briefly explain them.
Question No. 8 Marks : 15
How following quality metrics can be evaluated using simple ratios? Give examples.
a. Free of error
b. Completeness
c. Consistency
What sort of objective assessment metrics are used by companies? What are the
possible issues in formulating these metrics?
What are the "Five Signs of Trouble" that serve as a key indicator that the data
warehousing project is under threat of failure?
List and briefly explain the three fundamental factors that affect the amount of
history stored in a DWH.
Q#2: Give two real-life examples of clustering i.e. clustering used for market
segmentation for telecom industry and clustering used for crop identification of
insurance fraud.(15)
Q#3: If w is the window size and n is the size of data set, then the complexity of
merging phase in BSN method is ___________.(2)
Q#4:
a) Comment on the statement that “Creating indexes requires careful
consideration so as to avoid performance degradation”. [5]
b) Write the following Bit vectors in compressed form using Run Length
Encoding with # being the separator symbol. [5+5]
I. 111001010110000101000001001010010000001
II. 110101010100001111011010010010001011111
Q#5:
a) Why we require Data warehouse. Give at least three reasons. [6]
b) What is meant by the “House of Quality”? What type of risks can be dealt
with this technique? [4]
What are the goals of horizontal splitting and what are different methods of
horizontal splitting? [4+6]
Q#6:
In DW project, it is assumed that _________ environment is very similar to the
production environment.(2)
Q#7: Which is the least appropriate join operation for Pipeline parallelism?(2)
1. DOLAP.
2. Explain OLAP FASMI TEST.
The good example of pivoting is changing the dimensions along the axis.
True
False
Question No. 4 Marks : 2
True
False
________splitting places a group of columns in one table and the remaining columns in
another table.
1. Horizontal
2. Vertical
3. Both 1 and 2
True
False
www.vujannat.ning.com
• Write a typical OLTP Query and explain why it is different from DWH Query.
• Briefly explain the Enrichment basic Data Transformation task?
• What is the major transformation of Decoding of Fields?
Question No. 2 Marks : 15
• Define ETL. List three typical Operating Systems that are found while doing
ETL?
• Briefly explain the selection basic Data Transformation task?
• What is meant by "grain" in the context of a data warehouse? Give at least three
examples.
o Central
o Parallel
o Vertical
o Horizontal
In Online Data Extraction data is extracted directly from the ------ system itself.
o Host
o Destination
o Source
o Terminal
o Deleted
o Updated
o Inserted
o True
o False