# Assignment No. 4 Graded Semester Fall 2015 Data Warehousing– CS614 due date 8 Feb,2016

Objective:

The assignment has been designed to develop your ability to calculate the Bitmap index.

Instructions:

1. 1.      The assignment will not be accepted after due date in any case (whether it is the case of load shedding or emergency electric failure or internet malfunctioning etc.).
2. 2.      Zero marks will be awarded to the assignment that does not open or the file is corrupt.
3. 3.      The assignment file must be an MS Word (.doc) file format; Assignment will not be accepted in any other format.
4. 4.      Zero marks will be awarded to the assignment if copied (from other student or copied from handouts or internet).
5. 5.      Zero marks will be awarded to the assignment if the Student ID is not mentioned in the assignment file.

For any query about the assignment, contact only at CS614@vu.edu.pk

Do not post queries related to assignment on MDB.

GOOD LUCK

Question 1                                                                                                                                         [10 Marks]

Consider the following table:

Player_Team

 Player_ID Player_Name Team_ID Pool_ID IND-06 Gangoli IND A AFG-05 Najeeb AFG B SA-07 AB Devillier SA A AU-01 Steve Waugh AU B IND-01 Tandulker IND A AU-04 Maxwell AU B AFG-01 Nawroze AFG B SA-09 Dal Styn SA A

Consider the following query:

SELECT Count (Player_Team.Pool_ID) AS CountOfPool_ID

FROM Player_Team

GROUP BY Player_Team.Pool_ID;

1. You need to identify the number of clusters from this data.
2. Secondly, you have to identify whether the given clustering is one way or two way clustering. Your answer should support by valid reasons.

Question 2                                                                                                                                         [10 Marks]

Consider the following tables:

Player

 Player_ID Player_Name Team PK-01 Wasim Pakistan PK-02 Misbah Pakistan SA-03 AB Devillier South Africa

Award

 Award_ID Match_ID Player_ID 01 01 PK-01 01 02 PK-01 02 03 PK-02 01 04 SA-03

Consider the following query:

Select * from Player P, Award A where P.Team= ‘Pakistan’ and A.Award_ID = ‘01’ and P.Player_ID = A.Player_ID

Suppose this query is executed using Naive Nested-Loop join and (i.e. there is no index created on both Player and Award tables). Mention that which table should be the Outer table to get minimum I/O by manually calculating the cost in both cases i.e. when “Player” is outer table and when “Award” is outer table.

Note: You need to mention the calculations in your solutions where required.

### Replies to This Discussion

Our main purpose here discussion not just Solution

# CS614 - Data Warehousing Assignment No. 4 Solution Fall 2015 Due Date Feb 08, 2016

