CS计算机代考程序代写 COMP9315 Sample Exam

COMP9315 Sample Exam

COMP9315 Sample Exam

The University of New South Wales

COMP9315 DBMS Implementation
Sample Exam

DBMS Implementation

[Instructions] [Notes] [PostgreSQL] [C]

[Q1] [Q2] [Q3] [Q4] [Q5] [Q6] [Q7] [Q8]

Question 3 (12 marks)

Consider the following query on the table from Question 2:

select distinct name from People;

Assume that the People table has the following characteristics:

P 4096 size of pages/buffers in bytes
r 1,234,567 total number of tuples
R 32 size of each tuple in bytes (fixed-sized records)

B 101 number of available memory buffers

Each page contains a page directory as well as the tuples.
The page directory consist of 1 presence bit for each tuple,
and the bits are packed as tightly as possible into a sequence of
bytes at the start of the page. The remaining space is used for tuples.
All pages, even those in intermediate files, have the same page
structure.
The following diagram shows what such a page might look like after
some tuples have been inserted and others removed:

For this problem, you can assume that tuples are added consectively
starting from slot one, so the first k presence bits will
be one if there are k tuples in the page, and the remaining
presence bits will be zero.

For the above scenario, you should:

Calculate the number of tuples per page

Calculate the total number of pages for the People table

Calculate the cost of duplicate removal using sorting

Calculate the cost of duplicate removal using hashing

What would be the effect on the hashing approach if only 91 buffers were available?

(you do not need to do a complete detailed cost calculation for this part;
just describe the effect in words)

The costs should give the total number of page reads/writes and
include all page reads and page writes except the page writes for
the final output stage.
Note the because we are not considering the size of the final output,
the costs do not rely on the percentage of duplicates.

You can ignore any effects from buffer management and assume that
all buffers will be used in the most effective manner possible.
For the hashing case, assume that the hash functions partition the
tuples uniformly.
Your cost calculations must show clearly the number of pages
in all intermediate files.

Show all working.

Instructions:

Type your answer to this question into the file called q3.txt

Submit via:   give cs9315 sample_q3 q3.txt

or via: Webcms3 > exams > Sample Exam > Submit Q3 > Make Submission

End of Question