当前位置:天才代写 > 作业代写,留学生作业代写-北美、澳洲、英国等靠谱代写 > 数据库系统技术作业代写 Buffer Management代写

数据库系统技术作业代写 Buffer Management代写

2021-05-26 17:46 星期三 所属: 作业代写,留学生作业代写-北美、澳洲、英国等靠谱代写 浏览:672

数据库系统技术作业代写

CSCD43 DATABASE SYSTEM TECHNOLOGY

Assigned January 22

Due Monday February 9

Weight: 10% of your ftnal grade

Buffer Management

数据库系统技术作业代写 In this assignment, you will modify the PostgreSQL 7.4.13 Buffer Manager. Before starting this assignment,you should have already downloaded

1 Introduction 数据库系统技术作业代写

In this assignment, you will modify the PostgreSQL 7.4.13 Buffer Manager. Before starting this assignment, you should have already downloaded, compiled and installed PostgreSQL 7.4.13. The goal of this assignment is to understand and evaluate various memory management strategies in a database system. The actual coding for this assignment is minimal, but you will be modifying existing source code in a very large codebase; therefore, it may be a bit challenging.

Start early!

2 Experimentalsetup 数据库系统技术作业代写

2.1 Data Generation and loading:

The table we will be working with can be created using the following SQL statement:

CREATE TABLE Data (sa

ID INTEGER PRIMARY KEY, A INTEGER,

B INTEGER, C INTEGER);

  1. You have to write a script to generate a 5000-line file from which the table will be uploaded. In addition, you will have to make surethat:
    • the ID column is unique,and
    • thenumbers inserted in the other columns are uniformly-random generated between 1-100. Each line in this file should be of the form:

<id>, <A>, <B>, <C>

An example of such file is the following:

1,56,89,23

2,45,78,45

  1. Upload this file into the table Data using the COPYcommand in psql. Your data should now be ready. 数据库系统技术作业代写

Suggestion: you can check the correctness of your data using some queries.

数据库系统技术作业代写
数据库系统技术作业代写

2.2 Queries

  1. Write 10 queries of the form:

SELECT COUNT(*)

FROM Data

WHERE <some range condition on A,B,C>;

An example of such a query is below:

SELECT COUNT(*)

FROM Data

WHERE A > 20 AND B < 40;

  1. Store these 10 queries in a file sql, asfollows:
    • each query is stored in a separate line,and
    • there is a blank line at the end of the file. 数据库系统技术作业代写   Thesequeries will be executed with sequential
  2. Write 10 queries of the form:

SELECT COUNT(*)

FROM Data

WHERE <some small range condition on ID>;

An example of such a query is below:

SELECT COUNT(*)

FROM Data

WHERE ID > 7 AND ID < 20;

  1. Store these 10 queries in a file sql, asfollows:
    • each query is stored in a separate line,and
    • thereis a blank line at the end of the

These queries should probably be executed with index scans.

  1. Before proceedingfurther,
    • try executing these queries in PostgreSQL ,and
    • check,using the EXPLAIN command in psql, that they are indeed executed using Sequential Scans or Index Scans, as

2.3 Evaluation of the queries

By now, you should have the data and the queries to run on it.

To run the queries on a backend, and extract the buffer hit rate, you should run PostgreSQL in a standalone backend mode, using the command postgres as follows. Note that this is the easiest way to run a set of queries and extract “interesting” information from the backend.

cat <queryfile.sql> | postgres -B <numbuffers> -D <datadir> -d <debug-level> -s <databasename>

The output of the postgres command contains lots of information. Of interest to you is the buffer hit rate

for each query. In order to make sure you

  • understand its output,and
  • can extract appropriate information, werecommend you to run different

The flags on the postgres command are explained below:

  • The –B flag specifies the size of the buffer cache in
  • The –D flag refers to the data
  • The -d flag sets the debug level. This flag will force the backend to print out useful

– For this assignment, setting the debug-level to 1 or 2 should be enough.数据库系统技术作业代写

  • The –sflag in conjunction with the debug flag results in buffer hit rates being printed in the output. For more information on postgres, you can use man postgres.

Note: in the evaluation section, you will have to vary the number of buffers, and to printe the hit rates for different configurations.

An example of the command above is as follows:

cat ScanQueries.sql | postgres -B 20 -D ~/data/ -d 1 -s mydatabase

Now, you may hack!

3 Implementation

The version of PostgreSQL you are using for this assignment uses the LRU buffer replacement policy. For this assignment you are required to implement the LFU (Least Frequently Used) and MRU (Most Recently Used) replacement policies. Consult your favourite textbook to find out the details of these policies.

You can find the buffer manager code in the directories

src/backend/storage/buffer src/include/storage/

The files which will most likely need the most are:

  • freelist.c

–itmanages pages which are not pinned in memory and are eligible for

  • bufmgr.c

–itdefines the interface used by the rest of PostgreSQL to access the memory  数据库系统技术作业代写

  • buf_init.c

–ithandles initializations of the buffer manager data

  • buf_internals.h

–itdefines the data structures used by the rest of the buffer

You should be able to complete the entire assignment by modifying just these files.

We recommend that, before you start modifying the files  to  implement the  new replacement policies,  you save a copy of these files elsewhere. Note that you will need to keep separate copies of the 4 files for implementing the LFU and MRU policies.

Each time you make changes to the files, you have to compile and install PostgreSQL again. This can be done using the following commands:

$gmake clean

$gmake uninstall

$gmake

$gmake install

If you haven’t modified any header files since you last compiled the code, you may avoid the gmake clean command. Also, you do not need to create the database again each time you recompile and reinstall PostgreSQL .数据库系统技术作业代写

3.1 Tips on how to deal with such code

  • Donot delete any old LRU code in the  It is much easier to just comment it out and add a marker (for instance, BEGIN OLDCODE) at the beginning of the commented section.

Similarly, you may want to clearly delimit the new code you write in the file, to keep track of your changes. Such marking should help even if you are using a CVS.

  • You may have to declare global variables to make your code work. Although this isn not the best programmingpractice, it makes life much simpler with such legacy  Be sure to use the C modifier static to limit the scope of your global variables to a single C file.
  • To debug your code, you could eitheruse

*fprint(stderr,…), or

*gdb.

  • You have to comment your code. In the event that your assignment is not completely correct, commentedcode is the only way we can understand your code, and give you partial  Also, your own code should be clearly demaracted with BEGIN NEWCODE and END NEWCODE comments. 数据库系统技术作业代写
3.1.1 Tips on using gdb

To use gdb, you have to configure and compile PostgreSQL with debugging enabled, as below:

./configure --prefix=PREFIX --enable-debug --enable-depend

This creates much bigger compiled binaries, though. For this assignment, you will probably not need to resort to gdb.

If you do decide to use gdb, you should run it as below:

$gdb postgres(gdb) run -B 20 -D ~/data/ -d 1 -s mydatabase < ScanQueries.sql

4 Evaluation of the performance 数据库系统技术作业代写

Finally, you will now compare the performance of the LRU, LFU, and MRU replacement policies. To accomplish this, you will have to run the queries (from your query files) with varying buffer cache sizes (as discussed in Section 2.3). The PostgreSQL binary should report buffer hit rates for each query in the query files.  For  each run, you should take the average over all queries in a file.

  1. You should produce 2 graphs, as described below:
    • Thefirst graph contains the buffer hit rates for all three algorithms, applied to the queries on

ScanQueries.sql, using varying buffer sizes.

  • The second graph contains the buffer hit rates for all three algorithms, applied to the querieson

IndexScanQueries.sql, using varying buffer sizes. 数据库系统技术作业代写

Remember to clearly mark the axes on the graphs and state clearly which parameters are being kept constant and what their values are. Submissions with graphs which are not clear and precise will be penalized

  1. Analyze/Interpret any “interesting” trends that you notice in the behaviour of the LRU, LFU, and  MRUreplacement  You have to explain when and why one policy is better than the other. Note that any trend you claim should be clearly visible in the graphs. If you notice anything else interesting in your results (say the behaviour over some type of queries), then add this to your report. Please be concise!
  2. Try to estimate the size of your table Data in number of blocks, based on the buffer cache hit rates, with increasing size of the table. (This should beeasy!)

5 Submission instructions 数据库系统技术作业代写

  1. You have to submit electronically the followingfiles:
    • Your report(pdfONLY):

–it should contain the graphs, your interpretation of the results, and any other interesting trends that you observed. No additional documentation is

    • A READMEfile:

–itshould contain your name, student number, what works and what doesn’t work in your assignment.

    • ScanQueries.sql
    • IndexScanQueries.sql
    • The4 C code files for LFU. Rename the files
freelist_lfu.c, bufmgr_lfu.c, buf_init_lfu.c, buf_internals_lfu.h.

You will have to submit all 4 files even if you didn’t have to modify all of them. 数据库系统技术作业代写

  • The4 C code files for MRU. Rename the files
freelist_mru.c, bufmgr_mru.c, buf_init_mru.c, buf_internals_mru.h.

As with LFU, you will have to submit all 4 files even if you didn’t have to modify all of them.

  • If you had to modify any other files, please include them as well. Note that the assignment can andshould be solved by modifying just the 4 files

Good luck!

数据库系统技术作业代写
数据库系统技术作业代写

其他代写:web代写 program代写 app代写  Programming代写 homework代写 source code代写 考试助攻 Exercise代写 C++代写 java代写 r代写 finance代写 CS代写 program代写 cs作业代写

合作平台:essay代写 论文代写 写手招聘 英国留学生代写

 

天才代写-代写联系方式