Archive for DB

Data Storage(数据存储)

    I‘ll try to explain it  in  English  for improving my English writing skill.

    Data storage management is  the lowest  layer in the DBMS architecture.
    It consist of  2  main component:the  disk space manager(storage manager) and the buffer manager.

First of all, we must clarify 2 concepts: page and block.
    page is a logical concept while block is a physical concept which means the data unit  storing in disk.

Then, we ‘ll see the 2 component.

Storage manager(SM)
Responsibility:
- allocate/deallocate a page
- read/write a page

Problem we must solve:
- how to keep track of free block

Buffer manager(BM)
Responsibility:
- bring the requested page from the upper level to the  buffer pool

Problem we must solve:
- “Address Transformation”. How we can know which frame in the buffer pool the requested page corresponds to?
-
“Replacement Policies”.Which frame we should choose to replace if  the
buffer pool is full? And we must write the frame into disk is the 
frame  is  changed  after it brought into the buffer pool.
- “Concurrency Control”. What if a page is requested by several transactions.

Powered by ScribeFire.

Leave a Comment

DBMS的体系结构 and Redbase Project

        最近时间比较充裕,决定把数据库系统实现重新看一遍,以前在dbclub学的时候没怎么深入。另外,Standford的这门课程有个RedBase Project,可以参考实现一个小型的RDBMS,正好拿来实现一下。

General picture of where a DBMS sits:

What you’ll build in this class:

Closer look at DBMS, File System, and Disk:

Components of DBMS:

Missing from figure:

  • Query results returned to User/Application
  • Buffer Manager or Index/Record Managers call Concurrency Control
  • Separate Table Manager
  • Many, many (many!) details…

RedBase architecture:

Legend:

  • PF: Paged File
  • RM: Record Management
  • IX: Indexing
  • SM: System Management (DDL, utilities)
  • QL: Query Language
  • EX: Extension

Correspondence between RedBase and DBMS components:

  • PF = Storage Manager + Buffer Manager
  • RM = Record Manager
  • IX = Index Manager
  • SM = DDL Processor (+ misc. utilities)
  • QL = Query Checker, Query Optimizer (simple), Execution Engine
  • Command parser = Query Parser

Leave a Comment

db的一些研究方向

从ssos师兄的主页上看到的Research Interest,十分精辟

XML Database How to manage massive data in form of XML?
sensor network How to detect and manage data from physical world?
graph database How to manage very large graph?
Compressed Database How to store and query massive data efficiently with the method of compression?
WEB Database How to extract, store, query, mining the data from web?
Information Integration How to query and mining the data from heterogenous data sources in a distributed envrionment?

Powered by ScribeFire.

Leave a Comment

计算机工程专业,如何找paper?[zz]

http://zeal.haliluya.org/blog/2006/05/26/how-to-find-computer-engineering-papers/

学了几年的Computer Engineering,虽然学无所成,但是对于怎么找到专业相关的paper(尤其是电子版)还是有一点心得。今天特总结一下,供后来人参考。本文应该也普遍适用于Computer Science。

1. 搜索引擎,这个其实是最简单有效的方法了。

推荐使用:

Google Scholar

Citeseer

DBLP

Google
Scholar对每一篇paper一般能返回若干个结果,可以点开group看,默认结果通常是链接向ACM或者IEEE的Digital
Library的,是收费的。如果你的学校没有订阅的话,你是下载不了的。所以最好在group里面寻找一个链向作者主页的,实验室主页之类的,通常可以
直接下载。Citeseer里面,很多文章的电子版都直接在citeseer的数据库里,可以直接下载。DBLP本身不提供论文下载,但是相当一部分
paper是有链接指向ACM和IEEE的。DBLP的好处是对论文进行了系统的分类,按作者,按会议,按期刊等等。Citeseer主要是根据
citation来把paper串起来的。

2. ACM Digital Library

如果你所在的学校和机构订阅了ACM Digital
Library的话,那么这里是一个不错的跟踪ACM会议和期刊的好地方。因为ACM的期刊和会议一般都是该领域内最好的之一。所以,建议
Graduate
Student们养成定期到这里看最新的会议proceedings和最新的transaction的习惯,尤其是conference的
proceedings
(计算机专业重conference)。ACM因为会议和期刊并不是很多(从而也保证了相当的质量),所以分类很清晰。而且一篇paper的
citation也尽量提供了链接。

ACM Journals

ACM Transactions

ACM Conference Proceedings

3. IEEE Computer Society Digital Library

如果你跑到ieee.org,而不是computer.org,你会发现那里paper不是那么容易找。其实计算机专业的,到computer.org更容易找paper一点。这里也是需要订阅的,不是免费的。另外一方面,IEEE的会议多而杂,有好的会议,但是绝大部分会议是不如ACM的,所以在这里跟会议的proceedings一定要有针对性。 IEEE的几个magzine还是做得挺不错的,可以读读。

4. 寻找BibTeX

如果你用LaTeX写论文的话,手边备有几个地方可以找到现成的BibTeX entry还是很省事的 (平时自己一定要积累,比如没看一篇paper,都把相应的bibtex entry整理好,放进自己的bib文件)。

ACM Digital Library的每篇文章,都提供BibTeX的。IEEE则不提供。Citeseer也提供能找到的文章的BibTeX,DBLP也是。另外还有一个网站,专门收集paper的BibTeX,叫做The Collection of Computer Science Bibliographies,也是一个不错的地方。

经常需要用的就是这些了,其实不用我这篇文章,大家也都知道。 也欢迎同行们来补充一下。

Powered by ScribeFire.

Leave a Comment

Sources About DB

Database Research and Development by Area
http://www-ccs.cs.umass.edu/db/research-areas.html

University of Washington database group
http://data.cs.washington.edu/

Stanford CS346 Database System Implementation  http://infolab.stanford.edu/~widom/cs346/

berkeley CS262 Readings in Database Systems  http://db.cs.berkeley.edu/cs262/syllabus.php
http://redbook.cs.berkeley.edu/redbook3/lecs.html

VLDB2007 http://www.vldb2007.org/(点击左侧Best Paper Awards)

Powered by ScribeFire.

Leave a Comment