CS 525s - Resources - Fall 2006

Where to find relevant material?

Where to look up papers to supplement your understanding of a topic, to prepare your proponent presentation to prepare for your rebuttal, to get started on your research project, and so on.

One good on-line server for where to go whenever you are looking for high-quality database related literature is:

DBLP (Digital Bibliography and Library Project: http://www.informatik.uni-trier.de/~ley/db/index.html

Of course, the library is another great place, as it has interlibrary loan, search services into the IEEE digital library and the ACM digital library, citation indices, and so on, that you should familiarize yourself with. These will likely be tools you will cherish to use in future projects as well beyond this course.

Journals (typically published once a month):

ACM Transactions on Database Systems (ACM TODS)
IEEE Transactions on Knowledge and Database Systems (ACM TKDE)
Data and Knowledge Engineering Journal (DKE)
Information Systems Journal
ACM Transactions on Office Information Systems (ACM OIS)
IEEE Transactions on Software Engineering (ACM TSE)

Database Conference proceedings (typically published once a year):

ACM Special Int. Group on Data Management (ACM SIGMOD)
Int. Conf. on Very Large Databases (VLDB)
IEEE Int. Conf. on Data Engineering (ICDE)
Int. Conf on Information and Knowledge Management (CIKM)
ACM Principles on Database Systems (PODS)
WWW Conference

The above lists are just a subset and there are additional respectable sources for database material. For this course, you should "visit" the library to make you familiar with additional methods of locating material from valid sources. Exclusive use of material from "non-referred" web sites is not sufficient and at times dangerous to one's health :). Know your sources !

Selected Subset of Stream and Sensor-Related Publications

Special Workshops in Sensors/Streaming (typically published once a year):

A variety of smaller workshops are beginning to be offered, typically attached under the umbrella of some larger conference. These include:

2nd International Conference on Geosensor Networks (GSN2.0), WORKSHOP, Oct 1-3 2006, to be held in Boston, MA

Some Subset of Relevant Project Web Sites

Some Subset of Possible Sources of Data

Data is either synthetic or 'real'. Synthetic data means that you have generated, typically using some special-purpose generator with the goal to control certain parameters that you deem likely to influence the outcome of your results, such as arrival rates, selectivities, and so on. On the other hand, you are also encouraged to work with "real data" in your experiments and demonstration, if at all possible. You can often find data on the web if you do some looking. Ideas for 'real data' include:

Some Places for Getting Access to Source-Code

You may want to consider to build your implementation on top of the Berkeley TelegraphCQ prototype, which is a publicly-available data stream management system code-base.

Or, you could look at Aurora source, or more likely, its follow-on system, the Borealis source.

Or, we also have our own WPI stream query engine, called CAPE, that would be available for your use. The high-level project description can be found here. Though no user documentation is available at this time.

Other Resources

Acceptance rates of Major database Conferences.

DBLP (DataBase systems and Logic Programming) Computer Science Bibliography: (information about database system conferences, journals and research groups)

DBWORLD (home of DB researchers worldwide.)

Citeseer publications research index: (Computer and Information science papers)

DBWORLD (home of DB researchers worldwide.)

Mining Data Streams Bibliography:

Relevant Courses Elsewhere

Data Stream Processing (a course at CMU).

Data Stream Management System (a course at Brown Univ).