CS 561 Spring 2007.
CS561 "HOMEWORK" #9
Assigned: Thursday, Mar 29, 2007
Due: Thursday, April 5, 2007
To do:
-
Read the assigned readings on distributed query processing
(ACM Survey paper) and on parallel and distributed databases (book chapter).
They are long readings, so your primary focus should be first on the former
and a little less on the later.
I have handed out copies in class.
Oone is from the CS542 textbook, and the other,
the ACM Survey paper, is also linked into our course webpage.
-
Prepare a written report
on the two readings to be turned in in the next class meeting.
The goal is to tease out some of the main points from
the assigned readings.
-
Topics to consider in your critique include:
-
Describe several algorithms for processing queries in
distributed systems, with emphasis on expensive queries such as joins
across such systems. What are the key ideas compared to centralized
join algorithms?
-
Outline techniques for performance optimization in distributed
database systems, such as
the application of caching, replication, etc.
-
Characterize the difference between parallel database systems
and distributed database systems, including query processing
within such systems.
-
Briefly describe the different types of distributed architectures discussed
in your readings.
Sketch the most important
techniques proposed in these systems, include
query processing issues.
Focus on differences, if any, that are specific to these architectures.
-
Lastly, please postulate which, if any, of the
techniques that have been utilized for the processing
of queries in distributed static databases may also be applicable
for distributed processing of stream monitoring queries.
Or, vice versa, do you have any thoughts on what, if any, issues
may be different for stream processing compared to relational
query processing in large-scale networked systems.
This clearly is an open-ended question for which no one correct answer
exists.