Distributed Database Systems
© 2001 M. Tamer Özsu and Patrick Valduriez
Outline
Introduction
Background
Distributed DBMS Architecture
Distributed Database Design (Briefly)
Distributed Query Processing (Briefly)
Distributed Transaction Management
(Extensive)
Building Distributed Database Systems (RAID)
Mobile Database Systems
Privacy, Trust, and Authentication
Peer to Peer Systems
Distributed DBMS © 2001 M. Tamer Özsu & Patrick Valduriez Page 0.2
Instructor Introduction
Bharat Bhargava
Professor of Computer Sciences, Purdue University
West Lafayette, IN 47907
Phone: 7654946013, Email: bb@cs.purdue.edu
Professor Bhargava has taught the “Distributed Database Systems” course
twenty times since 1985. He has graduated the largest number of Ph.D.
students in Computer Sciences Department in Purdue University. He has
been inducted in the “Book of Great Teachers” at Purdue University.
Professor Bhargava's research involves both theoretical and experimental
studies in distributed systems. His research group has implemented a robust
and adaptable distributed database system called RAID, an adaptable video
conferencing system and is involved in networking research. Prof. Bhargava
has conducted experiments in large scale distributed systems,
communications, authentication, key management, faulttolerance and
Quality of Service. His current interests are in secure mobile systems,
multimedia security and QoS as a security parameter.
Distributed DBMS © 2001 M. Tamer Özsu & Patrick Valduriez Page 0.3
Instructor Introduction
Professor Bhargava was the chairman of the IEEE Symposium on Reliable and
Distributed Systems held at Purdue in October 1998. Professor Bhargava is
on the editorial board of three international journals. In the 1988 IEEE Data
Engineering Conference, he and John Riedl received the best paper award for
their work on "A Model for Adaptable Systems for Transaction Processing."
Professor Bhargava is a fellow of Institute of Electrical and Electronics
Engineers and Institute of Electronics and Telecommunication Engineers. He
has been awarded the charter Gold Core Member distinction by IEEE
Computer Society for his distinguished service. He received Outstanding
Instructor Awards, from the Purdue chapter of the ACM in 1996 and 1998.
He has received IEEE Technical Achievement award for a major impact of
his decade long contributions to foundations of adaptability in communication
and distributed systems in 1999. Prof. Bhargava's students have received
best paper awards in International conferences and have started a Nasdaq
listed company.
For details please see http://www.cs.purdue.edu/people/bb
Distributed DBMS © 2001 M. Tamer Özsu & Patrick Valduriez Page 0.4
Course Introduction
This course will deal with the fundamental issues in
large distributed systems which are motivated by the
computer networking and distribution of processors,
and control. The theory, design, implementation, and
performance of large systems will be discussed.
Concurrency, Consistency, Integrity, Reliability,
Privacy, and Security in distributed systems will be
included.
Advanced features of the course include research
related to Mobile Data Management, Streaming
databases, and Peer to Peer systems.
Distributed DBMS © 2001 M. Tamer Özsu & Patrick Valduriez Page 0.5
Slides
Most of the slides are taken from
http://www.cs.ualberta.ca/~database/ddbook.html
based on the main textbook by Tamer Oszu and
Patrick Valduriez
Some slides have been excluded from various chapters
Some slides have been updated by Prof. Bhargava
Some slides have been prepared by Prof. Bhargava
Some slides are based on research papers that are
available at the website as reading materials
http://www.cs.purdue.edu/homes/bb/cs54206Spr
Distributed DBMS © 2001 M. Tamer Özsu & Patrick Valduriez Page 0.6
Course Introduction
Textbooks
Principles of Distributed Database Systems, Prentice Hall, Tamer
Oszu and Patrick Valduriez (Maint Text)
Concurrency Control and Reliability in Distributed Systems, Van
Nostrand and Reinhold Publishers, Bharat Bhargava (Ed.), 1987
(Out of Print)
Some reading materials based on current research topics.
Supplemental Textbook
Principles of Database Systems. Computer Science Press, Rockville,
J. D. Ullman, 1982
Assignments and Grading Policy
Non programming assignments: 5 (Once every two or three weeks:
40% of grade)
Mid Term and Final Exams: 2 (30% of grade each)
>87% = A, 7787% = B, 6077% = C, 5060% = D, <50% = F
Distributed DBMS © 2001 M. Tamer Özsu & Patrick Valduriez Page 0.7
Distributed Database
Systems
Computer network (communication system)
Database systems
Users (programs, transactions)
Examples:
Distributed INGRES (UCBerkley)
SDD1 (Computer Corporation of America)
DB2 and System R* (IBM)
SIRIUS – DELTA (INRIA, France)
RAID (Purdue)
Distributed DBMS © 2001 M. Tamer Özsu & Patrick Valduriez Page 0.8
Distributed Database Systems
Computer Networks: Communications:
Ethernet UDP/IP
ATM TCP/IP
FDDI ISO
ARPANET
BITNET
Internet2
…
User Interaction:
SQL
Transaction
Distributed DBMS © 2001 M. Tamer Özsu & Patrick Valduriez Page 0.9
Fundamental References
Bharat Bhargava (Ed.), Concurrency Control and Reliability in
Distributed Systems, Van Nostrand and Reinhold Publishers, 1987.
A. Helal, A. Heddaya, and B. Bhargava, Replication Techniques in
Distributed Systems, Klumer Academic Publishers, 1996.
J. Gray and A. Reuter. Transaction Processing Concepts and
Techniques. Morgan Kaufmann, 1993.
M.T. Özsu and P. Valduriez. Principles of Distributed Database Systems,
2nd edition. Prentice Hall,1999.
S. Ceri and G. Pelagatti. Distributed Databases Principles and Systems.
McGraw Hill, 1984.
D.A. Bell and J.B. Grimson. Distributed Database Systems. Addison
Wesley, 1992.
Distributed DBMS © 2001 M. Tamer Özsu & Patrick Valduriez Page 0.10
Fundamental References (see Website)
B. Bhargava, Building Distributed Database Systems.
B. Bhargava and John Riedl, The Raid Distributed Database System, IEEE Trans on Software
Engineering, 15(6), June 1989.
B. Bhargava, Concurrency Control in Database Systems, IEEE Trans on Knowledge and Data
Engineering,11(1), Jan.Feb. 1999
B. Bhargava and John Riedl, A Model for Adaptable Systems for Transaction Processing, IEEE
Transactions on Knowledge and Data Engineering, 1(4), Dec 1989.
B. Bhargava and M. Annanalai, A framework for communication software and meaurements for
digital library, Journal of Multimedia systems, 2000.
B. Bhargava and C. Hua. A Causal Model for Analyzing Distributed Concurrency Control
Algorithms, IEEE Transactions on Software Engineering, SE9, 470486, 1983.
E. Mafla, and B. Bhargava, Communication Facilities for Distributed Transaction Processing
Systems, IEEE Computer, 24(8), 1991.
Y. Zhang and B. Bhargava, WANCE: Wide area network communication emulation systems, IEEE
workshop on Parallel and Distributed Systems, 1993.
G. Ding and B. Bhargava, Peertopeer Filesharing over Mobile Ad hoc Networks, in the First
International Workshop on Mobile PeertoPeer Computing, Orlando, Florida, March 2004
M. Hefeeda, A. Habib, B. Botev, D. Xu, B. Bhargava, PROMISE: PeertoPeer Media Streaming
Using CollectCast, In Proc. of ACM Multimedia 2003, 4554, Berkeley, CA, November 2003.
Y. Lu, W. Wang, D. Xu, and B. Bhargava, TrustBased Privacy Preservation for Peertopeer, in the
1st NSF/NSA/AFRL workshop on secure knowledge management (SKM), Buffalo, NY, Sep. 2004.
B. Bhargava, Y. Zhang, and E. Mafla, Evolution of a communication system for distributed
transaction processing in RAID, Computing Systems, 4(3), 1991.
E. Pitoura and B. Bhargava, Data Consistency in Intermittently Connected Distributed Systems,
IEEE TKDE, 11(6), 1999.
Distributed DBMS © 2001 M. Tamer Özsu & Patrick Valduriez Page 0.11
Fundamental References (cont’d)
E. Pitoura and B. Bhargava, Maintaining Consistency of Data in Mobile Distributed Environments,
ICDCS, 1995.
A. Zhang, M. Nodine, and B. Bhargava, Global scheduling for flexible transactions in heterogeneous
distributed database systems, IEEE TKDE, 13(3), 2001.
P. Bernstein and N. Goodman, Concurrency Control in Distributed Database Systems, ACM Computer
Survey, 13(2), 1981.
P. Bernstein, D. Shipman, and J. Rothnie, Concurrency control in a system for distributed databases
(SDD1), ACM Transactions on Database Systems, 5(1), 1980.
Jim Gray, The Transaction Concept: Virtues and Limitations, VLDB, 1981.
H.T. Kung and John T. Robinson, On Optimistic Methods for Concurrency Control, ACM Trans. Database
Systems, 6(2), 1981.
C. Papadimitriou, The serializability of concurrent database updates, Journal of the ACM, 26(4), 1979.
D. Skeen, A Decentralized Termination Protocol, IEEE Symposium on Reliability in Distributed Software
and Database Systems, July 1981.
D. Skeen, Nonblocking commit protocols, ACM SIGMOD, 1981.
D. Skeen and M Stonebraker, A Formal Model of Crash Recovery in a Distributed System, IEEE Trans.
Software Eng. 9(3): 219228, 1983.
W. W. Chu, Optimal File Allocation in Multiple Computer System, IEEE Transaction on Computers, 885
889, October 1969.
B. Bhargava and L. Lilien, Private and Trusted Collaborations, in Proceedings of Secure Knowledge
Management (SKM), Amherst, NY, Sep. 2004.
S. B. Davidson, Optimism and consistency in partitioned distributed database systems, ACM Transactions
on Database Systems 9(3): 456481, 1984.
S. B. Davidson, H. GarciaMolina, and D. Skeen, Consistency in Partitioned Networks, ACM Computer
Survey, 17(3): 341370, 1985.
B. Bhargava, Resilient Concurrency Control in Distributed Database Systems, IEEE Trans. on Reliability,
R31(5): 437443, 1984.
Jr. D. Parker, et al., Detection of Mutual Inconsistency in Distributed Systems, IEEE Trans. on Software
Engineering, SE9, 1983.
Distributed DBMS © 2001 M. Tamer Özsu & Patrick Valduriez Page 0.12
Other References
Transaction Management:
P.A. Bernstein and E. Newcomer. Principles of Transaction
Processing for the Systems Professional, Morgan Kaufmann, 1997.
P.A. Bernstein; V. Hadzilacos and N. Goodman. Concurrency
Control and Recovery in Database Systems. AddisonWesley, 1987.
(out of print)
M. Buretta. Data Replication, Wiley, 1997.
V. Kumar (ed.). Performance of Concurrency Control Mechanisms in
Centralized Database Systems, Prentice Hall, 1996.
V. Kumar and S.H. Son. Database Recovery, Kluwer, 1998.
C.H. Papadimitriou. The Theory of Concurrency Control. Computer
Science Press, 1986.
Distributed DBMS © 2001 M. Tamer Özsu & Patrick Valduriez Page 0.13
Other References
Interoperability:
A.K. Elmagarmid, M. Rusinkiewicz, and A. Sheth (eds). Management
of Heterogeneous and Autonomous Database Systems, Morgan
Kaufmann, 1998.
A. Bouguettaya, B. Benatallah, and A. Elmagarmid (eds.).
Interconnecting Heterogeneous Information Systems, Kluwer, 1998.
J. Siegel (ed.). CORBA Fundamentals and Programming, Wiley,
1996.
K. Brockschmidt. Inside OLE, 2nd edition, Microsoft Press, 1995.
K. Geiger. Inside ODBC, Microsoft Press, 1995.
Distributed DBMS © 2001 M. Tamer Özsu & Patrick Valduriez Page 0.14
Other References
Data Warehousing
There are many books. A small sample:
W. Inmon. Building the Data Warehouse. John Wiley
and Sons, 1992.
A. Berson and S.J. Smith. Data Warehousing, Data
Mining, and OLAP. McGraw Hill, 1997.
S. Chaudri and U. Dayal. Overview of Data Warehousing
and OLAP Technology. ACM SIGMOD Record, March 1997,
26(1), pp. 6574.
IEEE Q. Bull. Data Engineering, Special Issue on
Materialised Views on Data Warehousing, June 1995, 18(2).
Distributed DBMS © 2001 M. Tamer Özsu & Patrick Valduriez Page 0.15
Other References
Mobile Databases
A. Helal et al. Any Time, Anywhere Computing, Kluwer, 1999.
T. Imielinski and H. Korth. Mobile Computing. Kluwer
Publishers, 1996.
E. Pitoura and G. Samaras. Data Management for Mobile
Computing. Kluwer Publishers, 1998.
T. Imielinski and B.R. Badrinath. Data Management Issues
in Mobile Computing. Communications of ACM, October
1994, 37(10):1828.
M. H. Dunham and A. Helal. Mobile Computing and
Databases: Anything New? ACM SIGMOD Record, December
1995, 24(4): 59.
G. H. Forman and J. Zahorjan. The Challenges of Mobile
Computing, Computer, April 1994, 27(4):3847.
Distributed DBMS © 2001 M. Tamer Özsu & Patrick Valduriez Page 0.16
Other References
Web Data Management
S. Abiteboul, P. Buneman, D. Suciu. Data on the Web,
Morgan Kaufmann, 2000.
D. Florescu, A. Levy, and A. Mendelzon, Database
Technoques for the World Wide Web: A Survey, ACM
SIGMOD Record, 27(3): 5974, 1998.
S. Bhowmick, S. Madria, and W. K. Ng, Web Data
Management: A Warehouse Approach, Springer, 2003.
Distributed DBMS © 2001 M. Tamer Özsu & Patrick Valduriez Page 0.17