Introduction to Database Systems by Prof. Trummer

This is an introduction to relational and non-relational database management systems. We will learn how to query database systems via languages such as SQL (the structured query language). Then, we will see how database systems work internally, how they store and index data, how they process and optimize queries, and how they process transactions while providing guarantees such as isolation, atomicity, and durability (ACID guarantees).

We will address questions of database design and discuss novel approaches to data management via NoSQL or NewSQL systems. Finally, we will discuss about graph, stream, and spatial data, and about systems that are specialized for those data types. The following recordings are taken from the Fall 2020 online lecture at Cornell by Prof. Immanuel Trummer (www.itrummer.org).

Click on the following topics to access videos and slides:

Relational Data Processing

Intro to Query Processing; Buffer Management; Slides
Estimating Processing Costs; Simple Nested-Loops Join Algorithms; Slides
Advanced Join Algorithms: Hash Join, Sort-Merge Join; Slides
Join Wrap Up; Other Relational Operators; Connecting Operators; Slides 1; Slides 2
Query Optimization; Cost and Cardinality Estimation; Slides
Plan Enumeration; Query Optimization Demo in Postgres; Slides 1; Slides 2

Additional Reading

About the first two thirds of the course are based on the textbook "Database Management Systems" by Raghu Ramakrishnan and Johannes Gehrke. The last third of the course discusses recently proposed systems, more details about them can be found in the corresponding research papers.