Date |
Topic |
Zoom |
Assignments |
Reading |
Non-required Reading |
Tue 01/16 |
Course Intro
| [lecture] |
HW 1 out
|
1) How (and How Not) to Write a Good Systems Paper (Levin and Redell, SIGOPS OSR, 1983)
2) On Being the Right Size (Haldane, 1928)
lecture slides
|
You and Your Research(Hamming 1986)
|
Thu 01/18 |
Correctness
| [lecture] |
HW1 in-class quiz
|
1) The UNIX Timesharing System (Ritchie and Thompson, SOSP, 1974) other notes
2) Medical Devices, The Therac-25 (Levinson, Safeware: System Safety and Computers, 1995) SKIM!
3) The Rise of Worse is Better (Richard Gabriel, Lisp: Good News, Bad News, How to Win Big, 1989)
Therac lecture slides Unix lecture slides
|
The Wrong Patient (Chassin and Becher, American College of Physicians - American Society of Internal Medicine, 2002)
Radiation Offers New Cures, and Ways to Do Harm (New York Times 2010 flash animation)
The Task of the Referee (Smith)
|
Tue 01/23 |
OS Design
| [lecture] |
|
1) Exokernel: An Operating System Architecture for Application-Level Resource Management (Engler, Kaashoek, and O'Toole Jr., SOSP, 1995) other notes
2) End-to-end Arguments in System Design -- (through page 4) -- (Saltzer, Reed, and Clark, TOCS, 1984) other notes
Unix lecture slides
|
1) The Linux Edge (Linus Torvalds, Open Sources: Voices from the Open Source Revolution 1999)
Singularity: Rethinking the Software Stack (Hunt and Larus, SIGOPS OSR, 2007)
THE (Dijkstra, 1968)
Application performance and Flexibility on Exokernel Systems (Kaashoek, Engler, Ganger, Briceno, Hunt, Mazieres, Pinckney, Grimm, Jannotti, and Mackenzie, SOSP, 1997)
On Micro-Kernel Construction (Liedtke, SOSP, 1995) notes
|
Thu 01/25 |
OS Design
| [lecture] |
|
1) Threads and Input/Output in the Synthesis Kernel (Massalin and Pu, SOSP, 1989) other notes
Exokernel lecture slides
|
Chapter 6 from Massalin's Thesis (1992)
Protection and the Control of Information Sharing in Multics (Saltzer, 1974)
|
Tue 01/30 |
mmap + page faults
| [lecture] |
|
Understanding the Linux Kernel (3rd Edition) Chapter 9, Process Address Space (skim)
1) Practical, transparent operating system support for superpages (Navarro, Iyer, Druschel, and Cox, OSDI 2002)
Synthesis lecture slides
|
Coordinated and Efficient Huge Page Management with Ingens
Machine-Independent Virtual Memory Management for Paged Uniprocessor and Multiprocessor Architectures (Rashid, Tevanian, Young, Golub, Baron, Black, Bolosky, and Chew, 1987)
|
Thu 02/01 |
Virtual machines
| [lecture] |
Lab 0 due
|
1) Xen and Art of Virtualization (Barham, Dragovic, Fraser, Hand, Harris, Ho, Neugebaur, Pratt and Warfield, SOSP 2003). other notes
2) Are Virtual Machine Monitors Microkernels Done Right? (Hand, Warfield, Fraser, Kotsovinos, and Magenheimer, HotOS 2005)
3) Are Virtual Machine Monitors Microkernels Done Right? (Heiser, Uhlig, and LeVasseur, SIGOPS OSR, 2006)
Superpages lecture slides
|
Intel Virtualization Technology (Uhlig, Neiger, Rodgers, Santoni, Martins, Anderson, Bennett, Kagi, Leung, and Smith, IEEE Computer 2005)
Intel Virtualization Technology (journal version) (Neiger, Santoni, Leung, Rodgers, Uhlig, Intel Technology Journal 2006)
|
Tue 02/06 |
Virtual machines
| [lecture] |
|
1) Memory Resource Management in VMware ESX Server (Waldspurger, OSDI, 2002) other notes
Xen lecture slides
|
The Turtles Project: Design and Implementation of Nested Virtualization (Ben-Yehuda, Day, Dubitzky, Factor, Har’El, Gordon, Liguoriz, Wasserman, Yassour, OSDI 2010)
Unix as an Application Process (Golub, USENIX, 1990)
Compatibility is not transparency VMM detection myths and realities (Garfinkel, Adams, Warfield, and Franklin, HOTOS, 2007)
Disco: Running Commodity Operating Systems on Scalable Multiprocessors (Bugnion, Devine, and Rosenblum, SOSP, 1997) notes
|
Thu 02/08 |
Virtualization
| [lecture] |
|
1) Memory Resource Management in VMware ESX Server (Waldspurger, OSDI, 2002) other notes
Xen lecture slides ESX+VMMs-vs-micro-kernels lecture slides
|
Rethinking the Library OS from the Top Down (Porter, Boyd-Wickizer, Howell, Olinsky, and Hunt, ASPLOS 2011)
Containers
Dune: Safe User-level Access to Privileged CPU Features (Belay, Bittau, Mashtizadeh, Terei, Mazieres, Kozyrakis, OSDI 2012)
|
Tue 02/13 |
Lightweight Virtualization
| [lecture] |
|
1) Arrakis: The Operating System is the Control Plane (Peter, Li, Zhang, Ports, Woos, Krishnamurthy, Anderson, Roscoe. OSDI, 2014) other notes
Arrakis lecture slides JITSU lecture slides
|
|
Thu 02/15 |
Lightweight Virtualization
| [lecture] |
Lab 1 due
|
1) Jitsu: just-in-time summoning of unikernels (Madhavapeddy, Leonard, Skjegstad, Gazagnaire, Sheets, Scott, Mortier, Chaudhry, Singh, Ludlam, Crowcroft, Leslie, NSDI 2015)
2) AvA: Accelerated Virtualization of Accelerators(Yu, Akshintala, Peters, Rossbach ASPLOS 2020)
JITSU lecture slides
|
|
Tue 02/20 |
Lightweight Virtualization
| [lecture] |
|
1) X-Containers: Breaking Down Barriers to Improve Performance and Isolation of Cloud-Native Containers (Shen, Sun, Sela, Bagdasaryan, Delimitrou, Van Renesse, Weatherspoon, ASPLOS, 2019) other notes
JITSU lecture slides
|
Rethinking the Library OS from the Top Down (Porter, Boyd-Wickizer, Howell, Olinsky, and Hunt, ASPLOS 2011)
Containers
Dune: Safe User-level Access to Privileged CPU Features (Belay, Bittau, Mashtizadeh, Terei, Mazieres, Kozyrakis, OSDI 2012)
|
Thu 02/22 |
File systems
| [lecture] |
|
1) Crash Consistency: FSCK and Journaling (Chapter 42 in Operating Systems: Three Easy Pieces) (R. Arpaci-Dusseau, A. Arpaci-Dusseau)
2) A Fast File System for UNIX (McKusick, Joy, Leffler, and Fabry, TOCS, 1984) other notes
X-Containers lecture slides Arrakis lecture slides
|
Disks from the Perspective of a File System (Marshall Kirk McKusick, ACM Queue, 2012)
Improving the Performance of fsck in FreeBSD (Marshall Kirk McKusick 2013)
|
Tue 02/27 |
File systems
| [lecture] |
|
1) The Design and Implementation of a Log-Structured File System (Rosenblum and Ousterhout, TOCS, 1992) other notes
2) Ousterhout's critique of Seltzer's 1993 paper
3) Ousterhout's critique of Seltzer's 1995 paper
4) Seltzer's response to Ousterhout's critiques
5) Ousterhout's response to Seltzer
6) Log structured file systems: There's one in every SSD (Valerie Aurora (formerly Henson)) LWN 2009)
Arrakis lecture slides AvA lecture slides
|
An Introduction to Disk Drive Modeling (Ruemmler and Wilkes, 1994)
Soft Updates: A Technique for Eliminating Most Synchronous Writes in the Fast Filesystem (McKusick and Ganger, USENIX, 1999)
Generalized File System Dependences (Frost, Mammarella, Kohler, de los Reyes, Hovsepian, Matsuoka, and Zhang, SOSP 2007)
|
Thu 02/29 |
Networked file systems
| [lecture] |
|
1) Design and Implementation of the SUN Network Filesystem (Sandberg, Goldberg, Kleiman, Walsh, and Lyon, USENIX, 1985) other notes
LFS lecture slides FFS lecture slides
|
XDR: External Data Representation Standard (M. Eisler, Ed., RFC 4506, 2006) pdf txt
Remote Procedure Calls (Birrell and Nelson, TOCS, 1984) notes
|
Tue 03/05 |
Data center file systems
| [lecture] |
Lab 2 due
|
1) The Google File System (Ghemawat, Gobioff, and Leung, SOSP, 2003) other notes
lecture slides
|
1) GFS: Evolution on Fast-forward (McKusick and Quinlan, ACM Queue, 2009)
|
Thu 03/07 |
Exam
| |
|
Exam in class. faux-quiz compilation
|
|
Tue 03/12 |
no class
| |
|
Spring Break!
|
|
Thu 03/14 |
no class
| |
|
Spring Break!
|
|
Tue 03/19 |
NFS catch up
| |
|
catch up!
|
|
Thu 03/21 |
GFS catch up
| |
Project Plan DUE
|
lecture slides
|
|
Tue 03/26 |
Events and Threads
| [lecture] |
|
1) Experiences with Processes and Monitors in Mesa (Lampson and Redell, Communications of the ACM 23, 2, 1980) notes
2) The "Double-Checked Locking is Broken" Declaration (Bacon, Bloch, Bogda, Click, Haahr, Lea, May, Maessen, Mitchell, Nilsen, Pugh, Sirer, 2004)
lecture slides
|
JSR 133 (Java Memory Model FAQ by Manson and Goetz, 2004)
|
Thu 03/28 |
Events and Threads
| [lecture] |
Lab 3 due
|
1) Scheduler Activations (Anderson, Bershad, Lazowska, and Levy, TOCS, 1992)
2) Introduction to Non-blocking IO (Kegel 1999)
3) Exchange between Linus Torvalds and David Patterson about parallelism Part 1
4) Part 2
5) Part 3
lecture slides
|
Event-driven Programming for Robust Software (Dabek, Zeldovich, Kaashoek, Mazieres, and Morris, SIGOPS, 2002)
Why Events Are A Bad Idea (for high-concurrency servers) (von Behren, Condit, and Brewer, HOTOS, 2003)
Cooperative Task Management without Manual Stack Management Adya, Howell, Theimer, Bolosky, Douceur, USENIX 2002)
|
Tue 04/02 |
Concurrency and Transactions
| [lecture] |
Revised Project Plan DUE
|
Ordering concurrent events and transactions in Principles of Transaction Processing (second edition by Philip A. Bernstein and Eric Newcomer 2009) book notes
The Art of Multiprocessor Programming (Maurice Herlihy and Nir Shavit, 2008) 3-3.6 book notes notes
lecture slides
|
Transaction Processing: Concepts and Techniques (Jim Gray and Andreas Reuter 1993) 1.1 - 1.2.5 4.2, 4.7 and 4.7.1, 4.9 and 4.9.1 7.1-7.6
The Transaction Concept (Gray, 1981) notes
|
Thu 04/04 |
Concurrency and Transactions
| [lecture] |
|
1) Operating System Transactions (Porter, Hofmann, Rossbach, Benn, and Witchel, SOSP, 2009)
lecture slides
|
Experiences with transactions in QuickSilver (Shmuck and Wyllie, SOSP 1991)
|
Tue 04/09 |
Concurrency & Synchronization
| [lecture] |
|
1) RaceTrack: Efficient Detection of Data Race Conditions via Adaptive Tracking (Yu, Rodeheffer, and Chen, SOSP, 2005) notes
lecture slides
|
Eraser: A Dynamic Data Race Detector for Multithreaded Programs (Savage, Burrows, Nelson, Sobalvarro, Anderson, TOCS, 1997)
|
Thu 04/11 |
Concurrency Continued
| [lecture] |
|
1) The Scalable Commutativity Rule: Designing Scalable Software for Multicore Processors (Clements, Kaashoek, Zeldovich, Morris, Kohler, SOSP 2013)
lecture slides
|
A Comparison of Approaches to Large-Scale Data Analysis (Pavlo, Paulson, Rasin, Abadi, DeWitt, Madden, Stonebreaker, SIGMOD 2009)
BigTable: A system for Distributed Structured Storage (Chang, Dean, Ghemawat, Hsieh, Wallach, Burrows, Chandra, Fikes, and Gruber, OSDI, 2006) webnotes video
Large-scale Incremental Processing Using Distributed Transactions and Notifications (Peng and Dabek, OSDI 2010)
Brewer's CAP Theorem (Browne 2009)
Spanner: Google's Globally-Distributed Database (Corbett, Dean, Epstein, Fikes, Frost, Furman, Ghemawat, Gubarev, Heiser, Hochschild, Hsieh, Kanthak, Kogan, Li, Lloyd, Melnik, Mwaura, Nagle, Quinlan, Rao, Rolig, Saito, Szymaniak, Taylor, Wang and Woodford, OSDI, 2012)
Dynamo: Amazon’s Highly Avaiable Key-value Store
PNUTS: Yahoo!'s Hosted Data Serving Platform (Cooper, Ramakrishnan, Srivastava, Silberstein, Bohannon, Jacobsen, Puz, Weaver, Yereni, VLDB, 2008)
MapReduce: Simplified Data Processing on Large Clusters (Dean and Ghemawat, OSDI, 2004)
MapReduce: A Major Step Backwards (DeWitt and Stonebreaker, 2008)
Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing (Zaharia, Chowdhury, Das, Dave, Ma, McCauley, Franklin, Shenker, and Stoica, NSDI 2012)
|
Tue 04/16 |
Networking
| [lecture] |
|
1) Snap: a microkernel approach to host networking (Marty et al., SOSP 2019) lecture slides
|
Dandelion: a compiler and runtime for heterogeneous systems (Rossbach, Yu, Currey, Martin, Fetterly, SOSP 2013)
|
Thu 04/18 |
Heterogeneity
| [lecture] |
|
1) GPUfs: Integrating a file system with GPUs (Silberstein, Ford, Keidar, Witchel, ASPLOS 2013)
2) PTask: operating system abstractions to manage GPUs as compute devices (Rossbach, Currey, Silberstein, Ray, Witchel, SOSP 2011)
PTask lecture slides GPUfs lecture slides
|
Helios: heterogeneous multiprocessing with satellite kernels (Nightingale, Hodson, McIlroy, Hawblitzel, Hunt, SOSP 2009)
Tapping into the fountain of CPUs: on operating system support for programmable devices (Weinsberg, Dolev, Anker, Ben-Yehuda, Wyckoff, ASPLOS 2008)
Pegasus: coordinated scheduling for virtualized accelerator-based systems (Gupta, Schwan, Tolia, Talwar, Ranganathan, ATC 2011)
TimeGraph: GPU scheduling for real-time multi-tasking environments
TensorFlow: A system for Large-scale Machine Learning (Abadi et al., OSDI 2016)
|
Tue 04/23 |
Heterogeneity
| [lecture] |
|
1) Sharing, protection, and compatibility for reconfigurable fabric with AmorphOS (Khawaja, Landgraf, Prakash, Wei, Schkufza, Rossbach, OSDI 2018)
lecture slides
|
Helios: heterogeneous multiprocessing with satellite kernels (Nightingale, Hodson, McIlroy, Hawblitzel, Hunt, SOSP 2009)
Tapping into the fountain of CPUs: on operating system support for programmable devices (Weinsberg, Dolev, Anker, Ben-Yehuda, Wyckoff, ASPLOS 2008)
Pegasus: coordinated scheduling for virtualized accelerator-based systems (Gupta, Schwan, Tolia, Talwar, Ranganathan, ATC 2011)
TimeGraph: GPU scheduling for real-time multi-tasking environments
TensorFlow: A system for Large-scale Machine Learning (Abadi et al., OSDI 2016)
|
Thu 04/25 |
Exam 2
| [lecture] |
|
Exam in class.
|
Hints for Computer System Design (Lampson, SIGOPS OSR, 1983)
A Note on the Confinement Problem (Lampson, Communications of ACM, 1973)
Reflections on Trusting Trust <(Thompson, Turing Award Lecture, 1984)
|
Mon 04/29 |
Project due
| |
Presentations
Project DUE
|
|
|
Wed 05/01 |
Additional reading
| |
|
|
The Many Faces of Systems Research - And How To Evaluate Them (Brown, Chanda, Farrow, Fedorova, Maniatis, Scott, HotOS 2005)
The Confused Deputy (Hardy, SIGOPS OSR, 1988) lecture notes
A Decentralized Model for Information Flow Control (Myers and Liskov, SOSP, 1997)
Laminar: Practical Fine-Grained Decentralized Information Flow Control (Roy, Setty, Kilzer, Shmatikov, Witchel, PLDI 2009)
Improving Application Security with Data Flow Assertions (Yip, Wang, Zeldovich, and Kaashoek, SOSP, 2009)
Why Cryptosystems Fail (Anderson, 1994)
Linearizability: A Correctness Condition for Concurrent Objects (Herlihy and Wing, TOPLAS, 1990)
Klee: Unassisted and Automatic Generation of High-Coverage Tests for Complex Systems Programs (Cadar, Dunbar, and Engler, OSDI, 2008)
A few billion lines of code later: using static analysis to find bugs in the real world (Bessey, Block, Chelf, Chou, Fulton, Hallem, Henri-Gros, Kamsky, McPeak, Engler, CACM v53 no 2, 2010)
Weird things that surprise academics trying to commercialize a static checking tool (Chou, Chelf, Hallem, Hanri-Gros, Fulton, Unangst, Zak, and Engler, SPIN, 2005)
EROS: A Fast Capability System (Shapiro, Smith, and Farber, SOSP, 1999)
Capsicum: practical capabilities for UNIX (Watson, Anderson, Laurie, Kennaway, USENIX security 2010) < a href=lectures/capsicum.pdf>notes
|