Course: | Technology Driven Computer Architecture CS395T Unique Number: #50259 |
Instructor: | Stephen W. Keckler Taylor Hall 5.110 phone: 471-9763 skeckler@cs.utexas.edu |
Office Hours: | Monday 1-3, Tuesday 10-12, or by appointment |
Class Meetings: | MW 10-11:15, Taylor 3.144 |
Class Information: | http://www.cs.utexas.edu/users/skeckler/CS395T cs395t-tdca@cs.utexas.edu |
Objectives: | Students will develop an understanding of the relationship between technology and computer system design, as well as practice critical research and communication skills. |
Prerequisites: | Prospective students should have graduate standing and proficiency in computer architecture (CS352 or equivalent). Enrollment may be limited to 20 students. |
Grading: | 20%: Critiques of papers 20%: Class discussion participation 30%: Discussion leading 30%: Project |
Course Overview:
Since the invention of the transistor in 1947, advances in silicon
processing technology has provided not just incremental, but quantum
leaps in computer functionality and performance. We are fast
approaching the capability to fabricate billion transistor chips that
can run at gigahertz clock rates. However, with this advancing
technology comes new challenges for the computer designer. Wires and
communication, rather than transistors and computation, have now
become the most critical resource. While technological constraints
will limit the scalability of modern-day implementations, the changing
balance between communication and computation will provide
opportunities for innovative architectures to match the demands of
emerging applications with the capabilities of the underlying
technology.
This course will examine computer architectures for billion transistor
chips from the perspective of technological capabilities and
constraints. Readings will focus both on momentous technological
breakthroughs and their implications, including high density
solid-state memory, interconnection networks, and notable milestones
of integration measured in transistors per chip. The topics of
discussion will include VLSI scaling, integrated DRAM and processors,
on and off-chip communication bandwidth, power consumption, tradeoffs
between hardware and software, and novel techniques for using billions
of transistors on a single chip.
The format of the class is lecture/discussion with a project during
the second half of the course. The quality of this class and the
benefit you gain from it is determined primarily by you and your
classmates, so it is important that everyone read the papers and
come prepared to participate in discussions. The assignments for this
class will include the following:
If more than one paper is assigned, your critique should provide some
comparison of the papers.
Collaboration and discussion outside of class is strongly
encouraged, but each student will turn in her/his own
critique. Grading will be based on a satisfactory(S)/not
satisfactory(NS) scale, and not satisfactory critiques receive no
credit. Critiques are due at the
beginning of the class period, and no late critiques will
be accepted.
You may skip up to 6 critiques during the term without
penalty. A not satisfactory critique is equivalent to a
skipped critique. The critiques should reflect insight
that you have gained from the paper, so don't bother
turning in the critique if you have not read the paper.
In researching for your discussion, a facilitation group
should read the assigned paper thoroughly and find 3
papers of related or relevant work that are not already on
the primary reading list (they may however be on the
supplementary reading list). If you come across a cool
paper that you think is extremely relevant, bring a copy
to the instructor and it may be added to the supplemental
reading list or be substituted into the assigned reading list.
At the beginning of the
class period, the facilitation group will turn in the 3
related work papers and a discussion plan. The discussion
plan is an outline that describes how you plan on spending
the 75 minutes of class time. For example, if the
assigned paper was entitled, "Widgets++: Taking Widget
Technology to the Next Century", a discussion plan might
include:
Each facilitation group is free to choose how to spend the 75 minutes
in class productively and may use a variety of techniques
for presentation/discussion including: tag team
presentation by facilitation group, small group
discussions, mini-projects, etc. The expectation is that
the classes will be lively, fun, and technically
interesting.
After leading the discussion, each member of the facilitation group
will write a brief self-evaluation of the group's performance.
These self evaluations will be due to the instructor on the same day
of the presentation and will include the following:
Part of the exercise of finding background material is learning to use
the on-campus library and information resources. I highly recommend
the INSPEC database (accessible through the on-line library web page:
http://www.lib.utexas.edu) as
a starting point. The introductory seminar "Electronic Sources for
Engineering Information" is likely to be valuable and will be
conducted by the library staff at the following times:
To summarize, the facilitation group will submit the following to the
instructor:
Handouts (postscript format):
Computation Engines:
Paper Critiques:
Discussion Facilitation:
Project:
Date | Discussion Topic | Discussion Leader(s) |
---|---|---|
Aug. 26 | Administrative details, Course overview | Keckler |
Development of the Microprocessor | ||
Aug. 31 | "The History of the 4004", Federico Faggin, Marcian
E. Hoff, Stanley Mazor, and Masatoshi Shima, IEEE Micro,
vol. 16:6, December, 1996, pp. 10-20. 4004 Instruction Set (8.7MB) Related Reading:
|
Keckler |
Sep. 2 | "MicroVAX 78032 Chip, a 32-Bit Microprocessor", Dan
Dobberpuhl, Robert Supnik, and Richard Witek, Proceedings of
the IEEE International Conference on Computer Design, October
1986, pp. 414-419. Related Reading:
|
Srinivasan, Shingal, Venkatachalam |
Sep. 7 | Labor day - no class | |
Sep. 9 | "Introducing the Intel i860 64-Bit Microprocessor",
Les Kohn and Neal Margulis, IEEE Micro, August 1989,
pp. 15-30. Related Reading:
|
Ferris, Husain |
Sep. 14 | "The Alpha 21264: A 500 MHz Out-of-Order Execution
Microprocessor", Daniel Leibholz and Rahul Razdan, Proceedings of
IEEE Compcon 97, February 1997, pp. 28-36. "UltraSPARC-IIi: Expanding the Boundaries of a System on a Chip", Kevin B. Normoyle, Michael A. Csoppenszky, Allan Tzeng, Timothy P. Johnson, Christopher D. Furman, and Jamshid Mostoufi, IEEE Micro, March/April 1998, pp. 14-24. Related Reading:
|
Kothari, Sawada |
The Future of Wires | ||
Sep. 16 | "Silicon Trends and Limits for Advanced
Microprocessors", Mark Bohr, Communications of the ACM,
March 1998, vol. 41:3, pp. 80-87. Related Reading:
|
Bhargava, Srinivasan |
Sep. 21 | "Interconnect Fabrication Processes and the
Development of Low-Cost Wiring for CMOS Products",
T.J. Licata, E.G. Colgan, J.M.E. Harper, and S.E. Luce, IBM
Journal of Research and Development, July 1995, vol. 39:4,
pp. 419-435. Related Reading:
|
Keckler |
Directions for Microprocessors | ||
Sep. 23 | "A VLIW Architecture for a Trace Scheduling
Compiler", Robert P. Colwell, Robert P. Nix, John
J. O'Donnell, David B. Papworth, and Paul K. Rodman, IEEE
Transactions on Computers, August 1988, vol. 37:8, pp. 967-979. Related Reading:
|
Ferris, Sawada |
Sep. 28 |
"Complexity Effective Superscalar Processors",
Subbarao Palacharla, Norman P. Jouppi, and J.E. Smith,
Proceedings of the International Symposium on Computer
Architecture, May 1997, pp. 206-218. Related Reading:
|
Srinivasan, Shinghal |
Sep. 30 |
"Evaluation of Design Alternatives for a
Multiprocessor Microprocessor", Basem A. Nayfeh, Lance
Hammond, and Kunle Olukotun, Proceedings of the
International Symposium on Computer Architecture, May 1996,
pp. 67-77. Project assignment distributed Related Reading:
|
Kothari, Husain |
Oct. 5 |
"Exploiting Fine-Grain Thread Level Parallelism on the MIT
Multi-ALU Processor", Stephen W. Keckler, William J. Dally,
Daniel Maskit, Nicholas P. Carter, Andrew Chang, and Whay S. Lee,
Proceedings of the International Symposium on Computer
Architecture, June 1998, pp. 306-317. Related Reading:
|
Keckler |
Data Storage:
Primary Storage - RAM | ||
---|---|---|
Oct. 7 | "One-Level Storage System", T. Kilburn,
D.B.G. Edwards, M.J. Lanigan, and F.H. Sumner, IRE
Transactions, April 1962, pp. 223-235. Related Reading:
|
Husain, Shinghal |
Oct. 12 | "Trends in Semiconductor Memories", Yasunao Katayama,
IEEE Micro, November/December 1997, pp. 10-17. "Direct Rambus Technology: The New Main Memory Standard", Richard Crisp, IEEE Micro, November/December 1997, pp. 18-28. Related Reading:
|
Bhargava, Kothari |
Oct. 14 |
"A Case for Intelligent RAM", David Patterson, Tom
Anderson, Neal Cardwell, Richard Fromm, Kimberly Keaton,
Christoforos Kozyrakis, Randi Thomas, and Katherine Yelick,
IEEE Micro, March/April 1997, pp. 34-44. Related Reading:
|
Sawada |
Oct. 19 |
"Active Pages: A Computation Model for Intelligent
Memory", Mark Oskin, Frederic T. Chong, and Timothy
Sherwood, Proceedings of the
International Symposium on Computer Architecture, July 1998,
pp. 192-203. Related Reading:
|
Bhargava |
Oct. 21 |
"eNVy: A Non-Volatile, Main Memory Storage System",
Michael Wu and Willy Zwaenepoel, Proceedings of the the
Sixth International Conference on Architectural Support for
Programming Languages and Operating Systems, October 1994,
pp. 86-97. Related Reading:
|
Ferris |
Secondary Storage - Disks | ||
Oct. 26 | "A Quarter Century of Disk File Innovation", J.M
Harker, D.W. Brede, R.E. Pattison, G.R. Santana, and
L.G. Taft, IBM Journal of Research and Development,
September 1981, vol. 25:5, pp. 677-689. Related Reading:
|
Keckler |
Oct. 28 | "A Case for Redundant Arrays of Inexpensive Disks
(RAID)", David A. Patterson, Garth Gibson, and Randy
H. Katz, SIGMOD International Conference on Management of
Data, SIGMOD Record, vol. 17:3, pp. 109-116. Related Reading:
|
Husain |
Nov. 2 | "Flash Memory Goes Mainstream", Brian Dipert and
Lou Hebert, IEEE Spectrum, October 1993,
pp. 48-52. "Filing in a Flash", James Eldridge, IEEE Spectrum, October 1993, pp. 53-54. Related Reading:
|
Keckler |
Physical and Electrical Considerations:
Packaging | ||
---|---|---|
Nov. 4 | "Large Chip vs. MCM for a High-Performance System",
Evan E. Davidson, IEEE Micro, July/August 1998, pp. 33-41. Related Reading:
|
Shinghal |
Power | ||
Nov. 9 | "Low Power Design Issues", Mark Horowitz, Slides
presented at the 1998 Power Driven Microarchitecture
Workshop, pp. 1-48. "Low-Power Digital Design", M. Horowitz, T. Indermaur, and R. Gonzalez, Proceedings of the 1994 IEEE Symposium on Low Power Electronics, October 1994, pp. 10-12. "Energy Dissipation in General Purpose Processors", Ricardo Gonzalez and Mark Horowitz, Proceedings of the 1994 IEEE Symposium on Low Power Electronics, October 1994, pp. 9-11. Related Reading:
|
Bhargava, Shen |
Nov. 11 | "Dynamic Power Management for Microprocessors: A Case
Study", Vivek Tiwari, Ryan Donnelly, Sharad Malik, and Ricardo
Gonzalez, Proceedings fo the 10th International Conference on
VLSI Design, Jnauary 1997, pp. 185-192. Related Reading:
|
Kothari |
Nov. 16 | "An Architectural Level Power Estimator", Rita Yu
Chen, Mary Jane Irwin, and Raminder S. Bajwa, Proceedings of
the 1998 Power Driven Microarchitecture Workshop,
pp. 87-91. "Instruction Scheduling for Low Power Dissipation in High Performance Microprocessors", Mark C. Toburen, Thomas M. Conte, and Matt Reilly,Proceedings of the 1998 Power Driven Microarchitecture Workshop, pp. 14-19. Related Reading:
|
Ferris, Shen |
Off-Chip Communication | ||
Nov. 18 | "Starfire: Extending the SMP Envelope", Alan
Charlesworth, IEEE Micro, January/February 1998, pp. 39-49. Related Reading:
|
Keckler |
Nov. 23 | "High-Performance Bidirectional Signalling in VLSI
Systems", Larry R. Dennison, Whay S. Lee, and William
J. Dally, Proceedings of the Symposium on Integrated
Systems, March 1993, pp. 300-319. Related Reading:
|
Srinivasan |
Nov. 25 | Project Presentations Project Reports Due |
|
Nov. 30 | Class Cancelled | |
Dec. 2 | TBA | Keckler |