Theory of Parallel Systems (SMA 5509)
Lecture Notes
|
LEC # |
TOPICS |
|
Week 1 |
|
|
1 |
Dynamic Multithreading (PDF) (Courtesy of Ben Adida and Abhi Shelat. Used with permission.) |
|
Week 2 |
|
|
2 |
Cilk, Matrix Multiplication, and Sorting (PDF) |
|
3 |
Serial Performance and Caching (PDF) (Courtesy of Kenneth Barr and Zardosht Kasheff. Used with permission.) |
|
Week 3 |
|
|
4 |
Determinacy Detection and Race Detection (PDF) (Courtesy of Siddhartha Sen and Jim Sukha. Used with permission.) |
|
5 |
Consistency of the Memory Sub-System |
|
Week 4 |
|
|
6 |
Analyzing Space Bounds (PDF) (Courtesy of Jeremy Fineman and Siddhartha Sen. Used with permission.) |
|
Week 5 |
|
|
7 |
Memory Contention (PDF) (Courtesy of Barbara Mack and C. Scott Ananian. Used with permission.) |
|
8 |
Cilk Scheduler (PDF) (Courtesy of Barbara Mack and Kevin Matulef. Used with permission.) |
|
Week 6 |
|
|
9 |
Analysis of Cilk Scheduler (PDF) (Courtesy of Alexandru Caracas and C. Scott Ananian. Used with permission.) |
|
10 |
Cilk Implementation (PDF) |
|
Week 7 |
|
|
11 |
|
|
Week 8 |
|
|
12 |
|
|
13 |
Implementation of Memory Consistency (PDF) (Courtesy of Seth Gilbert and Xie Yong. Used with permission.) |
|
Week 9 |
|
|
14 |
Competitive Snoopy Caching |
|
15 |
Snoopy Caching and Spin-Block Problem |
|
Week 10 |
|
|
16 |
Hypercubic Networks 1 |
|
17 |
Hypercubic Networks 2 (PDF) |
|
Week 11 |
|
|
18 |
Hypercubic Networks 3 (PDF) (Courtesy of Sriram Saroop and Wang Junqing. Used with permission.) |
|
Week 12 |
|
|
19 |
Squish Routing |
|
20 |
Permuting Data on Parallel Disks |
|
Week 13 |
|
|
21 |
Sorting and Permuting |
|
22 |
Pick a Winner |
|
Week 14 |
|
|
23 |
|
|
24 |
Final Project Presentations (cont.) |
|
Week 15 |
|
|
25 |
Final Project Presentations (cont.) |
|
26 |
|
Assignments
Problem Set 1 (PDF)
Problem Set 2 (PDF)
Problem Set 3 (PDF)
Problem Set 4 (PDF)
Projects
Term projects will be presented in class for the last four class sessions. Your group is also responsible for a project report of 5-10 pages due on the last day of class. About one-third of your term project grade will be from your presentation and two-thirds will be from your project report.
For the final presentation, each group will have about 10-12 minutes times the number of students in your group to make your presentation, plus a few minutes for questions. All students in the group should participate actively in the presentation. Prepare PowerPoint® or other online presentation materials.
The Fall 2003 student projects are provided below.
|
STUDENTS |
PROPOSALS |
FINAL PRESENTATIONS |
FINAL PAPERS |
|
Improving Cilk |
|||
|
Kunal Agrawal and Siddhartha Sen, "Adaptively Parallel Processor Allocation for Cilk Jobs" |
(PDF) |
(PDF) |
(PDF) |
|
Alexandru Caracas, "Fast Serial-Append File I/O Mode Support for Cilk" |
(PDF) (Courtesy of Alexandru Caracas. Used with permission.) |
(PDF) (Courtesy of Alexandru Caracas. Used with permission.) |
(PDF) (Courtesy of Alexandru Caracas. Used with permission.) |
|
Jason Hickey and Tyeler Quentmeyer, "A Space-Efficient Global Scheduler for Cilk" |
(PDF) |
(PDF) |
(PDF) |
|
Sajindra Jayasena and Sharad Ganesh, "Automatic Conversion of Non Series-Parallel DAGs to Series Parallel DAGs" |
(PDF) (Courtesy of Sharad Ganesh and Sajindra Jayasena. Used with permission.) |
(PDF) (Courtesy of Sharad Ganesh and Sajindra Jayasena. Used with permission.) |
|
|
Transactional Cilk |
|||
|
C. Scott Ananian, "Language-Level Complex Transactions" |
(PDF) (Courtesy of C. Scott Ananian. Used with permission.) |
(PDF) (Courtesy of C. Scott Ananian. Used with permission.) |
|
|
Sean Lie, "An Evaluation of Nested Concurrent Transactions" |
(PDF) (Courtesy of Sean Lie. Used with permission.) |
(PDF) (Courtesy of Sean Lie. Used with permission.) |
(PDF) (Courtesy of Sean Lie. Used with permission.) |
|
Jim Sukha, "Atomic Transactions in Cilk" |
(PDF) (Courtesy of Jim Sukha. Used with permission.) |
(PDF) |
(PDF) (Courtesy of Jim Sukha. Used with permission.) |
|
Xie Yong, "Transactions in Cilk" |
(PDF)(Courtesy of Xie Yong. Used with permission.) |
(PDF) (Courtesy of Xie Yong. Used with permission.) |
(PDF) (Courtesy of Xie Yong. Used with permission.) |
|
Non-Determinacy Detection |
|||
|
Jeremy Fineman, "Linear Time Detection of Determinacy Races" |
(PDF) (Courtesy of Jeremy Fineman. Used with permission.) |
(PDF) (Courtesy of Jeremy Fineman. Used with permission.) |
(PDF) (Courtesy of Jeremy Fineman. Used with permission.) |
|
He Yuxiong, "Parallel Nondeterminator" |
|||
|
Wang Junqing, "Parallel Nondeterminator" |
(PDF) (Courtesy of Wang Junqing. Used with permission.) |
(PDF) |
|
|
Using Cilk |
|||
|
Kenneth C. Barr, "Accelerating Multiprocessor Simulation" |
|||
|
Zardosht Kesheff, "Parallelizing METIS" |
(PDF) (Courtesy of Zardosht Kasheff. Used with permission.) |
(PDF) (Courtesy of Zardosht Kasheff. Used with permission.) |
(PDF) (Courtesy of Zardosht Kasheff. Used with permission.) |
|
Paul Youn, "Parallelizing Sorting" |
|||
|
Pham Duc Minh, "Implement FIR Filter in Parallel Using Cilk" |
|||
|
Cache-Oblivious Algorithms |
|||
|
Advait D.Karande and Sriram Saroop, "Cache-Oblivious Sorting for Burrows-Wheeler Transform" |
(PDF) (Courtesy of Advait Karande and Sriram Saroop. Used with permission.) |
(PDF) (Courtesy of Advait Karande and Sriram Saroop. Used with permission.) |
(PDF) (Courtesy of Advait Karande and Sriram Saroop. Used with permission.) |
|
Zhang Jiahui and Neel Kamal, "New Cache-Oblivious Algorithms" |
(PDF) (Courtesy of Zhang Jiahu. Used with permission.) |
||
|
Seth Gilbert, "Cache-Oblivious, Lock-Free Algorithms" |
(PDF) (Courtesy of Seth Gilbert. Used with permission.) |
(PDF) (Courtesy of Jeremy Fineman and Seth Gilbert. Used with permission.) |
(PDF) (Courtesy of Seth Gilbert. Used with permission.) |

