Welcome to CS 159!
The goal of the class is to bring students up to speed in two topics in modern machine learning research through a series of lectures. Students will then go on to conduct a mini research project at the end of the class. The two topics are:
- Predictive control & model-based reinforcement learning;
- Neural network theory: learning & generalisation.
Caltech listing: CS/CNS/EE/IDS 159
(3-0-6) TTh 2:30-4:00.
Office hours: MW 3:00-4:00
.
Instructors
- Yisong Yue (yyue@caltech.edu);
- Ugo Rosolia (urosolia@caltech.edu);
- Jeremy Bernstein (bernstein@caltech.edu).
Teaching assistants
- Natalie Bernat (nbernat@caltech.edu);
- Joe Marino (jmarino@caltech.edu);
- Alex Farhang (afarhang@caltech.edu).
Structure
- Weeks 1–3: lectures on reinforcement learning & control;
- Weeks 4–6: lectures on neural network theory;
- Weeks 7–10: research project and special guest lectures.
Discord
Office hours will be held on the class Discord server. A signup link will be sent to students during the first week of class. Students are encouraged to use the Discord freely at other times to discuss homeworks, projects and any other class-related matters. If you do not have access to the Discord by the end of the first week of class, email an instructor.
It goes without saying, but please remember to treat other students with respect and don’t say anything you wouldn’t in a Caltech classroom.
Grading
The final grade will be based 20% on homeworks and 80% on the final project.
Homeworks
Students should complete homeworks in groups of two to three—submitting one solution document per group. All students in a group should understand all parts of the completed homework. If you don’t have a group, use the #matchmaking
channel on Discord to find one.
Completed homeworks should be submitted on Gradescope. If you do not have access to the Gradescope by the end of the first week of class, email an instructor.
Homework schedule
Homeworks will be set on Thursday and due the Thursday after.
# | Date set | Date due | Resources |
---|---|---|---|
1 | 4/01 | 4/08 | hw1.zip |
2 | 4/08 | 4/15 | hw2.zip |
3 | 4/22 | 4/29 | hw3.zip |
4 | 4/29 | 5/06 | hw4.pdf |
Late policy
A group has a total of 48 late hours for the term. This means that if the first homework is submitted 47 hours late and the second homework is submitted two hours late, the group would incur no penalty on the first homework but score zero on the second.
Final project
The final project will be conducted during the last four weeks of class. Students should work in teams of two to three. Students may either pick one of the special topics to build their project on, or alternatively they may attempt to mix the two topics. The poster session is on 6/03 and project reports are due 6/04.
Lecture schedule
# | Date | Subject | Resources |
---|---|---|---|
0 | 3/30 | Introduction | pdf / vid |
Topic 1—RL & Control | |||
1 | 3/30 | Discrete MDPs | pdf / vid |
2 | 4/01 | Optimal Control | pdf / vid |
3 | 4/06 | Model Predictive Control | pdf / vid |
4 | 4/08 | Learning MPC | pdf / vid / supp |
5 | 4/13 | Model Learning in MPC | pdf / vid |
6 | 4/15 | Planning Under Uncertainty and Project Ideas | pdf / vid |
Topic 2—Neural Network Theory | |||
7 | 4/20 | Neural Architecture Design | pdf / vid |
8 | 4/22 | Network Function Spaces | pdf / vid / ipynb |
9 | 4/27 | Network Optimisation | pdf / vid |
10 | 4/29 | Statistical Learning Theory | pdf / vid |
11 | 5/04 | PAC-Bayesian Theory | pdf / vid |
12 | 5/06 | Project Ideas | pdf / vid |
Final Project & Guest Lectures | |||
13 | 5/11 | Joe Marino | pdf / vid |
14 | 5/13 | Yasaman Bahri | pdf / vid |
15 | 5/18 | — | |
16 | 5/20 | Guanya Shi | pdf / vid |
17 | 5/25 | Roberto Calandra | pdf / vid |
18 | 5/27 | Guillermo Valle-Pérez & Ard Louis | pdf / vid |
19 | 6/01 | SueYeon Chung | pdf / vid |
20 | 6/03 | Poster session |
Reading material
Additional references are listed on the control and learning subpages.
Acknowledgements
- The Jekyll theme is modified from Jekyll Now.
- The brain, controller and robot images are from Pixabay.
- Some CSS is borrowed from Berkeley’s EE227C.