This class is part of the Master 2 MVA. It will last 18 hours (3x6 lectures) + 2h for the exam.
Final grade: approximately 70% final exam, 30% homeworks (to implement some of the algorithms seen in class).
It will be on March 20th 2026, 10h-12h. A single two-sided sheet of handwritten notes (with any content) will be allowed for the exam. The exam will be divided into two separate parts (online convex optimization and stochastic bandits), each of which should be returned on a separate sheet. Please bring your own sheet of paper for the exam.
The homework is due by Friday, March 20, 2026. More information will be posted here. It can be done alone or in groups of two students. The report can be written in English or in French.
Friday mornings from 9h00 to 12h00 at ENS Paris-Saclay. Information will be provided on the website before each class.
| # | Date | Teacher | Where | Title |
|---|---|---|---|---|
| 1 | 16/01/2026 | PG | 1Z28 | Introduction. Online convex optimization (slides 1-57). |
| 2 | 23/01/2026 | PG | 1Z28 | Online convex optimization (slides 58-79). |
| 3 | 31/01/2026 | PG | 1Z28 | Adversarial Bandits (slides 80-108). |
| 4 | 06/02/2026 | RD | 1Z28 | Stochastic Bandits 1. Basic algorithms: Explore-Then-Commit, Upper Confidence Bound, ε-greedy. |
| 5 | 13/02/2026 | RD | 1Z28 | Stochastic Bandits 2. Linear and continuous bandits. |
| 6 | 20/02/2026 | RD | 1Z28 | Stochastic Bandits 3. Lower bounds. Best arm identification. |
| 20/03/2026 | 1Z31 | Exam (from 10:00 to 12:00). In person at ENS Paris-Saclay. Previous exams: 2020, 2021, 2022 (corr), 2023, 2024 (corr), 2025 (corr) |