SL_2024.html

<!DOCTYPE HTML>
<!--
	Read Only by HTML5 UP
	html5up.net | @n33co
	Free for personal and commercial use under the CCA 3.0 license (html5up.net/license)
-->
<html>
	<head>
		<title>Sequential Learning</title>
		<meta charset="utf-8" />
		<meta name="viewport" content="width=device-width, initial-scale=1" />
		<!--[if lte IE 8]><script src="assets/js/ie/html5shiv.js"></script><![endif]-->
		<link rel="stylesheet" href="assets/css/main.css" />
		<!--[if lte IE 8]><link rel="stylesheet" href="assets/css/ie8.css" /><![endif]-->
	</head>
	<body>

		<!-- Header -->
			<section id="header">
				<header>
					<span class="image avatar"><img src="images/avatar2.jpg" alt="" /></span>
					<h1 id="logo"><a href="#">Sequential Learning (2024-2025)</a></h1>
					<p>Rémy Degenne</p>
				</header>
				<footer>
					<ul class="icons">
						<li><a href="#" class="icon fa-github"><span class="label">Github</span></a></li>
					</ul>
				</footer>
			</section>

		<!-- Wrapper -->
			<div id="wrapper">

				<!-- Main -->
					<div id="main">

						<!-- Three-->
							<section id="one">
								<div class="container">
									<h3>Sequential learning</h3>
									A course about reinforcement learning and bandits.
									<ul>
										<li><b>Lecture 1</b> - <a href="https://remydegenne.github.io/docs/SL_2024/Cours1.pdf">Reinforcement learning</a></li>
										<li><b>Lecture 2</b> - <a href="https://remydegenne.github.io/docs/SL_2024/Cours2.pdf">Dynamic Programming</a></li>
										<li><b>Lecture 3</b> - <a href="https://remydegenne.github.io/docs/SL_2024/Cours3.pdf">Reinforcement Learning Algorithms</a></li>
										<li><b>Practical Session 1</b> - <a href="https://remydegenne.github.io/docs/SL_2024/readme.txt">installation readme</a> and <a href="https://remydegenne.github.io/docs/SL_2024/StoreManagement.ipynb">notebook</a>. (You can also use Google colab: <a target="_blank" href="https://colab.research.google.com/github/RemyDegenne/remydegenne.github.io/blob/master/docs/SL_2024/StoreManagement.ipynb"> <img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/> </a>) Deadline: Friday, December 6, 11:59 AM CET.</li>
										<li><b>Lecture 4</b> - <a href="https://remydegenne.github.io/docs/SL_2024/Cours4.pdf">Reinforcement Learning with Function Approximation</a></li>
										<li><b>Lecture 4.5</b> - <a href="https://remydegenne.github.io/docs/SL_2024/Cours4demi.pdf">Summary of the first 4 courses</a></li>
										<li><b>Lecture 5</b> - <a href="https://remydegenne.github.io/docs/SL_2024/Cours5.pdf">Beyond Value-Based Methods</a></li>
										<li><b>Practical Session 2</b> - <a href="https://remydegenne.github.io/docs/SL_2024/readme.txt">installation readme</a> and <a href="https://remydegenne.github.io/docs/SL_2024/TD0_QLearning.ipynb">notebook</a>. (You can also use Google colab: <a target="_blank" href="https://colab.research.google.com/github/RemyDegenne/remydegenne.github.io/blob/master/docs/SL_2024/TD0_QLearning.ipynb"> <img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/> </a>) Deadline: Friday, January 10, 15:00 CET.</li>
										<li><b>Lecture 6</b> - <a href="https://remydegenne.github.io/docs/SL_2024/Cours6.pdf">Multi-armed bandits</a></li>
										<li><b>Practical Session 3</b> - <a href="https://remydegenne.github.io/docs/SL_2024/readme.txt">installation readme</a> and <a href="https://remydegenne.github.io/docs/SL_2024/Bandit.ipynb">notebook</a>. (You can also use Google colab: <a target="_blank" href="https://colab.research.google.com/github/RemyDegenne/remydegenne.github.io/blob/master/docs/SL_2024/Bandit.ipynb"> <img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/> </a>) Deadline: Friday, January 31, 15:00 CET.</li>
										<li><b>Lecture 7</b> - <a href="https://remydegenne.github.io/docs/SL_2024/Cours8.pdf">Best arm identification in bandits</a></li>
										<li><b>Practical Session 4</b> - <a href="https://remydegenne.github.io/docs/SL_2024/readme_FittedQ.txt">installation readme (not the same as the previous ones!)</a> and <a href="https://remydegenne.github.io/docs/SL_2024/FittedQ.ipynb">notebook</a> (You can also use Google colab: <a target="_blank" href="https://colab.research.google.com/github/RemyDegenne/remydegenne.github.io/blob/master/docs/SL_2024/FittedQ.ipynb"> <img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/> </a>). Deadline: Friday, February 7, 15:00 CET.</li>
										<li><b>Lecture 8</b> - <a href="https://remydegenne.github.io/docs/SL_2024/Cours9.pdf">Bandit tools for reinforcement Learning</a></li>
									</ul>
								</div>
								<div class="container">
									<h4>References</h4>
									<ul>
	<li><a href="https://tor-lattimore.com/downloads/book/book.pdf">Bandit Algorithms</a>. Tor Lattimore and Csaba Szepesvari (2019).</li>
    <li><a href="http://www.incompleteideas.net/book/the-book.html">Reinforcement Learning</a>. Richard Sutton and Andrew Barto (2018 edition).</li>
    <li><a href="https://sites.ualberta.ca/~szepesva/papers/RLAlgsInMDPs.pdf">Reinforcement Learning Algorithms</a>. Csaba Szepesvari (2009).</li>
    <li>Markov Decision Processes. Martin Puterman (1994).</li>
    <li>Lecture notes of similar courses written by several other researchers: <a href="https://emiliekaufmann.github.io/RL.html">Emilie Kaufmann</a>, <a href="http://researchers.lille.inria.fr/munos/master-mva/index.html">Rémi Munos</a>, <a href="http://chercheurs.lille.inria.fr/~lazaric/Webpage/MVA-RL_Course15.html">Alessandro Lazaric</a> and <a href="https://perso.ens-lyon.fr/aurelien.garivier/www.math.univ-toulouse.fr/_agarivie/sites/default/files/MLDM_advancedML_StEtienne.pdf">Aurélien Garivier</a>.</li>

									</ul>
								</div>
							</section>

					</div>

				<!-- Footer -->
					<section id="footer">
						<div class="container">
							<ul class="copyright">
								<li>&copy; Untitled. All rights reserved.</li><li>Design: <a href="http://html5up.net">HTML5 UP</a></li>
							</ul>
						</div>
					</section>

			</div>

		<!-- Scripts -->
			<script src="assets/js/jquery.min.js"></script>
			<script src="assets/js/jquery.scrollzer.min.js"></script>
			<script src="assets/js/jquery.scrolly.min.js"></script>
			<script src="assets/js/skel.min.js"></script>
			<script src="assets/js/util.js"></script>
			<!--[if lte IE 8]><script src="assets/js/ie/respond.min.js"></script><![endif]-->
			<script src="assets/js/main.js"></script>

	</body>
</html>