Classes (from 2026–27)¶

New five-year accreditation

This page describes the new curriculum starting fall 2026 under the new quinquennal accreditation. For the previous program (up to 2025–26), see Classes up to 2025–26.

You may use this guide to select your UE (Unité d'Enseignement), whether you are enrolled in the [AI] track or another track of the Informatics Master's program.

All [AI] courses are open to students from other tracks; however, please ensure that you meet the required prerequisites.

If in doubt, contact the course instructors or the AI Master's program secretary or coordinators.

You can also catch up during the summer by taking one of our online crash courses.

M1 Classes¶

T1 — First period¶

Applied Statistics

This course introduces the necessary formalism to solve computational problems with statistical reasoning. The goal is to provide students with the necessary skills to employ statistical modeling in a computational framework to tackle real world problems.

Prior knowledge: basic linear algebra.

Acquired skills:

Give a statistical description of a dataset.
Leverage probabilistic modeling to perform statistical inference.
Analyze the mathematical properties of a statistical estimator.
Employ bootstrapping.
Perform hypothesis testing.

References: All of Statistics: A Concise Course in Statistical Inference — Larry Wasserman (PDF)

Hands-on Machine Learning with Scikit-learn

A practical oriented class, where students apply ML techniques to simple illustrative examples and then to tackle competitive challenges. It will start with an introduction to present (refresh) the ML landscape. Classes will then be articulated to successively focus on the major concepts of practical ML.

Outline:

Introduction/refresher on ML
Working with real data
Discover and visualize the data to gain insights
Prepare the data for processing
Select and train models
Fine-tune models

Recommended reading:

Géron (2019) Hands-on Machine Learning with Scikit-Learn, Keras, and Tensorflow
VanderPlas (2017) Python Data Science Handbook

Mathematics for Data Science — class from other track

This class aims at teaching/reminding mathematical basis useful in data science:

Vector spaces, linear transformations
Matrices, linear systems
Trace, determinant
Norms, orthogonality
Eigenvalues, singular value decomposition
Tensors (notions), multivariable calculus

The course is evaluated by a written exam.

References:

Introduction to Probability, Statistics, and Machine Learning — Samuel S. Watson, Brown University (lectures 2, 3, 4)
Mathematics for Machine Learning — Deisenroth, Faisal & Ong, Cambridge University Press
Linear Algebra Review and Reference — Zico Kolter (updated by Chuong Do), Stanford
Video Lectures on Linear Algebra — Gilbert Strang, MIT

Advanced Databases — class from other track

This course aims at enabling the students to learn the fundamentals of Relational Database Management Systems (DBMS) and apply these concepts in practice.

Basics of relational databases
Design theory for relational databases
Relational algebra and SQL
High-level database models

Recommended reading:

Database Systems: The Complete Book — Garcia-Molina, Ullman, Widom (2nd ed.)
Database Management Systems — Ramakrishnan & Gehrke, McGraw-Hill
A First Course in Database Systems — Ullman & Widom

Exploring Data Declaratively: Constraints and Patterns — class from other track

T2 — Second period¶

Machine Learning Basics 1

This course is algorithms-oriented: we sketch the great principles of ML at first, and then focus on how algorithms work in practice, including all necessary mathematical aspects. They are the basic building blocks of more advanced algorithms.

Gradient Descent, Linear Regression from scratch
Classification with a single layer Perceptron, from scratch. Geometrical interpretation, SGD/mini-batch learning. Discussion on the choice of the loss function or activation functions. OVR multi-class scheme.
Overfitting, train/validation/test split, K-fold CV, regularization (L2, L1)
MAP, Bayesian interpretation of Ridge Regression or Lasso
Feature maps ("Kernel trick"), PCA (from scratch, seen as variance maximization), PCA as pre-processing (dimensional reduction)
Kernels, Kernelized perceptron, SVM in the separable case

Acquired skills:

Know the basics of ML vocabulary
Make good habits, understand the standard pipeline
Know standard supervised, shallow-ML algorithms (be able to write their pseudo-code, explain their functioning)
Be able to code an algorithm by reading its documentation

Recommended reading:

Bishop (2006) Pattern Recognition and Machine Learning (available online for free)
Géron Hands-on Machine Learning with Scikit-Learn and TensorFlow — also in French: Introduction au Machine Learning

Optimization

This Optimization for Machine Learning course aims at studying the mathematical and computational constructions and properties of key optimization algorithms in different situations (continuous/discrete, constrained/unconstrained, convex/non-convex, deterministic, and stochastic problems) with use case illustrations in machine learning.

Introductions and background (convexity, differentiability, optimality conditions, convergence rates…)
Continuous optimization (first order methods: gradient methods, linear search, acceleration)
Continuous optimization (second order methods: Newton methods including Quasi-Newton, secant, IRLS)
Constrained optimization (equality and inequality constraints, duality/Lagrangian, KKT optimality conditions, linear programming, GD for a constrained problem)
Non-convex, stochastic optimization (the EM algorithm, stochastic gradient, stochastic EM)

Readings:

Convex Optimization — Boyd and Vandenberghe
Numerical Optimization — Nocedal and Wright
Optimization — Kenneth Lange

Course page

Datacamp

This course aims to learn the practical tools for data science and how to frame and solve data science problems.

Data wrangling
The scikit-learn API and missing values
Metrics and unbalanced data
Dealing with complex data
Ensemble methods and hyperparameter optimization

Full syllabus on GitHub

T3 — Third period¶

Machine Learning Basics 2

This course aims at mastering the core concept of algorithmic design in ML, from an optimization or a probabilistic point-of-view, using supervised and unsupervised algorithms.

Regression/classification seen in optimization and probabilistic frameworks, implication on batch and stochastic gradient descent
Learning theory and Vapnick-Chervonenkis dimension
Evaluating performances of ML algorithms in different contexts (imbalanced, small-sized, etc.)
Probabilistic framework for machine learning: Discriminative vs Generative learning, Empirical Risk Minimization, Risk Decomposition, Bias-Variance Tradeoff; MLE, MLE and OLS in regression, MLE and IRLS in softmax classification
Unsupervised Learning and Clustering: K-means, Mixture Models, EM algorithms
Unsupervised Learning and Dimensionality reduction: PCA, Probabilistic PCA & EM, ICA

Recommended reading:

James, Witten, Hastie & Tibshirani (2013) An Introduction to Statistical Learning
Hart, Stork & Duda (2000) Pattern Classification
Cornuéjols & Miclet (2011) Apprentissage artificiel: concepts et algorithmes

Hands-on Natural Language Processing (2 groups)

Introduction to NLP
Basic concepts in NLP, tokenization, lemmatisation, POS tagging, ...
Lexical semantics, word sense disambiguation
Syntax and interpretations
Parsing

Recommended reading: Natural Language Processing with Python

Foundations of Agent-based Systems — class from other track

T4 — Fourth period¶

Deep Learning

The aim of this course is to introduce the Deep-Learning framework. It will cover fundamental models such as the multilayer perceptron through to the most recent deep learning architectures (CNN, VAE, ...). In addition, the course will cover the different approaches to train these neural networks, with lectures dedicated to backpropagation algorithms and optimization methods based on gradient descent.

Introduction to Neural Networks and the MLP model
MLP and Gradient Descent algorithm
Backpropagation algorithm and optimization methods
Create your neural network with PyTorch
Neural Networks architectures (CNN, AE, ...)
Generative approaches (VAE, GAN, Denoising Diffusion models)

Acquired skills:

Broad view of NN architectures
Training and evaluating NN
Use of PyTorch
Implementing backpropagation algorithm

Recommended reading: Deep Learning Book

Speech and Language Processing

Various talks on the following subjects:

From linguistics to NLP
Treebanks and oral syntax
NLP, semantics, multi-word expressions
Speech-Audio processing + ASR + practical work
Emotion detection
Speech interaction

Keywords: ongoing NLP research at LISN

Creation of a Data Challenge

This course is designed to bridge theory (lectures) and practice (TPs) by guiding students through the end-to-end creation and resolution of an AI challenge, using pre-formatted real datasets and the open-source platform Codabench.

Designing AI Challenges
Evaluation and Metrics
Baselines: Core ML algorithms
Baselines: Image Classification
Optimizing ML Solutions
Interpreting Results & Presenting Impact
Project presentations

Full syllabus

Deliverables:

A Codabench challenge (including: website, GitHub repo, starting kit with a Python notebook)
A challenge/benchmark leaderboard with submissions from classmates
A written report (with conference publication potential if high quality)
An oral project presentation

Projects, internships, and other M1 modules¶

TER et/ou Stage et/ou Écoles (12 ECTS):
- TER (3 ECTS) + Stage 3 mois (9 ECTS)
- TER (6 ECTS) + Stage 2 mois (6 ECTS)
- Stage 4 mois (12 ECTS)
UE libre (3 ECTS)
UE Sustainable Development (3 ECTS)
3 UE from other tracks (3 × 3 = 9 ECTS): 2 mandatory + 1 elective

For more details on internships and TER, see M1 Internships & TER.

M2 Classes (8 UE propres = 24 ECTS)¶

T5 — Fifth period¶

Frugal AI

14 hours of lectures and evaluation + 7 hours of lab work.

Artificial intelligence algorithms are often built under the assumption that vast amounts of (labeled) data are available and that computational resources and memory are not constrained. These assumptions do not hold in a large number of practical applications, and this course will address situations in which constraints apply.

Discussion of different types of frugality (data, model parameters, training/inference constraints, memory), illustration with practical examples, and the relation between frugal learning and large models
Label frugality: supervised, weakly-supervised, semi-supervised, and self-supervised learning
Data frugality: transfer, few-shot, zero-shot, and continual learning
Frugal vs. large deep models and applications

The three labs will focus on self-supervised, few-shot, and continual learning, respectively.

Prerequisites: at least basic knowledge of ML and preferably at least one introductory course in deep learning.

Keywords: pretraining, large models, transfer learning, data scarcity

Deep Learning for NLP

Introduction, language models
Convolutional neural nets
Recurrent neural networks
Attention mechanisms
Word representations, Transformers
Generative models

Keywords: machine learning, statistics, probability theory, Python

NLP Today

Introduction (Cyril Grouin)
Text Mining in Open and Medical Domain (Aurélie Névéol)
Text Mining in Open and Medical Domain (Aurélie Névéol)
Semantics and Word Embeddings (Sahar Ghannay)
Chatbots and Evaluation (Thomas Gerald)
Chatbots and Evaluation (Thomas Gerald)

Keywords: text processing, word representations, neural networks for texts, dialogue systems

Course page

From Symbolic to Neurosymbolic AI — class from other track

T6 — Sixth period¶

Reinforcement Learning

Introduction to Reinforcement Learning
Markov Decision Processes
Planning by Dynamic Programming
Model-Free Prediction
Model-Free Control
Value Function Approximation
Policy Gradient Methods

Keywords: machine learning, statistics, probability theory, Python

Information Retrieval

This course gives a basic introduction to Information Retrieval.

Introduction to Information Retrieval: key terms and domains; tutorial on new textual dataset indexing and basic counting techniques
Handling Large Datasets: exploration of big datasets; binary evaluation methods; introduction to TF-IDF
Improving Retrieval Methods: introduction to sparse embeddings; overview of BM25 and Sense2Vec
Advanced Embedding Techniques: study of dense embeddings; use case with a patent dataset and its citations; introduction to doc2vec and sentenceBERT
Challenge Presentation: class challenge related to information retrieval concepts
Project Work and Presentation: collaborative project work and discussions; final project presentations

Recommended reading:

Manning et al. Introduction to Information Retrieval
Grainger et al. AI Powered Search

Keywords: Hands-On Knowledge of the Foundations of AI-Powered Search

Scientific Machine Learning

ML is increasingly adopted as a useful tool in the exact sciences (Physics, Chemistry, etc), where abundant precise data are available or can be generated with simulators. Applications include drug design, material design, genetics, applied quantum mechanics, fluid mechanics, and many others.

Introduction to Scientific Machine Learning — applications in Physics, Chemistry, and Biology
Graph Neural Networks and Geometric Deep Learning
NeuralODEs and PINNs
Steerable and Equivariant Neural Networks
Attention Mechanisms across Architectures (GAT, EquiFormer, ViT)

This course introduces paradigms relevant to Scientific ML such as GNNs, NeuralODEs, PINNs, and the general framework of Geometric Deep Learning, including steerable neural networks (e.g. rotation-equivariant nets). It also dives deeper into the attention mechanism (e.g. for Vision Transformers), since the attention block is now of widespread use across other architectures.

Keywords: Graph Neural Networks (GNNs), NeuralODEs, Physics-Informed Neural Networks (PINNs), Geometric Deep Learning, Transformers

T7 — Seventh period¶

Signal Processing

Introduction & Fourier analysis
Filtering
Random signals
Time-Frequency analysis
Time-Scale analysis
Introduction to linear inverse problems

Keywords: spectral analysis of time signals and images, convolution and filtering, time-frequency and wavelets analysis, denoising, sparse coding for inverse problem resolution

Course page

Advanced Optimization and Automated Machine Learning

Introduction to Optimization
Unconstrained Continuous Optimization
Constraint Optimization
Black Box Optimization
Hyper-parameter Optimization
Neural Architecture Search + Learning to Optimize & Meta-Learning

Keywords: optimization, black-box optimization, Bayesian optimization, neural architecture search, hyperparameter optimization, meta-learning, AutoML

Probabilistic Generative Models

This module covers probabilistic and neural generative models, from theoretical foundations to modern applications in vision, language, and multimodal AI.

Foundations of Generative Models — definitions, probability basics, explicit vs. implicit models, and first applications
Graphical Models & HMMs — Bayesian and Markov networks, exact/approximate inference, and sequential models
Neural Generative Models (Images) — variational autoencoders (VAEs), GANs, and diffusion models for image synthesis
Text Generation with Transformers — Transformer architecture, attention, pre-training (BERT, GPT)
Optimization & Advanced LLM — efficient adaptation of large models with techniques like LoRA
Emerging Models — Liquid Neural Nets, HRM, state-space & quantum-inspired approaches

Keywords: notions of statistics and AI

Soft skills & internship¶

Soft Skills (6 ECTS) — includes Trust and professional development
Stage (30 ECTS) — 5 to 6 month internship in a research lab or a company