PhD defense of Jean VASSOYAN

Title: Learning to teach humans and large language models
Supervision: N. Vayatis, Jill-Jenn VIE
Defended April 16, 2026

Add to the calendar

Jean VASSOYAN

Learning to teach humans and large language models

Abstract

In classical machine learning, training data is typically assumed to stem from random sampling over which the experimenter has no control. Nevertheless, certain paradigms, such as active learning, specifically address situations where one can intervene in data selection to optimize model training. More broadly, the data selection problem can be formalized through the "teacher-student" paradigm, where the teacher is the data selection mechanism and the student is the agent to be trained. This thesis focuses on the task of the "teacher", framed as an optimization problem. We study two distinct special cases: one where the student is a human learner, and another where it is a "black-box" statistical learning model.

The first axis, centered on the human learner, falls within the field of adaptive learning, a highly active area in educational technology. We propose a new educational resource recommendation engine which, unlike classical approaches, is data-efficient, scalable, free of any a priori model of human learning, and trained end-to-end to maximize the learner's "progress". It achieves this through a keyword-based modeling of the learner's knowledge.

The second axis, focusing on the "machine" learner, relates to a family of statistical learning fields, including active, curriculum, and reinforcement learning. We introduce a unifying formal framework that allows us to bridge and compare all these domains. Finally, we study the application of this paradigm to large language models, which have become natural candidates for this type of approach. Indeed, as high-quality human textual data is nearing exhaustion, imitation learning alone is starting to show its limits. This suggests that to further improve these models, it is necessary to shift toward paradigms unconstrained by human data, based on open-ended "self-teaching" dynamics.

Key-words

Machine learning, Reinforcement learning, Adaptive learning, Teacher-student, Large language model, Graph neural networks,

PhD supervisors

Nicolas VAYATIS, Professor, Centre Borelli, ENS Paris-Saclay (Supervisor)
Jill-Jênn VIE

Defense committee

Pierre-Yves OUDEYER, Directeur de recherche, Centre Inria de l'université de Bordeaux, Rapporteur
Min CHI, Full professor, North Carolina State University, Rapporteur
Olivier SIGAUD, Professeur des universités, Sorbonne Université, Examinateur
Claire VERNADE, Full professor, University of Technology Nuremberg, Examinateur
Guillaume CHARPIAT, Chargé de recherche, Centre Inria de l'Université Paris-Saclay, Examinateur