Machine learning and High-dimensional statistics
Scientific Referents
Alumni:
- Ioannis Bargiotas
- Alain Durmus
- Vianney Perchet
Thematic overview
The research developed in this area draws on our mastery of the theoretical and algorithmic foundations of machine learning and our knowledge of high-dimensional statistical methods. But it also draws on our familiarity with a wide range of applications, mainly in the industrial and biomedical fields (health, human factor).
The scientific approach adopted leads to the introduction of a priori knowledge (from underlying physics or resource constraints, for example). into learning techniques. Scientific contributions are also largely materialized through interaction with other research themes developed at the Centre Borelli.
For illustration purposes, here are some examples of specific research topics:
- Transfer Learning – Learning problems in an industrial context are constrained by the small number of samples and by poorly represented stationarity regimes. Several transfer learning techniques have been proposed to provide usable methods and tools. In addition, operational applications require fine quantification of uncertainties for risk control, which is an important direction of research work.
- Physically motivated learning – The hybridization of knowledge-based models (in the form of partial differential equations, for example) and empirical models is one of the major scientific challenges in the engineering and life sciences. Their development presents a challenge for the simulation and for the monitoring of complex systems. In particular, the Centre Borelli's work explores PINNS (Physically-Informed Neural Networks) models and uses them to specify experimental designs that exploit the physical structure of solutions.
- High-dimensional statistical methods - The use of machine learning methods to address high dimensional statistical problems raises a number of issues that are being studied at the Centre Borelli. These concern information representations (e.g. identifying local stationarity regimes for time series, possibly on graphs), but also the synthesis of dependency patterns for high-dimensional observations (e.g. comparing such samples, or sequentially optimizing a function).
- Process modeling on networks – Spatio-temporal processes on discrete structures modeled by graphs cover many concrete modeling applications, for example in epidemiology, economics, physics, information dissemination and telecommunications. An example of a studied model is the epidemic competition, where two infections spread across a given graph, while mutually excluding each other when they occupy a node. Other issues can then be added onto the simple characterization of a phenomenon: around these models, we can also consider operational research issues such as process control from a set of limited resources.
Key facts
- The Python ruptures library implements and unifies numerous state-of-the-art and the Centre Borelli algorithms for detecting change-point in time series. Launched in 2019, it has been downloaded more than 20 million times by 2024, and is infusing many fields (industry, advertising, medicine, biology, etc.)
- Mathilde Mougeot is involved in the management of various institutions: she is deputy director of the FMJH and a member of the INSMI Scientific Council, and has been in charge of the INSMI's “valorisation” project. She is also Deputy Director of the Graduate School of Mathematics at Université Paris Saclay. She was a member of the HCERES committee for the CNRS AMIES unit. Finally, she was involved in the white paper drawn up for the Assises des Mathématiques 2022.
- Nicolas Vayatis has been elected ELLIS Fellow. He was an expert member of two interministerial commissions for the evaluation of COVID-19 epidemiological models in 2020. He is also member of the scientific council for the committee of directors of SNCF group and he is a scientific advisor for the Fondation MAIF for research.
- Vianney Perchet is an associate editor of the Journal of dynamic games and applications and of Operations Research Letters. He is regurlaly member of scientific commitee of major conferences in Machine Learning (COLT, NeurIPS, ICML).
Applications
- Industry (manufacturing, transport, energy),
- Digital health,
- Human factors.
Porfolio
Publications
-
A.D. de la Concha Duarte, N. Vayatis, A. Kalogeratos. Online non-parametric likelihood-ratio estimation by Pearson-divergence functional minimization. Int. Conf. on Artificial Intelligence and Statistics 2024.
-
G. Serré, A. Kalogeratos, N. Vayatis. Stein Boltzmann Sampling: A Variational Approach for Global Optimization, Int. Conf. on Artificial Intelligence and Statistics 2024.
-
T.N.K. Nguyen, C. Millet, T. Dairay, R. Meunier, M. Mougeot Adaptive Physics-Informed Learning for Downscaling Fluid Flows over Irregular Geometries. Bulletin of the American Physical Society, 2024
-
C. Millet, T.N.K. Nguyen, M. Mougeot. Physics-informed learning for modeling infrasound propagation in extreme weather conditions. Acoustical Society of America, 2023.
-
A. de Mathelin, F. Deheeger, M. Mougeot, N. Vayatis. From theoretical to practical transfer learning : The adapt library. In Federated and Transfer Learning, Springer, 2022
-
A de la Concha, N Vayatis, A Kalogeratos. Offline detection of change-points in the mean for stationary graph signals. Int. Conf. on Artificial Intelligence and Statistics, 2021
-
I. Bargiotas, I., Kalogeratos, A., Limnios, M., Vidal, P. P., Ricard, D., & Vayatis, N. Revealing posturographic profile of patients with Parkinsonian syndromes through a novel hypothesis testing framework based on machine learning. Plos one, 2021.
-
T. N. K. Nguyen, T. Dairay, R. Meunier, and M. Mougeot. Physics-informed neural networks for non-newtonian fluid thermo-mechanical problems : An application to rubber calendering process. Engineering Applications of Artificial Intelligence, 114, 2022
-
S. Clémençon, M. Limnios, N. Vayatis. Concentration Inequalities for Two-Sample Rank Processes with Application to Bipartite Ranking. Electronic Journal of Statistics, Institute of Mathematical Statistics and Bernoulli Society, vol 15(2), 2021.
-
A. de Mathelin, F. Deheeger, M. Mougeot, N. Vayatis, (2021) Discrepancy-based active learning for domain adaptation. In Proceedings of the International Conference on Learning Representations (ICLR), 2021.