Patricia Crifo will be taking part in COP 28 UAE, the UN’s 28th Conference of the Parties on Climate, to be held from November 30 to December 12, 2023 in Dubai, United Arab Emirates.
From Lab Desks to Bookshelves: ‘Machine Learning pour l’Économétrie’ Authored by Lab Alumni
We are pleased to announce the publication by Editions Economica (ENSAE Collection, Advanced Economics and Statistics) of Machine Learning for Econometrics by Christophe Gaillac and Jeremy L’hour.
Two PhD alumni from CREST
Christophe Gaillac is a Postdoctoral Prize Research Fellow in Economics at Nuffield College, University of Oxford and was previously a PhD student at CREST. His research focuses on Econometrics, Statistics, and Machine Learning. Christophe is also an alumni from École polytechnique and ENSAE Paris where he was, later, the coordinator of statistical and mathematical teachings from 2017 to 2019 at ENSAE Paris.
Jeremy L’hour works on econometric methods and applications of machine learning to causal inference. He currently works as a quantitative researcher in the statistical arbitrage alpha team at Capital Fund Management (CFM) and he holds a PhD in Economics from Université Paris-Saclay prepared at CREST-ENSAE. Jeremy is an alumni of ENSAE Paris and he was the coordinator for econometrics teachings between 2016 and 2019.
Machine Learning For Econometrics
“Machine Learning pour l’économétrie” is a work intended for economists who wish to grasp modern machine learning techniques – from their predictive performance to the revolutionary processing of unstructured data – in order to establish causality relationships from the data.
It addresses the automatic selection of variables in various high-dimensional contexts, the estimation of treatment effects heterogeneity, natural language processing (NLP) techniques, as well as synthetic control and macroeconomic forecasting.
The foundations of machine learning methods are introduced to provide both a thorough theoretical treatment of how they can be used in econometrics and numerous economic applications. Each chapter contains a series of empirical examples, programs, and exercises to facilitate the adoption and implementation of techniques by the reader.
This book is aimed at master’s or graduate students, researchers, and practitioners eager to understand and enhance their knowledge of machine learning to apply it in a context traditionally reserved for econometrics.
Discover the 2024 CREST Job Market Candidates
CREST is pleased to introduce its 2023-2024 EconJobMarket candidates.
Discover the 2024 CREST Job Market Candidates
CREST is pleased to introduce its 2023-2024 EconJobMarket candidates.
RAPPORTS IPP n° 46 & 47
“Nouvelle évaluation des réformes de la fiscalité du capital” par Clément MALGOUYRES, Laurent BACH, Antoine BOZIO et Arthur GUILLOUZOUIC: Trois ans après ses premiers travaux, l’IPP publie mardi 17 octobre deux nouvelles évaluations des réformes de 2017 ayant conduit à la mise en place du prélèvement forfaitaire unique (PFU), au remplacement de l’impôt de solidarité sur la fortune (ISF) par l’impôt sur la fortune immobilière (IFI) et à la baisse de l’impôt sur les sociétés – Octobre 2023
« La France est dans une situation inflammable »
Une interview de l’économiste Pierre Boyer par Grégoire Normand pour le journal La Tribune – 13 Oct 2023
Emmanuelle Taugourdeau, membre du nouveau Conseil d’évaluation des fraudes
“Le gouvernement veut chiffrer l’ampleur des fraudes fiscales, sociales et douanières”. Dans cet article, le journal Les Echos, s’intéresse au nouveau Conseil d’évaluation des fraudes, dont Emmanuelle Taugourdeau, directrice adjointe du CREST, est une des membres – 10 oct. 2023
Économie du développement et démographie
“Économie du développement et démographie : les cas extrêmes de l’Afrique et de la Chine”: Une interview de Pauline Rossi par Pierre Rousseaux pour le journal Oeconomicus – 4 octobre 2023
“Black-Scholes: the formula at the origin of Wall Street”
An article by Peter Tankov in Polytechnique Insights – September 6th, 2023
Workshop on Statistics in Metric Spaces
The workshop on Statistics in Metric Spaces was held at ENSAE, on October 11, 12 and 13, 2023. It brought together international experts in the joint fields of statistics, optimization, probability theory and geometry. Each participant gave a 45-60 min talk and the range of topics that were covered was broad, tackling modern questions concerning statistical analysis on non-standard spaces.

Victor-Emmanuel Brunel (CREST-ENSAE), Christopher Criscitiello (EPFL), Stephan Huckemann (Georg-August-Universität Göttingen), Alexey Kroshnin (Weierstrass-Institut Für Angewandte Analysis und Stochastik), Kazuhiro Kuwae (Fukuoka University), Tom Nye (Newcastle University), Shin-ichi Ohta (Osaka University), Miklós Pálfia (Corvinus University of Budapest), Pierre Pansu (Université Paris-Saclay), Quentin Paris (HSE University), Xavier Pennec (INRIA), Gabriel Romon (CREST-ENSAE), Jordan Serres (CREST-ENSAE), Austin Stromme (CREST-ENSAE)
While available data become more and more rich and complex, it is essential to understand their intrinsic geometry, for instance as a tool of dimensionality reduction or, sometimes, in order to produce interpretable statistical procedures. This, however, also comes at a cost, since these geometries may be non-standard (e.g., non-linear and/or non-smooth geometries), yielding new challenges from the points of view of both statistical and algorithmic analysis.
For instance, directional data lie on spheres or projective spaces. In shape statistics, data are encoded as landmarks on three-dimensional objects, which should be invariant under rigid transformations: Hence, data lie in the quotient of a Euclidean space by a class of rigid transformations. In fact, such quotient spaces are also useful to understand statistical models that arise in econometrics, when a parameter is only identifiable up to some known transformations. Optimal transport theory is based on Wasserstein spaces, which are metric spaces with Riemannian/Finsler-like geometries. In various fields, in particular physics and economics, the geometry provided by optimal transport on sets of probability measures has been shown to be very well adapted to understand general phenomena, such as transportation of goods, or distribution of tasks, capital, etc. In the machine learning community, it has also been recently pointed out that metric trees and hyperbolic spaces, which exhibit negative curvature, are well adapted to encode data with hierarchical structures.
While probability theory is now fairly well understood in smooth, finite dimensional spaces (such as Euclidean spaces and Riemannian manifolds), much less is known in more general metric spaces, exhibiting possible infinite dimension (such as functional spaces), inhomogeneous structure (such as stratified spaces), etc. From a more algorithmic prospective, gradient flows and their discretization in non-smooth spaces are challenging because they require brand new approaches (e.g., new definitions of (sub)-gradients), yet they are essential in order to extend fundamental tools such as gradient descent algorithms to non-standard setups. Even in smooth spaces, the impact of curvature on gradient descent algorithms is still not clearly understood. More generally, the notion of convexity, which is pervasive to probability theory, statistical learning and optimization, and its interplays with curvature, still raise challenging questions.
To summarize, the impact of curvature (or generalized notions of curvature) on measure concentration, on the statistical behavior of learning algorithms and on their computational aspects is a flourishing topic of research that brings together experts in smooth/non-smooth geometry, statistics, probability theory and optimization.
The workshop brought these challenges onto the stage, and yielded fruitful discussions among the participants and the audience, with the goal of entailing future collaborations. We hope that this workshop, on Statistics in Metric Spaces, was the first edition of a long series, that will also spread interest in these rich topics into a broader audience.