Lorenzo Pacchiardi

prof_pic.jpg

Research Associate, University of Cambridge

I am a Research Associate at the Leverhulme Centre for the Future of Intelligence at the University of Cambridge. I lead a research project (funded by Open Philanthropy) on developing a benchmark for measuring the ability of LLMs to perform data science tasks. I am more broadly interested in predictability and cognitive-oriented evaluation of AI systems, and I closely collaborate with Prof José Hernández-Orallo and Dr Lucy Cheke. I contribute to the AI evaluation newsletter.

I previously worked on detecting lying in large language models with Dr Owain Evans and on technical standards for AI for the EU AI Act at the Future of Life Institute. I am deeply interested in AI policy (particularly at the EU level).

I obtained a PhD in Statistics and Machine Learning at Oxford, during which I worked on Bayesian simulation-based inference, generative models and probabilistic forecasting (with applications to meteorology). My supervisors were Prof. Ritabrata Dutta (Uni. Warwick) and Prof. Geoff Nicholls (Uni. Oxford).

Before my PhD studies, I obtained a Bachelor’s degree in Physical Engineering from Politecnico di Torino (Italy) and an MSc in Physics of Complex Systems from Politecnico di Torino and Université Paris-Sud, France. I carried out my MSc thesis at LightOn, a machine learning startup in Paris.

news

Feb 21, 2025 We have two (more) new preprints on arXiv! One surveying AI evaluation and identifying six main paradigms, the other one introducing a benchmark for jointly evaluating the performance of LLMs and its predictability on individual instances.
Oct 15, 2024 We have two new preprints on arXiv! One on predicting the performance of LLMs on individual instances, the other one on predicting the answers of LLM benchmarks from simple features.
Oct 01, 2024 I have obtained a grant from Open Philanthropy on building a benchmark for measuring the ability of LLMs to perform data science tasks! 🤓 📊
Sep 21, 2024 Our paper Generalised Bayesian Likelihood-Free Inference (on which I worked during my PhD studies) is now published at the Electronic Journal of Statistics! :tada:
Apr 17, 2024 We’ve launched academicjobsitaly.com, a portal to search academic jobs in Italy, boasting automated notifications for new openings! 🇮🇹

selected publications

  1. arXiv
    PredictaBoard: Benchmarking LLM Score Predictability
    Lorenzo Pacchiardi ,  Konstantinos Voudouris ,  Ben Slater ,  Fernando Martínez-Plumed ,  José Hernández-Orallo ,  Lexin Zhou ,  and  Wout Schellaert
    arXiv preprint arXiv:2502.14445, 2025
  2. arXiv
    Paradigms of AI Evaluation: Mapping Goals, Methodologies and Culture
    John Burden ,  Marko Tešić ,  Lorenzo Pacchiardi ,  and  José Hernández-Orallo
    arXiv preprint arXiv:2502.15620, 2025
  3. ICLR 2024
    How to Catch an AI Liar: Lie Detection in Black-Box LLMs by Asking Unrelated Questions
    Lorenzo Pacchiardi ,  Alex J Chan ,  Sören Mindermann ,  Ilan Moscovitz ,  Alexa Y Pan ,  Yarin Gal ,  Owain Evans ,  and  Jan Brauner
    The Twelfth International Conference on Learning Representations, 2024
  4. JMLR
    Probabilistic Forecasting with Generative Networks via Scoring Rule Minimization
    Lorenzo Pacchiardi ,  Rilwan Adewoyin ,  Peter Dueben ,  and  Ritabrata Dutta
    Journal of Machine Learning Research, 2024
  5. JMLR
    Score Matched Neural Exponential Families for Likelihood-Free Inference
    Lorenzo Pacchiardi ,  and  Ritabrata Dutta
    Journal of Machine Learning Research, 2022