Sustainability and Machine Learning Group

Research group

University College London

We are a research group at UCL’s Centre for Artificial Intelligence.

Our research expertise includes:

data-efficient machine learning and probabilistic modeling
autonomous decision making and recommender systems
responsible AI and AI safety

We also work on applications related to social/environmental sustainability, climate and nuclear fusion.

If you are interested in joining the team, please check out our openings.

Meet the Team

Principal Investigators

Marc Deisenroth

Google DeepMind Chair of Machine Learning and Artificial Intelligence

Machine learning, Gaussian processes, AI for sustainability, Environmental modeling

Maria Perez-Ortiz

Associate Professor

Responsible AI, AI for sustainability, Recommender systems, Hybrid intelligence, Simulation intelligence

Administrators

PhD Students

Vignesh Gopakumar

PhD Student

Machine learning, Nuclear fusion, Bayesian optimization, Neural operators

Jake Cunningham

PhD Student

Machine learning, Gaussian processes, Earth systems modelling

Mathieu Alain

PhD Student

Machine learning, Graph neural networks, Diffusion models, PAC-Bayes

James Rudd-Jones

PhD Student

Sustainable policies, socio-environmental AI

Alumni

Daniel Giles

Senior Research Fellow (11/2023-03/2025)

Tsunami modeling, HPC, Gaussian processes

Sicelukwanda Zwane

PhD (10/2020-09/2025)

Machine learning, Robotics, Transfer Learning, Reinforcement Learning

Yicheng Luo

PhD (09/2020-12/2024)

Meta-learning, Probabilistic Programming, Reinforcement Learning, Deep Generative Models

So Takao

Senior Research Fellow (11/2020 - 07/2023)

Machine learning, Climate science, Fluid mechanics, Geometric mechanics

Alexander Terenin

PhD (10/2018-11/2021)

Machine learning, Bayesian theory, Geometric machine learning

Samuel Cohen

PhD (09/2019-09/2024)

Machine learning, Optimal transport, Gaussian processes

Sanket Kamthe

PhD (10/2016-03/2021)

Machine learning, Reinforcement learning, Optimal control, Copulas

Mihaela Rosca

PhD (03/2020-05/2023)

Generative models, Optimization in deep learning, Reinforcement learning

Jacob Menick

Researcher (01/2020-10/2024)

Machine learning, Generative models, Large-scale deep learning, Variational inference, Information theory, Sparsity

Hugh Salimbeni

PhD (10/2015-10/2019)

Machine learning, Deep probabilistic models, Approximate inference

Steindór Sæmundsson

PhD (11/2016-11/2021)

Machine learning, Gaussian processes, Meta learning, Structural priors, Variational inference

K. S. Sesh Kumar

Research Associate

Machine learning, Discrete optimization, Differential privacy, Submodularity

Riccardo Moriconi

PhD (10/2016-02/2021)

Machine learning, Gaussian processes, Bayesian optimization

Janith Petangoda

PhD (10/2017-07/2022)

Machine learning, Meta learning, Differential geometry, Reinforcement learning

James Wilson

PhD (10/2017-08/2022)

Machine learning, Gaussian processes, Bayesian optimization, Practical approximate inference

Simon Olofsson

PhD (06/2016-03/2020)

Machine learning, Bayesian optimization, Mechanistic models, Model discrimination

Benjamin Chamberlain

PhD (10/2014-08/2018)

Machine learning, Community detection, Representation of graphs, Hyperbolic embeddings

Recent Blog Posts

Thin and Deep Gaussian processes

Selecting an appropriate kernel for Gaussian processes (GPs) can be challenging. Deep GPs avoid manual kernel engineering by low-dimensional embeddings of the inputs that explain the output data, but lose all the interpretability of shallow GPs. Alternatively one successively parameterize the lengthscale of a kernel, improving the interpretability but ultimately giving away the notion of learning lower-dimensional embeddings. Both methods are susceptible to particular pathologies which may hinder fitting and limit their interpretability. We propose a novel synthesis of both previous approaches. Each TDGP layer is a local linear transformation generating latent embeddings while also being the lengthscales of a kernel. This model is, unlike previous models, tailored to specifically discover lower-dimensional manifolds in the input data and behaves well when increasing the number of layers.

Daniel Augusto de Souza

Feb 12, 2024

Safe Trajectory Sampling in Model-based Reinforcement Learning

Background Model-based reinforcement learning (MBRL) approaches learn a dynamics model from system interaction data and use it as a proxy of the physical system. Instead of executing actions directly on the target system, the agent queries the dynamics model, using it to generate forward trajectories of how the system will evolve given a sequence of actions.

Sicelukwanda Zwane

Sep 13, 2023

Actually Sparse Variational Gaussian Processes

Gaussian processes infamously suffer from an $\mathcal{O}(N^3)$ computational complexity and $\mathcal{O}(N^2)$ memory requirements, rendering them intractable for even medium sized datasets where $N\gtrsim 10,000$. Sparse variational Gaussian processes have been developed to alleviate some of the pains of scaling GPs to large datasets by approximating the exact GP posterior with a variational distirbution conditioned on a small set of inducing variables designed to summarise the dataset.

Jake Cunningham

May 26, 2023

Optimal Transport for Offline Imitation Learning

With the advent of large datasets, offline reinforcement learning (ORL) is a promising framework for learning good decision-making policies without interacting with the real environment. However, offline RL requires the dataset to be reward-annotated, which presents practical challenges when reward engineering is difficult or when obtaining reward annotations is labor-intensive.

Yicheng Luo

Last updated on Jun 10, 2023

Iterative State Estimation in Non-linear Dynamical Systems Using Approximate Expectation Propagation

State estimation in nonlinear systems is difficult due to the non-Gaussianity of posterior state distributions. For linear systems, an exact solution is attained by running the Kalman filter/smoother. However for nonlinear systems, one typically relies on either crude Gaussian approximations by linearising the system (e.

So Takao

Jun 27, 2022

See all

Recent News

Papers at NeurIPS Conference

Papers accepted at NeurIPS Conference

Oct 2, 2023

Best Paper Award at FAccT 2023

Marc Deisenroth

Jun 15, 2023

Paper on Safe Trajectory Sampling Accepted at CASE

May 25, 2023

Dr. Rosca

Dr. Mihaela Rosca passed her PhD viva

Marc Deisenroth

May 3, 2023

Senior Research Fellowship in Machine Learning for Weather and Climate Science

Feb 19, 2023

See all

Recent Publications

Infinite Neural Operators: Gaussian Processes on Functions

A variety of infinitely wide neural architectures (e.g., dense NNs, CNNs, and transformers) induce Gaussian process (GP) priors over …

Daniel Augusto De Souza, Yuchen Zhu, Harry Jake Cunningham, Yuri Saporito, Diego Mesquita, Marc P. Deisenroth

Parameter Efficient Fine-tuning via Explained Variance Adaptation

Foundation models (FMs) are pre-trained on large-scale datasets and then fine-tuned for a specific downstream task. The most common …

Fabian Paischer, Lukas Hauzenberger, Thomas Schmied, Benedikt Alkin, Marc P. Deisenroth, Sepp Hochreiter

Guaranteed Prediction Sets for Functional Surrogate models

We propose a method for obtaining statistically guaranteed prediction sets for functional machine learning methods: surrogate models …

Ander Gray, Vignesh Gopakumar, Sylvain Rousseau, Sebastien Destercke

Calibrated Physics-Informed Uncertainty Quantification

Neural PDEs have emerged as inexpensive surrogate models for numerical PDE solvers. While they offer efficient approximations, they …

Vignesh Gopakumar, Ander Gray, Lorenzo Zanisi, Timothy Nunn, Daniel Giles, Matt Kusner, Stanislas Pamela, Marc P. Deisenroth

See all publications

Recent & Upcoming Talks

Maud Lemercier: Non-adversarial training of Neural SDEs with signature kernel scores

Neural SDEs are continuous-time generative models for sequential data. State-of-the-art performance for irregular time series …

Oct 19, 2023

Thomas Baldwin-McDonald: Bayesian Deep Learning with Physics-informed Gaussian Processes

Dynamical systems are ubiquitous across the natural sciences, with many physical and biological processes being driven on a fundamental …

Oct 5, 2023

Viacheslav Borovitskiy: Geometric Gaussian Processes

Gaussian processes (GPs) are often considered to be the gold standard in settings where well-calibrated predictive uncertainty is of …

Feb 8, 2023

Michel Tsamados: AI for polar remote sensing: making sense or making it up?

My background is in physics and my early work since being a postdoc and academic has been on model parameterizations of sea ice and …

Feb 3, 2023

Marc Killpack: Modelling and Optimal Control for Uncertain Robotic Systems

Over the last several years, our research group has worked to develop control and modeling methods for large-scale, deformable, …

Feb 2, 2023

See all

Featured Publications

Daniel Augusto De Souza, Yuchen Zhu, Harry Jake Cunningham, Yuri Saporito, Diego Mesquita, Marc P. Deisenroth

2025-12-04 Advances in Neural Information Processing Systems (NeurIPS)

Infinite Neural Operators: Gaussian Processes on Functions

A variety of infinitely wide neural architectures (e.g., dense NNs, CNNs, and transformers) induce Gaussian process (GP) priors over their outputs. These relationships provide both an accurate characterization of the prior predictive distribution and enable the use of GP machinery to improve the uncertainty quantification of deep neural networks. In this work, we extend this connection to neural operators (NOs), a class of models designed to learn mappings between function spaces. Specifically, we show conditions for when arbitrary-depth NOs with Gaussian-distributed convolution kernels converge to function-valued GPs. Based on this result, we show how to compute the covariance functions of these NO-GPs for two NO parametrizations, including the popular Fourier neural operator (FNO). With this, we compute the posteriors of these GPs in realistic scenarios. This work is an important step towards uncovering the inductive biases of current FNO architectures and opens a path to incorporate novel inductive biases for use in kernel-based operator learning methods.

Fabian Paischer, Lukas Hauzenberger, Thomas Schmied, Benedikt Alkin, Marc P. Deisenroth, Sepp Hochreiter

2025-12-04 Advances in Neural Information Processing Systems (NeurIPS)

Parameter Efficient Fine-tuning via Explained Variance Adaptation

Foundation models (FMs) are pre-trained on large-scale datasets and then fine-tuned for a specific downstream task. The most common fine-tuning method is to update pretrained weights via low-rank adaptation (LoRA). Existing initialization strategies for LoRA often rely on singular value decompositions (SVD) of gradients or weight matrices. However, they do not provably maximize the expected gradient signal, which is critical for fast adaptation. To this end, we introduce Explained Variance Adaptation (EVA), an initialization scheme that uses the directions capturing the most activation variance, provably maximizing the expected gradient signal and accelerating fine-tuning. EVA performs incremental SVD on minibatches of activation vectors and selects the right-singular vectors for initialization once they converged. Further, by selecting the directions that capture the most activation-variance for a given rank budget, EVA accommodates adaptive ranks that reduce the number of trainable parameters, while maintaining or improving downstream performance. We apply EVA to a variety of fine-tuning tasks as language generation and understanding, image classification, and reinforcement learning. EVA exhibits faster convergence than competitors and achieves the highest average score across a multitude of tasks per domain while reducing the number of trainable parameters through rank redistribution.

Ander Gray, Vignesh Gopakumar, Sylvain Rousseau, Sebastien Destercke

2025-07-24 Proceedings of the Conference on Uncertainty in Artificial Intelligence (UAI)

Guaranteed Prediction Sets for Functional Surrogate models

We propose a method for obtaining statistically guaranteed prediction sets for functional machine learning methods: surrogate models which map between function spaces, motivated by the need to build reliable PDE emulators. The method constructs nested prediction sets on a low-dimensional representation (an SVD) of the surrogate model’s error, and then maps these sets to the prediction space using set-propagation techniques. This results in prediction sets for functional surrogate models with conformal prediction coverage guarantees. We use zonotopes as basis of the set construction, which allow an exact linear propagation and are closed under Cartesian products, making them well-suited to this high-dimensional problem. The method is model agnostic and can thus be applied to complex Sci-ML models, including Neural Operators, but also in simpler settings. We also introduce a technique to capture the truncation error of the SVD, preserving the guarantees of the method.

Vignesh Gopakumar, Ander Gray, Lorenzo Zanisi, Timothy Nunn, Daniel Giles, Matt Kusner, Stanislas Pamela, Marc P. Deisenroth

2025-07-13 Proceedings of the International Conference on Machine Learning (ICML)

Calibrated Physics-Informed Uncertainty Quantification

Neural PDEs have emerged as inexpensive surrogate models for numerical PDE solvers. While they offer efficient approximations, they often lack robust uncertainty quantification (UQ), limiting their practical utility. Existing UQ methods for these models typically have high computational demands and lack guarantees. We introduce a novel framework for calibrated physics-informed uncertainty quantification to address these limitations. Our approach leverages physics residual errors as a nonconformity score within a conformal prediction (CP) framework. This enables data-free, model-agnostic, and statistically guaranteed uncertainty estimates. Our framework utilises convolutional layers as finite difference stencils for gradient estimation, our framework provides inexpensive coverage bounds for the violation of conservation laws within model predictions. In our experiments, we utilise CP to obtain marginal coverage for each cell and joint coverage over the entire prediction domain of various PDEs.

Denis Hadjivelichkov, Sicelukwanda N. T. Zwane, Marc P. Deisenroth, Lourdes Agapito, Dimitrios Kanoulas

2025-05-19 Proceedings of the International Conference on Robotics and Automation (ICRA)

Semantic Cross-Pose Correspondence from a Single Example

This article focuses on predicting how an object can be transformed to a semantically meaningful pose relative to another object, given only one or few examples. Current pose correspondence methods rely on vast 3D object datasets and do not actively consider semantic information, which limits the objects to which they can be applied. We present a novel method for learning cross-object pose correspondence. The proposed method detects interacting object parts, performs one-shot part correspondence, and uses geometric and visual-semantic features. Given one example of two objects posed relative to each other, the model can learn how to transfer the demonstrated relations to unseen object instances.

See all publications

Sustainability and Machine Learning Group

Research group

Meet the Team

Principal Investigators

Google DeepMind Chair of Machine Learning and Artificial Intelligence

Associate Professor

Administrators

Administrator

PhD Students

PhD Student

PhD Student

PhD Student

PhD Student

PhD Student

PhD Student

PhD Student

Alumni

Senior Research Fellow (11/2023-03/2025)

PhD (10/2020-12/2024)

PhD (10/2020-09/2025)

Senior Research Fellow (08/2021 - 07/2023)

PhD (09/2020-12/2024)

Senior Research Fellow (11/2020 - 07/2023)

PhD (10/2018-11/2021)

PhD (09/2019-09/2024)

PhD (10/2016-03/2021)

PhD (03/2020-05/2023)

Researcher (01/2020-10/2024)

PhD (10/2015-10/2019)

PhD (11/2016-11/2021)

Research Associate

PhD (10/2016-02/2021)

PhD (10/2017-07/2022)

PhD (10/2017-08/2022)

PhD (06/2016-03/2020)

PhD (10/2014-08/2018)

Recent Blog Posts

Recent News

Recent Publications

Recent & Upcoming Talks

Featured Publications