Elias Ramzi - AI Research scientist
I am a Research Scientist at valeo.ai, where I work on deep learning for autonomous driving. My research focuses on end-to-end driving — learning to plan directly from sensor data — together with the world models that predict how a scene will unfold and the vision-language models and LLMs used for reasoning and explainability. Recent work includes VaViM & VaVAM, a video world model and action model for driving. I also co-supervise two PhD students, one on LLMs/VLMs and one on world models and reinforcement learning.
Before joining valeo.ai, I earned a PhD in computer vision at Cnam, supervised by Nicolas Thome (Sorbonne Université), Nicolas Audebert (IGN) and Clément Rambour (Cnam), with Xavier Bitot (Coexya) as industrial advisor. My thesis — awarded the AFRIF Prix de Thèse — focused on ranking-loss optimization and hierarchical learning for image retrieval (ROADMAP, HAPPIER, SupRank).
News
======
- We have released a tech report and fully open-sourced code and weights for VaViM & VaVAM. This project builds a world model composed of a next frame predictor (VaVIM) and an action model (VaVAM); [paper] [code].
- LLM-wrapper, which allows black-box fine-tuning of VLMs, has been accepted at ICLR 2025, congrats Amaia; [paper] [code].
- SupRank is accepted at TPAMI, it is the first of its kind hierarchical landmark retrieval dataset; [paper] [code] [dataset].
- I started at valeo.ai as a research scientist.
- Our paper for local prompt learning, GalLoP, has been accepted to ECCV 2024; [paper] [code].
- Our paper ITEM on improving the learning and evaluation of Message-Passing GNNs models in the recommendation task has been accepted at TMLR; [paper].
- I defended my PhD thesis on the 20th of March. I now officialy hold a PhD 🎉 [manuscript].
- Our paper on OOD detection, HEAT, using energy-based models has been accepted to ICML 2023; [paper] [code].
- Our paper on hierarchical image retrieval, HAPPIER, has been accepted to ECCV 2022; [paper] [code].
- Our paper on ranking metric optimization for image retrieval, ROADMAP, has been accepted to NeurIPS 2021; [paper] [code].
Publications
======
Test-Time Trajectory Optimization for Autonomous Driving
Yihong Xu, Éloi Zablocki, Yuan Yin, Elias Ramzi, Ellington Kirby, Alexandre Boulch, Matthieu Cord: Test-Time Trajectory Optimization for Autonomous Driving. arXiv preprint (2026).
DRIV-EX: Counterfactual Explanations for Driving LLMs
Amaia Cardiel, Éloi Zablocki, Elias Ramzi, Eric Gaussier: DRIV-EX: Counterfactual Explanations for Driving LLMs. ACL findings (2026).
Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning
Shashanka Venkataramanan, Valentinos Pariza, Mohammadreza Salehi, Lukas Knobel, Spyros Gidaris, Elias Ramzi, Andrei Bursuc, Yuki M. Asano: Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning. CVPR (2026).
LLM-wrapper: Black-Box Semantic-Aware Adaptation of Vision-Language Models for Referring Expression Comprehension
Amaia Cardiel, Éloi Zablocki, Elias Ramzi, Oriane Siméoni, Matthieu Cord: LLM-wrapper: Black-Box Semantic-Aware Adaptation of Vision-Language Models for Referring Expression Comprehension. Internation Conference on Learning Representations (ICLR 2025).
VaViM and VaVAM: Autonomous Driving through Video Generative Modeling
Florent Bartoccioni, Elias Ramzi, Victor Besnier, Shashanka Venkataramanan, Tuan-Hung Vu, Yihong Xu, Loick Chambon, Spyros Gidaris, Serkan Odabas, David Hurych, Renaud Marlet, Alexandre Boulch, Mickael Chen, Éloi Zablocki, Andrei Bursuc, Eduardo Valle, Matthieu Cord: VaViM and VaVAM: Autonomous Driving through Video Generative Modeling. ArXiv Preprint 2025.
Optimization of Rank Losses for Image Retrieval
Elias Ramzi, Nicolas Audebert, Clément Rambour, André Araujo, Xavier Bitot, Nicolas Thome: Optimization of Rank Losses for Image Retrieval. In IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI, 2025).
GalLoP: Learning Global and Local Prompts for Vision-Language Models
Marc Lafon, Elias Ramzi, Clément Rambour, Nicolas Audebert, Nicolas Thome: GalLoP: Learning Global and Local Prompts for Vision-Language Models. European Conference on Computer Vision (ECCV 2024).
ITEM: Improving Training and Evaluation of Message-Passing based GNNs for top-k recommendation
Yannis Karmim, Elias Ramzi, Raphaël Fournier-S 'Niehotta, Nicolas Thome: ITEM: Improving Training and Evaluation of Message-Passing based GNNs for top-k recommendation. TMLR (2024).
Hybrid Energy Based Model in the Feature Space for Out-of-Distribution Detection
Marc Lafon, Elias Ramzi, Clément Rambour, Nicolas Thome: Hybrid Energy Based Model in the Feature Space for Out-of-Distribution Detection. International Conference on Machine Learning (ICML 2023).
Hierarchical Average Precision Training for Pertinent Image Retrieval
Elias Ramzi, Nicolas Audebert, Nicolas Thome, Clément Rambour, Xavier Bitot: RHierarchical Average Precision Training for Pertinent Image Retrieval. In: European Conference on Computer Vision. Springer (ECCV, 2022).
Robust and Decomposable Average Precision for Image Retrieval
Elias Ramzi, Nicolas Thome, Clément Rambour, Nicolas Audebert, Xavier Bitot: Robust and decomposable average precision for image retrieval. Advances in Neural Information Processing Systems 34 (NeurIPS, 2021).
