Roberto Bigazzi, PhD - Computer Vision / Deep Learning / Robotics

Roberto Bigazzi

I am a Machine Learning Engineer and Data Scientist for the Vehicle Testing and Development unit at Ferrari.

I have been a Postdoctoral research fellow at the AImageLab research laboratory, where I worked on Computer Vision and Deep Learning, with a focus on robotic visual navigation. I've spent a period as Visiting Student Researcher at Stanford University in the Autonomous Systems Lab (ASL) under Prof. Marco Pavone. I did my PhD at University of Modena and Reggio Emilia advised by Prof. Rita Cucchiara, while I pursued my Bachelor's and Master's degrees at Polytechnic University of Milan with an exchange period at Technische Universität Wien.

I am a IEEE (Institute of Electrical and Electronics Engineers) and a CVF (The Computer Vision Foundation) member.

Email / Resume / BC / Google Scholar / LinkedIn / GitHub

Research

	Autonomous Embodied Agents: When Robotics Meets Deep Learning Reasoning Roberto Bigazzi Ph.D. Thesis, 2023 pdf / cover / bibtex In this thesis, I present the research work carried out during my Ph.D. that was focused on the challenges defined by the field of Embodied Artificial Intelligence. I would like to thank again my family, friends, and supervisor for the support during the last three years.
	Personalized Instance-based Navigation Toward User-Specific Objects in Realistic Environments Luca Barsellotti, Roberto Bigazzi, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara, NeurIPS, 2024 arXiv / website / code / bibtex We propose a new task denominated Personalized Instance-based Navigation (PIN), in which an embodied agent is tasked with locating and reaching a specific personal object by distinguishing it among multiple instances of the same category. The task is accompanied by PInNED, a dedicated new dataset composed of photo-realistic scenes augmented with additional 3D objects.
	AIGeN: An Adversarial Approach for Instruction Generation in VLN Niyati Rawal, Roberto Bigazzi, Lorenzo Baraldi, Rita Cucchiara, CVPRW, 2024 arXiv / bibtex We propose a novel architecture inspired by Generative Adversarial Networks (GANs) that produces meaningful and well-formed synthetic instructions to improve navigation agents’ performance. The validation analysis of our proposal on REVERIE and R2R highlights the promising aspects of our proposal, achieving state-of-the-art results.
	Mapping High-level Semantic Regions in Indoor Environments without Object Recognition Roberto Bigazzi, Lorenzo Baraldi, Shreyas Kousik, Rita Cucchiara, Marco Pavone ICRA, 2024 (Collaboration with Stanford University and Georgia Tech) arXiv / pdf / bibtex We propose a novel approach that combines visual and fine-tuned CLIP features to generate grounded language-visual features for mapping. These region classification features are seamlessly integrated into a global grid map using an exploration-based navigation policy.
	Embodied Agents for Efficient Exploration and Smart Scene Description Roberto Bigazzi, Marcella Cornia, Silvia Cascianelli, Lorenzo Baraldi, Rita Cucchiara ICRA, 2023 arXiv / bibtex We propose and evaluate an approach that combines recent advances in visual robotic exploration and image captioning on images generated through agent-environment interaction. Our approach can generate smart scene descriptions that maximize semantic knowledge of the environment and avoid repetitions.
	Towards Explainable Embodied Navigation and Recounting Samuele Poppi, Roberto Bigazzi, Niyati Rawal, Marcella Cornia, Silvia Cascianelli, Lorenzo Baraldi, Rita Cucchiara ICIAP, 2023 (Honorable Mention ICIAP Best Paper Award) pdf / bibtex We use explainable maps to visualize model predictions and highlight the correlation between observed entities and words generated by a captioning model during the embodied exploration.
	Spot the Difference: A Novel Task for Embodied Agents in Changing Environments Federico Landi, Roberto Bigazzi, Silvia Cascianelli, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara ICPR, 2022 arXiv / website / code / bibtex We propose Spot the Difference: a novel task for Embodied AI where the agent has access to an outdated map of the environment and needs to recover the correct layout in a fixed time budget.
	Focus on Impact: Indoor Exploration with Intrinsic Motivation Roberto Bigazzi, Federico Landi, Silvia Cascianelli, Lorenzo Baraldi, Marcella Cornia, Rita Cucchiara RA-L/ICRA, 2022 arXiv / code / bibtex We propose to train a navigation model with a purely intrinsic reward signal to guide exploration, which is based on the impact of the robot's actions on its internal representation of the environment.
	Embodied Navigation at the Art Gallery Roberto Bigazzi, Federico Landi, Silvia Cascianelli, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara ICIAP, 2021 arXiv / website / code / bibtex We build and release a new 3D space for embodied navigation with unique characteristics: the one of a complete art museum. We name this environment ArtGallery3D (AG3D).
	Out of the Box: Embodied Navigation in the Real World Roberto Bigazzi, Federico Landi, Marcella Cornia, Silvia Cascianelli, Lorenzo Baraldi, Rita Cucchiara CAIP, 2021 arXiv / code / bibtex We detail how to transfer the knowledge acquired in simulation into the real world describing the architectural discrepancies that damage the Sim2Real adaptation ability of models trained on the Habitat simulator and we propose a novel solution tailored towards the deployment in real-world scenarios.
	Explore and Explain: Self-Supervised Navigation and Recounting Roberto Bigazzi, Federico Landi, Marcella Cornia, Silvia Cascianelli, Lorenzo Baraldi, Rita Cucchiara ICPR, 2020 (Oral Presentation) arXiv / bibtex We devise an embodied setting in which an agent needs to explore a previously unknown environment while recounting what it sees during the path. The agent needs to navigate the environment driven by an exploration goal, select proper moments for description, and output natural language descriptions of relevant objects and scenes.

Reviewing Committees

Journals:

IEEE Robotics and Automation Letters (RA‑L)
IEEE Geoscience and Remote Sensing Letters (GRSL)
IEEE Pattern Recognition Letters (PRL)
Transactions on Multimedia Computing Communications and Applications (TOMM)

Conferences:

IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR) [Outstanding Reviewer @ CVPR 2025]
European Conference on Computer Vision (ECCV) [Outstanding Reviewer @ ECCV 2024]
IEEE/CVF International Conference on Computer Vision (ICCV)
Conference on Neural Information Processing Systems (NeurIPS)
IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
IEEE International Conference on Robotics and Automation (ICRA)
IAPR International Conference on Pattern Recognition (ICPR)
ACM International Conference on Multimedia (ACMMM)
International Conference on Image Analysis and Processing (ICIAP)

Other:

ELLIS Ph.D. Program

Certificates and Memberships

Outstanding Reviewer: ECCV 2024 [pdf / url], CVPR 2025 [url]
IEEE (Institute of Electrical and Electronics Engineers) Membership
CVF (The Computer Vision Foundation) Membership
Visiting Student Researcher @ Stanford University [certificate]
English Certificate TOEIC (C1) [certificate]
English Certificate FCE (B2)
Transcript of Records for Erasmus @ TU Wien [certificate]
European Computer Driving Licence (ECDL) [certificate]

Teaching Activities

Computer Architectures - Prof. Rita Cucchiara, Prof. Simone Calderara, 2020-2023
Project Tutor for Computer Vision and Cognitive Systems - Prof. Rita Cucchiara, Dott. Lorenzo Baraldi, 2019-2023
Project Tutor for AI for Automotive - Prof. Rita Cucchiara, Dott. Lorenzo Baraldi, 2021-2022
Deep Learning and Neural Networks course - Nuova Didactica, 2024
Python and Machine Learning course for Banco BPM - Prometeia, 2021
Data Analysis and Data Visualization course - IFOA Bologna, 2021
Deep Learning, Artificial Intelligence e Neurolinguistic Processing (IBM, SAS) course - IFOA Modena, 2021
Project Tutor for Neural Network Computing, AI and Machine Learning for Automotive - Prof. Rita Cucchiara, Dott. Lorenzo Baraldi, 2020-2021

Thesis Supervision

Francesco Olivato - Tackling Unsupervised Personalised Object Recognition using Procedural Placement in Photorealistic Environments for Embodied AI [pdf]
Davide Borghi - Evaluating the Effect of Exploration Bonuses on Deep Reinforcement Learning-Based Agents [pdf]
Davide Paltrinieri - Object Recognition with Personalized CLIP Embeddings

Courses and Summer Schools

Advanced Course on Data Science and Machine Learning (ACDL 2021) - Siena (SI), Italy [certificate]
Interactive Internet of Things e Smart Object Design Course - Polytechnic University of Milan, Italy

International Conferences

18th European Conference on Computer Vision (ECCV 2024) - Milan, Italy [certificate]
IEEE International Conference on Robotics and Automation (ICRA 2024) - Yokohama, Japan [certificate]
IEEE International Conference on Robotics and Automation (ICRA 2023) - London, UK [certificate]
IEEE International Conference on Robotics and Automation (ICRA 2022) - Philadelphia, US [certificate]
International Conference on Computer Analysis of Images and Patterns (CAIP 2021) - Nicosia, Cyprus (Remote) [certificate]
IAPR International Conference on Pattern Recognition (ICPR 2020) - Milan, Italy (Remote)

For any issues with this page please let me know here
* thanks to Jon Barron for the codebase of this website.