Our Research

We investigate robots that can understand their environment semantically and geometrically, in order to perform manipulation and other safety critical tasks in proximity to humans. This encompasses semantic understanding under open-set conditions, map representations of the environment, active perception and planning, as well as adaptation and continual self-supervised learning.
osmAG-LLM: Zero-Shot Open-Vocabulary Object Navigation via Semantic Maps and Large Language Models Reasoning

F Xie, S Schwertfeger, H Blum

IEEE Robotics and Automation Letters 2026

doiarxivPDF
SNI-SLAM++: Tightly-Coupled Semantic Neural Implicit SLAM

S Zhu, G Wang, H Blum, Z Wang, G Zhang, D Cremers, M Pollefeys, H Wang

T-PAMI 2025

doi
CroCoDL: Cross-device Collaborative Dataset for Localization

H Blum, A Mercurio, J O'Reilly, T Engelbracht, M Dusmanu, M Pollefeys, ...

CVPR 2025

Hoi! - A Multimodal Dataset for Force-Grounded, Cross-View Articulated Manipulation

T Engelbracht, R Zurbrügg, M Wohlrapp, M Büchner, A Valada, M Pollefeys, H Blum, Z Bauer

2025

arxivPDF
3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection

Y Yang, L Piccinelli, M Segu, S Li, R Huang, Y Fu, M Pollefeys, H Blum, Z Bauer

ICCV 2025

arxivPDFwebsitecode
ActLoc: Learning to Localize on the Move via Active Viewpoint Selection

B Sun, L Di Giammarino, H Blum, M Pollefeys

CoRL 2025

arxivPDFwebsite
FunGraph: Functionality Aware 3D Scene Graphs for Language-Prompted Scene Interaction

D Rotondi, F Scaparro, H Blum, K O. Arras

IROS 2025

arxivPDFwebsite
SpotLight: Robotic Scene Understanding through Interaction and Affordance Detection

T Engelbracht, R Zurbrügg, M Pollefeys, H Blum, Z Bauer

HUMANOIDS 2025

arxivPDFwebsite
FrontierNet: Learning Visual Cues to Explore

B Sun, H Chen, S Leutenegger, C Cadena, M Pollefeys, H Blum

IEEE Robotics and Automation Letters 2025

doiwebsitearxivPDFcodedoi
ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding

G Ji, S Weder, F Engelmann, M Pollefeys, H Blum

CVPR 2025

arxivPDFlabelmaker.orgcode
DepthSplat: Connecting Gaussian Splatting and Depth

H Xu, S Peng, F Wang, H Blum, D Barath, A Geiger, M Pollefeys

CVPR 2025

arxivPDFwebsitecodeopenaccess.thecvf.com
Spot-On: A Mixed Reality Interface for Multi-Robot Cooperation

T Engelbracht, P Lukovic, T Behrens, K Lascheit, R Zurbrügg, M Pollefeys, ...

2025

arxivPDF
Loop Closure from Two Views: Revisiting PGO for Scalable Trajectory Estimation through Monocular Priors

TY Lim, B Sun, M Pollefeys, H Blum

2025

arxivPDF
Lost & Found: Tracking Changes from Egocentric Observations in 3D Dynamic Scene Graphs

T Behrens, R Zurbrügg, M Pollefeys, Z Bauer, H Blum

Robotics and Automation Letters (RA-L) 2025

arxivPDFwebsitedoi
NeuSurfEmb: A Complete Pipeline for Dense Correspondence-based 6D Object Pose Estimation without CAD Models

F Milano, JJ Chung, H Blum, R Siegwart, L Ott

IROS 2024

HoloSpot: Intuitive Object Manipulation via Mixed Reality Drag-and-Drop

P Soler Garcia, P Lukovic, L Reynaud, A Sgobbi, F Bruni, M Brun, M Zünd, R Bollati, M Pollefeys, H Blum, Z Bauer

2024

arxivPDFwebsite
Learning Where to Look: Self-supervised Viewpoint Selection for Active Localization Using Geometrical Information

L Di Giammarino, B Sun, G Grisetti, M Pollefeys, H Blum, D Barath

ECCV 2024

doiarxivPDFcode
“Where am I?” Scene Retrieval with Language

J Chen, D Barath, I Armeni, M Pollefeys, H Blum

ECCV 2024

doiarxivPDFwebsite
OpenDAS: Open-Vocabulary Domain Adaptation for Segmentation

G Yilmaz, S Peng, M Pollefeys, F Engelmann, H Blum

2024

arxivPDFwebsite
SNI-SLAM: Semantic Neural Implicit SLAM

S Zhu, G Wang, H Blum, J Liu, L Song, M Pollefeys, H Wang

CVPR 2024

arxivPDFcodeopenaccess.thecvf.com
A 3D Mixed Reality Interface for Human-Robot Teaming

J Chen, B Sun, M Pollefeys, H Blum

ICRA 2024

arxivPDFcode
Active Visual Localization for Multi-Agent Collaboration: A Data-Driven Approach

M Hanlon, B Sun, M Pollefeys, H Blum

ICRA 2024

arxivPDF
Spot-Compose: A Framework for Open-Vocabulary Object Retrieval and Drawer Manipulation in Point Clouds

O Lemke, Z Bauer, R Zurbrügg, M Pollefeys, F Engelmann, H Blum

ICRA 2024

arxivPDFwebsitecode
LabelMaker: Automatic Semantic Label Generation from RGB-D Trajectories

S Weder, H Blum, F Engelmann, M Pollefeys

3DV 2024

arxivPDFlabelmaker.orgcode