Institute for Visual Computing

Acronym
Homepage	http://www.ivc.ethz.ch/
Country	Switzerland
ZIP, City
Address
Phone
Type	Academy
Top-level organization	ETH Zurich
Parent organization	Department of Computer Science
Current organization	Institute for Visual Computing
Child organizations	Computer Graphics Laboratory Computer Vision and Geometry Group Computer Vision and Learning Group Interactive Geometry Lab IVIA Lab Media Technology Center

Open Opportunities

Virtual Reality Avatars to support Learning and Education

ETH Zurich
Sensing, Interaction & Perception Lab Other organizations: Computer Graphics Laboratory

We will explore the design space of avatars in Virtual Reality to support learning and creativity. The project will leverage the concept of "embodied cognition", a set of theories that imply that our bodies and their interaction with the environment can impact how we learn. We will develop a Unity3D-based VR environment for embodied learning that can be deployed on everyday VR headsets.

Computer Graphics, Computer-Human Interaction
Master Thesis, Semester Project

Scene Exploration and Object Search for Robotic System

ETH Zurich
Computer Vision and Geometry Group

Object search is the problem of letting a robot find an object of interest. For this, the robot has to explore the environment it is placed into until the object is found. To explore an environment, current robotic methods use geometrical sensing, i.e. stereo cameras, LiDAR sensors or similar, such that they can create a 3D reconstruction of the environment which also has a clear distinction of 'known & occupied', 'known & unoccupied' and 'unknown' regions of space. The problem of the classic geometric sensing approach is that it has no knowledge of e.g. doors, drawers, or other functional and dynamic elements. These however are easy to detect from images. We therefore want to extend prior object search methods such as https://naoki.io/portfolio/vlfm with an algorithm that can also search through drawers and cabinets. The project will require you to train your own detector network to detect possible locations of an object, and then implement a robot planning algorithm that explores all the detected locations.

Intelligent Robotics, Robotics and Mechatronics
Master Thesis

Multimodal Floorplan Encoding

ETH Zurich
Computer Vision and Geometry Group

The objective of the project is to train a neural network taking any floorplan modality as input and outputting an embedding in a latent space shared by all the floorplan modalities. This is beneficial for downstream applications such as visual localization and model alignment. Check the attached the documents for more details. The thesis will be co-supervised between CVG, ETH Zurich and Microsoft Spatial AI lab, Zurich.

Computer Vision
ETH Zurich (ETHZ), Master Thesis

Reconstructing liquids from multiple views with 3D Gaussian Splatting

ETH Zurich
Computer Vision and Geometry Group Other organizations: Advanced Interactive Technologies

This project reconstructs liquids from multi-view imagery, segmenting fluid regions using methods like Mask2Former and reconstructing static scenes with 3D Gaussian Splatting or Mast3r. The identified fluid clusters initialize a particle-based simulation, refined for temporal consistency and enhanced by optional thermal data and visual language models for fluid properties.

Computer Vision
Master Thesis, Semester Project

Human-Robot Communication with Text Prompts and 3D Scene Graphs

ETH Zurich
Computer Vision and Geometry Group

This project extends previous work [a] on calculating similarity scores between text prompts and 3D scene graphs representing environments. The current method identifies potential locations based on user descriptions, aiding human-agent communication, but is limited by its coarse localization and inability to refine estimates incrementally. This project aims to enhance the method by enabling it to return potential locations within a 3D map and incorporate additional user information to improve localization accuracy incrementally until a confident estimate is achieved. [a] Chen, J., Barath, D., Armeni, I., Pollefeys, M., & Blum, H. (2024). "Where am I?" Scene Retrieval with Language. ECCV 2024.

Computer Vision
Master Thesis, Semester Project

Uncertainty-aware 3D Mapping

ETH Zurich
Computer Vision and Geometry Group

The goal of this project is to enhance the 3D mapping capabilities of a robotic agent by incorporating uncertainty measures into MAP-ADAPT, an incremental mapping pipeline that constructs an adaptive voxel grid from RGB-D input.

Computer Vision, Intelligent Robotics
Master Thesis, Semester Project

OpenSet Semantic SLAM

ETH Zurich
Computer Vision and Geometry Group

The goal of the project is to create a Simultaneous Localization and Mapping algorithm that, besides estimating the camera trajectory and the geometry of the scene, also obtains object instances. These object instances should not be restricted to a fixed set of classes (e.g., chair, table). Hence, the problem is open set segmentation.

Computer Vision, Intelligent Robotics
Master Thesis, Semester Project