Computer Vision: 3D Reconstruction

This course covers areas of computer vision which deal with reconstruction of 3D scene. This means, for instance, to recover a 3D scene from a video, or to detect and track a dynamically changing object in the scene. We introduce the underlying principles and methods to solve such tasks, where the best methods are often a combination of deep learning and traditional approaches. To do so, we cover the necessary background knowledge, such as image processing, camera models, deep learning, image formation model, and Kalmann Filters.

Date and Place:
Lecture: Tuesdays from 11-13;
Exercise: Thursdays from 11-13
Mathematikon B, Berliner Str. 43, SR B128

Content:
- Brief introduction to required Machine Learning concepts (Neural Networks, Convolution NN, etc)
- Basic Image processing (Filtering, Bilateral Filter)
- Sparse feature detection and description (points, edges, LIFT) w/ and w/o Neural networks
- Projective Geometry, Epipolar Geometry
- Sparse reconstruction (image matching, image descriptors) w/ and w/o Neural Networks
- Robust matching w/ Neural networks (Differentiable RASANC and related pipelines)
- Camera Localization and SLAM
- Stereo Vision, Dense 3D Reconstruction
- 3D Object detection (End-to-End Trainable Pipelines) w/ and w/o Neural Networks
- 3D Object tracking (6D Pose estimation, Kalman Filter, Particle Filter)

Formalities:
Teaching assistant (main point of contact): Philip Grassal: philip-william.grassal@iwr.uni-heidelberg.de

Registration: Via Moodle (now open: Computer Vision: 3D WiSe 2021/22). Important: We only have 40 slots for participants. Due to size of lecture room, we only have 20 slots for coming in person to the lecture, and the remaining 20 people can participate via life zoom. Please indicate your preference via moodle. If too many people want top participate in person we will random select. If you have problems please contact teaching assistant.
Prerequisite: no prerequisites, but it is recommended to have Machine Learning Background, e.g. Fundamentals of Machine Learning or equivalent
Exam: TBA
Leistungspunkte: 6 LP
Usability: Physics, MSc., Angewandte Informatik, MSc. Scientific Computing

Teaching goals:
The students
- Understand the principles behind estimating 3D Point Clouds and Motion from two or more images. They are able to apply this knowledge to new tasks in the field of 3D reconstruction.
- Understanding the principles of an image formation process and corresponding Geometry. This can be utilized to design new algorithms, for e.g. 3D motion estimation for autonomous driving.
- Understand and implement methods that combine machine learning based methods with classical computer vision based techniques.
- Have studied various state-of-the-art computer vision systems and approaches, and are then able to evaluate and classify new systems and approaches.
- Understand and implement different approaches for object tracking .

top