RelativePoseCam

The goal of this task is to provide space displacement vector and euler displacement vector (difference of camera directions) for given pair of two neighbour frames in video.

This task is neural based, so we have to collect the data, and we will use colmap in console mode for this purpose. All actions are performed in nvidia-docker container, which will install colmap and useful apps.

Then you can find sparse_model.sh bash script in storage/ folder, this script will prepare and preprocess the data. Then in code/model path you can run train_eval.py script and provide paths with colmap sparse models to the dataset class.

Solution. For this task I used resnet-34 architecture for extract features for each image in current pair. Then I concatenate two feature tensors and provide it as input for 3-layer fully convolutional NN, consists of convolutional layers. The final layer produces 6-dimanesional vector (first 3 components - for angles, activation function - tanh, second 3 components - for displacement). The loss function is MAE, because I used videos from smartphone, and quality of frames in sample may be relatively bad. If I use MSE, the NN will adjust to relatively big number of outliers.

Example of inference and training / validation loss you can see below.

angle difference (radians) [-0.2180, 0.2343, 0.2135] displacement [-0.1910, -0.1233, 0.2215]

Learning, losses

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
code		code
storage		storage
Dockerfile		Dockerfile
README.md		README.md
build.sh		build.sh
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RelativePoseCam

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RelativePoseCam

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages