Master’s Thesis Project

Achieving Sketch-to-3D Shape Transformation using the Sketch-A-Shape Architecture

Liulihan Kuang

AU-ID: AU636049

Student number: 201906612

Department of Computer Engineering, Aarhus University

September 2024

Welcome to the GitHub repository for Liulihan Kuang's master’s thesis project. This repository contains the source code and resources for the project, structured into two main stages of training and inference.

Project Overview

This project focuses on transforming 2D sketches into 3D shapes using the Sketch-A-Shape architecture. The implementation includes two key training stages, as well as inference capabilities.

Directory Structure

Training Stage 1 (VQ-VAE Training)
This directory contains the implementation of the VQ-VAE (Vector Quantized Variational AutoEncoder) architecture, including:
- Network architecture files
- Training scripts
- Testing scripts
- Plotting scripts for visualizing results
Training Stage 2 and Inference (Transformer-based Model)
This directory contains files for the transformer-based model used in the second training stage and inference, including:
- Transformer architecture
- Training scripts for Stage 2
- Pretrained VQ-VAE architecture from Stage 1
- Inference scripts
- Input sketch samples and output 3D shapes
- Plotting scripts for evaluating model performance

Setup Instructions

To replicate the training and inference processes, follow these steps:

Dataset Download
Download the dataset required for both training stages using the following command:
```
wget https://clip-forge-pretrained.s3.us-west-2.amazonaws.com/exps.zip
unzip exps.zip
```
Environment Setup
Create and activate the conda environment for this project:
```
conda env create -f environment.yml
conda activate master_project
```

Running the Code

You can execute the training and inference stages with the following commands:

Stage 1 (VQ-VAE Training):

python train.py --dataset_path /path/to/dataset/

Stage 2 (Transformer-based Model Training):

python training_stage2.py --dataset_path /path/to/dataset/

Inference (Generate 3D Shape from Sketch):

python inference.py --save_path /path/to/output/

Note: Both training and inference stages require a significant amount of GPU VRAM. Ensure that your system meets the necessary hardware requirements.

Sorry to say that i actually don't know what the minimum hardware requirements are, because I just tried to adjust the transformer architectures to run it,it was trained and inferenced on a NVIDA A40 which has 48GB of VRAM, and it used up to 46GB VRAM to both train and inference.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
training_stage1/vqvae_3d_v2		training_stage1/vqvae_3d_v2
training_stage2		training_stage2
.gitignore		.gitignore
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Master’s Thesis Project

Achieving Sketch-to-3D Shape Transformation using the Sketch-A-Shape Architecture

Project Overview

Directory Structure

Setup Instructions

Running the Code

About

Releases

Packages

Languages

Gitgutgait/master_project

Folders and files

Latest commit

History

Repository files navigation

Master’s Thesis Project

Achieving Sketch-to-3D Shape Transformation using the Sketch-A-Shape Architecture

Project Overview

Directory Structure

Setup Instructions

Running the Code

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages