GitHub - AIR-DISCOVER/FreeAD

Introduction

FreeAD is an end-to-end (E2E) robot navigation system designed to operate in unstructured environments such as auxiliary roads, campus paths, and indoor settings. Unlike traditional E2E autonomous driving models that focus on structured roads, FreeAD aims to improve navigation capabilities in these challenging scenarios.

This project introduces the FreeWorld Dataset, a comprehensive dataset combining real-world robot data and synthetic data generated using the Isaac Sim simulator. This dataset is tailored for training and evaluating E2E autonomous driving models in unstructured environments.

To validate the dataset's effectiveness, we fine-tuned the efficient E2E driving model VAD using FreeWorld. Our results demonstrate that this fine-tuning significantly enhances navigation performance in unstructured environments.

This repository provides the first dataset specifically for E2E robot navigation in unstructured scenarios and offers a benchmark for vision-based E2E navigation technology, supporting the development of logistics and service robots.

Paper

Our research paper detailing the FreeAD system, dataset, and experimental results is available on arXiv.

Dataset

We modified some APIs from the nuScenes dataset to enhance flexibility and support a wider variety of data and map scenarios. The modified code has been localized and named FreeWorld. The FreeWorld Dataset is available for access.

Model

The FT-VAD model (FT_VAD_s1e6_s2e3.pth) was trained for 6 epochs in stage 1 and 3 epochs in stage 2, based on the pre-trained VAD-Base model.
The FT-VAD model is available on Hugging Face.

Results

Map Modeling Results on FreeWorld

This table presents a comparison between VAD-Tiny and VAD-Base using the Boundary + Divider map modeling strategy on the Full Warehouse map. This map structure closely aligns with the nuScenes map definition, providing a comprehensive evaluation in an open-loop scenario.

Method	L2 (m) 1s ↓	L2 (m) 2s ↓	L2 (m) 3s ↓	L2 (m) Avg. ↓	AP Divider ↑	AP Boundary ↑	FPS	Collision (%) ↓
VAD-Tiny	1.772	3.291	5.008	3.357	0.004	0.000	7.6	0.00
VAD-Base	3.296	5.779	8.429	5.835	0.001	0.000	4.6	0.00

Note: AP Divider and AP Boundary are computed with a threshold of 1.5.

Open-loop Results on FreeWorld

Open-loop Results on FreeWorld: Comparison of VAD-Tiny, VAD-Base, and FT-VAD under the "Divider Only" map modeling strategy. ADE and FDE represent pedestrian motion prediction errors. ADE and FDE represent pedestrian motion prediction errors.

Open-loop planning results on FreeWorld.

Method	L2 (m) 1s ↓	L2 (m) 2s ↓	L2 (m) 3s ↓	Avg. ↓	AP divider ↑	FPS ↑	ADE ↓	FDE ↓
VAD-Tiny	0.891	1.600	2.449	1.647	0.000	8.7	2.848	3.294
VAD-Base	0.499	0.759	1.040	0.766	0.001	5.0	2.089	2.917
FT-VAD	0.421	0.596	0.760	0.592	0.567	5.0	1.432	2.319

Note: AP divider is computed with a threshold of 1.5.

Open-loop Results on nuScenes

Open-loop Planning Results on nuScenes

Method	L2 (m) 1s ↓	L2 (m) 2s ↓	L2 (m) 3s ↓	Col. (%) 1s ↓	Col. (%) 2s ↓	Col. (%) 3s ↓
VAD-Tiny	0.46	0.76	1.12	0.21	0.35	0.58
VAD-Base	0.41	0.70	1.05	0.07	0.17	0.41
FT-VAD	2.02	3.24	4.38	0.00	0.00019	0.00016

Getting Started

Qualitative Results

FreeAD results

VAD results

Note: We found that the VAD exaggerated the predicted distance of map objects, and the 3D box detection performance was average.

Contact

If you have any questions or suggestions about this repo, please feel free to contact us ([email protected]).

Citation

If you find FreeAD useful in your research or applications, please consider giving us a star 🌟 and citing it by the following BibTeX entry.

@article{peng2025bench2freead,
  title={Bench2FreeAD: A Benchmark for Vision-based End-to-end Navigation in Unstructured Robotic Environments},
  author={Peng, Yuhang and Wang, Sidong and Yang, Jihaoyu and Li, Shilong and Wang, Han and Gong, Jiangtao},
  journal={arXiv preprint arXiv:2503.12180},
  year={2025}
}

License

All code in this repository is under the Apache License 2.0.

Acknowledgement

FreeAD is based on the following projects:VAD, mmdet3d, detr3d, BEVFormer and MapTR. Many thanks for their excellent contributions to the community.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
FreeWorld		FreeWorld
assets		assets
docs		docs
projects		projects
tools		tools
.gitignore		.gitignore
CoordinateSystem.md		CoordinateSystem.md
LICENSE		LICENSE
README.md		README.md
VAD_qualitative_results.jpg		VAD_qualitative_results.jpg
fine_tune.sh		fine_tune.sh
infer_eval.sh		infer_eval.sh
output_image.jpg		output_image.jpg
prepare_data.sh		prepare_data.sh
qualitative_analysis.jpg		qualitative_analysis.jpg
requirements.txt		requirements.txt
train_stage_1.sh		train_stage_1.sh
train_stage_2.sh		train_stage_2.sh
uncon1x1_0.01.txt		uncon1x1_0.01.txt
vis.sh		vis.sh
zip_project.sh		zip_project.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Introduction

Paper

Dataset

Model

Results

Map Modeling Results on FreeWorld

Open-loop Results on FreeWorld

Open-loop Results on nuScenes

Getting Started

Qualitative Results

Contact

Citation

License

Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

AIR-DISCOVER/FreeAD

Folders and files

Latest commit

History

Repository files navigation

Introduction

Paper

Dataset

Model

Results

Map Modeling Results on FreeWorld

Open-loop Results on FreeWorld

Open-loop Results on nuScenes

Getting Started

Qualitative Results

Contact

Citation

License

Acknowledgement

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages