FreeAD is an end-to-end (E2E) robot navigation system designed to operate in unstructured environments such as auxiliary roads, campus paths, and indoor settings. Unlike traditional E2E autonomous driving models that focus on structured roads, FreeAD aims to improve navigation capabilities in these challenging scenarios.
This project introduces the FreeWorld Dataset, a comprehensive dataset combining real-world robot data and synthetic data generated using the Isaac Sim simulator. This dataset is tailored for training and evaluating E2E autonomous driving models in unstructured environments.
To validate the dataset's effectiveness, we fine-tuned the efficient E2E driving model VAD using FreeWorld. Our results demonstrate that this fine-tuning significantly enhances navigation performance in unstructured environments.
This repository provides the first dataset specifically for E2E robot navigation in unstructured scenarios and offers a benchmark for vision-based E2E navigation technology, supporting the development of logistics and service robots.
Our research paper detailing the FreeAD system, dataset, and experimental results is available on arXiv.
We modified some APIs from the nuScenes dataset to enhance flexibility and support a wider variety of data and map scenarios. The modified code has been localized and named FreeWorld. The FreeWorld Dataset is available for access.
The FT-VAD model (FT_VAD_s1e6_s2e3.pth
) was trained for 6 epochs in stage 1 and 3 epochs in stage 2, based on the pre-trained VAD-Base model.
The FT-VAD model is available on Hugging Face.
This table presents a comparison between VAD-Tiny and VAD-Base using the Boundary + Divider map modeling strategy on the Full Warehouse map. This map structure closely aligns with the nuScenes map definition, providing a comprehensive evaluation in an open-loop scenario.
Method | L2 (m) 1s ↓ | L2 (m) 2s ↓ | L2 (m) 3s ↓ | L2 (m) Avg. ↓ | AP Divider ↑ | AP Boundary ↑ | FPS | Collision (%) ↓ |
---|---|---|---|---|---|---|---|---|
VAD-Tiny | 1.772 | 3.291 | 5.008 | 3.357 | 0.004 | 0.000 | 7.6 | 0.00 |
VAD-Base | 3.296 | 5.779 | 8.429 | 5.835 | 0.001 | 0.000 | 4.6 | 0.00 |
Note: AP Divider and AP Boundary are computed with a threshold of 1.5.
Open-loop Results on FreeWorld: Comparison of VAD-Tiny, VAD-Base, and FT-VAD under the "Divider Only" map modeling strategy. ADE and FDE represent pedestrian motion prediction errors. ADE and FDE represent pedestrian motion prediction errors.
- Open-loop planning results on FreeWorld.
Method | L2 (m) 1s ↓ | L2 (m) 2s ↓ | L2 (m) 3s ↓ | Avg. ↓ | AP divider ↑ | FPS ↑ | Collision (Avg. %) ↓ | ADE ↓ | FDE ↓ |
---|---|---|---|---|---|---|---|---|---|
VAD-Tiny | 0.891 | 1.600 | 2.449 | 1.647 | 0.000 | 8.7 | 0.00 | 2.848 | 3.294 |
VAD-Base | 0.499 | 0.759 | 1.040 | 0.766 | 0.001 | 5.0 | 0.00 | 2.089 | 2.917 |
FT-VAD | 0.421 | 0.596 | 0.760 | 0.592 | 0.567 | 5.0 | 0.00 | 1.432 | 2.319 |
Note: AP divider is computed with a threshold of 1.5.
- Open-loop Planning Results on nuScenes
Method | L2 (m) 1s ↓ | L2 (m) 2s ↓ | L2 (m) 3s ↓ | Col. (%) 1s ↓ | Col. (%) 2s ↓ | Col. (%) 3s ↓ |
---|---|---|---|---|---|---|
VAD-Tiny | 0.46 | 0.76 | 1.12 | 0.21 | 0.35 | 0.58 |
VAD-Base | 0.41 | 0.70 | 1.05 | 0.07 | 0.17 | 0.41 |
FT-VAD | 2.02 | 3.24 | 4.38 | 0.00 | 0.00019 | 0.00016 |
Note: We found that the VAD exaggerated the predicted distance of map objects, and the 3D box detection performance was average.
If you have any questions or suggestions about this repo, please feel free to contact us ([email protected]).
If you find FreeAD useful in your research or applications, please consider giving us a star 🌟 and citing it by the following BibTeX entry.
@article{peng2025bench2freead,
title={Bench2FreeAD: A Benchmark for Vision-based End-to-end Navigation in Unstructured Robotic Environments},
author={Peng, Yuhang and Wang, Sidong and Yang, Jihaoyu and Li, Shilong and Wang, Han and Gong, Jiangtao},
journal={arXiv preprint arXiv:2503.12180},
year={2025}
}
All code in this repository is under the Apache License 2.0.
FreeAD is based on the following projects:VAD, mmdet3d, detr3d, BEVFormer and MapTR. Many thanks for their excellent contributions to the community.