This repository contains all computer vision publications related to the NEXUS (Netherlands Ecosystem for Cross-specialty Unified Surgical intelligence) consortium.
This repository aims to gather publications, code and datasets produced by our group related to surgical foundation models, anatomy recognition/segmentation, and surgical phase recognition.
We have opensourced multiple in-domain-pretrained DINOv1-3 models across various architectures including Vision Transformers (ViT), ConvNeXts, and more. These models are pretrained on SurgeNet, the largest reported surgical dataset to date, comprising over 4.7 million video frames from more than 23 different procedures. Available in different sizes, ranging from lightweight to large-scale architectures, they can be deployed for diverse surgical computer vision applications with varying computational requirements—from real-time intraoperative systems to high-accuracy offline analysis.
Explore our complete model collection and implementation at SurgeNet.
ATLAS-Interactive is an open-source framework for clip-level anatomical segmentation in minimally invasive surgery. It combines keyframe annotation with temporal mask propagation, enabling fast, temporally consistent labeling. The tool was trained on over 100 videos, 500 clips, and 120k surgical video frames including 14 surgical procedures and 42 distinct classes.
Explore the tool and implementation at ATLAS-Interactive.
| Journal / Conference | Title | Paper | Code | Citation |
|---|---|---|---|---|
| MIDL 2026 | Towards Effective Surgical Representation Learning with DINO models | Paper | Code | BibTeX |
| MedIA 2025 | Scaling up self-supervised learning for improved surgical foundation models | Paper | Code | BibTeX |
| MICCAI-DEMI 2024 | Exploring the Effect of Dataset Diversity in Self-Supervised Learning for Surgical Computer Vision | Paper | Code | BibTeX |
| Journal / Conference | Title | Paper | Code | Citation |
|---|---|---|---|---|
| Surg Endosc 2025 | Do segmentation metrics reflect clinical reality? A surgeon-centered evaluation in robot-assisted minimally invasive esophagectomy | Paper | BibTeX | |
| SPIE-Medical imaging 2024 | Benchmarking pretrained attention-based models for real-time recognition in RAMIE | Paper | BibTeX | |
| Journal of Endourology 2024 | Estimating surgical urethral length on intraoperative robot-assisted prostatectomy images using artificial intelligence anatomy recognition | Paper | BibTeX | |
| Surg Endosc 2023 | Deep learning-based recognition of key anatomical structures during robot-assisted minimally invasive esophagectomy | Paper | BibTeX | |
| Surg Endosc 2022 | Computer-aided anatomy recognition in intrathoracic and-abdominal surgery: a systematic review | Paper | BibTeX |
| Journal / Conference | Title | Paper | Code | Citation |
|---|---|---|---|---|
| MICCAI 2025 | SemiVT-Surge: Semi-Supervised Video Transformer for Surgical Phase Recognition | Paper | Code | BibTeX |
| SPIE-Medical imaging 2024 | Benchmarking and Enhancing Surgical Phase Recognition Models for Robot-Assisted Esophagectomy | Paper | Code (Causal-Transformer) | BibTeX |
Our work includes several subprojects supported by different funding sources:
- INTRA-SURGE: INTelligent computeR-Aided Surgical gUidance for Robot-assisted surGEry
- IMAGINERS: IMAGINg-based augmentation for RobotIc-assisted lung Surgery: the IMAGINERS project for automated and intelligent surgical guidance
E-mail:

