A Closer Look at Cross-Domain Few-Shot Object Detection:
Fine-Tuning Matters and Parallel Decoder Helps

Xuanlong Yu¹, Youyang Sha¹, Longfei Liu¹, Xi Shen^{† 1}, Di Yang^{† 2}

¹Intellindust AI Lab ²Suzhou Institute for Advanced Research, USTC

🔍 TL;DR

We introduce a Hybrid Ensemble Decoder (HED) to improve query diversity and model robustness.
We propose a simple fine-tuning recipe for cross-domain few-shot object detection, with plateau-aware scheduling and progressive optimization.
This repository includes:
- Training/evaluation code and configs for CD-FSOD, ODinW-13, and RF100-VL benchmarks.
- A challenge subproject: NTIRE 2026 CDFSOD Challenge, including pseudo-label annotation strategy and challenge-oriented pipeline details.

📦 Installation

# Create conda environment
conda create -n ft-fsod python=3.10 -y
conda activate ft-fsod

# Install PyTorch first (pick the command matching your CUDA)
# Example for CUDA 12.1
pip install torch torchvision --index-url https://download.pytorch.org/whl/cu121

# Install OpenMMLab dependencies
pip install -U openmim
mim install "mmengine"
mim install "mmcv==2.1.0"
mim install mmdet

# Common dependencies
pip install -r requirements.txt

Deterministic hack (MMEngine): to reduce few-shot training instability, we set deterministic=True in configs, which needs to patch MMEngine as follows:
1) mmengine/runner/runner.py (set_randomness): add warn_only: bool = True and pass warn_only=warn_only to set_random_seed.
2) mmengine/runner/utils.py (set_random_seed): add warn_only: bool = True, and change to torch.use_deterministic_algorithms(True, warn_only=warn_only).
Refs: runner.py#L698, runner.py#L719, utils.py#L48, utils.py#L90.

📍 Fine-tuning on Benchmarks

1) Download bert-base-uncased, pre-trained weights and train models

Download bert-base-uncased and nltk_data following this instruction
Download pre-trained weight from: MMGDINO-B and MMGDINO-L
Adjust the dataset path and pre-trained weight path in src_path.py

2) Run the following scripts to fine-tune the models on various benchmarks

CD-FSOD (1/5/10-shot train+eval): bash run_mmgdinob_traineval_cdfsod.sh
ODinW-13 (1/3/5/10-shot, multi-seed): bash run_mmgdinol_traineval_odwin.sh
RF100-VL (10-shot): bash run_mmgdinol_traineval_rf100vl.sh
CD-Mixed OOD evaluation (1/5/10-shot): bash run_mmgdinob_eval_cdmixed.sh

Due to the instability of few-shot fine-tuning (even if the random seed is fixed), the results will be slightly different from the ones in the paper.

🏁 NTIRE 2026 CDFSOD Challenge

Challenge materials and instructions are in the same repository: NTIRE 2026 CDFSOD Challenge.
It includes our challenge-oriented pipeline, such as pseudo-label annotation strategy using FSOD-mAP as the evaluation metric for annotation selection, model fine-tuning and TTAs.

📊 Result Aggregation

CD-FSOD:

python analyze_results_cdfsod.py <experiment_directory_path>
# e.g.
python analyze_results_cdfsod.py exp_cdfosd_results

ODinW-13:

python analyze_results_odinw.py <experiment_directory_path>
# e.g.
python analyze_results_odinw.py exp_odinwfsod_results

RF100-VL:

python analyze_results_rf100.py <experiment_directory_path>
# e.g.
python analyze_results_rf100.py exp_rf100vlfsod_results

🙏 Acknowledgements

This project is built on top of the OpenMMLab ecosystem, RF100-VL benchmark, CD-FSOD benchmark and OdinW-13 benchmark.

📚 Citation

If you find this project useful in your research, please consider citing:

@inproceedings{yu2026acloser,
  title={A Closer Look at Cross-Domain Few-Shot Object Detection: Fine-Tuning Matters and Parallel Decoder Helps},
  author={Yu, Xuanlong and Sha, Youyang and Liu, Longfei and Shen, Xi and Yang, Di},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
  year={2026}
}

📄 License

This project is released under the Apache 2.0 License.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
NTIRE 2026 CDFSOD Challenge		NTIRE 2026 CDFSOD Challenge
assets		assets
configs_cdfsod		configs_cdfsod
configs_odinw		configs_odinw
configs_rf100vlfsod		configs_rf100vlfsod
mmdet		mmdet
tools		tools
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
analyze_results_cdfsod.py		analyze_results_cdfsod.py
analyze_results_odinw.py		analyze_results_odinw.py
analyze_results_rf100.py		analyze_results_rf100.py
run_mmgdinob_eval_cdmixed.sh		run_mmgdinob_eval_cdmixed.sh
run_mmgdinob_traineval_cdfsod.sh		run_mmgdinob_traineval_cdfsod.sh
run_mmgdinol_traineval_odwin.sh		run_mmgdinol_traineval_odwin.sh
run_mmgdinol_traineval_rf100vl.sh		run_mmgdinol_traineval_rf100vl.sh
src_path.py		src_path.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Closer Look at Cross-Domain Few-Shot Object Detection:
Fine-Tuning Matters and Parallel Decoder Helps

🔍 TL;DR

📦 Installation

📍 Fine-tuning on Benchmarks

1) Download bert-base-uncased, pre-trained weights and train models

2) Run the following scripts to fine-tune the models on various benchmarks

🏁 NTIRE 2026 CDFSOD Challenge

📊 Result Aggregation

🙏 Acknowledgements

📚 Citation

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

A Closer Look at Cross-Domain Few-Shot Object Detection:Fine-Tuning Matters and Parallel Decoder Helps

🔍 TL;DR

📦 Installation

📍 Fine-tuning on Benchmarks

1) Download bert-base-uncased, pre-trained weights and train models

2) Run the following scripts to fine-tune the models on various benchmarks

🏁 NTIRE 2026 CDFSOD Challenge

📊 Result Aggregation

🙏 Acknowledgements

📚 Citation

📄 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

A Closer Look at Cross-Domain Few-Shot Object Detection:
Fine-Tuning Matters and Parallel Decoder Helps

Packages