Skip to content
@VisionXLab

VisionXLab

VisionXLab at Shanghai Jiao Tong University, led by Prof. Xue Yang.

Pinned Loading

  1. h2rbox-mmrotate h2rbox-mmrotate Public

    [ICLR'23] PyTorch Implementation for H2RBox

    Python 106 11

  2. mllm-mmrotate mllm-mmrotate Public

    [IGARSS 2025 Oral] A Simple Aerial Detection Baseline of Multimodal Language Models.

    Jupyter Notebook 90 6

  3. point2rbox-v2 point2rbox-v2 Public

    [CVPR'25] Official repo of "Point2RBox-v2:Rethinking Point-supervised Oriented Object Detection with Spatial Layout Among Instances"

    Python 40 3

  4. whollywood whollywood Public

    [TPAMI] Wholly Leveraging Diversified-quality Labels for Weakly-supervised Oriented Object Detection

    Jupyter Notebook 11

  5. LRS-VQA LRS-VQA Public

    [ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning

    Python 46 1

  6. CrossEarth CrossEarth Public

    [TPAMI 2025] CrossEarth: Geospatial Vision Foundation Model for Cross-Domain Generalization in Remote Sensing Semantic Segmentation

    Python 175 9

Repositories

Showing 10 of 29 repositories
  • SpaCE-10 Public

    [ICLR 2026] SpaCE-10: A Comprehensive Benchmark for Multimodal Large Language Models in Compositional Spatial Intelligence

    VisionXLab/SpaCE-10’s past year of commit activity
    Python 16 2 1 0 Updated Jan 26, 2026
  • Awesome-RS-VL-Data Public

    Awesome Remote Sensing Vision-Language Datasets

    VisionXLab/Awesome-RS-VL-Data’s past year of commit activity
    34 MIT 1 126 0 Updated Jan 24, 2026
  • mllm-mmrotate Public

    [IGARSS 2025 Oral] A Simple Aerial Detection Baseline of Multimodal Language Models.

    VisionXLab/mllm-mmrotate’s past year of commit activity
    Jupyter Notebook 90 6 0 1 Updated Jan 21, 2026
  • RSCoVLM Public

    [Remote Sensing 2026] Co-Training Vision Language Models for Remote Sensing Multi-task Learning

    VisionXLab/RSCoVLM’s past year of commit activity
    Python 18 1 0 0 Updated Jan 21, 2026
  • DVGBench Public

    [ISPRS2026] DVGBench: Implicit-to-Explicit Visual Grounding Benchmark in UAV Imagery with Large Vision-Language Models

    VisionXLab/DVGBench’s past year of commit activity
    8 0 1 0 Updated Jan 14, 2026
  • VisionXLab/VisionXLab_LaTeX_Template’s past year of commit activity
    TeX 7 0 0 0 Updated Jan 13, 2026
  • AirSpatialBot Public

    [TGRS'25] AirSpatialBot: A Spatially-Aware Aerial Agent for Fine-Grained Vehicle Attribute Recognization and Retrieval

    VisionXLab/AirSpatialBot’s past year of commit activity
    Python 29 1 1 0 Updated Jan 6, 2026
  • avi-math Public

    [ISPRS'25] Multimodal Mathematical Reasoning Embedded in Aerial Vehicle Imagery: Benchmarking, Analysis, and Exploration

    VisionXLab/avi-math’s past year of commit activity
    Python 13 1 0 0 Updated Jan 4, 2026
  • CastDet Public

    [ECCV'24/IJCV'26] Code repo for "Toward Open Vocabulary Aerial Object Detection with CLIP-Activated Student-Teacher Learning"

    VisionXLab/CastDet’s past year of commit activity
    Python 69 Apache-2.0 5 6 0 Updated Jan 1, 2026
  • CrossEarth Public

    [TPAMI 2025] CrossEarth: Geospatial Vision Foundation Model for Cross-Domain Generalization in Remote Sensing Semantic Segmentation

    VisionXLab/CrossEarth’s past year of commit activity
    Python 175 MIT 9 7 0 Updated Dec 21, 2025