- 👋 Hi, I’m @CSfufu
- I am currently focus on VLM Agentic reasoning and Reinforcement Learning.
Highlights
- Pro
Pinned Loading
-
Revisual-R1
Revisual-R1 Public[ICLR 2026]🚀ReVisual-R1 is a 7B open-source multimodal language model that follows a three-stage curriculum—cold-start pre-training, multimodal reinforcement learning, and text-only reinforcement l…
-
hiyouga/EasyR1
hiyouga/EasyR1 PublicEasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
-
verl-project/verl
verl-project/verl Publicverl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
-
shawn0728/Unify-Agent
shawn0728/Unify-Agent Public🐧 Unify-Agent: An end-to-end unified multimodal agent for faithful, knowledge-grounded image generation.
-
shawn0728/OpenSearch-VL
shawn0728/OpenSearch-VL Public🔍 OpenSearch-VL provides a fully open recipe for training strong multimodal deep search agents through high-quality data curation, diverse visual/search tools, and fatal-aware agentic reinforcement…
-
Osilly/Vision-DeepResearch
Osilly/Vision-DeepResearch PublicMultimodal deep-research MLLM and benchmark. The first long-horizon multimodal deep-research MLLM, extending the number of reasoning turns to dozens and the number of search-engine interactions to …
If the problem persists, check the GitHub status page or contact support.

