You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We introduce 'Thinking with Video', a new paradigm leveraging video generation for multimodal reasoning. Our VideoThinkBench shows that Sora-2 surpasses GPT5 by 10% on eyeballing puzzles and reaches 69% accuracy on MMMU.
🌴 ARES is an open-source framework for adaptive multimodal reasoning, featuring a two-stage pipeline—Adaptive Cold-Start and Entropy-Shaped Policy Optimization—to balance reasoning depth and efficiency.
Welcome to the 🤖 Generative AI 🤖 Papers Repository! This repository is dedicated to compiling and sharing research papers that are trending ✨/ impactful 💥in the domain of Generative AI. This compilation in part motivated by the course CSE 598: Topics in Generative AI by Dr. Yezhou Yang, Arizona State University.