VRBench: a benchmark for multi-step reasoning in long narrative videos Aug 12, 2025· Jiashuo Yu , Yue Wu , Meng Chu , Zhifei Ren , Zizheng Huang , Pei Chu , Ruijie Zhang , Yinan He , Qirui Li , Songze Li , Zhenxiang Li , Zhongying Tu , Conghui He , Yu Qiao , Yali Wang , Yi Wang Limin Wang · 0 min read Cite URL Type Conference paper Publication Proceedings of the IEEE/CVF International Conference on Computer Vision Last updated on Aug 12, 2025 Authors Limin Wang Nanjing University ← Scalable image tokenization with index backpropagation quantization Aug 12, 2025 Differentiable solver search for fast diffusion sampling Jul 18, 2025 →