MobileViCLIP: an efficient video-text model for mobile devices Aug 12, 2025· Min Yang , Zihan Jia , Zhilin Dai , Sheng Guo Limin Wang · 0 min read Cite URL Type Conference paper Publication Proceedings of the IEEE/CVF International Conference on Computer Vision Last updated on Aug 12, 2025 Authors Limin Wang Nanjing University ← Make your training flexible: towards deployment-efficient video models Aug 12, 2025 p-MoD: building mixture-of-depths MLLMs via progressive ratio decay Aug 12, 2025 →