Multi-Granular Spatio-Temporal Token Merging for

Training-Free Acceleration of Video LLMs

Jeongseok Hyun1    Sukjun Hwang2    Su Ho Han1    Taeoh Kim3    Inwoong Lee3

Dongyoon Wee3    Joon-Young Lee4    Seon Joo Kim1    Minho Shim3

1Yonsei University    2Carnegie Mellon University    3NAVER Cloud    4Adobe Research

ICCV 2025

Paper | Code