Cancel Create saved search Sign in Sign up {{ message }} huangb23 / VTimeLLM Public Notifications You must be signed in to change notification settings Fork 6 Star 146 Code Issues 4 Pull requests Actions Projects Security Insights ...
[CVPR'2024 Highlight] Official PyTorch implementation of the paper "VTimeLLM: Empower LLM to Grasp Video Moments". - VTimeLLM/vtimellm/inference.py at main · huangb23/VTimeLLM
Dec-4: VTimeLLM: demo released. VTimeLLM Overview 💡 VTimeLLM is a novel Video LLM designed for fine-grained video moment understanding and reasoning with respect to time boundary. VTimeLLM adopts a boundary-aware three-stage training strategy, which respectively utilizes image-text pairs for...
VTimeLLM [Paper] Official PyTorch implementation of the paper "VTimeLLM: Empower LLM to Grasp Video Moments". 📢 Latest Updates Jan-2: Thanks toXiao Xia,Shengbo TongandBeining Wang, we have refactored the code to now support both the LLAMA and ChatGLM3 architectures. We translated the tr...
VTimeLLM [Paper] Official PyTorch implementation of the paper "VTimeLLM: Empower LLM to Grasp Video Moments". 📢 Latest Updates Jan-2: Thanks to Xiao Xia , Shengbo Tong and Beining Wang, we have refactored the code to now support both the LLAMA and ChatGLM3 architectures. We translated...