Kevin Qinghong Lin

Ph.D. Student

Show Lab
National University of Singapore

Email: kevin.qh.lin [at] gmail.com


Biography

I am a Ph.D. student in Show Lab @ NUS, working with Prof. Mike Shou.

I work on building multi-modal assistants from and for humans. This involves abilities like:

I am open to discussion and collaboration. Feel free to reach out.

News

Selected Publications [Google Scholar]

† indicates equal contribution. Denotes student I mentored.
Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers
Wei Pang†, Kevin QH. Lin†, Xiangru Jian†, Xi He, Philip Torr

Preprint, 2025
[paper] [code] [project] [datasets] [twitter]
2K github stars.

Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models
Jiaqi Wang†, Kevin QH. Lin†, James Cheng, Mike Z. Shou.

Preprint, 2025
[paper] [code] [huggingface]

VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning
Ye Liu†, Kevin QH. Lin†, Chang Wen Chen, Mike Z. Shou.

Preprint, 2025
[paper] [code] [dataset] [project] [demo]

ShowUI: One Vision-Language-Action Model for GUI Visual Agent
Kevin QH. Lin, Linjie Li, Difei Gao, Zhengyuan Yang, Shiwei Wu, Zechen Bai, Stan WX. Lei, Lijuan Wang, Mike Z. Shou.

CVPR, 2025
NeurIPS OWA workshop, 2024. Oral
[paper] [code] [huggingface] [dataset] [demo]
Outstanding Paper Award, NeurIPS Open-World Agents Workshop 2024.

VLog: Video-Language Models by Generative Retrieval of Narration Vocabulary
Kevin QH. Lin, Mike Z. Shou.

CVPR, 2025
[paper] [code]

VideoGUI: A Benchmark for GUI Automation from Instructional Videos
Kevin QH. Lin, Linjie Li, Difei Gao, Qinchen Wu, Mingyi Yan, Zhengyuan Yang, Lijuan Wang, Mike Z. Shou.

NeurIPS, 2024. Spotlight
[paper] [code] [project]

Learning Video Context as Interleaved Multimodal Sequences
Kevin QH. Lin, Pengchuan Zhang, Difei Gao, Xide Xia, Joya Chen, Ziteng Gao, Jinheng Xie, Xuhong Xiao, Mike Z. Shou.

ECCV, 2024
[paper] [code]

UniVTG: Towards Unified Video-Language Temporal Grounding
Kevin QH. Lin, Pengchuan Zhang, Joya Chen, Shraman Pramanick, Difei Gao, Alex JP. Wang, Rui Yan, Mike Z. Shou.

ICCV, 2023
[paper] [code] [demo]

Egocentric Video-Language Pretraining
Kevin QH. Lin, Alex JP. Wang, M. Soldan, M. Wray, R. Yan, Eric ZC. Xu, D. Gao, R. Tu, W. Zhao, W. Kong, C. Cai, H. Wang, D. Damen, B. Ghanem, W. Liu, Mike Z. Shou.

NeurIPS, 2022. Spotlight (1.7%)
[paper] [code] [project] [poster] [media]
EgoVis Distinguished Paper Award & PREMIA Best Student Paper Award 2023.
Double champions in Ego4D & Epic-Kitchens CVPR 2022 challenges.

Honors

Service


Flag Counter

© Kevin