Kevin Qinghong LinPh.D. Student
Show Lab |
![]() |
I work on Vision + Language and Video Understanding, including:
I am open to discussion and collaboration. Feel free to reach out.
![]() |
ShowUI: One Vision-Language-Action Model for GUI Visual Agent Kevin QH. Lin, Linjie Li, Difei Gao, Zhengyuan Yang, Shiwei Wu, Zechen Bai, Stan WX. Lei, Lijuan Wang, Mike Z. Shou.
CVPR, 2025 |
![]() |
VLog: Video-Language Models by Generative Retrieval of Narration Vocabulary Kevin QH. Lin, Mike Z. Shou. |
![]() |
VideoGUI: A Benchmark for GUI Automation from Instructional Videos Kevin QH. Lin, Linjie Li, Difei Gao, Qinchen Wu, Mingyi Yan, Zhengyuan Yang, Lijuan Wang, Mike Z. Shou. |
![]() |
Learning Video Context as Interleaved Multimodal Sequences Kevin QH. Lin, Pengchuan Zhang, Difei Gao, Xide Xia, Joya Chen, Ziteng Gao, Jinheng Xie, Xuhong Xiao, Mike Z. Shou. |
![]() |
UniVTG: Towards Unified Video-Language Temporal Grounding Kevin QH. Lin, Pengchuan Zhang, Joya Chen, Shraman Pramanick, Difei Gao, Alex JP. Wang, Rui Yan, Mike Z. Shou. |
![]() |
Egocentric Video-Language Pretraining
Kevin QH. Lin, Alex JP. Wang, M. Soldan, M. Wray, R. Yan, Eric ZC. Xu, D. Gao, R. Tu, W. Zhao, W. Kong, C. Cai, H. Wang, D. Damen, B. Ghanem, W. Liu, Mike Z. Shou.
NeurIPS, 2022. Spotlight (1.7%) |
Workshop Organizer: LOVEU @ CVPR 25.
Conference Reviewer: CVPR (2024 Outstanding Reviewers), ICCV, ECCV, NeurIPS (2024 Top Reviewers), ICML, ICLR, etc.
Journal Reviewer: TPAMI, IJCV, TMLR, TNNLS, TMM, etc.
Co-organizer of The AI Talks.