Kevin Qinghong LinPh.D. Student
Show Lab |
![]() |
I work on Vision + Language and Video Understanding, including:
I am open to discussion and collaboration. Feel free to reach out.
![]() |
ShowUI: One Vision-Language-Action Model for GUI Visual Agent Kevin QH. Lin, Linjie Li, Difei Gao, Zhengyuan Yang, Shiwei Wu, Zechen Bai, Stan WX. Lei, Lijuan Wang, Mike Z. Shou.
NeurIPS OWA workshop, 2024. Oral |
![]() |
VideoGUI: A Benchmark for GUI Automation from Instructional Videos Kevin QH. Lin, Linjie Li, Difei Gao, Qinchen Wu, Mingyi Yan, Zhengyuan Yang, Lijuan Wang, Mike Z. Shou. |
![]() |
Learning Video Context as Interleaved Multimodal Sequences Kevin QH. Lin, Pengchuan Zhang, Difei Gao, Xide Xia, Joya Chen, Ziteng Gao, Jinheng Xie, Xuhong Xiao, Mike Z. Shou. |
![]() |
UniVTG: Towards Unified Video-Language Temporal Grounding Kevin QH. Lin, Pengchuan Zhang, Joya Chen, Shraman Pramanick, Difei Gao, Alex JP. Wang, Rui Yan, Mike Z. Shou. |
![]() |
Egocentric Video-Language Pretraining
Kevin QH. Lin, Alex JP. Wang, M. Soldan, M. Wray, R. Yan, Eric ZC. Xu, D. Gao, R. Tu, W. Zhao, W. Kong, C. Cai, H. Wang, D. Damen, B. Ghanem, W. Liu, Mike Z. Shou.
NeurIPS, 2022. Spotlight (1.7%) |
Conference Reviewer: CVPR (2024 Outstanding Reviewers), ICCV, ECCV, NeurIPS (2024 Top Reviewers), ICML, ICLR, etc.
Journal Reviewer: TPAMI, IJCV, TMLR, TNNLS, TMM, etc.
Teaching Assistant: EE6934 Deep Learning, EE6733 Advanced Topics on Vision and Machine Learning, EE4212 Computer Vision
Co-organizer of The AI Talks.