|
Junfeng Wu
I am currently a Research Scientist at ByteDance, US. My primary research interests lie in the field of Computer Vision, with a focus on visual generation, multimodal large language models. Specifically, I am exploring the exciting domains of multi-modal large models (MLLMs) and unified vision understanding and generation tasks.
Previously, I received my Ph.D. degree from the VLR Group at Huazhong University of Science and Technology under the supervision of Prof. Xiang Bai. During my Ph.D., I also worked as a research intern at the ByteDance AI Lab from 2021 to 2025.
Email  / 
Google Scholar  / 
Github
|
|
|
|
UniTok: A Unified Tokenizer for Visual Generation and Understanding
Chuofan Ma, Yi Jiang, Junfeng Wu, Jihan Yang, Xin Yu, Zehuan Yuan, Bingyue Peng, Xiaojuan Qi
Arxiv, 2025  
arXiv
/
code
|
|
|
Liquid: Language Models are Scalable and Unified Multi-modal Generators
Junfeng Wu,
Yi Jiang, Chuofan Ma, Yuliang Liu, Hengshuang Zhao, Zehuan Yuan, Song Bai, Xiang Bai
Arxiv, 2024  
arXiv
/
code
|
|
|
PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects
Junyi Li,
Junfeng Wu,
Weizhi Zhao,
Song Bai,
Xiang Bai
ECCV, 2024  
arXiv
/
code
|
|
|
General Object Foundation Model for Images and Videos at Scale
Junfeng Wu,
Yi Jiang,
QiHao Liu,
Zehuan Yuan,
Xiang Bai,
Song Bai
CVPR, 2024   (Highlight)
arXiv
/
code
/
video
|
|
|
InstMove: Instance Motion for Object-centric Video Segmentation
QiHao Liu,
Junfeng Wu,
Yi Jiang,
Xiang Bai,
Alan Yuille,
Song Bai
CVPR, 2023  
arXiv
/
code
|
|
|
In Defense of Online Models for Video Instance Segmentation
Junfeng Wu,
QiHao Liu,
Yi Jiang,
Song Bai,
Alan Yuille,
Xiang Bai
ECCV, 2022   (Oral Presentation)
arXiv
/
code
/
video
|
|
|
SeqFormer: Sequential Transformer for Video Instance Segmentation
Junfeng Wu,
Yi Jiang,
Song Bai,
Wenqing Zhang,
Xiang Bai
ECCV, 2022   (Oral Presentation)
arXiv /
code /
|
|
|
1st Place Solution for YouTubeVOS Challenge 2022: Video Instance Segmentation
Junfeng Wu,
Xiang Bai,
Yi Jiang,
Qihao Liu,
Zehuan Yuan,
Song Bai
CVPR, 2022 workshop
code
|
|
Academic Services
I actively serve as a reviewer for several leading conferences and journals in the field of computer vision and machine learning.
Conference Reviewer:
CVPR 2023, ICCV 2023, CVPR 2024, ECCV 2024, NeurIPS 2024, AAAI 2024, CVPR 2025, ICML 2025, ICCV 2025
Journal Reviewer:
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI),
Pattern Recognition (PR),
SCIENCE CHINA Information Sciences (SCIS)
|
|