Junfeng Wu

I am currently a Research Scientist at ByteDance, US. My primary research interests lie in the field of Computer Vision, with a focus on visual generation, multimodal large language models. Specifically, I am exploring the exciting domains of multi-modal large models (MLLMs) and unified vision understanding and generation tasks.

Previously, I received my Ph.D. degree from the VLR Group at Huazhong University of Science and Technology under the supervision of Prof. Xiang Bai. During my Ph.D., I also worked as a research intern at the ByteDance AI Lab from 2021 to 2025.

Email / Google Scholar / Github

	UniTok: A Unified Tokenizer for Visual Generation and Understanding Chuofan Ma, Yi Jiang, Junfeng Wu, Jihan Yang, Xin Yu, Zehuan Yuan, Bingyue Peng, Xiaojuan Qi Arxiv, 2025 arXiv / code
	Liquid: Language Models are Scalable and Unified Multi-modal Generators Junfeng Wu, Yi Jiang, Chuofan Ma, Yuliang Liu, Hengshuang Zhao, Zehuan Yuan, Song Bai, Xiang Bai Arxiv, 2024 arXiv / code
	PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects Junyi Li, Junfeng Wu, Weizhi Zhao, Song Bai, Xiang Bai ECCV, 2024 arXiv / code
	General Object Foundation Model for Images and Videos at Scale Junfeng Wu, Yi Jiang, QiHao Liu, Zehuan Yuan, Xiang Bai, Song Bai CVPR, 2024 (Highlight) arXiv / code / video
	InstMove: Instance Motion for Object-centric Video Segmentation QiHao Liu, Junfeng Wu, Yi Jiang, Xiang Bai, Alan Yuille, Song Bai CVPR, 2023 arXiv / code
	In Defense of Online Models for Video Instance Segmentation Junfeng Wu, QiHao Liu, Yi Jiang, Song Bai, Alan Yuille, Xiang Bai ECCV, 2022 (Oral Presentation) arXiv / code / video
	SeqFormer: Sequential Transformer for Video Instance Segmentation Junfeng Wu, Yi Jiang, Song Bai, Wenqing Zhang, Xiang Bai ECCV, 2022 (Oral Presentation) arXiv / code /
	1st Place Solution for YouTubeVOS Challenge 2022: Video Instance Segmentation Junfeng Wu, Xiang Bai, Yi Jiang, Qihao Liu, Zehuan Yuan, Song Bai CVPR, 2022 workshop code

Academic Services

I actively serve as a reviewer for several leading conferences and journals in the field of computer vision and machine learning.

Conference Reviewer:
CVPR 2023, ICCV 2023, CVPR 2024, ECCV 2024, NeurIPS 2024, AAAI 2024, CVPR 2025, ICML 2025, ICCV 2025

Journal Reviewer:
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI),
Pattern Recognition (PR),
SCIENCE CHINA Information Sciences (SCIS)

Design and source code from Jon Barron's website.