Junke Wang 「王君可」

I'm a final year Ph.D. student at Fudan University, supervised by Prof. Zuxuan Wu and Prof. Yu-Gang Jiang. Before this, I received my bachelor's degree from Fudan University in 2021.

My research interest lies in computer vision, with the emphasis on multimodal general intelligence, e.g., semantic tokenizers, unified MLLMs, and world models.

Feel free to reach out if you are interested in working with me.

Email: wangjk21[at]m.fudan.edu.cn

Google Scholar   /   Github

profile photo

(* denotes equal contribution)
Publication
OmniGen-AR: AutoRegressive Any-to-Image Generation.
Junke Wang, Xun Wang, Qiushan Guo, Peize Sun, Weilin Huang, Zuxuan Wu, Yu-Gang Jiang.
NeurIPS, 2025.
Perception Encoder: The best visual embeddings are not at the output of the network. [Code]
FAIR Perception, Meta.
NeurIPS, 2025 (Oral).
Rethinking Discrete Tokens: Treating Them as Conditions for Continuous Autoregressive Image Synthesis.
Peng Zheng, Junke Wang, Yi Chang, Yizhou Yu, Rui Ma, Zuxuan Wu.
ICCV, 2025.
Fighting Malicious Media Data: A Survey on Tampering Detection and Deepfake Detection.
Junke Wang, Zhenxin Li, Chao Zhang, Jingjing Chen, Zuxuan Wu, Larry S. Davis, Yu-Gang Jiang.
Proceedings of IEEE, 2025.
OmniTracker: Unifying Object Tracking by Tracking-with-Detection.
Junke Wang, Dongdong Chen, Zuxuan Wu, Chong Luo, Xiyang Dai, Lu Yuan, Yu-Gang Jiang.
TPAMI, 2025.
OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation. [Code]
Junke Wang, Yi Jiang, Zehuan Yuan, Binyue Peng, Zuxuan Wu, Yu-Gang Jiang.
NeurIPS, 2024.
OmniVid: A Generative Framework for Universal Video Understanding. [Code]
Junke Wang, Dongdong Chen, Chong Luo, Bo He, Lu Yuan, Zuxuan Wu, Yu-Gang Jiang.
CVPR, 2024.
Look Before You Match: Instance Understanding Matters in Video Object Segmentation.
Junke Wang, Dongdong Chen, Zuxuan Wu, Chong Luo, Chuanxin Tang, Xiyang Dai, Yucheng Zhao,
Yujia Xie, Lu Yuan, Yu-Gang Jiang.
CVPR, 2023.
OmniVL: One Foundation Model for Image-Language and Video-Language Tasks.
Junke Wang, Dongdong Chen, Zuxuan Wu, Chong Luo, Luowei Zhou, Yucheng Zhao,
Yujia Xie, Ce Liu, Yu-Gang Jiang, Lu Yuan.
NeurIPS, 2022.
Efficient Video Transformers with Spatial-Temporal Token Selection. [Code]
Junke Wang*, Xitong Yang*, Hengduo Li, Zuxuan Wu, Yu-Gang Jiang.
ECCV, 2022.
ObjectFormer for Image Manipulation Detection and Localization. [Code]
Junke Wang, Zuxuan Wu, Jingjing Chen, Xintong Han, Abhinav Shrivastava, Yu-Gang Jiang, Ser-Nam Li.
CVPR, 2022.
M2TR: Multi-modal Multi-scale Transformer for Deepfake Detection. [Code]
Junke Wang, Zuxuan Wu, Wenhao Ouyang, Xintong Han, Jingjing Chen, Ser-Nam Lim, Yu-Gang Jiang
ICMR, 2022.

Preprints
SimpleAR: Pushing the Frontier of Autoregressive Visual Generation through Pretraining, SFT, and RL. [Code]
Junke Wang, Zhi Tian, Xun Wang, Xinyu Zhang, Weilin Huang, Zuxuan Wu, Yu-Gang Jiang
Arxiv, 2025.
Pix2Cap-COCO: Advancing Visual Comprehension via Pixel-Level Captioning. [Code]
Zuyao You*, Junke Wang*, Lingyu Kong, Bo He, Zuxuan Wu
Arxiv, 2025.

Projects
To See is to Believe: Prompting GPT-4V for Better Visual Instruction Tuning. [Dataset] [Project page]
Junke Wang*, Lingchen Meng*, Zejia Weng, Bo He, Zuxuan Wu, Yu-Gang Jiang.

  • We introduce a fine-grained visual instruction dataset, LVIS-INSTRUCT4V, which contains 220K visually aligned and context-aware instructions produced by prompting the powerful GPT-4V with images from LVIS.
  • ChatVideo: A Tracklet-centric Multimodal and Versatile Video Understanding System. [Code]
    Junke Wang, Dongdong Chen, Chong Luo, Xiyang Dai, Lu Yuan, Zuxuan Wu, Yu-Gang Jiang.

  • We present our vision for multimodal and versatile video understanding and propose a prototype system, ChatVideo.

  • Academic Services

    Conference Reviewer for CVPR, ICCV, ICML, NeurIPS, ICLR, ECCV, etal.

    Journal Reviewer for TPAMI, TIP, IJCV, etal.


    Selected Awards

    Bytedance Scholarship (20 people in China and Singapore). 2025.

    CCF-CV Academic Rising Star Award (3 people/year). 2025.

    Academic Star in Fudan University (10 PhD students). 2025.

    Fundamental Research Program for PhD students, sponsored by NSFC. 2024.

    Young Elite Scientists Sponsorship Program for PhD students, sponsored by CAAI. 2024.

    National Scholarship (Top 1%). 2022, 2025.

    Outstanding graduates in Shanghai (undergrads). 2021.



    Updated at Nov. 2025.