Junke Wang 「王君可」

I'm a final-year Ph.D. student at Fudan University, supervised by Prof. Zuxuan Wu and Prof. Yu-Gang Jiang. I have interned at ByteDance Seed, Meta FAIR, and Microsoft AI. I'm the recipient of Bytedance Fellowship.

My research interest lies in multimodal general intelligence. Recently, I work on visual tokenizers, action tokenizers, and world models. Feel free to reach out if you are interested in working with me.

Email: wangjk21[at]m[dot]fudan[dot]edu[dot]cn

Google Scholar   /   Github

Publication
FluxMem: Adaptive Hierarchical Memory for Streaming Video Understanding.
Yiweng Xie, Bo He, Junke Wang, Xiangyu Zheng, Ziyi Ye, Zuxuan Wu.
CVPR, 2026.
TempoMaster: Efficient Long Video Generation via Next-Frame-Rate Prediction.
Yukuo Ma, Cong Liu, Junke Wang, Junqi Liu, Haibin Huang, Zuxuan Wu, Chi Zhang, Xuelong Li.
CVPR, 2026.
OmniGen-AR: AutoRegressive Any-to-Image Generation.
Junke Wang, Xun Wang, Qiushan Guo, Peize Sun, Weilin Huang, Zuxuan Wu, Yu-Gang Jiang.
NeurIPS, 2025.
Perception Encoder: The best visual embeddings are not at the output of the network. [Code]
FAIR Perception, Meta.
NeurIPS, 2025 (Oral).
Rethinking Discrete Tokens: Treating Them as Conditions for Continuous Autoregressive Image Synthesis.
Peng Zheng, Junke Wang, Yi Chang, Yizhou Yu, Rui Ma, Zuxuan Wu.
ICCV, 2025.
Fighting Malicious Media Data: A Survey on Tampering Detection and Deepfake Detection.
Junke Wang, Zhenxin Li, Chao Zhang, Jingjing Chen, Zuxuan Wu, Larry S. Davis, Yu-Gang Jiang.
Proceedings of IEEE, 2025.
OmniTracker: Unifying Object Tracking by Tracking-with-Detection.
Junke Wang, Dongdong Chen, Zuxuan Wu, Chong Luo, Xiyang Dai, Lu Yuan, Yu-Gang Jiang.
TPAMI, 2025.
OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation. [Code]
Junke Wang, Yi Jiang, Zehuan Yuan, Binyue Peng, Zuxuan Wu, Yu-Gang Jiang.
NeurIPS, 2024.
OmniVid: A Generative Framework for Universal Video Understanding. [Code]
Junke Wang, Dongdong Chen, Chong Luo, Bo He, Lu Yuan, Zuxuan Wu, Yu-Gang Jiang.
CVPR, 2024.
Look Before You Match: Instance Understanding Matters in Video Object Segmentation.
Junke Wang, Dongdong Chen, Zuxuan Wu, Chong Luo, Chuanxin Tang, Xiyang Dai, Yucheng Zhao,
Yujia Xie, Lu Yuan, Yu-Gang Jiang.
CVPR, 2023.
OmniVL: One Foundation Model for Image-Language and Video-Language Tasks.
Junke Wang, Dongdong Chen, Zuxuan Wu, Chong Luo, Luowei Zhou, Yucheng Zhao,
Yujia Xie, Ce Liu, Yu-Gang Jiang, Lu Yuan.
NeurIPS, 2022.
Efficient Video Transformers with Spatial-Temporal Token Selection. [Code]
Junke Wang*, Xitong Yang*, Hengduo Li, Zuxuan Wu, Yu-Gang Jiang.
ECCV, 2022.
ObjectFormer for Image Manipulation Detection and Localization. [Code]
Junke Wang, Zuxuan Wu, Jingjing Chen, Xintong Han, Abhinav Shrivastava, Yu-Gang Jiang, Ser-Nam Li.
CVPR, 2022.
M2TR: Multi-modal Multi-scale Transformer for Deepfake Detection. [Code]
Junke Wang, Zuxuan Wu, Wenhao Ouyang, Xintong Han, Jingjing Chen, Ser-Nam Lim, Yu-Gang Jiang
ICMR, 2022.


Projects
SimpleAR: Pushing the Frontier of Autoregressive Visual Generation through Pretraining, SFT, and RL. [Code]
Junke Wang, Zhi Tian, Xun Wang, Xinyu Zhang, Weilin Huang, Zuxuan Wu, Yu-Gang Jiang
To See is to Believe: Prompting GPT-4V for Better Visual Instruction Tuning. [Dataset] [Project page]
Junke Wang*, Lingchen Meng*, Zejia Weng, Bo He, Zuxuan Wu, Yu-Gang Jiang.
ChatVideo: A Tracklet-centric Multimodal and Versatile Video Understanding System. [Code]
Junke Wang, Dongdong Chen, Chong Luo, Xiyang Dai, Lu Yuan, Zuxuan Wu, Yu-Gang Jiang.

Academic Services

Conference Reviewer for CVPR, ICCV, ICML, NeurIPS, ICLR, ECCV, etal.

Journal Reviewer for TPAMI, TIP, IJCV, etal.


Selected Awards

Bytedance PhD Fellowship (20 people in China and Singapore). 2025.

CCF-CV Academic Rising Star Award (3 people in China). 2025.

Fundamental Research Program for PhD students, sponsored by NSFC. 2024.

Young Elite Scientists Sponsorship Program for PhD students, sponsored by CAAI. 2024.

National Scholarship (Top 1%). 2022, 2025.



Updated at Feb. 2026.