|
Junke Wang 「王君可」
I'm a final-year Ph.D. student at Fudan University, supervised by Prof. Zuxuan Wu and Prof. Yu-Gang Jiang. I have interned at ByteDance Seed, Meta FAIR, ByteDance Monetization GenAI, and Microsoft Cloud AI. I'm the recipient of 2025 Bytedance Fellowship.
My research interest lies in multimodal general intelligence. Recently, I work on action models, world models, and representation learning. Feel free to reach out if you are interested in working with me.
Email: wangjk21 [at] m [dot] fudan [dot] edu [dot] cn
Google Scholar [Full publications]   /  
Github
|
|
Publication
|
|
* denotes equal contribution, † denotes project leader.
|
FluxMem: Adaptive Hierarchical Memory for Streaming Video Understanding.
[Code]
Yiweng Xie, Bo He, Junke Wang †, Xiangyu Zheng, Ziyi Ye, Zuxuan Wu.
CVPR, 2026.
|
OmniGen-AR: AutoRegressive Any-to-Image Generation.
[Code]
Junke Wang, Xun Wang, Qiushan Guo, Peize Sun, Weilin Huang, Zuxuan Wu, Yu-Gang Jiang.
NeurIPS, 2025.
|
Perception Encoder: The best visual embeddings are not at the output of the network.
[Code]
FAIR Perception, Meta.
NeurIPS, 2025 (Oral).
|
OmniTracker: Unifying Object Tracking by Tracking-with-Detection.
Junke Wang*, Zuxuan Wu*, Dongdong Chen, Chong Luo, Xiyang Dai, Lu Yuan, Yu-Gang Jiang.
TPAMI, 2025.
|
Fighting Malicious Media Data: A Survey on Tampering Detection and Deepfake Detection.
Junke Wang, Zhenxin Li, Chao Zhang, Jingjing Chen, Zuxuan Wu, Larry S. Davis, Yu-Gang
Jiang.
PIEEE, 2025.
|
OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation.
[Code]
Junke Wang, Yi Jiang, Zehuan Yuan, Binyue Peng, Zuxuan Wu, Yu-Gang Jiang.
NeurIPS, 2024.
|
OmniVid: A Generative Framework for Universal Video Understanding.
[Code]
Junke Wang, Dongdong Chen, Chong Luo, Bo He, Lu Yuan, Zuxuan Wu, Yu-Gang Jiang.
CVPR, 2024.
|
Look Before You Match: Instance Understanding Matters in Video Object Segmentation.
Junke Wang, Dongdong Chen, Zuxuan Wu, Chong Luo, Chuanxin Tang, Xiyang Dai, Yucheng Zhao, Yujia Xie, Lu Yuan, Yu-Gang Jiang.
CVPR, 2023.
|
OmniVL: One Foundation Model for Image-Language and Video-Language Tasks.
Junke Wang, Dongdong Chen, Zuxuan Wu, Chong Luo, Luowei Zhou, Yucheng Zhao, Yujia Xie, Ce
Liu, Yu-Gang Jiang, Lu Yuan.
NeurIPS, 2022.
|
Efficient Video Transformers with Spatial-Temporal Token Selection.
[Code]
Junke Wang*, Xitong Yang*, Hengduo Li, Zuxuan Wu, Yu-Gang Jiang.
ECCV, 2022.
|
ObjectFormer for Image Manipulation Detection and Localization.
[Code]
Junke Wang, Zuxuan Wu, Jingjing Chen, Xintong Han, Abhinav Shrivastava, Yu-Gang Jiang,
Ser-Nam Li.
CVPR, 2022.
|
M2TR: Multi-modal Multi-scale Transformer for Deepfake Detection.
[Code]
Junke Wang, Zuxuan Wu, Wenhao Ouyang, Xintong Han, Jingjing Chen, Ser-Nam Lim, Yu-Gang
Jiang.
ICMR, 2022.
|
FT-TDR: Frequency-Guided Transformer and Top-Down Refinement Network for Blind Face Inpainting.
Junke Wang, Shaoxiang Chen, Zuxuan Wu, Yu-Gang Jiang.
TMM, 2022.
|
|
Projects
|
SimpleAR: Pushing the Frontier of Autoregressive Visual Generation through Pretraining, SFT, and RL.
[Code]
Junke Wang, Zhi Tian, Xun Wang, Xinyu Zhang, Weilin Huang, Zuxuan Wu, Yu-Gang Jiang.
|
To See is to Believe: Prompting GPT-4V for Better Visual Instruction Tuning.
[Dataset]
[Project page]
Junke Wang*, Lingchen Meng*, Zejia Weng, Bo He, Zuxuan Wu, Yu-Gang Jiang.
|
ChatVideo: A Tracklet-centric Multimodal and Versatile Video Understanding System.
[Code]
Junke Wang, Dongdong Chen, Chong Luo, Xiyang Dai, Lu Yuan, Zuxuan Wu, Yu-Gang Jiang.
|
|
Academic Services
Conference Reviewer for CVPR, ICCV, ICML, NeurIPS, ICLR, ECCV, etal.
Journal Reviewer for TPAMI, TIP, IJCV, etal.
|
|
Selected Awards
Bytedance PhD Fellowship (20 people in China and Singapore). 2025.
CCF-CV Academic Rising Star Award (3 people in China). 2025.
Academic Star of Fudan University. 2025.
Fundamental Research Program for PhD students, sponsored by NSFC. 2024.
Young Elite Scientists Sponsorship Program for PhD students, sponsored by CAAI. 2024.
National Scholarship (Top 1%). 2022, 2025.
|
|