👋 About Me
I am currently a PhD student jointly affiliated with the State Key Laboratory of Multimodal Artificial Intelligence Systems (MAIS) at the Institute of Automation, Chinese Academy of Sciences (CASIA) and Zhongguancun Academy, under the supervision of Prof. Cheng-Lin Liu.
🎓 Education Background
- 2024.09 - Present: Ph.D. Candidate in Pattern Recognition and Intelligent Systems, CASIA-MAIS & ZGCA
- 2021.09 - 2024.06: M.S. in Electronic Information, NLPR, CASIA
- 2017.09 - 2021.06: B.E. in Space Science and Technology, Xidian University
🔬 Research Focus
My Ph.D. research centers on:
- 🧬 AI for Science: AI-driven vaccine adjuvant discovery and development
- 🤖 Multimodal Large Language Models: Reliable reasoning, inference acceleration, and vision token optimization
- ✍️ Handwritten Text Recognition & Generation: Online Chinese text recognition and synthesis
📊 Academic Impact
You can find my publications on Google Scholar and connect with me through various academic platforms listed in the sidebar.
🔥 News
- 2026.03: 🎉🎉 Our paper accepted to Materials Genome Engineering Advances : An Efficient Strategy for Data-constrained Machine Learning in Materials Science.
- 2026.02: 🎉🎉 Three papers accepted to CVPR 2026! Including “MeteorPred” (meteorological multimodal model), “ChartAgent” (chart understanding framework), and “Fine-Grained Post-Training Quantization” (VLM optimization).
- 2026.01: 🎉🎉 Three papers accepted to top-tier conferences! Two papers to ICLR 2026: “An Open-Ended Benchmark for Adjuvant Research with MLLM” and “One Patch Doesn’t Fit All” (adaptive patching for MLLMs). One paper to ICRA 2026: “RANGER” (monocular zero-shot semantic navigation).
- 2025.11: 🎉🎉 One paper accepted to AAAI 2026! “VAGU & GtS: LLM-Based Benchmark and Framework for Joint Video Anomaly Grounding and Understanding” - a comprehensive framework for video anomaly detection and understanding.
📝 Publications

An Open-Ended Benchmark and Formal Framework for Adjuvant Research with MLLM
Yi Chen*, Yu Zhang*, Jian Xu, Xu-Yao Zhang, Hua Yue, Xinming Wang, Zequan Lyu, Wei Wei, Cheng-Lin Liu
ICLR 2026
- First benchmark dedicated to adjuvant research using multimodal large language models
- Formal framework for representing adjuvant design principles and immune mechanisms

Recoverable Compression: A Multimodal Vision Token Recovery Mechanism Guided by Text Information
Yi Chen, Jian Xu, Xu-Yao Zhang, Wen-Zhuo Liu, Yang-Yang Liu, Cheng-Lin Liu
AAAI 2025
- Text-guided dynamic visual token recovery mechanism for multimodal models
- Achieves comparable performance while compressing visual tokens to 10% of original quantity

VisTopo: Dynamic Spatial Topology Modeling for Fine-Grained Visual Prompting in Multimodal Reasoning
Yi Chen, MingMing Yu, Jie Gu, Chu Tang, Jingmin Chen, Rui-Qi Wang
arXiv 2026
- Dual-stream prompting mechanism for modeling visual scene structure
- Achieves state-of-the-art 97.6% precision in mitigating visual hallucination
🧬 AI for Science & Scientific Computing
-
An Open-Ended Benchmark and Formal Framework for Adjuvant Research with MLLM, Yi Chen*, Yu Zhang*, Jian Xu, et al. ICLR 2026
-
An Efficient Strategy for Data-constrained Machine Learning in Materials Science, ChunTing Shao*, Yi Chen*, ShanMan Song, et al.Materials Genome Engineering Advances
-
MeteorPred: A Meteorological Multimodal Large Model and Dataset for Severe Weather Event Prediction, Shuo Tang, Jian Xu, Jiadong Zhang, Yi Chen, et al. CVPR 2026r
-
The Hitchhiker’s Guide to Scientific Agents: A Journey Through the Cosmos of Research Automation, Xinming Wang, Aslan Feng, Jian Xu, Yi Chen, et al. TechRxiv 2024
🤖 Multimodal Large Language Models
-
Recoverable Compression: A Multimodal Vision Token Recovery Mechanism Guided by Text Information, Yi Chen, Jian Xu, Xu-Yao Zhang, et al. AAAI 2025
-
VisTopo: Dynamic Spatial Topology Modeling for Fine-Grained Visual Prompting in Multimodal Reasoning, Yi Chen, MingMing Yu, Jie Gu, et al. arXiv 2026
-
Sparsity Meets Similarity: Leveraging Long-Tail Distribution for Dynamic Optimized Token Representation in Multimodal Large Language Models, Yi Chen*, Gao-Tong Yu*, Jian Xu. arXiv 2024
-
One Patch Doesn’t Fit All: Adaptive Patching for Native-Resolution Multimodal Large Language Models, Wenzhuo Liu, Weijie Yin, Fei Zhu, Shijie Ma, Haiyang Guo, Yi Chen, et al. ICLR 2026
-
Fine-Grained Post-Training Quantization for Large Vision Language Models with Integrated Gradients, Ziwen Xiang, Fanhu Zeng, Hongjian Fang, Rui-Qi Wang, Renxing Chen, Yi Chen, et al. CVPR 2026
🧠 Machine Learning
- ManiNet: Manifold Network for Few-Shot Learning, Ruiqi Wang, Hengcan Shi, Yi Chen, YaoNan Wang. AIHCIR 2025 🏆 Best Paper Award
🦿 Embodied Intelligence & Robotics
- RANGER: A Monocular Zero-Shot Semantic Navigation Framework through Contextual Adaptation, Ming-Ming Yu, Yi Chen, Börje F. Karlsson, Wenjun Wu. ICRA 2026
🛠️ Intelligent Agents
- ChartAgent: A Chart Understanding Framework with Tool Integrated Reasoning, Boran Wang, Xinming Wang, Yi Chen, et al. CVPR Findings 2026
🔍 Video Analysis & Anomaly Detection
-
VAGU & GtS: LLM-Based Benchmark and Framework for Joint Video Anomaly Grounding and Understanding, Shibo Gao, Peipei Yang, Yi Chen, et al. AAAI 2026
-
The Evolution of Video Anomaly Detection: A Unified Framework from DNN to MLLM, Shibo Gao, Peipei Yang, Haiyang Guo, Yangyang Liu, Yi Chen, et al. arXiv 2024
📊 NLP & Information Processing
-
ElementCheck: Long-Form Text Factuality Evaluation via Sentence-Level Fact Elements, Xinming Wang, Haoran Du, Yi Chen, et al. arXiv 2026
-
MR-ALIGN: Meta-Reasoning Informed Factuality Alignment for Large Reasoning Models, Xinming Wang, Jian Xu, Bin Yu, Sheng Lian, Hongzhu Yi, Yi Chen, et al. arXiv 2025
✍️ Handwritten Text Recognition & Generation
-
Recognition of Online Handwritten Chinese Texts in Any Writing Direction via Stroke Classification Based Over-Segmentation, Yi Chen, Heng Zhang, Min-Si Ren, Cheng-Lin Liu. ICPR 2024
-
Improved Learning for Online Handwritten Chinese Text Recognition with Convolutional Prototype Network, Yi Chen, Heng Zhang, Cheng-Lin Liu. ICDAR 2023
-
Context-Aware Confidence Estimation for Rejection in Handwritten Chinese Text Recognition, Yang-Yang Liu, Yi Chen, Fei Yin, Cheng-Lin Liu. ICDAR 2024
-
Decoupling Layout from Glyph in Online Chinese Handwriting Generation, Min-Si Ren, Yan-Ming Zhang, Yi Chen. ICLR 2025
🎖 Honors and Awards
- 2025 Academic Research Star, National AI Academy Beijing Zhongguancun Academy
- 2025 Best Paper Award, AIHCIR 2025 (for “ManiNet: Manifold Network for Few-Shot Learning”)
- 2024 3rd Place, ICDAR2024 Competition on Multi Font Group Recognition and OCR
📖 Education
-
2024.09 - Present, Ph.D. Candidate in Pattern Recognition and Intelligent Systems
State Key Laboratory of Multimodal Artificial Intelligence Systems (MAIS), Institute of Automation, Chinese Academy of Sciences & Zhongguancun Academy
Supervisor: Prof. Cheng-Lin Liu -
2021.09 - 2024.06, M.S. in Electronic Information
National Laboratory of Pattern Recognition (NLPR), Institute of Automation, Chinese Academy of Sciences
Supervisor: Prof. Cheng-Lin Liu -
2017.09 - 2021.06, B.E. in Detection Guidance and Control Technology
School of Space Science and Technology, Xidian University
🔬 Research Interests
- AI for Science: Applying artificial intelligence to scientific discovery, particularly in adjuvant research and materials science
- Multimodal Large Language Models (MLLMs): Developing robust and efficient multimodal AI systems
- Online Handwritten Text Recognition: Recognition and generation of handwritten Chinese text
- Computer Vision: Image understanding, visual reasoning, and multimodal perception
🤝 Academic Service
- Journal Reviewer: IEEE Transactions on Circuits and Systems for Video Technology (IEEE TCSVT)
- Program Committee Member: AAAI 2026
- Conference Reviewer: ICLR 2026, CVPR 2026, ICML 2026, ECCV 2026
🌟 Open Source Contributions
- Implemented full pipeline for crystal structure data processing and graph neural network training
- Code merged into official repository and featured as an official case study
📧 Contact
- Email: yi.chen@nlpr.ia.ac.cn
- Office: State Key Laboratory of Multimodal Artificial Intelligence Systems, Institute of Automation, Chinese Academy of Sciences
- Address: Beijing 100190, China
Open to collaboration and academic exchange. Please feel free to contact me via email.