I am a second-year Master’s student in Artificial Intelligence at the Software College of Zhejiang University, with a research focus on multimodal.
📝 Publications
* indicates equal contribution.

Achieving Cross Modal Generalization with Multimodal Unified Representation
Yan Xia*, Hai Huang*, Jieming Zhu, Zhou Zhao

Semantic Residual for Multimodal Unified Discrete Representation
Hai Huang, Shulei Wang, Yan Xia

Overcoming both Domain Shift and Label Shift for Referring Video Segmentation
Hai Huang, Sashuai Zhou, Yan Xia
NAACL 2025
Omni-Chart-600K: A Comprehensive Dataset of Chart Types for Chart Understanding, Shulei Wang, Shuai Yang, Wang Lin, Zirun Guo, Sihang Cai, Hai Huang, Ye Wang, Jingyuan Chen, Tao Jin
CVPR 2025
Towards Transformer-Based Aligned Generation with Self-Coherence Guidance, Shulei Wang, Wang Lin, Hai Huang, Hanting Wang, Sihang Cai, WenKang Han, Tao Jin, Jingyuan Chen, Jiacheng Sun, Jieming Zhu, Zhou Zhao
ICME 2025
Enhancing Multi-modal Models with Heterogeneous MoE Adapters for Fine-tuning, Sashuai Zhou, Hai Huang, Yan Xia
🎖 Honors and Awards
- 2021.10 First Prize in the RoboMaster Robotics Competition
📖 Educations
- 2023.09 - (now), pursuing a master’s in Artificial Intelligence at Zhejiang University
- 2019.09 - 2023.06, earned bachelor’s in Computer Science and Technology from Northeastern University(China)
💻 Internships
- I am currently looking for an internship.