I am a second-year Master’s student in Artificial Intelligence at the Software College of Zhejiang University, with a research focus on multimodal.
📝 Publications
* indicates equal contribution.

Achieving Cross Modal Generalization with Multimodal Unified Representation
Yan Xia*, Hai Huang*, Jieming Zhu, Zhou Zhao

Enhancing Multimodal Unified Representations for Cross Modal Generalization
Hai Huang*, Yan Xia*, Shengpeng Ji, Shulei Wang, Hanting Wang, Minghui Fang, Jieming Zhu, Zhenhua Dong, Sashuai Zhou, Zhou Zhao

Semantic Residual for Multimodal Unified Discrete Representation
Hai Huang, Shulei Wang, Yan Xia

Overcoming both Domain Shift and Label Shift for Referring Video Segmentation
Hai Huang, Sashuai Zhou, Yan Xia
ACL 2025
Towards Simultaneous and Independent Zero-shot Speaker Cloning and Zero-shot Language Style Control, Shengpeng Ji, Qian Chen, Wen Wang, Jialong Zuo, Minghui Fang, Ziyue Jiang, Hai Huang, Zehan Wang, Xize Cheng, Siqi Zheng, Zhou Zhao
ACL 2025
Bridging Discrete Codec Representations and Speech Language Models, Shengpeng Ji, Minghui Fang, Jialong Zuo, Ziyue Jiang, Dingdong WANG, Hanting Wang, Hai Huang, Zhou Zhao
ACL 2025
ACE: A Generative Cross-Modal Retrieval Framework With Coarse-To-Fine Semantic Modeling, Minghui Fang, Shengpeng Ji, Jialong Zuo, Hai Huang, Yan Xia, Jieming Zhu, Xize Cheng, Xiaoda Yang, Wenrui Liu, Gang Wang, Zhenhua Dong, Zhou Zhao
ICML 2025
IRBridge: Solving Image Restoration Bridge with Pre-trained Generative Diffusion Models, Hanting Wang, Tao Jin, Wang Lin, Shulei Wang, Hai Huang, Shengpeng Ji, Zhou Zhao
NAACL 2025
Omni-Chart-600K: A Comprehensive Dataset of Chart Types for Chart Understanding, Shulei Wang, Shuai Yang, Wang Lin, Zirun Guo, Sihang Cai, Hai Huang, Ye Wang, Jingyuan Chen, Tao Jin
CVPR 2025
Towards Transformer-Based Aligned Generation with Self-Coherence Guidance, Shulei Wang, Wang Lin, Hai Huang, Hanting Wang, Sihang Cai, WenKang Han, Tao Jin, Jingyuan Chen, Jiacheng Sun, Jieming Zhu, Zhou Zhao
ICME 2025
Enhancing Multi-modal Models with Heterogeneous MoE Adapters for Fine-tuning, Sashuai Zhou, Hai Huang, Yan Xia
🎖 Honors and Awards
- 2021.10 First Prize in the RoboMaster Robotics Competition
📖 Educations
- 2023.09 - (now), pursuing a master’s in Artificial Intelligence at Zhejiang University
- 2019.09 - 2023.06, earned bachelor’s in Computer Science and Technology from Northeastern University(China)
💻 Internships
- I am currently looking for an internship.