About me
Greetings, I am Le Zhang, a Ph.D student at the Mila - Quebec AI Institute. Currently, I’m working with Prof. Aishwarya Agrawal in the domain of vision-language learning. Perviously, I obtained Bachelor degree from Fudan University.
Reaserach Interests
My research primarily centers around the domains of computer vision and multimodal learning. I am currently exploring representation learning from generative modeling and building unified generation&understanding MLLMs.
News and Updates 🎉
Publications
Assessing and Learning Alignment of Unimodal Vision and Language Models (SAIL)
Le Zhang, Qian Yang, Aishwarya Agrawal
CVPR 2025 HighlightEnhancing the Protein Tertiary Structure Prediction by Multiple Sequence Alignment Generation
Le Zhang, Jiayang Chen, Tao Shen, Yu Li, Siqi Sun.
NeurIPS 2024VisMin: Visual Minimal-Change Understanding
Rabiul Awal*, Saba Ahmadi*, Le Zhang*, Aishwarya Agrawal
NeurIPS 2024Exploring the Best Practices of Query Expansion with Large Language Models
Le Zhang, Qian Yang, Yihong Wu.
EMNLP 2024 findingsUnifying Graph Convolution and Contrastive Learning in Collaborative Filtering
Yihong Wu, Le Zhang, Fengran Mo, Tianyu Zhu, Weizhi Ma, Jian-Yun Nie
KDD 2024Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Compositional Understanding
Le Zhang, Rabiul Awal, Aishwarya Agrawal
CVPR 2024MoqaGPT: Zero-Shot Multi-modal Open-domain Question Answering with Large Language Model
Le Zhang, Yihong Wu, Fengran Mo, Jian-Yun Nie, Aishwarya Agrawal.
EMNLP 2023 FindingsInvestigating Prompting Techniques for Zero- and Few-Shot Visual Question Answering
Rabiul Awal, Le Zhang, Aishwarya Agrawal.
CVPR2024W OralSUBS: Subtree Substitution for Compositional Semantic Parsing
Jingfeng Yang*, Le Zhang*, Diyi Yang.
NAACL 2022TreeMix: Compositional Constituency-based Data Augmentation for Natural Language Understanding
Le Zhang, Zichao Yang, Diyi Yang.
NAACL 2022
Services
- Conference Reviewer: ACL 2023, EMNLP 2023, CVPR 2024, ARR 2024 Feb, Apr, June, ECCV 2024, Neurips 2024, CVPR 2025 (Outstanding Reviewer), ARR 2025, ICML 2025, Neurips 2025
- Teaching Assistant: IFT 6765 - Links between Computer Vision and Language