About me

Greetings, I am Le Zhang, a Ph.D student at the Mila - Quebec AI Institute. Currently, I’m working with Prof. Aishwarya Agrawal in the domain of vision-language learning. Perviously, I obtained Bachelor degree from Fudan University.

Reaserach Interests

My research primarily centers around the domains of computer vision and multimodal learning. I am currently exploring representation learning from generative modeling and building unified generation&understanding MLLMs.

News and Updates 🎉

[May 2025] Honored to be selected as Outstanding Reviewer for CVPR 2025
[Mar 2025] One paper SAIL got accepted at CVPR 2025 Highlight [project]
[Sep 2024] Two papers accepted at NeurIPS 2024
[Sep 2024] One paper on Information Retrieval with Large Language Models accepted to EMNLP 2024 Findings
[May 2024] One paper on Graph Convolution and Contrastive Learning accepted to KDD 2024
[Feb 2024] One paper on vision-language compositional understanding accepted to CVPR 2024
[Dec 2023] One paper on zero-shot multimodal question answering accepted to EMNLP 2023 Findings
[Sep 2023] Started graduate study at Mila!
[Feb-Sep 2022] Research Intern at Shanghai AI Lab
[May 2022] Two papers accepted at NAACL 2022

Publications


  • Assessing and Learning Alignment of Unimodal Vision and Language Models (SAIL)
    SAIL Image Le Zhang, Qian Yang, Aishwarya Agrawal
    CVPR 2025 Highlight arXiv Project GitHub stars

  • Enhancing the Protein Tertiary Structure Prediction by Multiple Sequence Alignment Generation
    Protein Structure Image Le Zhang, Jiayang Chen, Tao Shen, Yu Li, Siqi Sun.
    NeurIPS 2024 arXiv GitHub stars

  • VisMin: Visual Minimal-Change Understanding
    VisMin Image Rabiul Awal*, Saba Ahmadi*, Le Zhang*, Aishwarya Agrawal
    NeurIPS 2024 Project

  • Exploring the Best Practices of Query Expansion with Large Language Models
    Query Expansion Image
    Le Zhang, Qian Yang, Yihong Wu.
    EMNLP 2024 findings arXiv GitHub stars

  • Unifying Graph Convolution and Contrastive Learning in Collaborative Filtering
    Yihong Wu, Le Zhang, Fengran Mo, Tianyu Zhu, Weizhi Ma, Jian-Yun Nie
    KDD 2024

  • Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Compositional Understanding
    Contrasting Image Le Zhang, Rabiul Awal, Aishwarya Agrawal
    CVPR 2024 arXiv GitHub stars

  • MoqaGPT: Zero-Shot Multi-modal Open-domain Question Answering with Large Language Model
    MoqaGPT Image Le Zhang, Yihong Wu, Fengran Mo, Jian-Yun Nie, Aishwarya Agrawal.
    EMNLP 2023 Findings arXiv GitHub stars

  • Investigating Prompting Techniques for Zero- and Few-Shot Visual Question Answering
    Prompting Image Rabiul Awal, Le Zhang, Aishwarya Agrawal.
    CVPR2024W Oral arXiv

  • SUBS: Subtree Substitution for Compositional Semantic Parsing
    SUBS Image Jingfeng Yang*, Le Zhang*, Diyi Yang.
    NAACL 2022 arXiv GitHub stars

  • TreeMix: Compositional Constituency-based Data Augmentation for Natural Language Understanding
    TreeMix Image Le Zhang, Zichao Yang, Diyi Yang.
    NAACL 2022 arXiv GitHub stars

Services