Homepage

About Me

Hello! I’m Mohammad Qazim Bhat, a graduate student at University of Colorado Boulder. My research focuses on Computer Vision, Foundation Models, and Multi-Modal Large Language Models (LLMs). I work on advancing vision-language understanding, real-time robotic applications, and creating efficient data curation methods for large AI models.


Research Interests

  • Vision-Language Models: Developing AI systems for robotic manipulation and autonomous applications.
  • Foundation Models: Building scalable, multi-modal large language models.
  • Data Curation: Designing data pipelines and training techniques to improve model efficiency.

Experience & Contributions

I’ve worked on key projects at MBZUAI (vision-language models) and Correll Lab at CU Boulder (robotic multi-modal models). In industry, I gained hands-on experience in data science with ITC Limited and machine learning with Samsung R&D.


Publications & Open Source

I’ve published research in top venues, including NeurIPS, IJCV, and IJCAI. My open-source contributions, such as AutoVideo and TODS (with over 25,000 GitHub stars), help make advanced AI tools accessible to everyone.


Goals

I aim to bring AI research into real-world applications, focusing on robotics and scalable multi-modal models. I’m always open to collaboration in computer vision, robotics, and AI.