Homepage
About Me
Hello! I’m Mohammad Qazim Bhat, a graduate student at University of Colorado Boulder. My research focuses on Computer Vision, Foundation Models, and Multi-Modal Large Language Models (LLMs). I work on advancing vision-language understanding, real-time robotic applications, and creating efficient data curation methods for large AI models.
Research Interests
- Vision-Language Models: Developing AI systems for robotic manipulation and autonomous applications.
- Foundation Models: Building scalable, multi-modal large language models.
- Data Curation: Designing data pipelines and training techniques to improve model efficiency.
Experience & Contributions
I’ve worked on key projects at MBZUAI (vision-language models) and Correll Lab at CU Boulder (robotic multi-modal models). In industry, I gained hands-on experience in data science with ITC Limited and machine learning with Samsung R&D.
Publications & Open Source
I’ve published research in top venues, including NeurIPS, IJCV, and IJCAI. My open-source contributions, such as AutoVideo and TODS (with over 25,000 GitHub stars), help make advanced AI tools accessible to everyone.
Goals
I aim to bring AI research into real-world applications, focusing on robotics and scalable multi-modal models. I’m always open to collaboration in computer vision, robotics, and AI.