Hello there! 👋 I'm Mahesh, some endearingly often call me Maahi, much like the mahi-mahi fish but thankfully, I'm better at coding than swimming! I am a PhD student in Computer Science at Artificial Intelligence Innovation Laboratory (A2IL) in the University at Buffalo.
Much like our brain relies on a perceptual system to interpret visual stimuli, my work revolves around developing a similar perceptual framework for neural networks, enabling them to 'see' and understand the world around them. I am working on the following areas of research at the moment -
- Problems relating to multimodal generative models including Diffusion, Flow and Consistency Models, MultiModal Large Language Models for images and videos.
- Problems I have worked on before : Unified image generation from unpaired conditions, Fairness issues of MLLMs, Hallucinations in Difussion Models.
I love to share my knowledge and insights about AI, machine learning, and my PhD journey. Check out my blog at bhosalems.github.io.
Please find my CV here. I'm usually open to collaborating on projects, discussing research ideas, or just chatting about the latest in tech and/or how your day has been. Feel free to connect with me on X or Linkedin

