Resource-constrained image generation and visual understanding: an interview with Aniket Roy - Robohub

In the latest in our series of interviews meeting the AAAI/SIGAI Doctoral Consortium participants, we caught up with Aniket Roy to find out more about his research on generative models for computer vision tasks.

Tell us a bit about your PhD – where did you study, and what was the topic of your research?

I recently completed my PhD in Computer Science at Johns Hopkins University, where I worked under the supervision of Bloomberg Distinguished Professor Rama Chellappa. My research primarily focused on developing methods for resource-constrained image generation and visual understanding. In particular, I explored how modern generative models can be adapted to operate efficiently while maintaining strong performance.

During my PhD, I worked broadly at the intersection of generative AI, multimodal learning, and few-shot learning. Much of my work involved designing techniques that enable models to learn new concepts or perform complex visual tasks with limited data or computational resources. This included research on diffusion models, personalized image generation, and multimodal representation learning. Overall, my work aims to make advanced vision and generative AI systems more adaptable, efficient, and practical for real-world applications.

Resource-constrained image generation and visual understanding: an interview with Aniket Roy - Robohub

Other newsrooms on this story

Related reading

Generative AI improves a wireless vision system that sees through obstructions…

Does Creativity Still Stand Out In The AI Era? Study Seeks Answers

Anthropic’s Claude Mythos Problem, Dark DNA Unveiled, Pitfalls for Assistive…

The best AI image generators of 2026: There's only one clear winner now

Replicate Intelligence #2 – Replicate blog

Constraining CLIPDraw – Replicate blog