Om Khangaonkar
I'm an third-year undergrad at UC Davis advised by Hamed Pirsiavash. My research studies computer vision and machine learning.
Specifically, I am interested in the intersection of representation learning, generative modeling, and scene understanding. Large generative models (i.e. Stable Diffusion, FLUX.1) have learned to model a large amount of our visual world. How can we utilize the rich representations learned by these models to build generalizable models of perception from limited supervision, similar to humans?
Email /
Twitter /
Github /
LinkedIn
|