3D Perception & Detection
Locating and understanding objects in 3D from images and video, including promptable, open-world detection.
01 About
I'm a PhD student in the Paul G. Allen School of Computer Science & Engineering at the University of Washington, where I'm advised by Ali Farhadi. Alongside my PhD, I'm a Student Researcher at the Allen Institute for AI (AI2). Before coming to Seattle, I earned my B.Sc. in Computer Engineering from the University of Tehran, where my interest in computer vision first took shape.
My research is in computer vision and machine learning. These days I'm focused on object dynamics and world models, 3D perception, and vision-language models. At the core, I want to help machines interpret the visual world and interact with it, building a grounded, three-dimensional understanding of their surroundings from ordinary video.
02 Research
Locating and understanding objects in 3D from images and video, including promptable, open-world detection.
Predicting how objects move and interact over time, learned from large-scale human video.
Multimodal models and multi-agent systems that perceive, describe, and reason.
04 Publications
For the full and current list, see my Google Scholar.