Rustin Soraki

01 About

I'm a PhD student in the Paul G. Allen School of Computer Science & Engineering at the University of Washington, where I'm advised by Ali Farhadi. Alongside my PhD, I'm a Student Researcher at the Allen Institute for AI (AI2). Before coming to Seattle, I earned my B.Sc. in Computer Engineering from the University of Tehran, where my interest in computer vision first took shape.

My research is in computer vision and machine learning. These days I'm focused on object dynamics and world models, 3D perception, and vision-language models. At the core, I want to help machines interpret the visual world and interact with it, building a grounded, three-dimensional understanding of their surroundings from ordinary video.

02 Research

3D Perception & Detection

Locating and understanding objects in 3D from images and video, including promptable, open-world detection.

Object Dynamics & World Models

Predicting how objects move and interact over time, learned from large-scale human video.

Vision-Language & Agents

Multimodal models and multi-agent systems that perceive, describe, and reason.

03 News

Jun 2026 ObjectForesight accepted to ECCV 2026. Project
Apr 2026 CrossFusion accepted to MIDL 2026. arXiv
Jun 2025 PathFinder accepted to ICCV 2025. Project

04 Publications

For the full and current list, see my Google Scholar.

MolmoMotion: Forecasting Point Trajectories in 3D with Language Instruction

arXiv 2026

Jianing Zhang, Chenhao Zheng, …, Rustin Soraki, …, Jieyu Zhang, Ranjay Krishna

Project arXiv Code Model Dataset Benchmark

WildDet3D: Scaling Promptable 3D Detection in the Wild

arXiv 2026

Weikai Huang, Jieyu Zhang, …, Rustin Soraki, …, Ali Farhadi, Ranjay Krishna

Project arXiv Code Model Data Demo

ObjectForesight: Predicting Future 3D Object Trajectories from Human Videos

ECCV 2026

Rustin Soraki, Homanga Bharadhwaj, Ali Farhadi, Roozbeh Mottaghi

Project arXiv Code Data Pipeline Model Data

PathFinder: A Multi-Modal Multi-Agent System for Medical Diagnostic Decision-Making Applied to Histopathology

ICCV 2025

Fatemeh Ghezloo*, Mehmet Saygin Seyfioglu*, Rustin Soraki*, Wisdom O. Ikezogwo*, Beibin Li*, Tejoram Vivekanandan, Joann G. Elmore, Ranjay Krishna, Linda Shapiro (* equal contribution)

Project arXiv Paper

CrossFusion: A Multi-Scale Cross-Attention Convolutional Fusion Model for Cancer Survival Prediction

MIDL 2026

Rustin Soraki, Huayu Wang, Joann G. Elmore, Linda Shapiro

arXiv Code