I am a computer vision researcher interested in real-time vision, unsupervised visual representation learning, geometry-based computer vision, and vision for robotics. Currently I'm working as a Research Scientist at Meta Reality Labs.
Project | Paper | Code | Video
A modification of the Segment Anything Model (SAM), increasing efficiency through foveated tokenization.
Dense visual descriptors are training using a contrastive loss and similarity labels provided by a 3D reconstruction system.
A depth-based tracker provides real-time estimates of the pose of robot hands and target objects for tele-operated grasping.
A CUDA-accelerated model-based tracker using a signed distance function representation of target objects.
A dataset of RGB-D videos of static configurations of YCB objects. Annotations of object poses are provided.
A dataset of RGB-D videos capturing the same scene in a variety of lighting conditions and furniture configurations.