Projects

CS185 — Deep RL

with Sergey Levine

Imitation Learning

Action-chunking behavioral cloning with MSE and flow matching policies for the Push-T environment.

Feb 2026

CS280 — Computer Vision

with Angjoo Kanazawa and Alexei Efros

Poor Man's AR & Homographies

Keypoint tracking, DLT camera calibration with RANSAC and bundle adjustment, 3D cube projection, and affine/homography transforms from scratch.

Feb 2026

Facial Keypoint Detection

Direct CNN regression, ResNet-18 and DINOv3 transfer learning, and U-Net heatmap prediction with soft-argmax for 68-point facial keypoint detection.

Feb 2026

Academic

VISTA: Vision Intersectional Sparse Trait Analysis

Probed Vision Encoders with linear classifiers and trained patch-level SAEs to discover interpretable sparse dictionary features (SDFs) defining demographic traits, revealing the heavy influence of reconstruction error in SAE-based debiasing.

May 2025

Personal

VizDoom DQN

Double DQN with frozen pretrained vision encoders (AIMv2, V-JEPA 2), PCA whitening, and optional dueling architecture for VizDoom FPS environments.

Feb 2026