Devan Shah
I am a rising senior in Computer Science at Princeton University pursuing a minor in mathematics, and I am grateful to be advised by Prof. Elad Hazan. Through the Hazan Lab, I work with Google DeepMind Princeton on language modeling and dynamical systems. Previously I worked with Prof. Tingting Chen on ML for authentication.
Broadly, I am interested in better understanding language models, such as improving how models reason, designing post-transformer architectures, and thinking more about post-training. At Princeton, I help host a few reading groups and chair our ACM chapter.
Some recent work I am especially proud of is [1] and [2].
Email /
GitHub /
Google Scholar /
LinkedIn
|
Image Snippet from PRD presentation
|
|
Quantitative Research Intern at Jane Street
May 2025 — Present
I am interning as a (Quantitative) Research intern at Jane Street.
|
|
Hazan Lab w/ Google DeepMind Princeton
September 2024 — Present
I work on language modeling and dynamical systems in the Hazan Lab and collaborate with Google DeepMind Princeton. I've worked on projects aiming to accelerate the Google Spectral Transformer and better model linear dynamical systems, such as such as SpectraLDS, FutureFill, and Google Deluca.
|
|
Recommendation Systems at TikTok
May 2024 — August 2024
I improved the recommendation system for ecommerce videos for a projected $33,000,000+ increase in annual sales through projects focusing on improving multi-interest modeling.
|
|
EEG Vision Embeddings at CareYaya
November 2023 — May 2024
I studied the reconstruction of viewed images from brain activity, improving performance by conditioning on a user's camera roll. We aimed to better align EEG embeddings with CLIP space.
|
|
Price Prediction at Ticket Wallet (YC X25)
June 2023 — August 2023
I worked at an early-stage startup to advise and design data pipelines, pricing algorithms, and refine product pitches. I also worked on algorithms to determine if a ticket was likely to be sold.
|
|
SpectraLDS: Provable Distillation for Linear Dynamical Systems
Devan Shah, Shlomo Fortgang, Sofiia Druchyna, Elad Hazan
Preprint
paper
/ code
/ slides
|
|
FutureFill: Fast Generation from Convolutional Sequence Models
Naman Agarwal, Xinyi Chen, Evan Dogariu, Devan Shah, Hubert Strauss, Vladimir Feinberg, Daniel Suo, Peter Bartlett, Elad Hazan
Preprint
paper
/ code
/ slides
|
|
Augmenting Large Reasoning Models with Contrastive Goal-Conditioned Reinforcement Learning
Devan Shah*, Kevin Wang*, David Yan* (* indicates equal contribution)
paper
/ code
/ poster
|
|
Parallel Scaling with Entropic Reasoners
Devan Shah*, Owen Yang*, Daniel Yang*
paper
/ code
/ poster
|
|
ScooterId: Posture-Based Continuous User Identification From Mobility Scooter Rides
Devan Shah , Ruoqi Huang, Nisha Vinayaga-Sureshkanth, Tingting Chen, Murtuza Jadliwala
IEEE TMC 2025
website /
paper
|
|
Wave Filtering for General Linear Dynamical Systems
Devan Shah* , Brandon Cho*
Idea
paper
/ code
|
|
Truly Adaptive Bloom Filters
Devan Shah* , David Yan*
paper
/ poster
|
|
Rider Posture-Based Continuous Authentication with Few-Shot Learning for Mobility Scooters (Student Abstract)
Devan Shah , Ruoqi Huang, Tingting Chen, Murtuza Jadliwala
AAAI 2025
paper
/ poster
/ slides
|
|
DreamScape: Denoising CLIP Embeddings with User Images for Improved Visualization Reconstruction
Devan Shah
paper
/ code
|
|
Discussion on "Why Does Deep and Cheap Learning Work So Well?"
Devan Shah
paper
|
|
A Survey of State Space Models: From Linear Systems to Language
Devan Shah* , Brandon Cho* (Equal Contribution)
paper
|
|
Understanding Dynamic Algorithms for Packing-Covering LPs via Multiplicative Weight Updates
Devan Shah* , Owen Yang*, Sunil Vittal*
paper
|
Computer Science
|
COS217 |
Introduction to Programming Systems |
COS226 |
Algorithms and Data Structures |
COS418 |
Distributed Systems |
ECE435 |
Machine Learning and Pattern Recognition |
COS435 |
Introduction to Reinforcement Learning |
COS484 |
Natural Language Processing |
COS597 |
Long Term Memory in AI - Vector Search and Databases |
COS597 |
Systems and Machine Learning |
|
Math
|
MAT216 |
Multivariable Analysis and Linear Algebra I |
MAT218 |
Multivariable Analysis and Linear Algebra II |
ECO310 |
Microeconomic Theory: A Mathematical Approach |
MAT377 |
Combinatorial Mathematics |
MAT385 |
Probability Theory |
MAT478 |
The Probabilistic Method |
|
Theory
|
ECE434 |
Theoretical Machine Learning |
COS445 |
Economics and Computing |
COS521 |
Advanced Algorithm Design |
COS522 |
Computational Complexity |
COS585 |
Information Theory and Applications |
ORF543 |
Deep Learning Theory |
COS598 |
Theory of Natural Algorithms |
|
Creative Writing
|
FRS116 |
Evolution of Human Language |
CWR202 |
Creative Writing (Poetry) |
POL316 |
Civil Liberties |
ATL494 |
Creating Comedy for Television |
ATL497 |
How to Write a Monologue |
|
2025 |
Anthropic Alignment Hack #2/13 |
2025 |
Stanford Treehacks Codegen: Best Code Generation Application #1/24 |
2024 |
Columbia DevFest Overall Winners #1/54 |
2023 |
MIT Energy and Climate Hackathon #3/80 |
2023 |
HackHarvard CareYaya Track #1/24 |
2023 |
Princeton HackaTron Web3 Hack #2 |
2022 |
HackPrinceton Best AI Hack |
|
|