Devan Shah

I am a rising senior in Computer Science at Princeton University pursuing a minor in mathematics, and I am grateful to be advised by Prof. Elad Hazan. Through the Hazan Lab, I work with Google DeepMind Princeton on language modeling and dynamical systems. Previously I worked with Prof. Tingting Chen on ML for authentication.

Broadly, I am interested in better understanding language models, such as improving how models reason, designing post-transformer architectures, and thinking more about post-training. At Princeton, I help host a few reading groups and chair our ACM chapter.

Some recent work I am especially proud of is [1] and [2].

Email / GitHub / Google Scholar / LinkedIn

Image Snippet from PRD presentation

Work Experience

	Quantitative Research Intern at Jane Street May 2025 — Present I am interning as a (Quantitative) Research intern at Jane Street.
	Hazan Lab w/ Google DeepMind Princeton September 2024 — Present I work on language modeling and dynamical systems in the Hazan Lab and collaborate with Google DeepMind Princeton. I've worked on projects aiming to accelerate the Google Spectral Transformer and better model linear dynamical systems, such as such as SpectraLDS, FutureFill, and Google Deluca.
	Recommendation Systems at TikTok May 2024 — August 2024 I improved the recommendation system for ecommerce videos for a projected $33,000,000+ increase in annual sales through projects focusing on improving multi-interest modeling.
	EEG Vision Embeddings at CareYaya November 2023 — May 2024 I studied the reconstruction of viewed images from brain activity, improving performance by conditioning on a user's camera roll. We aimed to better align EEG embeddings with CLIP space.
	Price Prediction at Ticket Wallet (YC X25) June 2023 — August 2023 I worked at an early-stage startup to advise and design data pipelines, pricing algorithms, and refine product pitches. I also worked on algorithms to determine if a ticket was likely to be sold.

Papers & Projects

SpectraLDS: Provable Distillation for Linear Dynamical Systems

Devan Shah, Shlomo Fortgang, Sofiia Druchyna, Elad Hazan
Preprint
paper / code / slides

FutureFill: Fast Generation from Convolutional Sequence Models

Naman Agarwal, Xinyi Chen, Evan Dogariu, Devan Shah, Hubert Strauss, Vladimir Feinberg, Daniel Suo, Peter Bartlett, Elad Hazan
Preprint
paper / code / slides

Augmenting Large Reasoning Models with Contrastive Goal-Conditioned Reinforcement Learning

Devan Shah*, Kevin Wang*, David Yan* (* indicates equal contribution)

paper / code / poster

Parallel Scaling with Entropic Reasoners

Devan Shah*, Owen Yang*, Daniel Yang*

paper / code / poster

ScooterId: Posture-Based Continuous User Identification From Mobility Scooter Rides

Devan Shah , Ruoqi Huang, Nisha Vinayaga-Sureshkanth, Tingting Chen, Murtuza Jadliwala
IEEE TMC 2025
website / paper

Wave Filtering for General Linear Dynamical Systems

Devan Shah* , Brandon Cho*
Idea
paper / code

Truly Adaptive Bloom Filters

Devan Shah* , David Yan*

paper / poster

Rider Posture-Based Continuous Authentication with Few-Shot Learning for Mobility Scooters (Student Abstract)

Devan Shah , Ruoqi Huang, Tingting Chen, Murtuza Jadliwala
AAAI 2025
paper / poster / slides

DreamScape: Denoising CLIP Embeddings with User Images for Improved Visualization Reconstruction

Devan Shah

paper / code

Surveys

Discussion on "Why Does Deep and Cheap Learning Work So Well?"

Devan Shah

paper

A Survey of State Space Models: From Linear Systems to Language

Devan Shah* , Brandon Cho* (Equal Contribution)

paper

Understanding Dynamic Algorithms for Packing-Covering LPs via Multiplicative Weight Updates

Devan Shah* , Owen Yang*, Sunil Vittal*

paper

Coursework

Computer Science

COS217	Introduction to Programming Systems
COS226	Algorithms and Data Structures
COS418	Distributed Systems
ECE435	Machine Learning and Pattern Recognition
COS435	Introduction to Reinforcement Learning
COS484	Natural Language Processing
COS597	Long Term Memory in AI - Vector Search and Databases
COS597	Systems and Machine Learning

Math

MAT216	Multivariable Analysis and Linear Algebra I
MAT218	Multivariable Analysis and Linear Algebra II
ECO310	Microeconomic Theory: A Mathematical Approach
MAT377	Combinatorial Mathematics
MAT385	Probability Theory
MAT478	The Probabilistic Method

Theory

ECE434	Theoretical Machine Learning
COS445	Economics and Computing
COS521	Advanced Algorithm Design
COS522	Computational Complexity
COS585	Information Theory and Applications
ORF543	Deep Learning Theory
COS598	Theory of Natural Algorithms

Creative Writing

FRS116	Evolution of Human Language
CWR202	Creative Writing (Poetry)
POL316	Civil Liberties
ATL494	Creating Comedy for Television
ATL497	How to Write a Monologue

Hackathons

2025	Anthropic Alignment Hack #2/13
2025	Stanford Treehacks Codegen: Best Code Generation Application #1/24
2024	Columbia DevFest Overall Winners #1/54
2023	MIT Energy and Climate Hackathon #3/80
2023	HackHarvard CareYaya Track #1/24
2023	Princeton HackaTron Web3 Hack #2
2022	HackPrinceton Best AI Hack

Devan Shah

Work Experience

Quantitative Research Intern at Jane Street

May 2025 — Present

Hazan Lab w/ Google DeepMind Princeton

September 2024 — Present

Recommendation Systems at TikTok

May 2024 — August 2024

EEG Vision Embeddings at CareYaya

November 2023 — May 2024

Price Prediction at Ticket Wallet (YC X25)

June 2023 — August 2023