Devan Shah

I am a rising senior in Computer Science at Princeton University pursuing a minor in mathematics, and I am grateful to be advised by Prof. Elad Hazan. Through the Hazan Lab, I work with Google DeepMind Princeton on language modeling and dynamical systems. Previously I worked with Prof. Tingting Chen on ML for authentication.

Broadly, I am interested in better understanding language models, such as improving how models reason, designing post-transformer architectures, and thinking more about post-training. At Princeton, I help host a few reading groups and chair our ACM chapter.

Some recent work I am especially proud of is [1] and [2].

Email  /  GitHub  /  Google Scholar  /  LinkedIn

profile photo

Image Snippet from PRD presentation

Work Experience

project image

Quantitative Research Intern at Jane Street


May 2025 — Present

I am interning as a (Quantitative) Research intern at Jane Street.
project image

Hazan Lab w/ Google DeepMind Princeton


September 2024 — Present

I work on language modeling and dynamical systems in the Hazan Lab and collaborate with Google DeepMind Princeton. I've worked on projects aiming to accelerate the Google Spectral Transformer and better model linear dynamical systems, such as such as SpectraLDS, FutureFill, and Google Deluca.
project image

Recommendation Systems at TikTok


May 2024 — August 2024

I improved the recommendation system for ecommerce videos for a projected $33,000,000+ increase in annual sales through projects focusing on improving multi-interest modeling.
project image

EEG Vision Embeddings at CareYaya


November 2023 — May 2024

I studied the reconstruction of viewed images from brain activity, improving performance by conditioning on a user's camera roll. We aimed to better align EEG embeddings with CLIP space.
project image

Price Prediction at Ticket Wallet (YC X25)


June 2023 — August 2023

I worked at an early-stage startup to advise and design data pipelines, pricing algorithms, and refine product pitches. I also worked on algorithms to determine if a ticket was likely to be sold.



Papers & Projects

project image

SpectraLDS: Provable Distillation for Linear Dynamical Systems


Devan Shah, Shlomo Fortgang, Sofiia Druchyna, Elad Hazan
Preprint
paper / code / slides

project image

FutureFill: Fast Generation from Convolutional Sequence Models


Naman Agarwal, Xinyi Chen, Evan Dogariu, Devan Shah, Hubert Strauss, Vladimir Feinberg, Daniel Suo, Peter Bartlett, Elad Hazan
Preprint
paper / code / slides

project image

Augmenting Large Reasoning Models with Contrastive Goal-Conditioned Reinforcement Learning


Devan Shah*, Kevin Wang*, David Yan* (* indicates equal contribution)

paper / code / poster

project image

Parallel Scaling with Entropic Reasoners


Devan Shah*, Owen Yang*, Daniel Yang*

paper / code / poster

project image

ScooterId: Posture-Based Continuous User Identification From Mobility Scooter Rides


Devan Shah , Ruoqi Huang, Nisha Vinayaga-Sureshkanth, Tingting Chen, Murtuza Jadliwala
IEEE TMC 2025
website / paper

project image

Wave Filtering for General Linear Dynamical Systems


Devan Shah* , Brandon Cho*
Idea
paper / code

project image

Truly Adaptive Bloom Filters


Devan Shah* , David Yan*

paper / poster

project image

Rider Posture-Based Continuous Authentication with Few-Shot Learning for Mobility Scooters (Student Abstract)


Devan Shah , Ruoqi Huang, Tingting Chen, Murtuza Jadliwala
AAAI 2025
paper / poster / slides

project image

DreamScape: Denoising CLIP Embeddings with User Images for Improved Visualization Reconstruction


Devan Shah

paper / code


Surveys

project image

Discussion on "Why Does Deep and Cheap Learning Work So Well?"


Devan Shah

paper

project image

A Survey of State Space Models: From Linear Systems to Language


Devan Shah* , Brandon Cho* (Equal Contribution)

paper

project image

Understanding Dynamic Algorithms for Packing-Covering LPs via Multiplicative Weight Updates


Devan Shah* , Owen Yang*, Sunil Vittal*

paper

Coursework

Computer Science

COS217 Introduction to Programming Systems
COS226 Algorithms and Data Structures
COS418 Distributed Systems
ECE435 Machine Learning and Pattern Recognition
COS435 Introduction to Reinforcement Learning
COS484 Natural Language Processing
COS597 Long Term Memory in AI - Vector Search and Databases
COS597 Systems and Machine Learning

Math

MAT216 Multivariable Analysis and Linear Algebra I
MAT218 Multivariable Analysis and Linear Algebra II
ECO310 Microeconomic Theory: A Mathematical Approach
MAT377 Combinatorial Mathematics
MAT385 Probability Theory
MAT478 The Probabilistic Method

Theory

ECE434 Theoretical Machine Learning
COS445 Economics and Computing
COS521 Advanced Algorithm Design
COS522 Computational Complexity
COS585 Information Theory and Applications
ORF543 Deep Learning Theory
COS598 Theory of Natural Algorithms

Creative Writing

FRS116 Evolution of Human Language
CWR202 Creative Writing (Poetry)
POL316 Civil Liberties
ATL494 Creating Comedy for Television
ATL497 How to Write a Monologue


Hackathons

2025 Anthropic Alignment Hack #2/13
2025 Stanford Treehacks Codegen: Best Code Generation Application #1/24
2024 Columbia DevFest Overall Winners #1/54
2023 MIT Energy and Climate Hackathon #3/80
2023 HackHarvard CareYaya Track #1/24
2023 Princeton HackaTron Web3 Hack #2
2022 HackPrinceton Best AI Hack


Design and source code from David Yan's website