Publications
Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs
Tianyu Guo, Druv Pai, Yu Bai, Jiantao Jiao, Michael Jordan, Song Mei
In review at ICLR 2025
Attention-Only Transformers via Unrolled Subspace Denoising
Peng Wang, Yifu Lu, Yaodong Yu, Druv Pai, Qing Qu, Yi Ma
In review at ICLR 2025
Token Statistics Transformer: Linear Time Attention via Variational Rate Reduction
Ziyang Wu, Tianjiao Ding, Druv Pai, Jingyuan Zhang, Weida Wang, Yaodong Yu, Yi Ma, Benjamin Haeffele
In review at ICLR 2025
Scaling White-Box Transformers for Vision
Jinrui Yang, Xianhang Li, Druv Pai, Yuyin Zhou, Yi Ma, Yaodong Yu, Cihang Xie
Accepted at NeurIPS 2024
project website - code
A Geometric Analysis of Maximal Coding Rate Reduction
Peng Wang, Huikang Liu, Druv Pai, Yaodong Yu, Zhihui Zhu, Qing Qu, Yi Ma
Accepted at ICML 2024
White-Box Transformers via Sparse Rate Reduction: Compression Is All There Is?
Yaodong Yu*, Sam Buchanan*, Druv Pai*, Tianzhe Chu, Ziyang Wu, Shengbang Tong, Hao Bai, Yuexiang Zhai, Benjamin Haeffele, Yi Ma
Accepted at JMLR
project website - code
Masked Completion via Structured Diffusion with White-Box Transformers
Druv Pai, Sam Buchanan, Ziyang Wu, Tianzhe Chu, Yaodong Yu, Yi Ma
Accepted at ICLR 2024, accepted at CPAL 2024 (non-archival track)
project website - code
Congestion Pricing for Efficiency and Equity: Theory and Applications to the San Francisco Bay Area
Chinmay Maheshwari, Kshitij Kulkarni, Druv Pai, Jiarui Yang, Manxi Wu, Shankar Sastry
Emergence of Segmentation with Minimalistic White-Box Transformers
Yaodong Yu, Tianzhe Chu, Shengbang Tong, Ziyang Wu, Druv Pai, Sam Buchanan, Yi Ma
Accepted (oral) at CPAL 2024
project website - code
White-Box Transformers via Sparse Rate Reduction
Yaodong Yu, Sam Buchanan, Druv Pai, Tianzhe Chu, Ziyang Wu, Shengbang Tong, Benjamin Haeffele, Yi Ma
Accepted (poster) at NeurIPS 2023
project website - code
Representation Learning via Manifold Flattening and Reconstruction
Michael Psenka, Druv Pai, Vishal Raman, Shankar Sastry, Yi Ma
Accepted (poster) at SLowDNN 2023, accepted at JMLR
project website - code
Closed-Loop Transcription via Convolutional Sparse Coding
Xili Dai, Ke Chen, Shengbang Tong, Jingyuan Zhang, Xingjian Gao, Mingyang Li, Druv Pai, Yuexiang Zhai, Xiaojun Yuan, Heung-Yeung Shum, Lionel Ni, Yi Ma
Accepted (poster) at SLowDNN 2023, accepted (oral) at CPAL 2024
Pursuit of a Discriminative Representation for Multiple Subspaces via Sequential Games
Druv Pai, Michael Psenka, Chih-Yuan Chiu, Manxi Wu, Edgar Dobriban, Yi Ma
Accepted (poster) at SLowDNN 2023, accepted at Journal of the Franklin Institute
code
Independent and Decentralized Learning in Markov Potential Games
Chinmay Maheshwari, Manxi Wu, Druv Pai, Shankar Sastry
In review at IEEE Transactions on Automatic Control
code