publications

* Indicates equal contribution

2026

  1. reward_errs_categorization.png
    When Errors Can Be Beneficial: A Categorization of Imperfect Rewards for Policy Gradient
    Shuning Shang*, Hubert Strauss*, Stanley Wei, Sanjeev Arora, and Noam Razin
    arXiv preprint arXiv:2604.25872, 2026

2025

  1. hardware_eff.png
    Hardware-Efficient Attention for Fast Decoding
    Ted Zadouri, Hubert Strauss, and Tri Dao
    In Conference on Language Modeling (COLM 2025), 2025
  2. what_makes_good_rm.png
    What Makes a Reward Model a Good Teacher? An Optimization Perspective
    Noam Razin, Zixuan Wang, Hubert Strauss, Stanley Wei, Jason D Lee, and Sanjeev Arora
    In Advances in Neural Information Processing Systems (NeurIPS 2025), 2025

2024

  1. futurefill.png
    FutureFill: Fast Generation from Convolutional Sequence Models
    Naman Agarwal, Xinyi Chen, Evan Dogariu, Devan Shah, Hubert Strauss, Vlad Feinberg, Daniel Suo, Peter Bartlett, and Elad Hazan
    In International Conference on Learning Representations (ICLR 2026), 2024