about
publications

Announcement_2

Created on August 01, 2025

2025

Our work which provides an optimization perspective on what makes a good reward model for RLHF has been accepted to NeurIPS 2025. See you in San Jose !

© Copyright 2026 Hubert Strauss. Powered by Jekyll with al-folio theme. Last updated: June 14, 2026.