Announcement_2
Our work which provides an optimization perspective on what makes a good reward model for RLHF has been accepted to NeurIPS 2025. See you in San Jose !
Our work which provides an optimization perspective on what makes a good reward model for RLHF has been accepted to NeurIPS 2025. See you in San Jose !