Value Augmented Sampling: Predict Your Rewards to Align
Idan Shenfeld, Seungwook Han, Akash Srivastava, Yoon Kim, Pulkit Agrawal
Published in Under Review for ICML, 2024
Idan Shenfeld, Seungwook Han, Akash Srivastava, Yoon Kim, Pulkit Agrawal
Published in Under Review for ICML, 2024
Zhang-Wei Hong, Idan Shenfeld, Tsun-Hsuan Wang, Yung-Sung Chuang, Aldo Pareja, James R. Glass, Akash Srivastava, Pulkit Agrawal
Published in ICLR, 2024
Idan Shenfeld, Zhang-Wei Hong, Aviv Tamar, and Pulkit Agrawal
Published in ICML, 2023
Selected for Oral Presentation at 2023 ICLR RRL Workshop.
Ron Dorfman, Idan Shenfeld, and Aviv Tamar
Published in NeurIPS, 2021