Follow
Rishabh Agarwal
Rishabh Agarwal
Senior Research Scientist, Google DeepMind
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
An Optimistic Perspective on Offline Reinforcement Learning
R Agarwal, D Schuurmans, M Norouzi
International Conference on Machine Learning (ICML), 2020
635*2020
Deep Reinforcement Learning at the Edge of the Statistical Precipice
R Agarwal, M Schwarzer, PS Castro, A Courville, MG Bellemare
Neural Information Processing Systems (NeurIPS), 𝗢𝘂𝘁𝘀𝘁𝗮𝗻𝗱𝗶𝗻𝗴 𝗣𝗮𝗽𝗲𝗿 𝗔𝘄𝗮𝗿𝗱, 2021
5702021
Neural additive models: Interpretable machine learning with neural nets
R Agarwal, L Melnick, Frosst, Zhang, Lengerich, R Caruana, GE Hinton
Neural Information Processing Systems (NeurIPS), 𝗦𝗽𝗼𝘁𝗹𝗶𝗴𝗵𝘁, 2021
4262021
Revisiting Fundamentals of Experience Replay
W Fedus*, P Ramachandran*, R Agarwal, Y Bengio, H Larochelle, ...
International Conference on Machine Learning (ICML), 2020
2802020
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ...
arXiv preprint arXiv:2403.05530, *Core Contributor, 2024
1882024
Contrastive Behavioral Similarity Embeddings for Generalization in Reinforcement Learning
R Agarwal, MC Machado, PS Castro, MG Bellemare
International Conference on Learning Representations (ICLR), 𝗦𝗽𝗼𝘁𝗹𝗶𝗴𝗵𝘁, 2021
1882021
RL Unplugged: A Collection of Benchmarks for Offline Reinforcement Learning
C Gulcehre, Z Wang, A Novikov, T Paine, S Gómez, K Zolna, R Agarwal, ...
Advances in Neural Information Processing Systems (NeurIPS), 2020
185*2020
Learning to Generalize from Sparse and Underspecified Rewards
R Agarwal, C Liang, D Schuurmans, M Norouzi
International Conference on Machine Learning (ICML), 2019
1042019
Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning
A Kumar*, R Agarwal*, D Ghosh, S Levine
International Conference on Learning Representations (ICLR), *equal contribution, 2021
882021
Reincarnating Reinforcement Learning: Reusing Prior Computation to Accelerate Progress
R Agarwal, M Schwarzer, PS Castro, A Courville, MG Bellemare
Neural Information Processing Systems (NeurIPS), 2022
56*2022
On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes
R Agarwal*, N Vieillard*, Y Zhou, P Stanczyk, S Ramos, M Geist, ...
International Conference on Learning Representations (ICLR), 2024
52*2024
DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization
A Kumar, R Agarwal, T Ma, A Courville, G Tucker, S Levine
International Conference on Learning Representations (ICLR), 𝗦𝗽𝗼𝘁𝗹𝗶𝗴𝗵𝘁, 2022
522022
Bigger, Better, Faster: Human-level Atari with human-level efficiency
M Schwarzer, JO Ceron, Courville, Bellemare, PS Castro*, R Agarwal*
International Conference on Machine Learning (ICML), 2023
502023
The Dormant Neuron Phenomenon in Deep Reinforcement Learning
G Sokar, R Agarwal, PS Castro, U Evci
International Conference on Machine Learning (ICML), 𝐎𝐫𝐚𝐥, 2023
462023
Offline Q-Learning on Diverse Multi-Task Data Both Scales And Generalizes
A Kumar, R Agarwal, X Geng, G Tucker, S Levine
International Conference on Learning Representations (ICLR), 𝐍𝐨𝐭𝐚𝐛𝐥𝐞-𝐭𝐨𝐩-𝟓%, 2023
392023
Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving Research
C Gulino, J Fu, W Luo, G Tucker, E Bronstein, Y Lu, J Harb, X Pan, ...
Neural Information Processing Systems, NeurIPS, 2023
372023
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models
A Singh*, JD Co-Reyes*, R Agarwal*, A Anand, P Patil, PJ Liu, J Harrison, ...
Transactions on Machine Learning Research (TMLR), 2024
352024
Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation
E Nikishin, R Abachi, R Agarwal, PL Bacon
AAAI Conference on Artificial Intelligence, 2022
342022
Distillspec: Improving speculative decoding via knowledge distillation
Y Zhou, K Lyu, AS Rawat, AK Menon, ..., S Kumar, JF Kagy, R Agarwal
International Conference on Learning Representations (ICLR), 2024
292024
On the Generalization of Representations in Reinforcement Learning
CL Lan, S Tu, A Oberman, R Agarwal, MG Bellemare
International Conference on Artificial Intelligence and Statistics (AISTATS), 2022
292022
The system can't perform the operation now. Try again later.
Articles 1–20