Applying reinforcement learning to environments with sparse and underspecified rewards is an ongoing challenge, requiring generalization from limited feedback. See how we address this with a novel method that provides more refined feedback to the agent.
Learning to Generalize from Sparse and Underspecified Rewards
Posted by Rishabh Agarwal, Google AI Resident and Mohammad Norouzi, Research Scientist Reinfo... more
Tidak ada komentar:
Posting Komentar