您的瀏覽器不支援JavaScript語法,網站的部份功能在JavaScript沒有啟用的狀態下無法正常使用。

中央研究院 資訊科學研究所

活動訊息

友善列印

列印可使用瀏覽器提供的(Ctrl+P)功能

學術演講

:::

Relay Generalization of Reinforcement Learning

  • 講者藍立呈 先生 (Computer Science, University of California, Los Angeles)
    邀請人:吳廸融
  • 時間2022-12-15 (Thu.) 14:00 ~ 16:00
  • 地點資訊所舊館108演講廳
摘要
Generalization is critical for deploying reinforcement learning (RL) agents into real-world applications. A well-trained RL agent that can achieve high rewards under restricted settings may not be able to handle the enormous state space and complex environment variations in the real world. We propose, study, and improve the "relay-generalization" a well-generalized agent that masters a task should be able to start from any controllable state in the same environment and still reach the goal. For example, a self-driving system may need to take over the control from humans in the middle of driving and must continue to drive the car safely. We show that the current RL methods are not general enough to pass our "relay-evaluation". We also propose a novel method called Self-Trajectory Augmentation (STA) which can reduce the failure rate dramatically.