菜單總覽

【暑期短課】A Tutorial on Reinforcement Learning - Prof. Benjamin Van Roy

  • 2019.07.15
  • 活動
A Tutorial on Reinforcement Learning

主題: A Tutorial on Reinforcement Learning

報告人: Prof. Benjamin Van Roy, Stanford University

時間: 10:00 am - 11:30 am, July 15 and July 17, 2019

地點: Room 201, Teaching?Building B (July 15)

? ? ? ? ? Room 208, Cheng Dao Building (July 17)

?

?

摘要:

There is sometimes confusion about what reinforcement learning is about. This is partly because the term alternately refers to a problem, a community who work on the problem, and methods developed by this community, some of which have been useful in addressing other problems. The reinforcement learning problem is that faced by an agent interacting with an uncertain environment aiming to maximize rewards it accumulates over time. This tutorial will introduce the problem and basic policy and value function learning algorithms that aim to address it. We will also discuss data efficiency and the role of exploration, generalized value functions, and hierarchical reinforcement learning.

?

簡介:

Benjamin Van Roy is a Professor at Stanford University, where he has served on the faculty since 1998. His research focuses on understanding how an agent interacting with a poorly understood environment can learn over time to make effective decisions. He is interested in the design of efficient reinforcement learning algorithms, understanding what is possible or impossible in this domain, and applying the technology toward the benefit of society. Beyond academia, he leads a DeepMind Research team in Mountain View, and has also led research programs at Unica (acquired by IBM), Enuvis (acquired by SiRF), and Morgan Stanley.?

He is a Fellow of INFORMS and IEEE and has served on the editorial boards of Machine Learning, Mathematics of Operations Research, for which he co-edits the Learning Theory Area, Operations Research, for which he edited the Financial Engineering Area, and the INFORMS Journal on Optimization.

He received the SB in Computer Science and Engineering and the SM and PhD in Electrical Engineering and Computer Science, all from MIT. He has been a recipient of the MIT George C. Newton Undergraduate Laboratory Project Award, the MIT Morris J. Levin Memorial Master's Thesis Award, the MIT George M. Sprowls Doctoral Dissertation Award, the National Science Foundation CAREER Award, the Stanford Tau Beta Pi Award for Excellence in Undergraduate Teaching, and the Management Science and Engineering Department's Graduate Teaching Award. He has held visiting positions as the Wolfgang and Helga Gaul Visiting Professor at the University of Karlsruhe, the Chin Sophonpanich Foundation Professor and the InTouch Professor at Chulalongkorn University, a Visiting Professor at the National University of Singapore, and a Visiting Professor at the Chinese University of Hong Kong, Shenzhen.

金蟾捕鱼送分 老时时大小杀号定胆 云南云南时时走势图 四川快乐12技巧 广东快乐10分预测计划 75秒时时彩口诀 平码绝密公式规律 江西时时点 曾道玄机彩图 快乐时时是全国号码 重庆快乐十分重边号走势图 北京时时彩的骗局 胜平负 江苏时时百度贴吧 四川金七乐开奖走势 快速时时彩是官方彩吗 新浪足彩胜负彩预测