Bei Peng


I am a postdoctoral researcher with a focus on reinforcement learning. My PhD research mainly focuses on interactive machine learning, reinforcement learning, and curriculum learning.

I have done some work on exploring how non-expert humans want to teach the agent to solve new complex sequential decision tasks and how to incorporate these insights into the development of new machine learning algorithms. I'm also interested in developing more sample efficient reinforcement learning algorithms.

Prior to joining the University of Oxford, I did a post-graduate internship in the Reinforcement Learning Group at Microsoft Research AI, working on developing hierarchical RL algorithms for text-based games. I received my doctorate from the School of Electrical Engineering and Computer Science at Washington State University in 2018, under the supervision of Matthew E. Taylor.

Before that, I worked as a front-end web developer in Tencent after receiving my bachelor's degree in Computer Science from Huazhong University of Science and Technology in China in 2012.