Lecture image placeholder

Premium content

Access to this content requires a subscription. You must be a premium user to view this content.

Monthly subscription - $9.99Pay per view - $4.99Access through your institutionLogin with Underline account
Need help?
Contact us
Lecture placeholder background

ACL-IJCNLP 2021

August 02, 2021

Thailand

Would you like to see your presentation here, made available to a global audience of researchers?
Add your own presentation or have us affordably record your next conference.

keywords:

meta-reinforcement learning

text-based games

generalization

Text-based games can be used to develop task-oriented text agents for accomplishing tasks with high-level language instructions, which has potential applications in domains such as human-robot interaction. Given a text instruction, reinforcement learning is commonly used to train agents to complete the intended task owing to its convenience of learning policies automatically. However, because of the large space of combinatorial text actions, learning a policy network that generates an action word by word with reinforcement learning is challenging. Recent research works show that imitation learning provides an effective way of training a generation-based policy network. However, trained agents with imitation learning are hard to master a wide spectrum of task types or skills, and it is also difficult for them to generalize to new environments. In this paper, we propose a meta-reinforcement learning based method to train text agents through learning-to-explore. In particular, the text agent first explores the environment to gather task-specific information and then adapts the execution policy for solving the task with this information. On the publicly available testbed ALFWorld, we conducted a comparison study with imitation learning and show the superiority of our method.

Next from ACL-IJCNLP 2021

Embracing Ambiguity: Shifting the Training Target of NLI Models
poster

Embracing Ambiguity: Shifting the Training Target of NLI Models

ACL-IJCNLP 2021

+1Saku SugawaraAkiko AizawaJohannes Mario Meissner
Johannes Mario Meissner and 3 other authors

02 August 2021

Similar lecture

Meta-learning for Task-oriented Household Text Games
workshop paper

Meta-learning for Task-oriented Household Text Games

ACL-IJCNLP 2021

Zhenjie Zhao
Zhenjie Zhao and 1 other author

02 August 2021