Object-oriented state editing for HRL

Bapst, Victor; Sanchez-Gonzalez, Alvaro; Shams, Omar; Stachenfeld, Kimberly; Battaglia, Peter W.; Singh, Satinder; Hamrick, Jessica B.

计算机科学 > 机器学习

arXiv:1910.14361 (cs)

[提交于 2019年10月31日 ]

标题：面向对象的状态编辑用于分层强化学习

标题： Object-oriented state editing for HRL

Authors:Victor Bapst, Alvaro Sanchez-Gonzalez, Omar Shams, Kimberly Stachenfeld, Peter W. Battaglia, Satinder Singh, Jessica B. Hamrick

摘要：我们引入了使用面向对象推理的代理，以考虑世界的替代状态，从而更快地找到问题的解决方案。具体而言，分层控制器指导低级代理表现得好像场景中的物体被添加、删除或修改。控制器采取的动作是在场景的基于图的表示上定义的，动作对应于添加、删除或编辑图的节点。我们在三个环境中展示了初步结果，证明我们的方法可以达到与非分层代理相似的奖励水平，但数据效率更好。

摘要： We introduce agents that use object-oriented reasoning to consider alternate states of the world in order to more quickly find solutions to problems. Specifically, a hierarchical controller directs a low-level agent to behave as if objects in the scene were added, deleted, or modified. The actions taken by the controller are defined over a graph-based representation of the scene, with actions corresponding to adding, deleting, or editing the nodes of a graph. We present preliminary results on three environments, demonstrating that our approach can achieve similar levels of reward as non-hierarchical agents, but with better data efficiency.

评论：	8页；被接收至第33届神经信息处理系统大会（NeurIPS 2019）的感知作为生成推理研讨会
主题：	机器学习 (cs.LG) ; 人工智能 (cs.AI); 机器学习 (stat.ML)
引用方式：	arXiv:1910.14361 [cs.LG]
	(或者 arXiv:1910.14361v1 [cs.LG] 对于此版本)
	https://doi.org/10.48550/arXiv.1910.14361

提交历史

来自： Victor Bapst [查看电子邮件]
[v1] 星期四， 2019 年 10 月 31 日 10:48:45 UTC (409 KB)

计算机科学 > 机器学习

标题：面向对象的状态编辑用于分层强化学习

标题： Object-oriented state editing for HRL

提交历史

获取论文：

参考文献与引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

计算机科学 > 机器学习

标题： 面向对象的状态编辑用于分层强化学习 显示英文标题

标题： Object-oriented state editing for HRL

提交历史

获取论文：

参考文献与引用

BibTeX 格式的引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

标题：面向对象的状态编辑用于分层强化学习