Robust Policies via Mid-Level Visual Representations: An Experimental Study in Manipulation and Navigation

Chen, Bryan; Sax, Alexander; Lewis, Gene; Armeni, Iro; Savarese, Silvio; Zamir, Amir; Malik, Jitendra; Pinto, Lerrel

Computer Science > Robotics

arXiv:2011.06698 (cs)

[Submitted on 13 Nov 2020]

Title:Robust Policies via Mid-Level Visual Representations: An Experimental Study in Manipulation and Navigation

Authors:Bryan Chen, Alexander Sax, Gene Lewis, Iro Armeni, Silvio Savarese, Amir Zamir, Jitendra Malik, Lerrel Pinto

View PDF

Abstract:Vision-based robotics often separates the control loop into one module for perception and a separate module for control. It is possible to train the whole system end-to-end (e.g. with deep RL), but doing it "from scratch" comes with a high sample complexity cost and the final result is often brittle, failing unexpectedly if the test environment differs from that of training.
We study the effects of using mid-level visual representations (features learned asynchronously for traditional computer vision objectives), as a generic and easy-to-decode perceptual state in an end-to-end RL framework. Mid-level representations encode invariances about the world, and we show that they aid generalization, improve sample complexity, and lead to a higher final performance. Compared to other approaches for incorporating invariances, such as domain randomization, asynchronously trained mid-level representations scale better: both to harder problems and to larger domain shifts. In practice, this means that mid-level representations could be used to successfully train policies for tasks where domain randomization and learning-from-scratch failed. We report results on both manipulation and navigation tasks, and for navigation include zero-shot sim-to-real experiments on real robots.

Comments:	Extended version of CoRL 2020 camera ready. Supplementary released separately
Subjects:	Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2011.06698 [cs.RO]
	(or arXiv:2011.06698v1 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2011.06698

Submission history

From: Bryan Chen [view email]
[v1] Fri, 13 Nov 2020 00:16:05 UTC (39,022 KB)

Computer Science > Robotics

Title:Robust Policies via Mid-Level Visual Representations: An Experimental Study in Manipulation and Navigation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Robust Policies via Mid-Level Visual Representations: An Experimental Study in Manipulation and Navigation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators