Safe and efficient imitation learning by clarification of experienced latent space. (18th August 2021)

Record Type:: Journal Article
Title:: Safe and efficient imitation learning by clarification of experienced latent space. (18th August 2021)
Main Title:: Safe and efficient imitation learning by clarification of experienced latent space
Authors:: Fujiishi, Hidehito
Kobayashi, Taisuke
Sugimoto, Kenji
Abstract:: Abstract : Behavioral cloning from observation (BCO) allows the robot to learn the policy without the expert's action information. However, it requires a few interactions with the environment to infer expert's action with risk of robot failures. In addition, BCO assumes that the inferred action is of accurate, causing wrong and inefficient updates of the policy. Both problems can be resolved by outlier detection whether the faced state is experienced or not. This paper addresses such outlier detection mechanisms using variational autoencoder (VAE) to improve safety and efficiency of the standard BCO. For the first safety problem, we suppose that the expert's demonstrations only visited the safe states, and then, VAE is learned by the expert's state data to detect inexperienced and dangerous scenes. For the second efficiency problem, another VAE is trained with the state data safely collected by the imitator's policy to detect the scenes where the inferred actions are not accurate. In handwriting robot experiments, the proposed mechanisms succeeded in improving the standard BCO in terms of both the safety (roughly 64%) and the efficiency (roughly 44%). The high versatility of the proposed mechanisms is verified from learning various alphabets. GRAPHICAL ABSTRACT: UF0001
Is Part Of:: Advanced robotics. Volume 35:Number 16(2021)
Journal:: Advanced robotics
Issue:: Volume 35:Number 16(2021)
Issue Display:: Volume 35, Issue 16 (2021)
Year:: 2021
Volume:: 35
Issue:: 16
Issue Sort Value:: 2021-0035-0016-0000
Page Start:: 1012
Page End:: 1027
Publication Date:: 2021-08-18
Subjects:: Imitation learning -- outlier detection -- regularized latent space -- handwriting robot
Robotics -- Periodicals
Robotics -- Japan -- Periodicals
Robotics
Japan
Periodicals
629.89205
Journal URLs:: http://www.catchword.com/rpsv/cw/vsp/01691864/contp1.htm ↗
http://catalog.hathitrust.org/api/volumes/oclc/14883000.html ↗
http://www.tandfonline.com/toc/tadr20/current ↗
http://www.tandfonline.com/ ↗
http://firstsearch.oclc.org ↗
http://firstsearch.oclc.org/journal=0169-1864;screen=info;ECOIP ↗
http://www.ingentaselect.com/vl=16659242/cl=11/nw=1/rpsv/cw/vsp/01691864/contp1.htm ↗
DOI:: 10.1080/01691864.2021.1959397 ↗
Languages:: English
ISSNs:: 0169-1864
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 0696.926500
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store
Ingest File:: 18524.xml