Safe and efficient imitation learning by clarification of experienced latent space. (18th August 2021)
- Record Type:
- Journal Article
- Title:
- Safe and efficient imitation learning by clarification of experienced latent space. (18th August 2021)
- Main Title:
- Safe and efficient imitation learning by clarification of experienced latent space
- Authors:
- Fujiishi, Hidehito
Kobayashi, Taisuke
Sugimoto, Kenji - Abstract:
- Abstract : Behavioral cloning from observation (BCO) allows the robot to learn the policy without the expert's action information. However, it requires a few interactions with the environment to infer expert's action with risk of robot failures. In addition, BCO assumes that the inferred action is of accurate, causing wrong and inefficient updates of the policy. Both problems can be resolved by outlier detection whether the faced state is experienced or not. This paper addresses such outlier detection mechanisms using variational autoencoder (VAE) to improve safety and efficiency of the standard BCO. For the first safety problem, we suppose that the expert's demonstrations only visited the safe states, and then, VAE is learned by the expert's state data to detect inexperienced and dangerous scenes. For the second efficiency problem, another VAE is trained with the state data safely collected by the imitator's policy to detect the scenes where the inferred actions are not accurate. In handwriting robot experiments, the proposed mechanisms succeeded in improving the standard BCO in terms of both the safety (roughly 64%) and the efficiency (roughly 44%). The high versatility of the proposed mechanisms is verified from learning various alphabets. GRAPHICAL ABSTRACT: UF0001
- Is Part Of:
- Advanced robotics. Volume 35:Number 16(2021)
- Journal:
- Advanced robotics
- Issue:
- Volume 35:Number 16(2021)
- Issue Display:
- Volume 35, Issue 16 (2021)
- Year:
- 2021
- Volume:
- 35
- Issue:
- 16
- Issue Sort Value:
- 2021-0035-0016-0000
- Page Start:
- 1012
- Page End:
- 1027
- Publication Date:
- 2021-08-18
- Subjects:
- Imitation learning -- outlier detection -- regularized latent space -- handwriting robot
Robotics -- Periodicals
Robotics -- Japan -- Periodicals
Robotics
Japan
Periodicals
629.89205 - Journal URLs:
- http://www.catchword.com/rpsv/cw/vsp/01691864/contp1.htm ↗
http://catalog.hathitrust.org/api/volumes/oclc/14883000.html ↗
http://www.tandfonline.com/toc/tadr20/current ↗
http://www.tandfonline.com/ ↗
http://firstsearch.oclc.org ↗
http://firstsearch.oclc.org/journal=0169-1864;screen=info;ECOIP ↗
http://www.ingentaselect.com/vl=16659242/cl=11/nw=1/rpsv/cw/vsp/01691864/contp1.htm ↗ - DOI:
- 10.1080/01691864.2021.1959397 ↗
- Languages:
- English
- ISSNs:
- 0169-1864
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 0696.926500
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 18524.xml