Adaptive tensegrity locomotion: Controlling a compliant icosahedron with symmetry-reduced reinforcement learning. (January 2021)
- Record Type:
- Journal Article
- Title:
- Adaptive tensegrity locomotion: Controlling a compliant icosahedron with symmetry-reduced reinforcement learning. (January 2021)
- Main Title:
- Adaptive tensegrity locomotion: Controlling a compliant icosahedron with symmetry-reduced reinforcement learning
- Authors:
- Surovik, David
Wang, Kun
Vespignani, Massimo
Bruce, Jonathan
Bekris, Kostas E - Other Names:
- Della Santina Cosimo guest-editor.
Katzschmann Robert K. guest-editor.
Bicchi Antonio guest-editor.
Rus Daniela guest-editor. - Abstract:
- Tensegrity robots, which are prototypical examples of hybrid soft–rigid robots, exhibit dynamical properties that provide ruggedness and adaptability. They also bring about, however, major challenges for locomotion control. Owing to high dimensionality and the complex evolution of contact states, data-driven approaches are appropriate for producing viable feedback policies for tensegrities. Guided policy search (GPS), a sample-efficient hybrid framework for optimization and reinforcement learning, has previously been applied to generate periodic, axis-constrained locomotion by an icosahedral tensegrity on flat ground. Varying environments and tasks, however, create a need for more adaptive and general locomotion control that actively utilizes an expanded space of robot states. This implies significantly higher needs in terms of sample data and setup effort. This work mitigates such requirements by proposing a new GPS -based reinforcement learning pipeline, which exploits the vehicle's high degree of symmetry and appropriately learns contextual behaviors that are sustainable without periodicity. Newly achieved capabilities include axially unconstrained rolling, rough terrain traversal, and rough incline ascent. These tasks are evaluated for a small variety of key model parameters in simulation and tested on the NASA hardware prototype, SUPERball. Results confirm the utility of symmetry exploitation and the adaptability of the vehicle. They also shed light on numerousTensegrity robots, which are prototypical examples of hybrid soft–rigid robots, exhibit dynamical properties that provide ruggedness and adaptability. They also bring about, however, major challenges for locomotion control. Owing to high dimensionality and the complex evolution of contact states, data-driven approaches are appropriate for producing viable feedback policies for tensegrities. Guided policy search (GPS), a sample-efficient hybrid framework for optimization and reinforcement learning, has previously been applied to generate periodic, axis-constrained locomotion by an icosahedral tensegrity on flat ground. Varying environments and tasks, however, create a need for more adaptive and general locomotion control that actively utilizes an expanded space of robot states. This implies significantly higher needs in terms of sample data and setup effort. This work mitigates such requirements by proposing a new GPS -based reinforcement learning pipeline, which exploits the vehicle's high degree of symmetry and appropriately learns contextual behaviors that are sustainable without periodicity. Newly achieved capabilities include axially unconstrained rolling, rough terrain traversal, and rough incline ascent. These tasks are evaluated for a small variety of key model parameters in simulation and tested on the NASA hardware prototype, SUPERball. Results confirm the utility of symmetry exploitation and the adaptability of the vehicle. They also shed light on numerous strengths and limitations of the GPS framework for policy design and transfer to real hybrid soft–rigid robots. … (more)
- Is Part Of:
- International journal of robotics research. Volume 40:Number 1(2021)
- Journal:
- International journal of robotics research
- Issue:
- Volume 40:Number 1(2021)
- Issue Display:
- Volume 40, Issue 1 (2021)
- Year:
- 2021
- Volume:
- 40
- Issue:
- 1
- Issue Sort Value:
- 2021-0040-0001-0000
- Page Start:
- 375
- Page End:
- 396
- Publication Date:
- 2021-01
- Subjects:
- Adaptive control -- learning techniques in soft robotics -- locomotion -- mobile robotics -- tensegrity
Robots -- Periodicals
Robots, Industrial -- Periodicals
629.89205 - Journal URLs:
- http://ijr.sagepub.com/ ↗
http://www.uk.sagepub.com/home.nav ↗ - DOI:
- 10.1177/0278364919859443 ↗
- Languages:
- English
- ISSNs:
- 0278-3649
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 15290.xml