Personalized vital signs control based on continuous action-space reinforcement learning with supervised experience. (August 2021)