A Dynamic Penalty Function Approach for Constraint-Handling in Reinforcement Learning. Issue 3 (2021)