A deep reinforcement learning method for structural dominant failure modes searching based on self-play strategy. (May 2023)