Multiple Sequence Alignment based on deep Q network with negative feedback policy. (December 2022)