An efficient actor‐critic reinforcement learning for device‐to‐device communication underlaying sectored cellular network. (21st January 2020)