A bin and hash method for analyzing reference data and descriptors in machine learning potentials. Issue 3 (22nd April 2021)