Prevention of data duplication for high throughput sequencing repositories. (27th February 2018)