An efficient framework for semantically-correlated term detection and sanitization in clinical documents. (May 2022)