Audio-visual word prominence detection from clean and noisy speech. (March 2018)