Taris: An online speech recognition framework with sequence to sequence neural networks for both audio-only and audio-visual speech. (July 2022)