Open source TTS models Kokoro, Orpheus, and Piper are tested on symbols, abbreviations, and prosody with CER and MOS results.
Abstract: In the era of free speech and rapid internet expansion, curbing the dissemination of offensive content on social media has become a pressing concern for linguists and regulatory bodies. Hate ...
Abstract: Emotion recognition plays a key role in human-computer interaction(HCI) and intelligent systems. This study proposes a multimodal approach that combines facial expressions and speech ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results