AI-powered noise suppression for real-time audio processing with LiveKit. Based on the DeepFilterNet paper and implementation by Rikorose.
Abstract: A high-quality enrollment speech is crucial to target speaker extraction (TSE), since it provides essential cues for identifying the target speaker in the mixture. However, real applications ...
Abstract: Recent advances in deep learning technology have enabled high-quality speech synthesis, and text-to-speech models are widely used in a variety of applications. However, even state-of-the-art ...