Abstract: We introduce a new convolutional autoencoder architecture for user modeling and recommendation tasks with several improvements over the state of the art. First, our model has the flexibility ...
A PyTorch implementation of METTS: Multilingual Emotional Text-to-Speech by Cross-Speaker and Cross-Lingual Emotion Transfer.
Abstract: Explainable artificial intelligence (XAI) approaches started to be studied in the last period to improve the interpretability of increasingly complex deep learning (DL) methods for remote ...
Implementation of a Vision-Mamba network, integrating State Space Models (SSM) with a patch-based encoder–decoder for image inpainting, colorization, and denoising. Trained with L1, SSIM, and VGG ...