The overall objective of IA-Speak is to design, develop and validate an intelligent system providing comprehensive, personalised assistance for people with speech disorders resulting from brain damage. This system seeks to reduce the communication barriers faced by these patients, facilitating their daily interaction, promoting their social integration and contributing significantly to improving their quality of life.
With this strategy, IA-SPEAK addresses both the improvement of speech ability and effective, immediate communication in everyday life.
Within this framework, the following specific objectives are proposed:
Nair Center plays a key role in leading the development of advanced components for audio processing and the implementation of deep learning models aimed at recognising and translating non-standard speech.
Its main contribution is the design of the voice processing architecture, which interprets variable and complex patterns using automated pre-processing pipelines, applying filtering, normalisation and segmentation to optimise signal quality. Within this framework, it develops advanced diarisation algorithms, spectrogram extraction and MFCC coefficients, which allow the identification of each user's particularities and the generation of personalised voice profiles that underpin the entire system.
A notable contribution is the non-standard speech translation module, structured in four interconnected blocks: data input, transcription, cloning and optimised output. Transcription uses models such as Whisper and WhisperX, fine-tuned to learn the pronunciation characteristics of each patient, similar to adapting to a new accent or dialect. The voice cloning module, based on models such as F5-TTS adapted to Spanish, generates synthetic audio that preserves the user's vocal identity with greater clarity and comprehensibility, ensuring authenticity in communication.
Finally, Nair Center applies its expertise in multimodal recommendation systems, integrating voice analysis, facial recognition, user profile, and historical progress. Using deep reinforcement learning techniques, the system suggests personalised exercises in real time, adapted to each patient's progress.






