2026-04-27T01:38:43Zhttps://uvadoc.uva.es/oai/request

oai:uvadoc.uva.es:10324/715012024-11-15T20:02:14Zcom_10324_38col_10324_787

Quintana Angulo, Gino Jesús 2024-11-15T08:26:12Z 2024-11-15T08:26:12Z 2024 https://uvadoc.uva.es/handle/10324/71501 El objetivo de este proyecto es el desarrollo de un servicio web que permita la transcripción de voz a texto (speech to text) de manera precisa y con un rendimiento aceptable. Para lograrlo, se emplearán tecnologías ASR como Whisper y Wav2Vec. Además, este servicio establecerá un marco de referencia para la construcción de sistemas similares en este campo tecnológico en constante evolución. El desarrollo del servicio se llevará a cabo siguiendo la metodología ágil Scrum, dividiendo el proceso en iteraciones incrementales (sprints). Se destacan hitos como la implementación de microservicios usando Docker y Docker Compose, la creación de un prototipo funcional y la mejora continua del servicio, la calidad y velocidad de transcripción. El servicio se diseñará con las siguientes características: escalable, modular y accesible. The objective of this project is the development of a web service that allows speech to text conversion in an accurate way and with an acceptable performance. To achieve this, ASR technologies such as Whisper and Wav2Vec will be used. In addition, this service will establish a reference framework for the construction of similar systems in this constantly evolving field of technology. The development of the service will be carried out following the agile Scrum methodology, dividing the process into incremental iterations (sprints). Milestones such as the implementation of microservices using Docker and Docker Compose, the creation of a functional prototype and the continuous improvement of the service, the quality and speed of transcription are highlighted. The service will be designed with the following characteristics: scalable, modular and accessible. spa info:eu-repo/semantics/openAccess http://creativecommons.org/licenses/by-nc-nd/4.0/ Attribution-NonCommercial-NoDerivatives 4.0 Internacional TuVozATexto: Servicio web para la conversión de voz a texto info:eu-repo/semantics/masterThesis