Content area

Abstract

Text-to-speech systems (TTS) have come a long way in the last decade and are now a popular research topic for creating various human-computer interaction systems. Although, a range of speech synthesis models for various languages with several motive applications is available based on domain requirements. However, recent developments in speech synthesis have primarily attributed to deep learning-based techniques that have improved a variety of application scenarios, including intelligent speech interaction, chatbots, and conversational artificial intelligence (AI). Text-to-speech systems are discussed in this survey article as an active topic of study that has achieved significant progress in the recent decade, particularly for Indian and non-Indian languages. Furthermore, the study also covers the lifecycle of text-to-speech systems as well as developed platforms in it. We performed an efficient search for published survey articles up to May 2021 in the web of science, PubMed, Scopus, EBSCO(Elton B. Stephens CO (company)) and Google Scholar for Text-to-speech Systems (TTS) in various languages based on different approaches. This survey article offers a study of the contributions made by various researchers in Indian and non-Indian language text-to-speech systems and the techniques used to implement it with associated challenges in designing TTS systems. The work also compared different language text-to-speech systems based on the quality metrics such as recognition rate, accuracy, TTS score, precision, recall, and F1-score. Further, the study summarizes existing ideas and their shortcomings, emphasizing the scope of future research in Indian and non-Indian languages TTS, which may assist beginners in designing robust TTS systems.

Details

Title
A deep learning approaches in text-to-speech system: a systematic review and recent research perspective
Author
Kumar, Yogesh 1   VIAFID ORCID Logo  ; Koul, Apeksha 2 ; Singh, Chamkaur 3 

 Pandit Deendayal Energy University, Department of Computer Science and Engineering, School of Technology, Gandhinagar, India (GRID:grid.449189.9) (ISNI:0000 0004 1756 5243) 
 Punjabi University, Department of Computer Science and Engineering, Patiala, India (GRID:grid.412580.a) (ISNI:0000 0001 2151 1270) 
 Chandigarh Group of Colleges, Department of Computer Applications, Mohali, India (GRID:grid.412580.a) 
Pages
15171-15197
Publication year
2023
Publication date
Apr 2023
Publisher
Springer Nature B.V.
ISSN
13807501
e-ISSN
15737721
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2790202057
Copyright
© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022. Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.