Please use this identifier to cite or link to this item: https://dl.ucsc.cmb.ac.lk/jspui/handle/123456789/3696
Title: “EYES FOR THE BLIND” - GENERATING HIGH QUALITY TEXT-TO-SPEECH VOICES FOR SINHALA
Authors: JAYAMANNE, YASORA THEWUNI
Issue Date: 9-Sep-2016
Abstract: This research represents a study carried out to compare different Text to Speech (TTS) modelling techniques used in developing Sinhala Text to Speech systems. In the area of speech research there are different modelling techniques. From that we can find Sinhala TTS systems based on formant synthesis, diphone concatenation and unit selection approach. HMM based modelling is another popular technique which have not attempted before for Sinhala language. Main objective of this study is to find out the best modelling technique for Sinhala language. Therefore formant synthesis, diphone concatenation, unit selection approach and HMM based modelling is evaluated in order to achieve the research question. As the first step of the study an HMM based TTS system is developed. Later these four modelling techniques are analyzed using user evaluations. Three quality attributes, as speech quality, naturalness and intelligibility are considered in the analysis with the user preference. As the results stands unit selection approach is the best technique in overall values of three quality attributes measured. Having a 75.20% of speech quality, a 78.20% of naturalness and an 86.20% of intelligibility. When it comes to the user preference 50% of the users preferred unit selection approach. With overall results diphone concatenation approach had the second best values.
URI: http://hdl.handle.net/123456789/3696
Appears in Collections:SCS Individual Project - Final Thesis (2015)

Files in This Item:
File Description SizeFormat 
11001259_YTJayamanne.pdf
  Restricted Access
2.77 MBAdobe PDFView/Open Request a copy


Items in UCSC Digital Library are protected by copyright, with all rights reserved, unless otherwise indicated.