Quality of Synthetic Speech

Hinterleitner, Florian.

Quality of Synthetic Speech Perceptual Dimensions, Influencing Factors, and Instrumental Assessment / [electronic resource] : - 1st ed. 2017. - XVI, 157 p. 29 illus. | Binding - Card Paper | - T-Labs Series in Telecommunication Services, 2192-2810 . - T-Labs Series in Telecommunication Services, .

This book reviews research towards perceptual quality dimensions of synthetic speech, compares these findings with the state of the art, and derives a set of five universal perceptual quality dimensions for TTS signals. They are: (i) naturalness of voice, (ii) prosodic quality, (iii) fluency and intelligibility, (iv) absence of disturbances, and (v) calmness. Moreover, a test protocol for the efficient indentification of those dimensions in a listening test is introduced. Furthermore, several factors influencing these dimensions are examined. In addition, different techniques for the instrumental quality assessment of TTS signals are introduced, reviewed and tested. Finally, the requirements for the integration of an instrumental quality measure into a concatenative TTS system are examined.

9789811037344


EXTC Engineering

Signal, Image and Speech Processing. User Interfaces and Human Computer Interaction.

621.382
Unique Visitors hit counter Total Page Views free counter
Implemented and Maintained by AIKTC-KRRC (Central Library).
For any Suggestions/Query Contact to library or Email: librarian@aiktc.ac.in | Ph:+91 22 27481247
Website/OPAC best viewed in Mozilla Browser in 1366X768 Resolution.