Perturbation Measurements on the Degree of Naturalness of Synthesized Vowels

Objective. To determine the impact of jitter and shimmer on the degree of naturalness perception of synthesized vowels produced by acoustical simulation with glottal pulses (GP) and with solid model of the vocal tract (SMVT). Study Design. Prospective study. Methods. Synthesized vowels were produced in three steps: 1. Eighty GP were developed (20 with jitter, 20 with shimmer, 20 with jitter+shimmer, 20 without perturbation); 2. A SMVT was produced based on magnetic resonance imaging (MRI) from a woman during phonation-/epsilon/ and using rapid prototyping technology; 3. Acoustic simulations were performed to obtain eighty synthesized vowels-/epsilon/. Two experiments were performed. First Experiment: three judges rated 120 vowels (20 humans+80 synthesized+20% repetition) as ""human"" or ""synthesized"". Second Experiment: twenty PowerPoint slide sequences were created. Each slide had 4 synthesized vowels produced with the four perturbation condition. Evaluators were asked to rate the vowels from the most natural to the most artificial. Results. First Experiment: all the human vowels were classified as human; 27 out of eighty synthesized vowels were rated as human, 15 of those were produced with jitter+shimmer, 10 with jitter, 2 without perturbation and none with shimmer. Second Experiment: Vowels produced with jitter+shimmer were considered as the most natural. Vowels with shimmer and without perturbation were considered as the most artificial. Conclusions. The association of jitter and shimmer increased the degree of naturalness of synthesized vowels. Acoustic simulations performed with GP and using SMVT demonstrated a possible method to test the effect of the perturbation measurements on synthesized voices.

Palavras-chave

Synthesized voices, Acoustical measurements, Auditory, perceptual evaluation, Naturalness perception, Vocal tract model

URI

https://observatorio.fm.usp.br/handle/OPI/21363

Referências

Amorim P, 2015, ADV VISUAL COMPUTING, P14
Behlau M., 2001, VOICE BOOK SPECIALIS, P1
Brockmann M, 2008, J SPEECH LANG HEAR R, V51, P1152, DOI 10.1044/1092-4388(2008/06-0208)
Brockmann M, 2011, J VOICE, V25, P44, DOI 10.1016/j.jvoice.2009.07.002
Dang JW, 1997, J ACOUST SOC AM, V101, P456, DOI 10.1121/1.417990
Englert M, 2015, J VOICE, V31
Fraj S, 2012, J ACOUST SOC AM, V132, P2603, DOI 10.1121/1.4751536
Fujita S, 2005, ACOUST SCI TECH, V26
Gerratt BR, 2001, J ACOUST SOC AM, V110, P2560, DOI 10.1121/1.1409969
Gonçalves Maria Inês Rebelo, 2009, Braz. j. otorhinolaryngol., V75, P680, DOI 10.1590/S1808-86942009000500012
HILLENBRAND J, 1988, J ACOUST SOC AM, V83, P2361, DOI 10.1121/1.396367
Honda K, 2008, J ACOUST SOC AM, V123, P3731, DOI 10.1121/1.2935225
KERSTA LG, 1960, J ACOUST SOC AM, V32, P1502, DOI 10.1121/1.1935196
Kisenwether JS, 2015, J VOICE, V29, P548, DOI 10.1016/j.jvoice.2014.11.006
Kreiman J, 2005, J ACOUST SOC AM, V117, P2201, DOI 10.1121/1.1858351
Kreiman J, 2015, J ACOUST SOC AM, V138, P1, DOI 10.1121/1.4922174
Lopes Leonardo Wanderley, 2014, CoDAS, V26, P382, DOI 10.1590/2317-1782/20142013033
Mattioli F, 2015, J VOICE, V29, P455, DOI 10.1016/j.jvoice.2014.09.027
Montagnoli AN., 2015, VOICE ANAL PROGRAM 1
Murphy PJ, 2000, J ACOUST SOC AM, V107, P978, DOI 10.1121/1.428272
Nusbaum H. C., 1995, International Journal of Speech Technology, V1, P7, DOI 10.1007/BF02277176
Oppenheim A. V., 2010, DISCRETE TIME SIGNAL
Petrovic-Lazic M, 2015, J VOICE, V29, P241, DOI 10.1016/j.jvoice.2014.07.009
ROSENBERG AE, 1971, J ACOUST SOC AM, V49, P583, DOI 10.1121/1.1912389
ROZSYPAL AJ, 1975, J ACOUST SOC AM, V58, pS23, DOI 10.1121/1.2002025
Rozsypal AJ, 1979, J PHON, P343
Smruti S, 2015, P 3 INT C FRONT INT, V2, P367
Sofranko JL, 2014, J VOICE, V28, P24, DOI 10.1016/j.jvoice.2013.06.001
Sorensen MK, 2015, J VOICE, V30
Titze IR, 2000, PRINCIPLES VOICE PRO, P313
Topaloglu I, 2014, OTOLARYNG HEAD NECK, V151, P1003, DOI 10.1177/0194599814554763
Yiu EML, 2002, J ACOUST SOC AM, V112, P1091, DOI 10.1121/1.1500753
Zhang Y, 2005, J VOICE, V19, P519, DOI 10.1016/j.jvoice.2004.11.005
Ziwei Yu, 2014, J Voice, V28, P770, DOI 10.1016/j.jvoice.2014.03.014

Coleções

Artigos e Materiais de Revistas Científicas - FM/MOF
Artigos e Materiais de Revistas Científicas - HC/ICHC
Artigos e Materiais de Revistas Científicas - HC/InRad
Artigos e Materiais de Revistas Científicas - LIM/32
Artigos e Materiais de Revistas Científicas - ODS/09

Página do item completo