Perturbation Measurements on the Degree of Naturalness of Synthesized Vowels
Carregando...
Citações na Scopus
3
Tipo de produção
article
Data de publicação
2017
Editora
MOSBY-ELSEVIER
Indexadores
Título da Revista
ISSN da Revista
Título do Volume
Autores
MONTAGNOLI, Arlindo
SILVA, Jorge Vicente Lopes da
BEHLAU, Mara
Autor de Grupo de pesquisa
Editores
Coordenadores
Organizadores
Citação
JOURNAL OF VOICE, v.31, n.3, article ID 389.e1, 8p, 2017
Resumo
Objective. To determine the impact of jitter and shimmer on the degree of naturalness perception of synthesized vowels produced by acoustical simulation with glottal pulses (GP) and with solid model of the vocal tract (SMVT). Study Design. Prospective study. Methods. Synthesized vowels were produced in three steps: 1. Eighty GP were developed (20 with jitter, 20 with shimmer, 20 with jitter+shimmer, 20 without perturbation); 2. A SMVT was produced based on magnetic resonance imaging (MRI) from a woman during phonation-/epsilon/ and using rapid prototyping technology; 3. Acoustic simulations were performed to obtain eighty synthesized vowels-/epsilon/. Two experiments were performed. First Experiment: three judges rated 120 vowels (20 humans+80 synthesized+20% repetition) as ""human"" or ""synthesized"". Second Experiment: twenty PowerPoint slide sequences were created. Each slide had 4 synthesized vowels produced with the four perturbation condition. Evaluators were asked to rate the vowels from the most natural to the most artificial. Results. First Experiment: all the human vowels were classified as human; 27 out of eighty synthesized vowels were rated as human, 15 of those were produced with jitter+shimmer, 10 with jitter, 2 without perturbation and none with shimmer. Second Experiment: Vowels produced with jitter+shimmer were considered as the most natural. Vowels with shimmer and without perturbation were considered as the most artificial. Conclusions. The association of jitter and shimmer increased the degree of naturalness of synthesized vowels. Acoustic simulations performed with GP and using SMVT demonstrated a possible method to test the effect of the perturbation measurements on synthesized voices.
Palavras-chave
Synthesized voices, Acoustical measurements, Auditory, perceptual evaluation, Naturalness perception, Vocal tract model
Referências
- Amorim P, 2015, ADV VISUAL COMPUTING, P14
- Behlau M., 2001, VOICE BOOK SPECIALIS, P1
- Brockmann M, 2008, J SPEECH LANG HEAR R, V51, P1152, DOI 10.1044/1092-4388(2008/06-0208)
- Brockmann M, 2011, J VOICE, V25, P44, DOI 10.1016/j.jvoice.2009.07.002
- Dang JW, 1997, J ACOUST SOC AM, V101, P456, DOI 10.1121/1.417990
- Englert M, 2015, J VOICE, V31
- Fraj S, 2012, J ACOUST SOC AM, V132, P2603, DOI 10.1121/1.4751536
- Fujita S, 2005, ACOUST SCI TECH, V26
- Gerratt BR, 2001, J ACOUST SOC AM, V110, P2560, DOI 10.1121/1.1409969
- Gonçalves Maria Inês Rebelo, 2009, Braz. j. otorhinolaryngol., V75, P680, DOI 10.1590/S1808-86942009000500012
- HILLENBRAND J, 1988, J ACOUST SOC AM, V83, P2361, DOI 10.1121/1.396367
- Honda K, 2008, J ACOUST SOC AM, V123, P3731, DOI 10.1121/1.2935225
- KERSTA LG, 1960, J ACOUST SOC AM, V32, P1502, DOI 10.1121/1.1935196
- Kisenwether JS, 2015, J VOICE, V29, P548, DOI 10.1016/j.jvoice.2014.11.006
- Kreiman J, 2005, J ACOUST SOC AM, V117, P2201, DOI 10.1121/1.1858351
- Kreiman J, 2015, J ACOUST SOC AM, V138, P1, DOI 10.1121/1.4922174
- Lopes Leonardo Wanderley, 2014, CoDAS, V26, P382, DOI 10.1590/2317-1782/20142013033
- Mattioli F, 2015, J VOICE, V29, P455, DOI 10.1016/j.jvoice.2014.09.027
- Montagnoli AN., 2015, VOICE ANAL PROGRAM 1
- Murphy PJ, 2000, J ACOUST SOC AM, V107, P978, DOI 10.1121/1.428272
- Nusbaum H. C., 1995, International Journal of Speech Technology, V1, P7, DOI 10.1007/BF02277176
- Oppenheim A. V., 2010, DISCRETE TIME SIGNAL
- Petrovic-Lazic M, 2015, J VOICE, V29, P241, DOI 10.1016/j.jvoice.2014.07.009
- ROSENBERG AE, 1971, J ACOUST SOC AM, V49, P583, DOI 10.1121/1.1912389
- ROZSYPAL AJ, 1975, J ACOUST SOC AM, V58, pS23, DOI 10.1121/1.2002025
- Rozsypal AJ, 1979, J PHON, P343
- Smruti S, 2015, P 3 INT C FRONT INT, V2, P367
- Sofranko JL, 2014, J VOICE, V28, P24, DOI 10.1016/j.jvoice.2013.06.001
- Sorensen MK, 2015, J VOICE, V30
- Titze IR, 2000, PRINCIPLES VOICE PRO, P313
- Topaloglu I, 2014, OTOLARYNG HEAD NECK, V151, P1003, DOI 10.1177/0194599814554763
- Yiu EML, 2002, J ACOUST SOC AM, V112, P1091, DOI 10.1121/1.1500753
- Zhang Y, 2005, J VOICE, V19, P519, DOI 10.1016/j.jvoice.2004.11.005
- Ziwei Yu, 2014, J Voice, V28, P770, DOI 10.1016/j.jvoice.2014.03.014