Perturbation Measurements on the Degree of Naturalness of Synthesized Vowels

Carregando...
Imagem de Miniatura
Citações na Scopus
3
Tipo de produção
article
Data de publicação
2017
Título da Revista
ISSN da Revista
Título do Volume
Editora
MOSBY-ELSEVIER
Autores
MONTAGNOLI, Arlindo
SILVA, Jorge Vicente Lopes da
BEHLAU, Mara
Citação
JOURNAL OF VOICE, v.31, n.3, article ID 389.e1, 8p, 2017
Projetos de Pesquisa
Unidades Organizacionais
Fascículo
Resumo
Objective. To determine the impact of jitter and shimmer on the degree of naturalness perception of synthesized vowels produced by acoustical simulation with glottal pulses (GP) and with solid model of the vocal tract (SMVT). Study Design. Prospective study. Methods. Synthesized vowels were produced in three steps: 1. Eighty GP were developed (20 with jitter, 20 with shimmer, 20 with jitter+shimmer, 20 without perturbation); 2. A SMVT was produced based on magnetic resonance imaging (MRI) from a woman during phonation-/epsilon/ and using rapid prototyping technology; 3. Acoustic simulations were performed to obtain eighty synthesized vowels-/epsilon/. Two experiments were performed. First Experiment: three judges rated 120 vowels (20 humans+80 synthesized+20% repetition) as ""human"" or ""synthesized"". Second Experiment: twenty PowerPoint slide sequences were created. Each slide had 4 synthesized vowels produced with the four perturbation condition. Evaluators were asked to rate the vowels from the most natural to the most artificial. Results. First Experiment: all the human vowels were classified as human; 27 out of eighty synthesized vowels were rated as human, 15 of those were produced with jitter+shimmer, 10 with jitter, 2 without perturbation and none with shimmer. Second Experiment: Vowels produced with jitter+shimmer were considered as the most natural. Vowels with shimmer and without perturbation were considered as the most artificial. Conclusions. The association of jitter and shimmer increased the degree of naturalness of synthesized vowels. Acoustic simulations performed with GP and using SMVT demonstrated a possible method to test the effect of the perturbation measurements on synthesized voices.
Palavras-chave
Synthesized voices, Acoustical measurements, Auditory, perceptual evaluation, Naturalness perception, Vocal tract model
Referências
  1. Amorim P, 2015, ADV VISUAL COMPUTING, P14
  2. Behlau M., 2001, VOICE BOOK SPECIALIS, P1
  3. Brockmann M, 2008, J SPEECH LANG HEAR R, V51, P1152, DOI 10.1044/1092-4388(2008/06-0208)
  4. Brockmann M, 2011, J VOICE, V25, P44, DOI 10.1016/j.jvoice.2009.07.002
  5. Dang JW, 1997, J ACOUST SOC AM, V101, P456, DOI 10.1121/1.417990
  6. Englert M, 2015, J VOICE, V31
  7. Fraj S, 2012, J ACOUST SOC AM, V132, P2603, DOI 10.1121/1.4751536
  8. Fujita S, 2005, ACOUST SCI TECH, V26
  9. Gerratt BR, 2001, J ACOUST SOC AM, V110, P2560, DOI 10.1121/1.1409969
  10. Gonçalves Maria Inês Rebelo, 2009, Braz. j. otorhinolaryngol., V75, P680, DOI 10.1590/S1808-86942009000500012
  11. HILLENBRAND J, 1988, J ACOUST SOC AM, V83, P2361, DOI 10.1121/1.396367
  12. Honda K, 2008, J ACOUST SOC AM, V123, P3731, DOI 10.1121/1.2935225
  13. KERSTA LG, 1960, J ACOUST SOC AM, V32, P1502, DOI 10.1121/1.1935196
  14. Kisenwether JS, 2015, J VOICE, V29, P548, DOI 10.1016/j.jvoice.2014.11.006
  15. Kreiman J, 2005, J ACOUST SOC AM, V117, P2201, DOI 10.1121/1.1858351
  16. Kreiman J, 2015, J ACOUST SOC AM, V138, P1, DOI 10.1121/1.4922174
  17. Lopes Leonardo Wanderley, 2014, CoDAS, V26, P382, DOI 10.1590/2317-1782/20142013033
  18. Mattioli F, 2015, J VOICE, V29, P455, DOI 10.1016/j.jvoice.2014.09.027
  19. Montagnoli AN., 2015, VOICE ANAL PROGRAM 1
  20. Murphy PJ, 2000, J ACOUST SOC AM, V107, P978, DOI 10.1121/1.428272
  21. Nusbaum H. C., 1995, International Journal of Speech Technology, V1, P7, DOI 10.1007/BF02277176
  22. Oppenheim A. V., 2010, DISCRETE TIME SIGNAL
  23. Petrovic-Lazic M, 2015, J VOICE, V29, P241, DOI 10.1016/j.jvoice.2014.07.009
  24. ROSENBERG AE, 1971, J ACOUST SOC AM, V49, P583, DOI 10.1121/1.1912389
  25. ROZSYPAL AJ, 1975, J ACOUST SOC AM, V58, pS23, DOI 10.1121/1.2002025
  26. Rozsypal AJ, 1979, J PHON, P343
  27. Smruti S, 2015, P 3 INT C FRONT INT, V2, P367
  28. Sofranko JL, 2014, J VOICE, V28, P24, DOI 10.1016/j.jvoice.2013.06.001
  29. Sorensen MK, 2015, J VOICE, V30
  30. Titze IR, 2000, PRINCIPLES VOICE PRO, P313
  31. Topaloglu I, 2014, OTOLARYNG HEAD NECK, V151, P1003, DOI 10.1177/0194599814554763
  32. Yiu EML, 2002, J ACOUST SOC AM, V112, P1091, DOI 10.1121/1.1500753
  33. Zhang Y, 2005, J VOICE, V19, P519, DOI 10.1016/j.jvoice.2004.11.005
  34. Ziwei Yu, 2014, J Voice, V28, P770, DOI 10.1016/j.jvoice.2014.03.014