{"id":546369,"date":"2017-06-15T00:00:38","date_gmt":"2017-06-15T07:00:38","guid":{"rendered":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/?post_type=msr-research-item&#038;p=546369"},"modified":"2018-10-29T15:44:02","modified_gmt":"2018-10-29T22:44:02","slug":"towards-real-time-two-dimensional-wave-propagation-for-articulatory-speech-synthesis","status":"publish","type":"msr-research-item","link":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/publication\/towards-real-time-two-dimensional-wave-propagation-for-articulatory-speech-synthesis\/","title":{"rendered":"Towards real-time two-dimensional wave propagation for articulatory speech synthesis"},"content":{"rendered":"<p>The precise simulation of voice production is a challenging task, often characterized by a tradeoff between quality and speed. The usage of 3D acoustic models of realistic vocal tracts produces extremely precise results, at the cost of running simulations that may take several minutes to synthesize a few milliseconds of\u00a0audio. In contrast, 1D articulatory vocal synthesizers rely on highly simplified acoustic and anatomical models\u00a0to achieve real-time performances, but can only partially match the spectra of realistic vocal tracts. In\u00a0this work, we present a novel articulatory vocal synthesizer, based on a fast 2D propagation model running on a graphics card (GPU). The system can run in real-time under specific conditions and, differently from 1D synthesizers, allows for simulating airflow propagation through asymmetric and curved geometries. This paper covers details on the GPU implementation of the different components of the system, including the 2D Finite-Difference Time-Domain wave solver and the excitation mechanism. A preliminary evaluation is presented,<br \/>\nusing area functions to simulate static vowels. Three different resolutions are tested, combined with<br \/>\ntwo alternative ways of discretizing the 2D geometries. The computed formants are overall characterized by small positional errors while computational times are comparable with those from 1D systems.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The precise simulation of voice production is a challenging task, often characterized by a tradeoff between quality and speed. The usage of 3D acoustic models of realistic vocal tracts produces extremely precise results, at the cost of running simulations that may take several minutes to synthesize a few milliseconds of\u00a0audio. In contrast, 1D articulatory vocal [&hellip;]<\/p>\n","protected":false},"featured_media":0,"template":"","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","msr-author-ordering":null,"msr_publishername":"Acoustical Society of America","msr_publisher_other":"","msr_booktitle":"","msr_chapter":"","msr_edition":"","msr_editors":"","msr_how_published":"","msr_isbn":"","msr_issue":"","msr_journal":"","msr_number":"","msr_organization":"","msr_pages_string":"","msr_page_range_start":"","msr_page_range_end":"","msr_series":"","msr_volume":"","msr_copyright":"","msr_conference_name":"171st Meeting of the Acoustical Society of America","msr_doi":"","msr_arxiv_id":"","msr_s2_paper_id":"","msr_mag_id":"","msr_pubmed_id":"","msr_other_authors":"","msr_other_contributors":"","msr_speaker":"","msr_award":"","msr_affiliation":"","msr_institution":"","msr_host":"","msr_version":"","msr_duration":"","msr_original_fields_of_study":"","msr_release_tracker_id":"","msr_s2_match_type":"","msr_citation_count_updated":"","msr_published_date":"2017-6-15","msr_highlight_text":"","msr_notes":"","msr_longbiography":"","msr_publicationurl":"","msr_external_url":"","msr_secondary_video_url":"","msr_conference_url":"","msr_journal_url":"","msr_s2_pdf_url":"","msr_year":0,"msr_citation_count":0,"msr_influential_citations":0,"msr_reference_count":0,"msr_s2_match_confidence":0,"msr_microsoftintellectualproperty":true,"msr_s2_open_access":false,"msr_s2_author_ids":[],"msr_pub_ids":[],"msr_hide_image_in_river":0,"footnotes":""},"msr-research-highlight":[],"research-area":[13551,13545],"msr-publication-type":[193716],"msr-publisher":[],"msr-focus-area":[],"msr-locale":[268875],"msr-post-option":[],"msr-field-of-study":[],"msr-conference":[],"msr-journal":[],"msr-impact-theme":[],"msr-pillar":[],"class_list":["post-546369","msr-research-item","type-msr-research-item","status-publish","hentry","msr-research-area-graphics-and-multimedia","msr-research-area-human-language-technologies","msr-locale-en_us"],"msr_publishername":"Acoustical Society of America","msr_edition":"","msr_affiliation":"","msr_published_date":"2017-6-15","msr_host":"","msr_duration":"","msr_version":"","msr_speaker":"","msr_other_contributors":"","msr_booktitle":"","msr_pages_string":"","msr_chapter":"","msr_isbn":"","msr_journal":"","msr_volume":"","msr_number":"","msr_editors":"","msr_series":"","msr_issue":"","msr_organization":"","msr_how_published":"","msr_notes":"","msr_highlight_text":"","msr_release_tracker_id":"","msr_original_fields_of_study":"","msr_download_urls":"","msr_external_url":"","msr_secondary_video_url":"","msr_longbiography":"","msr_microsoftintellectualproperty":1,"msr_main_download":"","msr_publicationurl":"","msr_doi":"","msr_publication_uploader":[{"type":"doi","viewUrl":"false","id":"false","title":"10.1121\/2.0000395","label_id":"243106","label":0},{"type":"url","viewUrl":"false","id":"false","title":"https:\/\/asa.scitation.org\/doi\/abs\/10.1121\/2.0000395","label_id":"243109","label":0}],"msr_related_uploader":[{"type":"file","viewUrl":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2018\/10\/ArticulatorySpeechSynth2D.pdf","id":"546390","title":"articulatoryspeechsynth2d","label_id":"243112","label":0}],"msr_citation_count":0,"msr_citation_count_updated":"","msr_s2_paper_id":"","msr_influential_citations":0,"msr_reference_count":0,"msr_arxiv_id":"","msr_s2_author_ids":[],"msr_s2_open_access":false,"msr_s2_pdf_url":null,"msr_attachments":[{"id":546390,"url":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-content\/uploads\/2018\/10\/ArticulatorySpeechSynth2D.pdf"}],"msr-author-ordering":[{"type":"text","value":"Victor Zappi","user_id":0,"rest_url":false},{"type":"text","value":"Arvind Vasuvedan","user_id":0,"rest_url":false},{"type":"text","value":"Andrew Allen","user_id":0,"rest_url":false},{"type":"user_nicename","value":"Nikunj Raghuvanshi","user_id":33106,"rest_url":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/microsoft-research\/v1\/researchers?person=Nikunj Raghuvanshi"},{"type":"text","value":"Sidney Fels","user_id":0,"rest_url":false}],"msr_impact_theme":[],"msr_research_lab":[],"msr_event":[],"msr_group":[],"msr_project":[546345],"publication":[],"video":[],"msr-tool":[],"msr_publication_type":"inproceedings","related_content":{"projects":[{"ID":546345,"post_title":"Project Triton","post_name":"project-triton","post_type":"msr-project","post_date":"2018-12-03 12:03:07","post_modified":"2024-04-03 12:34:47","post_status":"publish","permalink":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/project\/project-triton\/","post_excerpt":"Project Triton performs physical simulation to provide sound propagation for games and mixed reality. It creates an accurate acoustic rendering at practical CPU cost and provides designer control.","_links":{"self":[{"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-project\/546345"}]}}]},"_links":{"self":[{"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-research-item\/546369","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-research-item"}],"about":[{"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/types\/msr-research-item"}],"version-history":[{"count":1,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-research-item\/546369\/revisions"}],"predecessor-version":[{"id":546399,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-research-item\/546369\/revisions\/546399"}],"wp:attachment":[{"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/media?parent=546369"}],"wp:term":[{"taxonomy":"msr-research-highlight","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-research-highlight?post=546369"},{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=546369"},{"taxonomy":"msr-publication-type","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-publication-type?post=546369"},{"taxonomy":"msr-publisher","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-publisher?post=546369"},{"taxonomy":"msr-focus-area","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-focus-area?post=546369"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=546369"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=546369"},{"taxonomy":"msr-field-of-study","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-field-of-study?post=546369"},{"taxonomy":"msr-conference","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-conference?post=546369"},{"taxonomy":"msr-journal","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-journal?post=546369"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=546369"},{"taxonomy":"msr-pillar","embeddable":true,"href":"https:\/\/cm-edgetun.pages.dev\/en-us\/research\/wp-json\/wp\/v2\/msr-pillar?post=546369"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}