Deep Machine Learning-based Analysis for Intelligent Phonetic Language Recognition

Yumei Liu; Qiang Luo

doi:10.12694/scpe.v25i3.2710

PDF

Published: Apr 12, 2024

DOI: https://doi.org/10.12694/scpe.v25i3.2710

Keywords:

prosody management, machine learning, speech analysis, lexical focus

Yumei Liu

Chongqing City Vocational College, Chongqing, China

Qiang Luo

Chongqing Creation Vocational College, Chongqing, China

Abstract

Modern speech generating systems can produce results that are almost as visually realistic as actual sounds. They still require further production management. This research presents a paradigm for managing prosodic output using explicit, unambiguous, and understandable parameters. We utilize this strategy to emphasize key words and provide a variety of architectural possibilities based on a richness of labelled resources. In an objective voice, we compare the options for producing data with or without labels. We assess them using listening tests that demonstrate our ability to retain the same level of naturalness while effectively attaining regulated concentration over a specific area.

Issue

Vol. 25 No. 3 (2024)

Section

Special Issue - Deep Learning-Based Advanced Research Trends in Scalable Computing

Article Sidebar

Main Article Content

Abstract

Article Details