Fig 4 - available via license: CC BY
Content may be subject to copyright.
Source publication
This work reports development of a MIDI-to-Singing song synthesis that will produce audio files from MIDI data and arbitrary Romaji lyrics in Japanese. The MIDI-to-Singing system relies on the Flinger (Festival singer) for singing voice synthesis. Originally, this MIDI-to-Singing system was developed by English. Based on some Japanese pronunciation...
Context in source publication
Context 1
... and the Editor automatically converts the lyrics into phonetic symbols by looking into a built-in pronunciation dictionary. If the word consists of two or more syllables, the Editor automatically decomposes it into syllables. The user can easily add vibrato in the Editor. A screenshot of the MIDI-to-Singing process and score editor is shown in Fig. 4(a) and 4(b). https://doi.org/10.1051/matecconf/201820102006 ICI 2017 ...
Similar publications
This study examines how prosodic features evoke the spacial aspects of interactional meanings of well-known social types in Mainland China. Prosodic features (duration, pitch, voice quality) of the scripted performances of 18 prominent social types in China were measured acoustically and grouped by cluster analysis. Commonalities among types within...
Artificial bandwidth extension (BWE) is still an important topic, especially in the automotive domain where consumers experience a dramatic degradation in voice quality when a wideband call suddenly falls back to 8-kHz GSM. This happens e.g. due to poor network coverage in the countryside. The aim of BWE is to bridge the perceived voice quality gap...
Purpose:
This study aimed to verify the best speech material for the AVQI for Brazilian Portuguese language and identify the best validity results between the auditory perceptual judgment (APJ) and the AVQI score on different speech materials.
Methods:
We recorded voice samples of 50 individuals (dysphonic and vocally healthy) of several continu...
Every speech signal carries implicit information about the emotions, which can be extracted by speech processing methods. In this paper, we propose an algorithm for extracting features that are independent from the spoken language and the classification method to have comparatively good recognition performance on different languages independent fro...
This paper reports on the prosody of rhetorical questions (RQs) and information-seeking questions (ISQs) in German for two question types, polar questions and constituent questions (henceforth wh-questions). The results are as follows: Phonologically, polar RQs were mainly realized with H-% (high plateau), while polar ISQs mostly ended in H-^H% (hi...