Fig 4 - available via license: CC BY
Content may be subject to copyright.
(a) Process of a MIDI-to-Singing system. (b) Score editor of MIDI-to-Singing.

(a) Process of a MIDI-to-Singing system. (b) Score editor of MIDI-to-Singing.

Source publication
Article
Full-text available
This work reports development of a MIDI-to-Singing song synthesis that will produce audio files from MIDI data and arbitrary Romaji lyrics in Japanese. The MIDI-to-Singing system relies on the Flinger (Festival singer) for singing voice synthesis. Originally, this MIDI-to-Singing system was developed by English. Based on some Japanese pronunciation...

Context in source publication

Context 1
... and the Editor automatically converts the lyrics into phonetic symbols by looking into a built-in pronunciation dictionary. If the word consists of two or more syllables, the Editor automatically decomposes it into syllables. The user can easily add vibrato in the Editor. A screenshot of the MIDI-to-Singing process and score editor is shown in Fig. 4(a) and 4(b). https://doi.org/10.1051/matecconf/201820102006 ICI 2017 ...

Similar publications

Article
Full-text available
This study examines how prosodic features evoke the spacial aspects of interactional meanings of well-known social types in Mainland China. Prosodic features (duration, pitch, voice quality) of the scripted performances of 18 prominent social types in China were measured acoustically and grouped by cluster analysis. Commonalities among types within...
Conference Paper
Full-text available
Artificial bandwidth extension (BWE) is still an important topic, especially in the automotive domain where consumers experience a dramatic degradation in voice quality when a wideband call suddenly falls back to 8-kHz GSM. This happens e.g. due to poor network coverage in the countryside. The aim of BWE is to bridge the perceived voice quality gap...
Article
Full-text available
Purpose: This study aimed to verify the best speech material for the AVQI for Brazilian Portuguese language and identify the best validity results between the auditory perceptual judgment (APJ) and the AVQI score on different speech materials. Methods: We recorded voice samples of 50 individuals (dysphonic and vocally healthy) of several continu...
Preprint
Full-text available
Every speech signal carries implicit information about the emotions, which can be extracted by speech processing methods. In this paper, we propose an algorithm for extracting features that are independent from the spoken language and the classification method to have comparatively good recognition performance on different languages independent fro...
Article
Full-text available
This paper reports on the prosody of rhetorical questions (RQs) and information-seeking questions (ISQs) in German for two question types, polar questions and constituent questions (henceforth wh-questions). The results are as follows: Phonologically, polar RQs were mainly realized with H-% (high plateau), while polar ISQs mostly ended in H-^H% (hi...