(a) Process of a MIDI-to-Singing system. (b) Score editor of MIDI-to-Singing.

Source publication

Fig. 1. Japanese MIDI-to-Singing Demo Websites:...

Fig. 4. (a) Process of a MIDI-to-Singing system. (b) Score editor of...

Building a Japanese MIDI-to-Singing song synthesis using an English male voice

Article

Full-text available

Jan 2018

Hung-Che Shen

This work reports development of a MIDI-to-Singing song synthesis that will produce audio files from MIDI data and arbitrary Romaji lyrics in Japanese. The MIDI-to-Singing system relies on the Flinger (Festival singer) for singing voice synthesis. Originally, this MIDI-to-Singing system was developed by English. Based on some Japanese pronunciation...

Context 1

... and the Editor automatically converts the lyrics into phonetic symbols by looking into a built-in pronunciation dictionary. If the word consists of two or more syllables, the Editor automatically decomposes it into syllables. The user can easily add vibrato in the Editor. A screenshot of the MIDI-to-Singing process and score editor is shown in Fig. 4(a) and 4(b). https://doi.org/10.1051/matecconf/201820102006 ICI 2017 ...

View in full-text

Figure 1. Spectrogram and pitch contour for laoganbu "Retired Cadre"

Placing social types through prosodic variation: An investigation of spatial meanings in Mainland China

Article

Full-text available

Mar 2019

Robert Xu

This study examines how prosodic features evoke the spacial aspects of interactional meanings of well-known social types in Mainland China. Prosodic features (duration, pitch, voice quality) of the scripted performances of 18 prominent social types in China were measured acoustically and grouped by cluster analysis. Commonalities among types within...

Feature Selection for DNN-based Bandwidth Extension

Conference Paper

Full-text available

Mar 2018

Artificial bandwidth extension (BWE) is still an important topic, especially in the automotive domain where consumers experience a dramatic degradation in voice quality when a wideband call suddenly falls back to 8-kHz GSM. This happens e.g. due to poor network coverage in the countryside. The aim of BWE is to bridge the perceived voice quality gap...

Acoustic Voice Quality Index - AVQI para o português brasileiro: análise de diferentes materiais de fala

Article

Full-text available

Feb 2019

Purpose: This study aimed to verify the best speech material for the AVQI for Brazilian Portuguese language and identify the best validity results between the auditory perceptual judgment (APJ) and the AVQI score on different speech materials. Methods: We recorded voice samples of 50 individuals (dysphonic and vocally healthy) of several continu...

A Study of Language and Classifier-independent Feature Analysis for Vocal Emotion Recognition

Preprint

Full-text available

Nov 2018

Every speech signal carries implicit information about the emotions, which can be extracted by speech processing methods. In this paper, we propose an algorithm for extracting features that are independent from the spoken language and the classification method to have comparatively good recognition performance on different languages independent fro...

The Prosody of Rhetorical and Information-Seeking Questions in German

Article

Full-text available

Oct 2019

This paper reports on the prosody of rhetorical questions (RQs) and information-seeking questions (ISQs) in German for two question types, polar questions and constituent questions (henceforth wh-questions). The results are as follows: Phonologically, polar RQs were mainly realized with H-% (high plateau), while polar ISQs mostly ended in H-^H% (hi...

(a) Process of a MIDI-to-Singing system. (b) Score editor of MIDI-to-Singing.

Context in source publication

Similar publications