John E. Volkmann's scientific contributions

What is this page?


This page lists the scientific contributions of an author, who either does not have a ResearchGate profile, or has not yet added these contributions to their profile.

It was automatically created by ResearchGate to create a record of this author's body of work. We create such pages to advance our goal of creating and maintaining the most comprehensive scientific repository possible. In doing so, we process publicly available (personal) data relating to the author as a member of the scientific community.

If you're a ResearchGate member, you can follow this page to keep up with this author's work.

If you are this author, and you don't want us to display this page anymore, please let us know.

Publications (2)


The relation of pitch to frequency: A revised scale
  • Article

January 1940

·

14 Reads

·

58 Citations

The American Journal of Psychology

Stanley Smith Stevens

·

John E. Volkmann
Share

Citations (2)


... Given a waveform x(t) ∈ R T at 16kHz, we compute its discrete wavelet transform (DWT) [40] with a Hann window of size 2048, and a hop size δ = 384 (i.e., a time resolution of ∆ t = 24 ms based on [34]). We then map it to 229 mel-frequency bins [41] in the 50 Hz-8000Hz range and take the logarithm, keeping an input representation as a log-mel spectrogram X(f, t ′ ) ∈ R 229×T ′ , where T ′ = T δ is the resulting "compact" time domain. ...

Reference:

Machine Learning Techniques in Automatic Music Transcription: A Systematic Survey
The Relation of Pitch to Frequency; A Revised Scale
  • Citing Article
  • January 1940

The American Journal of Psychology

... Given a waveform x(t) ∈ R T at 16kHz, we compute its STFT [14] with a Hann window of size 2048, and a hop size δ=384 (i.e. a time resolution of ∆ t =24ms). We then map it to 229 mel-frequency bins [30] in the 50Hz-8000Hz range, and take the logarithm, yielding our input representation: a log-mel spectrogram X(f, t ) ∈ R 229×T , where T = T δ is the resulting "compact" time domain (see Figure 1). We also compute the first time-derivativeẊ(f, t ) := X(f, t )−X(f, t −1) and concatenate it to X, forming the CNN input. ...

The relation of pitch to frequency: A revised scale
  • Citing Article
  • January 1940

The American Journal of Psychology