Tomoko Matsui

Tomoko Matsui
The Institute of Statistical Mathematics

Doctor of Engineering

About

160
Publications
19,786
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
2,224
Citations
Additional affiliations
November 1998 - December 2002
Advanced Telecommunications Research Institute
Position
  • Senior Researcher
April 1988 - December 2002
NTT
Position
  • Researcher

Publications

Publications (160)
Article
Full-text available
Accurate temperature forecasting is critical for various sectors, yet traditional methods struggle with complex atmospheric dynamics. Deep neural networks (DNNs), especially transformer-based DNNs, offer potential advantages, but face challenges with domain adaptation across different geographical regions. We evaluated the effectiveness of DNN-base...
Article
Full-text available
We devised a data-driven framework for uncovering hidden control strategies used by an evolutionary system described by an evolutionary probability distribution. This innovative framework enables deciphering of the concealed mechanisms that contribute to the progression or mitigation of such situations as the spread of COVID-19. Novel algorithms ar...
Article
Full-text available
Introduction The worldwide COVID-19 pandemic, which began in December 2019 and has lasted for almost 3 years now, has undergone many changes and has changed public perceptions and attitudes. Various systems for predicting the progression of the pandemic have been developed to help assess the risk of COVID-19 spreading. In a case study in Japan, we...
Article
Full-text available
Addressing biases in observed data is a major challenge in statistical and machine learning applications. This challenge also exists in recommendation systems, and various methods based on causal inference are being investigated. We investigate a collaborative filtering technique that robustly predicts ratings from biased observation. Utilizing the...
Article
Full-text available
The COVID-19 pandemic, which began in December 2019, progressed in a complicated manner and thus caused problems worldwide. Seeking clues to the reasons for the complicated progression is necessary but challenging in the fight against the pandemic. We sought clues by investigating the relationship between reactions on social media and the COVID-19...
Article
Full-text available
A practical algorithm has been developed for closeness analysis of sequential data that combines closeness testing with algorithms based on the Markov chain tester. It was applied to reported sequential data for COVID-19 to analyze the evolution of COVID-19 during a certain time period (week, month, etc.).
Article
Full-text available
In the era of open data, Poisson and other count regression models are increasingly important. Still, conventional Poisson regression has remaining issues in terms of identifiability and computational efficiency. Especially, due to an identification problem, Poisson regression can be unstable for small samples with many zeros. Provided this, we dev...
Article
Full-text available
Hybrid recommendation, which is based on collaborative filtering and supplemented with auxiliary content information, is being actively researched due to its ability to overcome the cold-start problem. Many proposed hybrid methods make recommendations using Gaussian distribution-based collaborative filtering even though they handle variables that t...
Article
The classical DICE model is a widely accepted integrated assessment model for the joint modeling of economic and climate systems, where all model state variables evolve over time deterministically. We reformulate and solve the DICE model as an optimal control dynamic programming problem with six state variables (related to the carbon concentration,...
Preprint
Full-text available
The classical DICE model is a widely accepted integrated assessment model for the joint modeling of economic and climate systems, where all model state variables evolve over time deterministically. We reformulate and solve the DICE model as an optimal control dynamic programming problem with six state variables (related to the carbon concentration,...
Article
This paper consider the penalized least squares estimators with convex penalties or regularization norms. We provide sparsity oracle inequalities for the prediction error for a general convex penalty and for the particular cases of Lasso and Group Lasso estimators in a regression setting. The main contribution is that our oracle inequalities are es...
Preprint
The COVID-19 pandemic, which began in December 2019, progressed in a complicated manner and thus caused problems worldwide. Seeking clues to the reasons for the complicated progression is necessary but challenging in the fight against the pandemic. We sought clues by investigating the relationship between reactions on social media and the COVID-19...
Article
Full-text available
A class of models for non-Gaussian spatial random fields is explored for spatial field reconstruction in environmental and sensor network monitoring. The family of models explored utilises a class of transformation functions known as Tukey g-and-h transformations to create a family of warped spatial Gaussian process models which can support various...
Article
Full-text available
Statistical analysis of speech is an emerging area of machine learning. In this paper, we tackle the biometric challenge of Automatic Speaker Verification (ASV) of differentiating between samples generated by two distinct populations of utterances, those of an authentic human voice and those generated by a synthetic one. Solving such an issue throu...
Preprint
A class of models for non-Gaussian spatial random fields is explored for spatial field reconstruction in environmental and sensor network monitoring. The family of models explored utilises a class of transformation functions known as the Tukey g-and-h transformations to create a family of warped spatial Gaussian process models which can support var...
Preprint
Full-text available
A practical algorithm has been developed for closeness analysis of sequential data that combines closeness testing with algorithms based on the Markov chain tester. It was applied to reported sequential data for COVID-19 to analyze the evolution of COVID-19 during a certain time period (week, month, etc.).
Article
Full-text available
As with the advancement of geographical information systems, non-Gaussian spatial data sets are getting larger and more diverse. This study develops a general framework for fast and flexible non-Gaussian regression, especially for spatial/spatiotemporal modeling. The developed model, termed the compositionally-warped additive mixed model (CAMM), co...
Preprint
Full-text available
In the era of open data, Poisson and other count regression models are increasingly important. Provided this, we develop a closed-form inference for an over-dispersed Poisson regression, especially for (over-dispersed) Bayesian Poisson wherein the exact inference is unobtainable. The approach is derived via mode-based log-Gaussian approximation. Un...
Preprint
Full-text available
As with the advancement of geographical information systems, non-Gaussian spatial data is getting larger and more diverse. Considering this background, this study develops a general framework for fast and flexible non-Gaussian regression, especially for spatial/spatiotemporal modeling. The developed model, termed the compositionally-warped additive...
Article
Extreme weather events can arrive unannounced and cause immense harm for communities. Especially in cities where many people live in close proximity, events like flash flooding, windstorms or even heat waves can cause property damage, overworking of the emergency infrastructure and death. Unfortunately, because climate change continues to alter wea...
Article
Full-text available
Real-time heatwave risk management with fine-grained spatial resolution is important for analysis of urban heat island (UHI) effects and local heatwaves. This study analyzed the spatio-temporal behavior of ground temperatures and developed methods for modeling them. The developed models consider two higher-order stochastic spatial properties (skewn...
Preprint
Full-text available
The statistical quantification of temperature processes for the analysis of urban heat island (UHI) effects and local heat-waves is an increasingly important application domain in smart city dynamic modelling. This leads to the increased importance of real-time heatwave risk management on a fine-grained spatial resolution. This study attempts to an...
Article
We develop extensions that introduce regression structure to the multi-factor stochastic models of commodity futures price term structure dynamics. We demonstrate the accuracy with which these models can be calibrated to oil futures data and how they improve on existing models both in model fit and in model interpretation. We found leading observab...
Article
Full-text available
This study develops an approach for optimizing the size/scale of microgrids used in electricity sharing around each residence by considering the uncertainty between the electricity supply from photovoltaics and electricity demand. Uncertainties are quantified using simulations that consider actual daily variations in supply and demand. The develope...
Article
Full-text available
The objective of this study is to map direct and indirect seasonal urban carbon emissions using spatial micro Big Data, regarding building and transportation energy-use activities in Sumida, Tokyo. Building emissions were estimated by considering the number of stories, composition of use (e.g., residence and retail), and other factors associated wi...
Conference Paper
Full-text available
It has been shown that by combining the acoustic and artic-ulatory information significant performance improvements in automatic speech recognition (ASR) task can be achieved. In practice, however, articulatory information is not available during recognition and the general approach is to estimate it from the acoustic signal. In this paper, we prop...
Article
Full-text available
Real-time urban climate monitoring provides useful information that can be utilized to help urban management personnel to monitor and adapt their precautionary measures to extreme events, including urban heatwaves. Fortunately, recently created social media platforms, such as Twitter, furnish real-time and high-resolution spatial information that m...
Article
Full-text available
Automatic emotion recognition from speech has been focused mainly on identifying categorical or static affect states, but the spectrum of human emotion is continuous and time-varying. In this paper, we present a recognition system for dynamic speech emotion based on state-space models (SSMs). The prediction of the unknown emotion trajectory in the...
Conference Paper
The development of the so-called intelligent tire has changed the role of the tire. Here we discuss a real-time road condition classification system that employs monitoring tire acceleration. Because the tire acceleration is non-stationary and is warped non-linearly in the time domain, we applied the time alignment algorithm to it similarly to spee...
Conference Paper
Automatic emotion recognition from speech has been focused mainly on identifying categorical or static affect states, but the spectrum of human emotion is continuous and time-varying. In this paper, we present a recognition system for dynamic speech emotion based on state-space models (SSMs). The prediction of the unknown emotion trajectory in the...
Article
Real-time urban climate monitoring provides useful information that can be utilized to help monitor and adapt to extreme events, including urban heatwaves. Typical approaches to the monitoring of climate data include weather station monitoring and remote sensing. However, climate monitoring stations are very often distributed spatially in a sparse...
Article
Full-text available
We develop new algorithms for spatial field reconstruction, exceedance level estimation and classification in heterogeneous (mixed analog & digital sensors) Wireless Sensor Networks (WSNs). We consider spatial physical phenomena which are observed by a heterogeneous WSN, meaning that it consists partially of sparsely deployed high-quality sensors a...
Chapter
Full-text available
Gaussian Processes (GPs) are Bayesian nonparametric models that are becoming more and more popular for their superior capabilities to capture highly nonlinear data relationships in various tasks ranging from classical regression and classification to dimension reduction, novelty detection and time series analysis. Here, we introduce Gaussian proces...
Chapter
This chapter provides a tutorial overview of some modern applications of the statistical modeling that can be developed based upon spatial wireless sensor network data. We then develop a range of new results relating to two important problems that arise in spatial field reconstructions from wireless sensor networks. The first new result allows one...
Book
This book provides a modern introductory tutorial on specialized methodological and applied aspects of spatial and temporal modeling. The areas covered involve a range of topics which reflect the diversity of this domain of research across a number of quantitative disciplines. For instance, the first chapter deals with non-parametric Bayesian infer...
Book
This book provides a modern introductory tutorial on specialized theoretical aspects of spatial and temporal modeling. The areas covered involve a range of topics which reflect the diversity of this domain of research across a number of quantitative disciplines. For instance, the first chapter provides up-to-date coverage of particle association me...
Article
Copula models have started to be explored in wireless communications, however to date the properties they offer have not been proven or verified on real data experiments. In this paper we provide the first real evidence that the features they offer will provide beneficial modeling capabilities in wireless channel models, which are not just theoreti...
Conference Paper
We investigate a discrimination method for invalid and valid inputs, received by a speech-oriented guidance system operating in a real environment. Invalid inputs include background voices, which are not directly uttered to the system, and nonsense utterances. Such inputs should be rejected beforehand. We have reported methods using not only the li...
Chapter
In this work, we address the topic classification of spoken inquiries in Japanese that are received by a guidance system operating in a real environment, with a semi-supervised learning approach based on a transductive support vector machine (TSVM). Manual data labeling, which is required for supervised learning, is a costly process, and unlabeled...
Conference Paper
We develop a new framework for explicitly modelling the threshold exceedence levels of the spatial stochastic process being monitored by a sensor network. Our framework also allows incorporating additional observed features as explanatory factors for the behaviour of the spatial stochastic process, and in particular the probability of exceedence of...
Article
Full-text available
Gaussian Processes (GPs) are Bayesian nonparametric models that are becoming more and more popular for their superior capabilities to capture highly nonlinear data relationships in various tasks, such as dimensionality reduction, time series analysis, novelty detection, as well as classical regression and classification tasks. In this paper, we inv...
Article
This paper describes the temporal music emotion recogni- tion system developed at the University of Aizu for the Emo- tion in Music task of the MediaEval 2014 benchmark evalua- tion campaign. The arousal-valence trajectory prediction is cast as a time series ltering task and is modeled by a state- space models. These models include standard linear...
Article
Full-text available
Availability of large amounts of raw unlabeled data has sparked the recent surge in semi-supervised learning research. In most works, however, it is assumed that labeled and unlabeled data come from the same distribution. This restriction is removed in the self-taught learning algorithm where unlabeled data can be different, but nevertheless have s...
Conference Paper
We present a novel generative model for audio event transcription that recognizes “events” on audio signals including multiple kinds of overlapping sounds. In the proposed model, firstly, the overlapping audio events are modeled based on nonnegative matrix factorization into which Bayesian nonparametric approaches: the Markov Indian buffet process...
Conference Paper
We propose a novel application of a family of non-parametric statistical models to estimate head-related transfer functions (HRTFs) using spatial-temporal Gaussian processes (GPs). In this approach, we model the head-related impulse response (HRIR) utilizing non-parametric regression via a GP. The challenge posed by this problem involves accurate m...
Conference Paper
Full-text available
In this paper we introduce Gaussian Process (GP) models for music genre classification. Gaussian Processes are widely used for various regression and classification tasks, but there are relatively few studies where GPs are applied in the audio signal processing systems. The GP models are non-parametric discriminative classifiers similar to the well...
Article
We develop a novel algorithm to estimate a spatial-temporal transfer function of a time-domain room impulse response for reverberation in closed environments. This novel approach involves developing two non-parametric models, one for the early phase and the other for the late phase for reverberation. These models are based on a composite of two Gau...
Article
In this work, we address the topic classification of spoken inquiries in Japanese that are received by a speech-oriented guidance system operating in a real environment. The classification of spoken inquiries is often hindered by automatic speech recognition (ASR) errors, the sparseness of features and the shortness of spontaneous speech utterances...
Article
Full-text available
This paper describes a design of spoken term detection (STD) studies and their evaluating framework at the STD sub-task of the NTCIR-9 IR for Spoken Documents (SpokenDoc) task. STD is the one of information access technologies for spoken documents. The goal of the STD sub-task is to rapidly detect presence of a given query term, consisting of word...
Conference Paper
Full-text available
Availability of large amounts of raw unlabeled data has sparked the recent surge in semi-supervised learning re-search. In most works, however, it is assumed that labeled and unlabeled data come from the same distribution. This restriction is removed in the self-taught learning approach where unlabeled data can be different, but nevertheless have s...
Conference Paper
Full-text available
Availability of large amounts of raw unlabeled data has sparked the recent surge in semi-supervised learning research. In most works, however, it is assumed that labeled and unlabeled data come from the same distribution. This restriction is removed in the self-taught learning approach where unlabeled data can be different, but nevertheless have si...
Article
We have already reported a corpus similarity visualization method based on the corpus attribute using multidimensional scaling that makes it easy for users to utilize various speech corpora. In this paper, we present a revised visualization method that is based on a ring structure like a planisphere. By using only a mouse, a user can choose appropr...
Conference Paper
Our goal is to point out usability problems in web pages in order to improve the web usability. We investigate the relation between user interaction behaviors in web-viewing and evaluation results of web usability by subjects. And we extract discriminative patterns for user interaction behaviors in visited web pages with low usability by using the...
Article
Full-text available
We investigate a novel gradient-based musical feature ex-tracted using a scale-invariant feature transform. This fea-ture enables dynamic information in music data to be effec-tively captured time-independently and frequency-independently. It will be useful for various music applica-tions such as genre classification, music mood classification, and...
Article
Full-text available
Example-based question answering (QA) is an ef-fective approach for real-world spoken dialogue systems. A limitation of an example-based QA is that a system cannot appropriately respond to a user's question, if a similar question-answer pair does not exist in the question and answer database (QADB). For a robust spoken dialogue system, it is import...
Article
Stacked generalization is a method that allows combining output of multiple classifiers using a second-level classification, minimizing the generalization error of first-level classifiers and achieving greater predictive accuracy. In a previous work, we compared the performance of support vector machine (SVM) with radial basis function (RBF) kernel...
Article
The purpose of this work is to reduce the cost of the web usability evaluation by usability testing. The cost will reduce by detecting low usability web pages. We analyzed empirically to find detectable metrics from the quantitative data including eye movement. We investigate the relation between the quantitative data about the behavior of users an...
Conference Paper
Full-text available
This work addresses the classification in topics of utterances in Japanese, received by a speech-oriented guidance system operating in a real environment. For this, we compare the performance of Support Vector Machine and PrefixSpan Boosting, against a conventional Maximum Entropy classification method. We are interested in evaluating their strengt...
Article
Hidden Markov models (HMMs) are powerful generative models for sequential data that have been used in automatic speech recognition for more than two decades. Despite their popularity, HMMs make inaccurate assumptions about speech signals, thereby limiting the achievable performance of the conventional speech recognizer. Penalized logistic regressio...
Article
In this paper, we propose a novel semi-supervised speaker identification method that can alleviate the influence of non-stationarity such as session dependent variation, the recording environment change, and physical conditions/emotions. We assume that the voice quality variants follow the covariate shift model, where only the voice feature distrib...
Article
Penalized logistic regression (PLR) is a well-founded discriminative classifier with long roots in the history of statistics. Speech classification with PLR is possible with an appropriate choice of map from the space of feature vector sequences into the Euclidean space. In this talk, one such map is presented, namely, the one that maps into vector...
Conference Paper
Full-text available
The sequence kernel has been shown to be a promising kernel function for learning from sequential data such as speech and DNA. However, it is not scalable to massive datasets due to its high computational cost. In this paper, we propose a method of approximating the sequence kernel that is shown to be computationally very efficient. More specifical...
Article
Full-text available
Speech corpora are indispensable to speech research. There are several data centers in the world that serve as repositories for various speech corpora. However, they use different specification items for their corpora, and so it is difficult to compare their corpora. It would be more convenient for corpus users if the data centers were to use a com...
Article
In this work, we address the classification in topics of utterances in Japanese received by a speech-oriented guidance system operating in a real environment. The implementation of this kind of systems requires the collection and manual labeling of actual user's utterances, which is a costly process. Because of this, we are interested in evaluating...
Article
The purpose of this study is to visualize the similarities among multiple speech corpora. In order for users to easily utilize various speech corpora, we reported a visualization method based on the corpus attribute using MDS. We had proposed the eight attributes as the speech corpus features. However, these attributes contained no acoustical featu...
Conference Paper
In this paper, we propose a novel semisupervised speaker identification method that can alleviate the influence of non-stationarity such as session dependent variation, the recording environment change, and physical condition/emotion. We assume that the utterance variation follows the covariate shift model, where only the utterance sample distribut...
Conference Paper
Full-text available
We investigate a method using support vector machines (SVMs) with walk-based graph kernels for high-level feature extraction from images. In this method, each image is first segmented into a finite set of homogeneous segments and then represented as a segmentation graph where each vertex is a segment and edges connect adjacent segments. Given a set...
Chapter
Full-text available
A two-step approach to continuous speech recognition using logistic regression on speech segments has been presented. In the first step, a set of hidden Markov models (HMMs) is used in conjunction with the Viterbi algorithm in order to generate an N-best list of sentence hypotheses for the utterance to be recognized. In the second step, each senten...
Conference Paper
Full-text available
We investigate a novel method for speaker verification with non-audible murmur (NAM) segments. NAM is recorded using a special microphone placed on the neck and is hard for other people to hear. We have already reported a method based on a support vector machine (SVM) using NAM segments to use a keyword phrase effectively. To further exploit keywor...
Article
Full-text available
In this paper, we propose a sequence kernel with fast computation. The kernel is approximately cal-culated by using a mean vector in feature space. We further studied on log normalization of a sequence kernel to avoid the diagonal dominance problem in this paper. In text-independent speaker identification experiments with 10 male speakers, our appr...
Conference Paper
We investigated a speaker verification method that uses non-audible murmur (NAM) segments using newly collected data and obtained several findings that will be useful when speaker verification systems are made in practice. NAM is recorded using a special microphone placed on the surface of the body, so it includes almost no external noise and is ha...
Conference Paper
Full-text available
We propose in this paper a new family of kernels to handle time series, notably speech data, within the framework of kernel methods which includes popular algorithms such as the support vector machine. These kernels elaborate on the well known dynamic time warping (DTW) family of distances by considering the same set of elementary operations, namel...

Network

Cited By