• Home
  • Vinay Kumar Mittal
Vinay Kumar Mittal

Vinay Kumar Mittal
Indian Institute of Information Technology Chittoor, Sri City, A.P., India · Research Centre for Smart Cities

Ph.D. (ECE)

About

85
Publications
71,648
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
1,049
Citations

Publications

Publications (85)
Chapter
Speech is a signal through which communication is done. In our daily lives, everything we need to express either through our actions or speech. However, many things we express through our speech. Speech may be degraded by the addition of background noise. However, the noise that is added has to be reduced or removed in order to get clear speech. Th...
Chapter
Today, for treatment of any disease, the patient needs to undergo many tests. The extraction of biomedical signal is done initially, and then, depending on the report, the diagnosis or treatment can be preferred. There are many biomedical signals which we extract for accurate analysis and treatment. The biomedical signal that deals with heart disea...
Article
Full-text available
Identification of the native language from speech segment of a second language utterance, that is manifested as a distinct pattern of articulatory or prosodic behavior, is a challenging task. A method of classification of speakers, based on the regional English accent, is proposed in this paper. A database of English speech, spoken by the native sp...
Article
Children with autism spectrum disorder (ASD) produce speech sounds different from that of Normal or non-ASD children. Hence, analyzing acoustic features can help characterizing the ASD speech signals. In this study, the distinguishing characteristics of speech production are examined for ASD affected children, with comparison to Normal children’s s...
Chapter
Many aspects of speech can provide information about particular speaker’s characteristics. This paper presents a novel method of automatic classification of speakers based upon their regional language accent. Present study uses English spoken as second language by speakers of four South Indian Languages. A new data set is developed by following a s...
Conference Paper
Emotion classification from emotional speech continues to be a challenging research domain. Few research studies have attempted to discriminate amongst a set of emotions, and categorize for valence, activation and dominance. Discriminating between high-arousal and low-arousal emotions is itself challenging, but discriminating emotions within each s...
Conference Paper
Full-text available
The verbal children affected with autism spectrum disorder (ASD) often shows some notable acoustic patterns. This paper represents the classification of autism speech, i.e., the speech signal of children affected with ASD. In addition, this work specifically aims to classify the speech signals of non-native Indo English speakers (children) affected...
Conference Paper
Full-text available
Children affected with autism spectrum disorder (ASD) produce speech that consists of distinctive acoustic patterns, as compared to normal children. Hence, acoustic analyses can help classifying speech of ASD affected children from that of normal children. In this study, the aim is to identify those discriminating characteristics of speech producti...
Conference Paper
Full-text available
This study explores different acoustic features for characterizing the speech signals of the children affected with autism spectrum disorder (ASD). An `Autism Speech Dataset' is collected from the children with ASD, for over a year, to study the changes in the English vowels' regions. Changes in speech production features are examined for character...
Conference Paper
Autism speech has distinct acoustic patterns, different from normal speech. Analyzing acoustic features derived from the speech of children affected with autism spectrum disorder (ASD) can help its early detection. In this study, a comparative analysis of the discriminating acoustic characteristics is carried out between ASD affected and normal chi...
Conference Paper
Children with autism spectrum disorder (ASD) have difficulty in producing the speech that is different from the speech of normal children. Most children with ASD have difficulty in proper communication of information, their thoughts or their emotional state. Only a few studies have been carried out towards acoustic analysis of ASD speech. Objective...
Conference Paper
Identification of emotions from human speech can be attempted by focusing upon three aspects of emotional speech: valence, arousal and dominance. In this paper, changes in the production characteristics of emotional speech are examined to discriminate between the high-arousal and low-arousal emotions, and amongst emotions within each of these categ...
Research
Autism spectrum disorder (ASD) is associated with children’s speech. There are cases that children with ASD cannot express or understand any emotional state, and hence are not able to communicate like the children with typical speech. But very less attention has been paid so far towards its acoustic analysis. This study aims at characterizing the a...
Conference Paper
Emotion recognition is a rapidly growing research domain in recent years. Unlike humans, machines lack the abilities to perceive and show emotions. But human-computer interaction can be improved by automated emotions recognition, thereby reducing the need of human intervention. In this paper, four basic emotions (Anger, Happy, Fear and Neutral) are...
Conference Paper
It is challenging to quantitatively evaluate the effect of music on human mind. In this paper, effect of music on human mind is studied using brainwaves. Changes in alpha and beta brainwave signal patterns are analyzed for meditation and attention states. Parameters mean, standard deviation and normalized standard deviation are used. First, the eff...
Conference Paper
Music is known to affect different states of the human mind, for example, in calming one’s mind and leading it to a blissful state. In this study, we examine the effect of music on the states of human mind. Changes in the alpha and beta brainwaves patterns are examined. These changes are compared for ‘attention’ and ‘meditation’ state of mind. An e...
Conference Paper
Music is known to affect different states of the human mind, for example, in calming one's mind and leading it to a blissful state. In this study, we examine the effect of music on the states of human mind. Changes in the alpha and beta brainwaves patterns are examined. These changes are compared for 'attention' and 'meditation' state of mind. An e...
Conference Paper
Speech signal, that is produced by the human speech production system, carries emotions that the humans can perceive easily. In this paper, we aim to classify the four basic emotions, namely happy, anger, fear and neutral, by analyzing changes in the speech production features in the vowel regions of speech signal. A Telugu emotional speech databas...
Conference Paper
Automatic mechanical robots can perform tasks in different environmental conditions that are tedious, monotonous and sometimes hazardous for human beings. These robots can help in mitigating the risk to precious human lives, and also function as substitute for humans to perform some routine, and arduous tasks that need long hours. A Multimodal Smar...
Conference Paper
In this paper, we propose a navigation system consisting of a novel multi-sensor fusion method for calculating precise and accurate aerial coordinates and orientation, of a quadcopter in indoor and GPS-silent environments. A prototype system is developed that is composed of 2 modules: Simultanious Localization and Mapping (SLAM) system that uses Or...
Conference Paper
Cough is the powerful mechanism of human body to clear the central airways. Cough is often triggered by the mucus that drains down the back of the throat. An infection in the lungs or upper airway passages can cause cough. This paper describes the characteristics of ailment (cold) cough with respect to simulated (healthy) cough sound signals. Stand...
Conference Paper
In this paper, we propose a robotic video surveillance system. A prototype system is implemented on an aerial vehicle, a quadcopter. It is capable of following a person or any moving object, while simultaneously localizing it, i.e., measuring the coordinates of the quadcopter on a scaled map. The system can operate well, even if it is flying in an...
Conference Paper
Enhancing the home security by remote control means is a cutting-edge research area in the domain of Internet of Things (IoT’s). The necessity of security is increasing these days, ranging from thefts, burglary, accidents, LPG gas leakage and fire detection etc, which all are important aspects of a Home Security System. In this paper, a prototype M...
Conference Paper
In this paper, we develop a remotely controlled robotic arm with 4 degree of freedom (D.O.F) that is wirelessly controlled using four control mechanisms, i.e, Voice Control, Smart Phone-Tilt Control, Remote control and Hand Gesture Control. Wireless technologies such as Bluetooth and Wi-Fi are used to access the Quad-Controlled Robotic Arm (QCRA)....
Conference Paper
Shouted speech signals have been studied mostly for utterances or word segments. In the production features derived over these segments, the differences between normal and shouted speech may sometimes be masked by variations due to pauses and unvoiced regions. Also, our recent study of electroglottograph signals has highlighted the usefulness of ex...
Conference Paper
Wireless operated spy-robots can be immensely useful if they can be controlled remotely over a larger operating ranges. Availability of multiple modalities for their wireless control operation can further enhance their capabilities and the range of applications. In this paper we develop a prototype spy-robot that can be controlled remotely, using m...
Conference Paper
Full-text available
Cough is an important symptom in many diseases and at times is the only major symptom to diagnose some particular ailments. Cough is the powerful mechanism of human body to clear the central airways. Analyzing the cough type, its intensity and sound, the medical experts can estimate enough details about the ailment and appropriate cure. Hence, it s...
Conference Paper
Full-text available
In recent years, the evolving wireless technology, cheaper micro-controllers, smart cities concept and ‘Internet of Things’ (IoTs) have given way to the need of online wireless management systems for smart weather stations (SWS). In this paper, we develop an ‘Online Smart Weather Station System’ for studying the correlation amongst multiple weather...
Conference Paper
Multi-modal controls of robots can be of immense help to the human beings, because of the diverse modes of communication they provide to humans for control of the robots. Ease of operation and user convenience can be other advantages. In this paper, we develop a prototype smart robot whose movements are controlled by voice-commands and gesture-comm...
Conference Paper
Robotic assistants can help human beings in reducing the manual efforts and the risk factor in hazardous situations. The robotic assistants can be made smart by facilitating their control of operation using multiple modalities. In this paper, we develop a smart robotic assistant (SRA) whose body is controlled using tilt-gesture of a smart-phone, an...
Conference Paper
Object tracking robots, if can be controlled smartly through voice, can be of tremendous help for the physically handicapped people. In this paper, we try to implement a voice-controlled object tracking robot. A speech recognition system is used to recognize a set of predefined commands such as forward, backward, left, right and rotation at a parti...
Article
Smart robotic assistants help human beings in reducing the manual efforts in day-to-day tasks and the risk to precious human lives in hazardous situations. In this paper, we develop a smart robotic assistant that operates on human voice commands, given remotely by using an Android platform based smart IoT device. The real-time signal processing of...
Article
Personal robotic assistants help reducing the manual efforts being put by humans in their day-to-day tasks. In this paper, we develop a voice-controlled personal assistant robot. The human voice commands are given to the robotic assistant remotely, by using a smart mobile phone. The robot can perform different movements, turns, start/stop operation...
Conference Paper
Personal robotic assistants (PRA) can help reducing the manual human efforts. Ease of their control and operation may help their effective utilization. In this paper, the prototype of a smart PRA is developed, whose operation is controlled using tilt-gesture of a smart phone as well as human voice commands. The tilt-gestures control the direction o...
Article
Full-text available
The feasibility of representing the excitation source characteristics in expressive voice signals by an aperiodic sequence of impulses in the time domain is examined in this paper. In particular, the aperiodic components of excitation of expressive voices, like the Noh voice, are examined in some detail. The aperiodic component is extracted from th...
Conference Paper
Full-text available
Proportional Integral Differential (PID) feedback systems are known for their robustness, accuracy and stability. These systems are used in a wide variety of applications. In this paper, we explore the possibility of using a PID architecture in robotic 3D navigation systems. The system developed can be implemented for robotic applications that requ...
Conference Paper
In the modern era, robots have become an integral part of human life. In a world where humans and robots need to coexist, it is important to evolve more natural and easy communication mechanisms for human-machine interaction. The communication mechanism needs to be ‘easy’ more for humans, than machines. One such mechanism can be developed by using...
Conference Paper
Full-text available
Infant cry is a biological signal through which an infant communicates with its care-giving environment. It also contains valuable information about the state of the infant. Infants produce this sound in response to a stimuli, which could be pain, discomfort, emotional need of attention, ailment, environmental factors or hunger/thirst. Signal proce...
Conference Paper
Full-text available
Robots have been making inroads to human life in almost all spheres. Spybots can be immensely useful for unmanned surveillance and covert spying operations. If online streaming of the spied data can be made feasible, that would be an added advantage. In this paper, we propose an unmanned Spybot that can be controlled remotely from web-page based co...
Article
In the past decade, robotic applications in human life have made significant progress. However, mobility of robots and user convenience of their control is still a challenge. Utility of robots for a physically challenged person, with practicality and ease of operation, is another issue. In this paper, a robotic solution is proposed for utilization...
Article
Full-text available
Proportional integral differential (PID) feedback systems are known for their robustness, accuracy and stability. These systems are used in a wide variety of applications. In this paper, we explore the possibility of using a PID architecture in robotic 2D navigation systems. The prototype system developed can be implemented for robotic applications...
Conference Paper
Full-text available
The progress in the areas of research like emotion recognition, identification, synthesis , etc., relies heavily on the development and structure of the database. This paper addresses some of the key issues in development of the emotion databases. A new audiovisual emotion (AVE) database is developed. The database consists of audio , video and audi...
Article
Full-text available
Characteristics of glottal vibration are affected by the obstruction to the flow of air through the vocal tract system. The obstruction to the airflow is determined by the nature, location, and extent of constriction in the vocal tract during production of voiced sounds. The effects of constriction on glottal vibration are examined for six differen...
Conference Paper
Full-text available
Laughter in speech has been studied mostly relying upon the spectral representation like formants and harmonics derived from the short-time spectrum. Significant changes appear to take place in the characteristics of glottal source of excitation during the production of laughter, but these changes have not been explored much. In this study, we exam...
Conference Paper
Full-text available
In this paper, we study the significance of aperiodicity in the pitch-perception of expressive voices such as Noh voice and laughter signals. The excitation source characteristics in the production of these signals is represented in terms of a sequence of impulses. The impulse sequence is derived from the acoustic signal using a modified zero-frequ...
Conference Paper
Full-text available
Automatic detection of shout in continuous speech is a chal- lenging task. In our recent study, the characteristics of shout and normal speech signals are examined along with the electroglottograph (EGG) signals. The study highlights the changes in the characteristics of both the excitation source and the vocal tract system during production of sho...
Article
In this paper, the production characteristics of laughter are analysed at call and bout levels. Data of natural laughter is examined using electroglottograph (EGG) and acoustic signals. Nonspeech-laugh and laughed-speech are analysed in comparison with normal speech using features derived from the EGG and acoustic signals. Analysis of EGG signal is...
Article
Full-text available
In this paper characteristics of speech produced at different loudness levels are analyzed in terms of changes in the glottal excitation. Four loudness levels are considered in this study, namely, soft, normal, loud, and shout. The distinct changes in the excitation of the shout signal are analyzed using electroglottograph signals. The open and clo...
Conference Paper
Full-text available
Shouted speech or screaming signals have been studied mostly through spectral representation such as melcepstral coefficients. Intuitive evidence that the characteristics of the excitation source may vary in the case of shouted speech has drawn little attention yet. In this paper we examine how the characteristics of both components of speech produ...
Conference Paper
Full-text available
The objective of this study is to understand the relative impor-tance of different components of speech that contribute to per-ception of emotion in speech. The four components considered in this study relate to the vocal tract system, excitation source and suprasegmental (pitch and duration) information. For this study, data collected from an arti...
Conference Paper
Full-text available
This paper aims to understand the components of speech that contribute to emotion characteristics in speech. Four components of speech (vocal tract, excitation, duration and intonation) are considered in this study. A Flexible Analysis Synthesis Tool (FAST) is developed to modify the features of an utterance from neutral to emotion or from emotion...
Article
Full-text available
Recent studies have indicated changes in the glottal excitation source characteristics apart from vocal tract resonances due to tongue tip trilling. In this paper we study the significance of changing vocal tract system and the associated glottal excitation source characteristics due to trilling, from perception point of view. These studies are mad...

Network

Cited By