Block diagram of text-to-speech device.

Source publication

Text to Speech Conversion

Article

Full-text available

Oct 2016

The present paper has introduced an innovative, efficient and real-time cost beneficial technique that enables user to hear the contents of text images instead of reading through them. It combines the concept of Optical Character Recognition (OCR) and Text to Speech Synthesizer (TTS) in Raspberry pi. This kind of system helps visually impaired peop...

Homograph disambiguation with contextual word embeddings for TTS systems

Conference Paper

Full-text available

Aug 2021

Selection and Training Schemes for Improving TTS Voice Built on Found Data

Conference Paper

Full-text available

Sep 2019

CONVERSION OF SIGN LANGUAGE TO TEXT AND SPEECH USING MACHINE LEARNING TECHNIQUES

Thesis

Full-text available

Apr 2018

Image and speech processing is one of the trending research areas in machine learning that contributes immensely to the field of artificial intelligence. It enhances raw images received from gadgets such as camera or a mobile phone in normal day-today life for various applications. Conversion of images to text as well as speech can be of great bene...

Overall survival (OS). A < 4 weeks vs. 4–6 weeks. B < 4 weeks...

Disease-free survival (DFS). A < 4 weeks vs. 4–6 weeks. B < 4 weeks...

Pathological complete response (pCR). A < 4 weeks vs. 4–6 weeks....

Major pathologic response (mPR). A < 4 weeks vs. 4–6 weeks. B < 4 weeks...

Optimal timing of surgery for gastric cancer after neoadjuvant chemotherapy: a systematic review and meta-analysis

Article

Full-text available

Dec 2023

Background Following neoadjuvant chemotherapy, surgical resection is one of the most preferred treatment options for locally advanced gastric cancer patients. However, the optimal time interval between chemotherapy and surgery is unclear. This review aimed to identify the optimal time interval between neoadjuvant chemotherapy and surgery for advanc...

Fig. 2. Unrolled feedback loops of the proposed TTS with the machine...

Fig. 3. Basic machine speech chain feedback loop unrolled into two...

Fig. 4. Architecture: (a) proposed TTS with an autoregressive...

Fig. 5. Proposed TTS training in two feedback loops based on clean and...

Fig. 6. (a) Proposed PCD-ITTS structure with (b) power context...

A Machine Speech Chain Approach for Dynamically Adaptive Lombard TTS in Static and Dynamic Noise Environments

Article

Full-text available

Jan 2022

Recent end-to-end text-to-speech synthesis (TTS) systems have successfully synthesized high-quality speech. However, TTS speech intelligibility degrades in noisy environments because most of these systems were not designed to handle noisy environments. Several works attempted to address this problem by using offline fine-tuning to adapt their TTS t...

Raspberry Pi based braille keyboard design with audio output for the visually challenged

Conference Paper

Full-text available

Jul 2023

Serhat Küçükdermenci

Most blind and visually impaired students in third world countries still use mechanical braillefor their education. With the advancement of technology and the spread of electronic communication,paper-based Braille is not effective and efficient enough. The Raspberry Pi-based Braille keyboard designwith audio output is a low-cost electronic keyboard whose main features are to vocalize Braille characterswritten by a visually impaired student and display them on an LCD screen. Proposed to promote aninteractive educational experience among students, teachers and parents, the Braille printer is affordableand cost-effective with advanced features. The design of the device is simple as it is based on Raspberry Pitechnology. The user hears the output after a short buzzer beep when the character typing process isfinished. gTTS (Google Text-to-Speech) is a Python package and Google Translates text-to-speech API isused to convert text to speech. The data is displayed on an LCD screen for the non-visually impaired(teacher/parent). The Braille keyboard study is designed through the Proteus simulation program. This workfocuses on developing a Braille keyboard for later stages that allows users to use the Braille writing systemto enter text and communicate with digital devices.

Object and Text Detection

Article

May 2023

The main aim of our project is to develop a portable raspberry pi implemented gadget for object detection with relative motion and distance. This technology is basically used for conversion of sequence of real time objects into series of text which can be further stored into database and can be utilized to assist visually impaired people and in various security purposes as well. For that purpose, the conversion system is proposed in this project. Our system basically operates in 2 different modes. One is detecting the class of objects nearby with the help of R-CNN network, and the second one is obstacle detection using ultrasonic sensor. It includes 3 buttons for mode selection and the system operates on the basis of mode selection. It includes camera to capture an image as input, and input image is then passed to the R-CNN that recognizes number of objects inside image, their classes and types, text written inside and which is then can be passed to the database for a storage.

A deep learning approaches in text-to-speech system: a systematic review and recent research perspective

Article

Full-text available

Sep 2022
MULTIMED TOOLS APPL

Text-to-speech systems (TTS) have come a long way in the last decade and are now a popular research topic for creating various human-computer interaction systems. Although, a range of speech synthesis models for various languages with several motive applications is available based on domain requirements. However, recent developments in speech synthesis have primarily attributed to deep learning-based techniques that have improved a variety of application scenarios, including intelligent speech interaction, chatbots, and conversational artificial intelligence (AI). Text-to-speech systems are discussed in this survey article as an active topic of study that has achieved significant progress in the recent decade, particularly for Indian and non-Indian languages. Furthermore, the study also covers the lifecycle of text-to-speech systems as well as developed platforms in it. We performed an efficient search for published survey articles up to May 2021 in the web of science, PubMed, Scopus, EBSCO(Elton B. Stephens CO (company)) and Google Scholar for Text-to-speech Systems (TTS) in various languages based on different approaches. This survey article offers a study of the contributions made by various researchers in Indian and non-Indian language text-to-speech systems and the techniques used to implement it with associated challenges in designing TTS systems. The work also compared different language text-to-speech systems based on the quality metrics such as recognition rate, accuracy, TTS score, precision, recall, and F1-score. Further, the study summarizes existing ideas and their shortcomings, emphasizing the scope of future research in Indian and non-Indian languages TTS, which may assist beginners in designing robust TTS systems.

Assistant for the guest with visually impaired using Deep Learning

Article

Full-text available

May 2021
J Phys Conf

The lack of Braille resources in this advanced world has tied the hands of visually impaired people from soaring up. This paper takes those guests into concern and presents a solution that helps every individual especially the blinds in reading books and text from real time images. This solution converts the text obtained from text document and real world entities to aural output which lends a hand in reading text. The main idea is to build a software using a novel methodology where OCR engine receives input images and get converted into intermediate textual output that is given to Google Trans to get the audio output via earphones.

Developing mathematical exercise software for visually impaired students

Article

Full-text available

Apr 2021

This study aims to develop an Android-based math exercises application for the visually impaired. This research is development research carried out with research steps, namely: (1) preliminary research, (2) prototyping stage, and (3) assessment phase. The research was conducted between April 2020 and December 2020. The material chosen in the application developed was a plan taught in 8 grade. The research process involved six experts in assessing the product, namely three mathematics education experts to assess the validity of the aspects of mathematical content, two blind education experts to assess visually impaired content suitability and accessibility, and 1 IT expert to assess product performance. The product was tested on nine visually impaired. The quality of teaching materials is based on three basic aspects: feasibility, practicality, and effectiveness. The conclusions of this study are: (1) the product has good quality because it has been declared feasible by experts, practical, which can be seen from the enthusiastic response and student testimonials, and is effective because it can be used to learn and measure abilities, (2) the application is divided into three sections, preamble (contains the opening tune and instructions for use), practice questions, and results. Application development is based on two elements, namely accessibility and compatibility of the content with the cognition of the visually impaired, (3) the question page consists of questions (will be read when entering the page and can be repeated when the user taps the question section), under the question, there is a question number. There is a question; answer choices are arranged twice in two (the answer choices will be read out when pressed by the user). There is an answer lock button at the very bottom, and (4) the visually impaired want an application that has a simple operating system, provides challenges to the user and has two functions, namely measuring their abilities and facilitating their learning.

Portable Assistive system for Visually Impaired using Raspberry Pi

Conference Paper

Full-text available

Dec 2020

JOURNAL OF CRITICAL REVIEWS OPTICAL CHARACTER RECOGNITION BASED SPEECH SYNTHESIS SYSTEM USING OPEN CV 1 M.SOWMIYA, 2 S.KANAGA SUBA RAJA, 3 S.GNANAPRIYA, 4 AISWARYA 5 G GAYATHRI S, 6 ISHA KRITHIKA G

Article

Full-text available

Nov 2020

Kanaga Suba RAJA Subramanian

The idea presented on this paper is proposed for an application of the OCR. It acts as a life saver for the visually challenged people. The feature of this system is its ability to capture the image of a real world environment using a camera and recognize the characters present in that image being captured. This setup is constructed using OpenCv. The identified characters are converted into an audio file output which will be helpful for visually challenged people. The characters that have been identified in the representation captured from the device is converted to the string using Tesseract. The string is translated to articulate sound using printed form to articulate sound (TTS) module. Important feature of this OCR system is that this whole system is made portable in a way; it acts as an artificial vision for the blind through the audio output generated on the system.

Design of the architecture for text recognition and reading in an online assessment applied to visually impaired students

Conference Paper

Full-text available

Oct 2020

Recognition of Medicine using CNN for Visually Impaired

Conference Paper

Jul 2020

Lip Reading and Speech to Text Converter for Deaf and Mute

Article

Mar 2019

Block diagram of text-to-speech device.

Similar publications

Citations