Bangla Consonants.

Source publication

Constraints in Developing a Complete Bengali Optical Character Recognition System

Preprint

Full-text available

Mar 2020

Technological advancement has led to digitizing hard copies of media effortlessly with optical character recognition (OCR) system. As OCR systems are being used constantly, converting printed or handwritten documents and books have become simple and time efficient. To be a fully functional structure, Bengali OCR system needs to overcome some constr...

Context 1

... way. It is the key towards a better mechanism which can be time-efficient, effortless and productive. Though Bangla is a popular language, it does not have a proper OCR system compared to other languages such as English. Bangla as a language is complex and the writing structure is different from other languages. Bangla language has consonants ( Fig. 1), vowels ( Fig. 2), modified vowels ( Fig. 3) and around 170 compound characters (Fig. 4) [17]. Such complex writing structure needs better segmentation process for conversion into digital media, hence the applications for it is ...

View in full-text

Context 2

... from a book or document, the image may contain some portion outside of the text page. One of the challenges here is to identify the text of the image and crop the image so that the unwanted parts outside of the text can be eliminated. When binarized, these unwanted parts provide a chunk of black pixels which result in poor segmentation of lines. Fig. 10 shows unwanted chunk of black pixels marked in a red ...

View in full-text

Context 3

... solve the problem, the input image is cropped. For appropriate cropping of the image, we have performed page-layout analysis and have found out where the text is. Fig. 11 shows step by step procedures of how a text image is cropped which have been described ...

View in full-text

Context 4

... dewarping is associated to perspective correction. Geometric distortion of a captured image lines is a common real life scenario. The formation of curved lines due to view angle of camera or warped page leads to poor line segmentation, as most of the lines overlap with each other. As a result, multiple lines get segmented as a single line. Fig. 12 shows such a problematic scenario. The steps of our proposed image dewarp algorithm are as ...

View in full-text

Context 5

... is calculated depending on the shape of the span and an angle that it is distorted at. With the estimated parameter, coordinate transformation is done to make the lines parallel and horizontal. -Finally, we optimize the remapping of span to minimize the re-projection error using scipy.optimize.minimize which is a derivative-free optimizer. Fig. 13 shows an example of a geometrically distorted image before and after the use of image ...

View in full-text

Context 6

... are detected easily from an image horizontally. At first the image pixel values are calculated for each of the rows and are compared. Line segmentation is performed where the sum of the pixel value is close to zero (Fig. ...

View in full-text

Context 7

... we have faced another challenge while working with multiple font sizes in single page, where we fail to segment each of the lines properly. When multiple font sizes are present on the same image, line segmentation is performed for the bigger font size. As a result, all the lines are not segmented correctly. As shown in Fig. 15, the first two lines get segmented together due to different font ...

View in full-text

Context 8

... are segmented easily from segmented line images in a vertical manner. At first, the image pixel values are calculated for each of the columns of a segmented line image. Word segmentation is performed where the sum of the pixel value is close to zero (Fig. ...

View in full-text

Context 9

... separate each character, we need to detect matra line and then remove it. To properly detect the matra line, we have horizontally divided the word image into half. Matra line is detected where the sum of pixel value of rows are greater than 60% on the upper half of the image. Fig. 17 shows the region of Matra line for a ...

View in full-text

Context 10

... matra line, we get an open space between the characters as the characters are not connected with each other with the matra line. Characters then can be detected from an image vertically. At first the image pixel values are calculated for each of the columns. Character segmentation is performed where the sum of the pixel values is close to zero (Fig. 18). This is because, in a few cases, matra line is removed partially. Sometimes only a portion of matra line gets removed for which segmentation is done with sum of pixel value close to zero. Character segmentation is considered to be correct if a consonant or a vowel or a compound character is segmented alone or alongside with a modified ...

View in full-text

Context 11

... 18). This is because, in a few cases, matra line is removed partially. Sometimes only a portion of matra line gets removed for which segmentation is done with sum of pixel value close to zero. Character segmentation is considered to be correct if a consonant or a vowel or a compound character is segmented alone or alongside with a modified vowel. Fig. 19 shows examples of some correctly segmented ...

View in full-text

Context 12

... line, word and character accuracy of the multiple font step was not included from the image in fig. 20. The accuracy of this step was conducted from the image in fig. 15. The accuracy comparison of each of the methods clearly shows the importance of each step and how all these steps together is the key for a better segmentation ...

View in full-text

Context 13

... we straighten a curved line, a few words may stay tilted. In such cases, the matra line goes undetected, because the horizontal pixel sum criteria does not work here. This makes it harder for us to eliminate the matra line. Fig. 21 shows the region of matra line being detected of a slightly tilted word along with row-wise black pixel histogram. Fig. 22 shows the region of matra line which stays undetected and was not removed. As a result, we fail to segment such words into characters properly. Fig. 23 shows some examples where the character segmentation of the ...

View in full-text

Figure 1. Diagram illustrating the research methodology employed in the...

Figure 2. (a,c) Processing of a blurry image with the Laplace and Sobel...

Figure 4. Correlation matrix of automatically detected parameters in...

Figure 6. Illustration of the possible structure of the proposed NN.

Image Text Extraction and Natural Language Processing of Unstructured Data from Medical Reports

Article

Full-text available

Jun 2024

This study presents an integrated approach for automatically extracting and structuring information from medical reports, captured as scanned documents or photographs, through a combination of image recognition and natural language processing (NLP) techniques like named entity recognition (NER). The primary aim was to develop an adaptive model for...

Figure 1. An example of the imaging and character recognition process...

Figure 2. The flow diagram of the article selection process of the...

Recent advancements in machine vision methods for product code recognition: A systematic review

Article

Full-text available

Sep 2022

Background: Manufacturing markings printed on products play an important role in the handling and use of pharmaceuticals and perishable foods. Currently, optical character recognition, neural networks, deep learning-based methods, and combinations of these methods are used to recognize these codes. Methods: This systematic review was performed to f...

Figure 2. Proposed process for the novel OCR system

Figure 3. Bounding box coordinates ordering based on skew. S 1 is...

Figure 4. Skew correction algorithm sequentially shown

Figure 5. Line and character separation in test image

Figure 7. Implementations of varied Noise-Reduction techniques with...

Novel Approach to High Accuracy and Efficiency Optical Character Recognizer for Handwritten Digits

Preprint

Full-text available

Dec 2023

Sanjith Sambath

The automated transcription of handwritten characters into a legible output is a multi-faceted process with diverse applications. In this paper, a novel approach to optical character recognition (OCR) for handwritten digits is proposed that, in certain components, exceeds current architectures in terms of accuracy, effectiveness, adjustability, tem...

The errors rates in predicting each letter of digitized Ottoman document

A Deep Learning Based Offline Optical Character Recognition Model for Printed Ottoman Turkish

Article

Full-text available

Nov 2023

Developing efficient optical character recognition (OCR) systems for printed Ottoman text is a problem since current OCR models created for Arabic have restrictions that make it difficult to be performed. The performance of these models has been shown to be low when used for the recognition of Ottoman text. It has also been shown that these models...

Proposed multi-modal text encryption framework

Proposed image encryption results five test images: a-e plain images...

Simulation results of scanned document image: a Original plain image, b...

Example of multi-modal text recognition and encryption process

Multi-modal text recognition and encryption in scanned document images

Article

Full-text available

Dec 2022

Many military and business documents such as memorandums, invoices, medical records and bills among others are transmitted over the network in the form of images. These scanned document images may contain some private information that must not be accessed by third party or unauthorized users. Most of the encryption techniques encrypt the complete i...

Bangla Consonants.

Contexts in source publication

Similar publications