Parse graph corresponding to the expression in Figure 2.

Source publication

Recognizing handwritten mathematics via fuzzy parsing

Article

Full-text available

Jul 2010

We present a new approach to multi-dimensional parsing using relational grammars and fuzzy sets. A fast, incremental parsing algorithm is developed, motivated by the two-dimensional structure of written mathematics. Our approach makes no hard deci-sions; recognized math expressions are reported to the user in ranked order. A flexible correction mec...

Context 1

... graphical representation of the branching structure that represents these derivations is shown in Figure 3. In the figure, the arrows indicate derivation. ...

View in full-text

Two characterisation results of multiple context-free grammars and their application to parsing

Thesis

Full-text available

Feb 2020

Tobias Denkinger

An extension of Świdziński's grammar of Polish

Article

Full-text available

Jan 2005

Maciej Ogrodniczuk

The article presents a series of proposed extensions to Świdzińki’s formal grammar of Polish which were introduced in the course of automated syntax verification of a corpus of Polish expressions.

Content-Structure Correspondence: A Generic Representation for Heterogeneous Structured Document

Article

Full-text available

Dec 2011

This on the web, most structured document collections consist of documents from different sources and marked up with different types of structures. The diversity of structures has lead to the emergence of heterogeneous structured documents. The heterogeneity of structured documents poses new challenges for document representation in structured docu...

A General Parsing Algorithm with Context Matching for Context-Sensitive Graph Grammars

Conference Paper

Full-text available

Jul 2020

Semantic Interpretation of Requirements through Cognitive Grammar and Configuration

Conference Paper

Full-text available

Dec 2014

Many attempts have been made to apply Natural Language Processing to requirements specifications. However, typical approaches rely on shallow parsing to identify object-oriented elements of the specifications (e.g. classes, attributes, and methods). As a result, the models produced are often incomplete, imprecise, and require manual revision and va...

Machine learning and non-machine learning methods in mathematical recognition systems: Two decades’ systematic literature review

Article

Full-text available

Aug 2023
MULTIMED TOOLS APPL

Tools based on machine learning (ML) have seen widespread application in the prediction and categorization of mathematical symbols and phrases. The purpose of this work is to conduct a comprehensive analysis of the machine learning and non-machine learning strategies that are currently in use for the recognition of mathematical expressions. (MEs). The authors collected and analyzed research studies on the recognition of MEs (and closely related models and issues as well), which are published from January 2000 to December 2022 in the SLR. The review has nominated 98 primary studies out of the extracted 202 studies after heedful filtering using inclusion/exclusion criteria and quality assessment. The pertinent data is derived from IEEE explore, Science Direct, Wiley, Scopus, ACM Digital Library, etc. For assiduously reviewing and synthesizing the data, the authors used grounded theory and other qualitative and quantitative techniques. The analysis reveals that the support vector machine as an ML model with CROHME as the dataset and expression recognition rate as an accuracy metric is frequently used in the chosen studies. Recognition is typically fragmented down into three stages—segmenting symbols, recognizing symbols, and analyzing structures—in non-ML studies. In conclusion, this work aims to synthesize the results of existing research to provide a summary of the state-of-the-art in recognizing handwritten MEs.

A dive in white and grey shades of ML and non-ML literature: a multivocal analysis of mathematical expressions

Article

Full-text available

Dec 2022
ARTIF INTELL REV

With the advent and advancement of machine learning and deep learning techniques, machine-based recognition systems for mathematical text have captivated the attention of the research community for the last four decades. Mathematical Expression Recognition systems have been identified based on terms of their techniques, approach, dataset, and accuracies. This study majorly targets a rigorous review of both the published form of literature and the least attended literature, i.e., grey literature. Apart from the digital libraries, the papers and other instances of information have been gathered from the grey sources like google patents, archives, technical reports, app stores, etc., culminating in 262 instances. After the heedful filtration imposed on both white and grey literature, the final pool of studies has been investigated for eight formulated research questions. The answers extracted have been analyzed, providing both quantitative and qualitative insights. The analysis and surveys have systematically summed up the potentials of both white and grey shades of literature present on MER and brought exciting extractions out of 155 formal white literature and 107 grey sources. The survey extracts and brings out the highlighting observations after analysis, which sublimates the fact that 52% of grey literature is composed of mobile applications and user interfaces, whereas the published 63% of white data is presently concentrated in 39 different conferences, and the prominent conference is ICDAR (#30). A list of challenges and open issues has been extracted for directing future research dimensions.

Unfuzzying Fuzzy Parsing

Conference Paper

Full-text available

Jan 2014

Traditional parsing has always been a focus of discussion among the computer science community. Numerous techniques and algorithms have been proposed along these years, but they require that input texts are correct according to a specific grammar. However, in some cases it's necessary to cope with incorrect or unpredicted inputs that raise ambiguities, making traditional parsing unsuitable. These situations led to the emergence of robust parsing theories, where fuzzy parsing gains relevance. Robust parsing comes with a price by losing precision and decaying performance, as multiple parses of the input may be necessary while looking for an optimal one. In this short paper we briefly describe the main robust parsing techniques and end up proposing a different solution to deal with fuzziness of input texts. It is based on automata where states represent contexts and edges represent potential matches (of constructs of interest) inside those contexts. It is expected that such an approach reduces recognition time and ambiguity as contexts reduce the search space by defining a smaller domain for constructs of interest. Such benefits may be a great addition to the robust parsing area with application on program comprehension, among other research fields.

Two-dimensional (2D) languages and application to handwritten graphical parsing

Article

Aug 2013

Bingwei Wu

A global learning approach for an online handwritten mathematical expression recognition system

Article

Nov 2012
PATTERN RECOGN LETT

Despite the recent advances in handwriting recognition, handwritten two-dimensional (2D) languages are still a challenge. Electrical schemas, chemical equations and mathematical expressions (MEs) are examples of such 2D languages. In this case, the recognition problem is particularly difficult due to the two dimensional layout of the language. This paper presents an online handwritten mathematical expression recognition system that handles mathematical expression recognition as a simultaneous optimization of expression segmentation, symbol recognition, and 2D structure recognition under the restriction of a mathematical expression grammar. The originality of the approach is a global strategy allowing learning mathematical symbols and spatial relations directly from complete expressions. A new contextual modeling is proposed for combining syntactic and structural information. Those models are used to find the most likely combination of segmentation/recognition hypotheses proposed by a 2D segmentation scheme. Thus, models are based on structural information concerning the symbol layout. The system is tested with a new public database of mathematical expressions which was used in the CHROME competition. We have also produced a large base of semi-synthetic expressions which are used to train and test the global learning approach. We obtain very promising results on both synthetic and real expressions databases, as well as in the recent CHROME competition.

A new approach for recognizing handwritten mathematics using relational grammars and fuzzy sets

Article

Jun 2012
INT J DOC ANAL RECOG

We present a new approach for parsing two-dimensional input using relational grammars and fuzzy sets. A fast, incremental parsing algorithm is devel-oped, motivated by the two-dimensional structure of written mathematics. The approach reports all identifi-able parses of the input. The parses are represented as a fuzzy set, in which the membership grade of a parse measures the similarity between it and the handwrit-ten input. To identify and report parses efficiently, we adapt and apply existing techniques such as rectangular partitions and shared parse forests, and introduce new ideas such as relational classes and interchangeability. We also present a correction mechanism which allows users to navigate parse results and choose the correct interpretation in case of recognition errors or ambigu-ity. Such corrections are incorporated into subsequent incremental recognition results. Finally, we include two empirical evaluations of our recognizer. One uses a novel user-oriented correction count metric, while the other replicates the CROHME 2011 math recognition con-test. Both evaluations demonstrate the effectiveness of our proposed approach.

Is the iPad useful for sketch input? A comparison with the Tablet PC.

Conference Paper

Full-text available

Aug 2011

Despite the increasing prevalence of touch-based tablet devices, little attention has been paid to what effects, if any, this form factor has on sketch behaviours in general and on sketch recognizers in particular. We investigate the title question through an empirical study in the context of mathematical expression recognition. Using a corpus of thirty expressions drawn on Tablet PC and iPad by thirty writers, we show that characteristics of sketching and drawing differ depending on platform. While recognition is most accurate on the Tablet PC due to its technical superiority, recognition is feasible on mobile touch-based devices. However, there are characteristics of sketching on multi-touch tablets that differ, and these physical characteristics of writing impact recognition accuracy. Together, our observations inform the broader Sketch Recognition community as they design systems targeted to multi-touch tablets.

Parse graph corresponding to the expression in Figure 2.

Context in source publication

Similar publications

Citations