The σ-automaton corresponding to the automaton in Fig. 1. It accepts all edit strings that transform a word of (10 + 010) * 0 into a different word of (10 + 010) * 0 using only substitution errors.

Source publication

Maximal Error-Detecting Capabilities of Formal Languages.

Article

Full-text available

Jan 2008

A (combinatorial) channel is a set of pairs of words describing all the possible input- output channel situations. We introduce the concept "maximal error-detecting capa- bility" of a given language, with respect to a certain class of channels, which is simply a maximal channel for which the given language is error-detecting. The new concept is int...

Context 1

... states of A. As the detailed view of these transitions is not needed here, we simplify the notation by using expressions of the form (¯ p, (x/y), ¯ q) for the transitions of A σ . We also note that A σ consists of at most 2s 2 A states, where s A is the number of states in A, and contains only states that are reachable from the start state -see Fig. 2. The next statement refers specifically to thin languages, as they are special when it comes to error-detection for channels of the form σ(m, l). A language K is called thin if any two different words of K have different lengths [8]. Obviously a thin language is error-detecting for σ(m, l), for every possible values of the parameters ...

View in full-text

LDQL: A Query Language for the Web of Linked Data (Extended Version)

Technical Report

Full-text available

Jul 2015

In this paper, we propose LDQL, that is, a language to query Linked Data on the World Wide Web. The novelty of LDQL is that it enables a user to express separately (i) patterns that describe the expected query result, and (ii) Web navigation paths that select the data sources to be used for computing the result. We show that LDQL is strictly more e...

Channels with Synchronization/Substitution Errors and Computation of Error Control Codes

Article

Full-text available

Jan 2016

We present a randomized algorithm that takes as input two positive integers $N,\ell$ and a channel (=specification of the errors permitted), and computes an error-detecting, or -correcting, block code having up to $N$ codewords of length $\ell$. The channel could allow any rational combination of substitution and synchronization errors. Moreover, if the algorithm finds less than $N$ codewords then those codewords constitute a code that, with high probability, is close to maximal (in a certain precise sense defined here). We also present some components of an open source Python package in which several code related concepts have been implemented. A methodological contribution is the presentation of how various error combinations can be expressed formally and processed algorithmically.

Formal descriptions of code properties: Decidability, complexity, implementation

Article

Full-text available

Apr 2012
INT J FOUND COMPUT S

The branch of coding theory that is based on formal languages has produced several methods for defining code properties, including word relations, dependence systems, implicational conditions, trajectories, and language inequations. Of those, the latter three can be viewed as formal methods in the sense that a certain formal expression can be used to denote a code property. Here we present a formal method which is based on transducers. Each transducer of a certain type defines/describes a desired code property. The method provides simple and uniform decision procedures for the basic questions of property satisfaction and maximality for regular languages. Our work includes statements about the hardness of deciding some of the problems involved. It turns out that maximality can be hard to decide even for "classical" code properties of finite languages. We also present an initial implementation of a LAnguage SERver capable of deciding the satisfaction problem for a given transducer code property and regular language.

Computing Maximal Error Detecting Capabilities of Regular Languages

Article

Full-text available

Jan 2010

A (combinatorial) channel γ consists of pairs of words representing all possible input-output situations of the channel. In an earlier paper, [5], we formalized the intuitive concept of "largest set of errors" detectable by a given language L by defining the maximal error-detecting capabilities of L with respect to a given class of channels, and we showed how to compute all maximal error-detecting capabilities of a given regular language with respect to the class of rational channels and a class of channels involving only the substitution-error type. In this paper we resolve the problem for channels involving errors of any combination of the basic types substitution, insertion, deletion. We also consider the problem of finding the inverses of these channels, in view of the fact that L is error-detecting for γ if and only if it is error-detecting for the inverse of γ.

Descriptional Complexity of Error Detection

Chapter

Nov 2017

The neighbourhood of a language L consists of all strings that are within a given distance from a string of L. For example, additive distances or the prefix-distance are regularity preserving in the sense that the neighbourhood of a regular language is always regular. For error detection and error correction applications an important question is to determine the size of the minimal deterministic finite automaton (DFA) needed to recognize the neighbourhood of a language recognized by an n state DFA. This paper surveys recent work on the state complexity of neighbourhoods of regularity preserving distances.

State Complexity of Neighbourhoods and Approximate Pattern Matching

Conference Paper

Jul 2015

The neighbourhood of a language L with respect to an additive distance consists of all strings that have distance at most the given radius from some string of L. We show that the worst case (deterministic) state complexity of a radius r neighbourhood of a language recognized by an n state nondeterministic finite automaton A is $(r+2)^n$. The lower bound construction uses an alphabet of size linear in n. We show that the worst case state complexity of the set of strings that contain a substring within distance r from a string recognized by A is $(r+2)^{n-2} + 1$.

Automata for Codes

Conference Paper

Jul 2013

Helmut Jürgensen

We survey the actual and potential rôles of automata in the modelling of information transmission systems and, in particular, in the encoder, channel and decoder components of such systems. Our focus is on applications of codes in such systems and on the relevance of automaton theoretic methods to these applications. We discuss, for example, the issues of error-detection, fault-tolerance and error-correction for variable-length codes. Beyond reviewing known work in a possibly new setting, we also present some recent results on fault-tolerant decoders for systems in which synchronization errors are likely. We conclude with a kind of research programme, a list of rather general open problems requiring solutions.

Computing Maximal Error-detecting Capabilities and Distances of Regular Languages

Article

Dec 2010
FUND INFORM

A (combinatorial) channel consists of pairs of words representing all possible input-output channel situations. In a past paper, we formalized the intuitive concept of “largest amount of errors” detectable by a given language L, by defining the maximal error-detecting capabilities of L with respect to a given class of channels, and we showed how to compute all maximal error-detecting capabilities (channels) of a given regular language with respect to the class of rational channels and a class of channels involving only the substitution-error type. In this paper we resolve the problem for channels involving any combination of the basic error types: substitution, insertion, deletion. Moreover, we consider the problem of finding the inverses of these channels, in view of the fact that L is error-detecting for γ if and only if it is error-detecting for the inverse of γ. We also discuss a natural method of reducing the problem of computing (inner) distances of a given regular language L to the problem of computing maximal error-detecting capabilities of L.

The σ-automaton corresponding to the automaton in Fig. 1. It accepts all edit strings that transform a word of (10 + 010) * 0 into a different word of (10 + 010) * 0 using only substitution errors.

Context in source publication

Similar publications

Citations