Home
Bulgarian Academy of Sciences
Institute for Bulgarian Language (IBL)
Tsvetana Dimitrova

Tsvetana Dimitrova
Bulgarian Academy of Sciences | BAS · Institute for Bulgarian Language (IBL)

Ph.D.

About

Publications

9,656

Reads

Citations

Skills and Expertise

Corpus Linguistics

March 2010 - present

Bulgarian Academy of Sciences

Institute for Bulgarian Language (IBL)
Bulgaria

Position

Professor (Assistant)

June 2008 - February 2010

Bulgarian Academy of Sciences

Institute for Bulgarian Language (IBL)
Bulgaria

Position

Research Assistant

August 2003 - April 2008

Norwegian University of Science and Technology

Field of study

Linguistics

August 1996 - August 2001

Sofia University "St. Kliment Ohridski"

Field of study

Bulgarian Language and Literature

Publications

Verbs of transfer of possession in FrameNet

Conference Paper

Full-text available

May 2024

Tsvetana Dimitrova

Българските клитики: история и настояще

Book

Full-text available

Feb 2023

Tsvetana Dimitrova

Predicative constructions and State Semantics: a corpus study on Bulgarian and Russian

Chapter

Full-text available

Nov 2022

Research on the Basic Verbal Vocabulary in Bulgarian for Students in the Initial Stage of Education through Online Games

Article

Sep 2022

The article offers an approach for tracking the knowledge and skills for the use of verbs that are deemed part of the basic vocabulary of students in the initial stage of education through language tasks. There are 5 types of tasks that will be conducted in the form of an online game in 4 variants. They are aimed at researching basic competencies i...

State predicatives: conceptions, classifications, problems

Article

Full-text available

Dec 2021

The study aims at presenting the predicatives of state in linguistic research. The existing descriptions of the predicatives expressing state are analyzed in the context of the semantic typology predicatives with a view of their structure and the scope of the semantic field to which they belong. Several classifications are considered that take into...

On the Diachrony of the Clitic Cluster in Bulgarian

Article

Full-text available

Dec 2021

Tsvetana Dimitrova

The article traces back the formation of the clitic cluster in Bulgarian starting from the Old Church Slavonic through Middle Bulgarian up to the Early Modern Bulgarian and beyond. It offers a hypothetical two-layer structure of the cluster – with the main layer consisting of a (pronominal) core and a (verbal) periphery, and a secondary layer hosti...

TOWARDS A SEMANTIC NETWORK ENRICHED WITH A VARIETY OF SEMANTIC RELATIONS

Book

Full-text available

Aug 2020

The Conjunction ‘Makar’ in Bulgarian Monuments Dated Between the 15th and 17th Centuries (Союз «макар» в болгарских памятниках XV-XVII вв.)

Article

Full-text available

Aug 2020

The article discusses the semantics and structure of the sentences introduced by the conjunction "макар" (although) drawing upon data excerpted from Bulgarian monuments dated between the 15th and 17th centuries. In this early period, "макар" was a newly introduced conjunction which later extended its use to become a widely used concessive conjuncti...

Sententional Negation and Clitics in Middle Bulgarian

Conference Paper

Full-text available

Jul 2020

We give a short survey of the clitic syntax and the placement of the general negation in Middle Bulgarian, with focus on Wallachian Bulgarian letters (ca. 1386 ―1509 AD). These issues are inter-dependent. The placement of the negation marker не in the clause has an impact over the clitic-internal ordering. Auxiliary clitics tend to be placed differ...

Tagging Historic Bulgarian Texts: Experiments and Challenges

Conference Paper

Full-text available

Feb 2020

Бележки върху изграждането на клитичния комплекс в историята на българския език

Conference Paper

Full-text available

Jan 2020

Tsvetana Dimitrova

Предложения с союзом МАКАР в болгарских памятниках XV-XVII вв.

Conference Paper

Full-text available

Feb 2019

Hear about Verbal Multiword Expressions in the Bulgarian and the Romanian Wordnets Straight from the Horse’s Mouth

Conference Paper

Jan 2019

The semantic classification of adjectives in the Bulgarian Wordnet: Towards a multiclass approach

Article

Full-text available

Dec 2018

The semantic classification of adjectives in the Bulgarian Wordnet: Towards a multiclass approach The paper presents an attempt at semantic classification of adjectives in the Bulgarian wordnet. Although designed for the Bulgarian wordnet, the classification can be applied to other wordnets which are developed in parallel to the Princeton WordNet....

Цветана Димитрова (БАН), Андрей Бояджиев (СУ). Коллекции параллельных примеров языковых явлений. 12 международный симпозиум „Русистика в современном мире“

Presentation

Full-text available

Dec 2018

Компютърна лингвистика, корпусна лингвистика, лингвистическа анотация

Classification of Adjectives in BulNet: Notes on an Effort

Conference Paper

Full-text available

Jun 2017

The paper presents an overview of an attempt at the semantic classification of adjectives in the Bulgarian Wordnet based on the information that is already available in WordNet, and other classifications proposed in the literature (classifications in the linguistic literature for Bulgarian and approaches implemented by other wordnets, more precisel...

Adjectives in Wordnet: Semantic Issues

Conference Paper

Full-text available

Oct 2016

The paper presents some preliminary observations on the classification of the adjectives in WordNet for a discussion on the principles applied. The insights support a work-in-progress on the development and introduction of a more detailed classification of the adjectives in the (Bulgarian) WordNet for enriching it with further information about the...

Metadata Extraction, Representation and Management within the Bulgarian National Corpus

Conference Paper

Full-text available

May 2016

This paper presents the extraction, representation and management of metadata in the Bulgarian National Corpus. We briefly present the current state of the Corpus and the general principles on which its development lies: uniformity, diversity of text samples, automatic compilation, extensive metadata, multi-layered linguistic annotation. The releva...

Automatic Prediction of Morphosemantic Relations

Conference Paper

Full-text available

Jan 2016

This paper presents a machine learning method for automatic identification and classification of morphosemantic relations (MSRs) between verb and noun synset pairs in the Bulgarian WordNet (BulNet). The core training data comprise 6,641 morphosemantically related verb–noun literal pairs from BulNet. The core data were preprocessed quality-wise by a...

Hydra for Web: A Browser for Easy Access to Wordnets

Conference Paper

Full-text available

Jan 2016

This paper presents a web interface for wordnets named Hydra for Web which is built on top of Hydra – an open source tool for wordnet development – by means of modern web technologies. It is a Single Page Application with simple but powerful and convenient GUI. It has two modes for visualisation of the language correspondences of searched (and foun...

HYDRA FOR WEB: A MULTILINGUAL WORDNET VIEWER

Conference Paper

Full-text available

Nov 2015

This paper presents Hydra for Web – a web interface for wordnets (and lexical-semantic databases with similar relational structure). Hydra for web is built on top of Hydra – an open source tool for wordnet development – and is a single page application with a simple GUI. It has two modes – single and parallel – for visualisation of the language cor...

NOUN-VERB DERIVATION IN THE BULGARIAN, ROMANIAN AND ENGLISH WORDNETS – A COMPARATIVE APPROACH

Conference Paper

Full-text available

Nov 2015

In the context of developing wordnets and using them in various applications, we have been enriching the Romanian and Bulgarian resources with morphosemantic relations that can aid broadening the wordnet content and improving the possible NLP applications. In this paper, we build on our previous results, adding to our presentation data from English...

Automatic Classification of WordNet Morphosemantic Relations

Conference Paper

Full-text available

Sep 2015

This paper presents work in progress on a machine learning method for classification of morphosemantic relations between verb and noun synsets. The training data comprises 5,584 verb–noun synset pairs from the Bulgarian WordNet, where the morphosemantic relations were automatically transferred from the Princeton Word-Net morphosemantic database. Th...

Genitive-Dative Syncretism in the History of the Bulgarian Language. Towards an Analysis

Article

Full-text available

Jan 2015

In this article, we trace the diachronic phases of so-called genitive-dative syncretism in Old Bulgarian, a phenomenon which marks the beginning of the process of disintegration of the Case system in the history of Bulgarian. We base our research on a corpus study (comprising the texts of Codex Marianus, Codex Zographensis and Codex Suprasliensis)...

Rule-based Person Named Entity Recognition for Bulgarian

Conference Paper

Jan 2015

Historical Corpora of Bulgarian Language and Second Position Markers

Conference Paper

Full-text available

Sep 2014

This paper demonstrates how historical corpora can be used in researching language phenomena. We exemplify the advantages and disadvantages through exploring three of the available corpora that contain textual sources of Old and Middle Bulgarian language to shed light on some aspects of the development of two words of ambiguous class. We discuss th...

Title Project: E-Reference Tools for Vet Trainers in Food Industry

Article

Apr 2014

Tsvetana Dimitrova

Title Project: Innovative E-Learning in Reparative Medicine

Article

Apr 2014

Tsvetana Dimitrova

New Opportunities for Vocational Training: Job Oriented E-Learning in Biotechnology and Environment Protection (Jobel-Bio)

Article

Apr 2014

Tsvetana Dimitrova

Coping with derivation in the Bulgarian wordnet

Article

Full-text available

Jan 2014

The paper motivates a strategy for identification and annotation of derivational relations in the Bulgarian wordnet that aims at coping with the complex morphology of the language in an elegant way. Our method involves transfer of the Princeton WordNet (morpho)semantic relations into the Bulgarian wordnet, at the level of the synset, and further de...

The Bulgarian National Corpus: Theory and Practice in Corpus Design

Article

Full-text available

Dec 2012

The paper discusses several key concepts related to the development of corpora and reconsiders them in light of recent developments in NLP. On the basis of an overview of present-day corpora, we conclude that the dominant practices of corpus design do not utilise adequately the technologies and, as a result, fail to meet the demands of corpus lingu...

Bulgarian-English Sentence- and Clause-Aligned Corpus

Conference Paper

Full-text available

Nov 2012

The paper presents the partially automatically annotated and fully manually validated Bulgarian-English Sentence- and Clause-Aligned Corpus. The discussion covers the motivation behind the corpus development, the structure and content of the corpus, illustrated by statistical data, the segmentation and alignment strategy and the tools used in the c...

Application of clause alignment for statistical machine translation

Conference Paper

Full-text available

Jul 2012

The paper presents a new resource light flexible method for clause alignment which combines the Gale-Church algorithm with internally collected textual information. The method does not resort to any pre-developed linguistic resources which makes it very appropriate for resource light clause alignment. We experiment with a combination of the method...

Design and Development of the Bulgarian Sense-Annotated Corpus.

Conference Paper

Full-text available

Apr 2011

The Old Bulgarian Noun Phrase

Book

Jan 2011

Tsvetana Dimitrova

"The Old Bulgarian Noun Phrase: Towards an Annotation Specification" addresses the issue of application of modern linguistic approaches to historical language data in a corpora-oriented approach. The study combines a linguistic analysis of Old Bulgarian diachronic data with a proposal for an annotation specification for the nominal categories. Ca....