MCA plot : a simultaneous representation of pre-adjectival vs. pre-determiner constructions (active, in black), intensifiers (active, in red), text modes (active, in green), text types (active, in blue), and semantic classes (illustrative, in cyan)

Source publication

Forms and meanings of intensification: a multifactorial comparison of quite and rather

Article

Full-text available

Aug 2015

Guillaume Desagulier

To capture usage-based relations between near-synonyms, I cluster collocation data using exploratory multifactorial methods. My investigation is restricted to quite and rather in the contexts where they intensify adjectives in the British National Corpus. I use correspondence analysis and multiple correspondence analysis to visualize and interpret...

Context 1

... variable 'text_info' was ignored. Figure 4 maps the first two dimensions of the data. Along the horizontal axis, pre-determiner quite is correlated with positive connotations ('value_desirable', 'dimension_position', 'importance', 'psych_stim_good', 'adequacy_suitability', 'physical_property_good'), with the exception of 'cost_high'. ...

View in full-text

Context 2

... 'psych_stim_good', or 'physical_property_good') whereas pre- adjectival rather is found with their negative counterparts ('value_undesirable', 'psych_stim_bad', 'physical_property_bad'). Admittedly, not all the oppositions listed in Table 6 are found in the first two dimensions of the MCA plot in Figure 4 (for example, the 'luck_good' vs. 'luck_bad' is absent from this second dataset). It suggests that the behavior of the alternation differs slightly from the more general behavior of quite and rather. ...

View in full-text

Detecting which variables alter component interpretation across multiple groups: A resampling-based method

Article

Full-text available

Apr 2019

In psychology, many studies measure the same variables in different groups. In case of a large number of variables and when a strong a priori idea about underlying latent constructs is lacking, researchers often start with reducing the variables to a few principal components in an exploratory way. Herewith, one often wants to evaluate whether the c...

Evolution of Market Uncertainty around Earnings Announcements

Article

Full-text available

Jan 2001

This paper investigates theoretically and empirically the dynamics of the implied volatility (or implied standard deviation - ISD) around earnings announcements dates. The volatility implied by option prices can be interpreted as the level of volatility expected by the market over the remaining life of the option. We propose a theoretical framework...

What is a STEM job? How Different Interpretations of the Acronym Result in Disparate Labor Market Projections How Different Interpretations of the Acronym Result in Disparate Labor Market Projections

Technical Report

Full-text available

Jan 2014

KEY FINDINGS: 1. Estimates of STEM jobs in the United States vary from 5.4 million to 26 million, depending on which occupations are included under the STEM umbrella and how occupations are defined. This results in wildly disparate projections for jobs, wages, and required education for what may appear to be a single cluster of occupations (i.e., S...

Construction of a happiness index using polytomous item response theory models in a survey

Conference Paper

Full-text available

Jun 2013

In this paper, we present an alternative to derive univariate indices by using polytomous item response theory models in data from surveys. Particularly, the Samejima's graded response model [12] was used. A real data set from the Social European Survey database was fitted. In order to interpret the index a multiple correspondence analysis and and...

Figure 4. The transformation of flavors upon a mutation on node j can...

Figure 6. Meson generated by anticomposition. Here m − c = 0. This rule...

Higher Cluster Categories and QFT Dualities

Article

Full-text available

Nov 2017

We present a unified mathematical framework that elegantly describes minimally SUSY gauge theories in even dimension, ranging from $6d$ to $0d$, and their dualities. This approach combines recent developments on graded quiver with potentials, higher Ginzburg algebras and higher cluster categories (also known as $m$-cluster categories). Quiver mutat...

A corpus-based study of maximizer–adjective patterns in Croatian

Article

Full-text available

Mar 2024
LANG SCI

Ivan Lacić

Maximizers represent a subclass of degree modifiers that convey the highest degree to which a property can be carried out. This paper studies five Croatian near-synonymous maximizers (all meaning “completely, totally”), viz. posve, potpuno, sasvim, skroz, and totalno, as a part of <maximizer + adjective> construction. It is assumed here that analysed pairings act as (semi)-prefabricated units with maximizers that impose particular modes of construal. To analyse the subtle semantic differences of examined maximizers, we shall turn to the distributional hypothesis and examine contexts in which maximizers occur. Using a combination of analytical statistics (collostructional analysis) and multifactorial methods (hierarchical agglomerative cluster analysis and correspondence analysis), we aim to examine similarities (proximities) and differences (distances) between analysed constructions in order to understand intricate relationships among maximizers, fostering valuable insights into their semantics. The findings of this study provide insight into the interplay of the Croatian maximizers and adjectives.

Review : Desagulier. 2017. Corpus Linguistics and Statistics with R: Introduction to Quantitative Methods in Linguistics

Article

Aug 2019
Corpora

Graham Ranger

Can word vectors help corpus linguists?

Article

Full-text available

Jul 2019

Guillaume Desagulier

Two recent methods based on distributional semantic models (DSMs) have proved very successful in learning high-quality vector representations of words from large corpora: word2vec and GloVe. Once trained on a very large corpus, these algorithms produce distributed representations for words in the form of vectors. DSMs based on deep learning and neural networks have proved efficient in representing the meaning of individual words. In this paper, I assess to what extent state-of-the-art word-vector semantics can help corpus linguists annotate large datasets for semantic classes. Although word vectors suggest exciting opportunities for resolving semantic annotation issues, there is still room for improvement in terms of the representation of polysemy, homonymy, and multiword expressions.

Notions of Statistical Testing

Chapter

Nov 2017

Guillaume Desagulier

In this chapter, you will learn the basics of statistical thinking, namely inferential statistics and statistical testing. These fundamentals will serve as a basis for the following chapters.

Clustering Methods

Chapter

Nov 2017

Guillaume Desagulier

In this chapter, I introduce clustering techniques Their aim is to form clusters of objects so that similar objects are grouped in the same clusters and different objects are grouped in different clusters. I also introduce the concept of a network graph which, although not a clustering technique, is a useful, related addition to your corpus linguistics tool repository.

MCA plot : a simultaneous representation of pre-adjectival vs. pre-determiner constructions (active, in black), intensifiers (active, in red), text modes (active, in green), text types (active, in blue), and semantic classes (illustrative, in cyan)

Contexts in source publication

Similar publications

Citations