Figure 4 - uploaded by Guillaume Desagulier
Content may be subject to copyright.
MCA plot : a simultaneous representation of pre-adjectival vs. pre-determiner constructions (active, in black), intensifiers (active, in red), text modes (active, in green), text types (active, in blue), and semantic classes (illustrative, in cyan)  

MCA plot : a simultaneous representation of pre-adjectival vs. pre-determiner constructions (active, in black), intensifiers (active, in red), text modes (active, in green), text types (active, in blue), and semantic classes (illustrative, in cyan)  

Source publication
Article
Full-text available
To capture usage-based relations between near-synonyms, I cluster collocation data using exploratory multifactorial methods. My investigation is restricted to quite and rather in the contexts where they intensify adjectives in the British National Corpus. I use correspondence analysis and multiple correspondence analysis to visualize and interpret...

Contexts in source publication

Context 1
... variable 'text_info' was ignored. Figure 4 maps the first two dimensions of the data. Along the horizontal axis, pre-determiner quite is correlated with positive connotations ('value_desirable', 'dimension_position', 'importance', 'psych_stim_good', 'adequacy_suitability', 'physical_property_good'), with the exception of 'cost_high'. ...
Context 2
... 'psych_stim_good', or 'physical_property_good') whereas pre- adjectival rather is found with their negative counterparts ('value_undesirable', 'psych_stim_bad', 'physical_property_bad'). Admittedly, not all the oppositions listed in Table 6 are found in the first two dimensions of the MCA plot in Figure 4 (for example, the 'luck_good' vs. 'luck_bad' is absent from this second dataset). It suggests that the behavior of the alternation differs slightly from the more general behavior of quite and rather. ...

Similar publications

Article
Full-text available
In psychology, many studies measure the same variables in different groups. In case of a large number of variables and when a strong a priori idea about underlying latent constructs is lacking, researchers often start with reducing the variables to a few principal components in an exploratory way. Herewith, one often wants to evaluate whether the c...
Article
Full-text available
This paper investigates theoretically and empirically the dynamics of the implied volatility (or implied standard deviation - ISD) around earnings announcements dates. The volatility implied by option prices can be interpreted as the level of volatility expected by the market over the remaining life of the option. We propose a theoretical framework...
Technical Report
Full-text available
KEY FINDINGS: 1. Estimates of STEM jobs in the United States vary from 5.4 million to 26 million, depending on which occupations are included under the STEM umbrella and how occupations are defined. This results in wildly disparate projections for jobs, wages, and required education for what may appear to be a single cluster of occupations (i.e., S...
Conference Paper
Full-text available
In this paper, we present an alternative to derive univariate indices by using polytomous item response theory models in data from surveys. Particularly, the Samejima's graded response model [12] was used. A real data set from the Social European Survey database was fitted. In order to interpret the index a multiple correspondence analysis and and...
Article
Full-text available
We present a unified mathematical framework that elegantly describes minimally SUSY gauge theories in even dimension, ranging from $6d$ to $0d$, and their dualities. This approach combines recent developments on graded quiver with potentials, higher Ginzburg algebras and higher cluster categories (also known as $m$-cluster categories). Quiver mutat...

Citations

... This principle asserts that a correlation between distributional similarity and meaning similarity enables us to infer the latter from the former. Drawing inspiration from Desagulier's (2014Desagulier's ( , 2015 methodological approach, this study aims to explore distinctions among the aforementioned maximizers by examining their collocational profiles as indicative of their divergent semantics. The working hypothesis, in alignment with Desagulier's framework (2014), postulates that an overlap in collocation preferences among skroz, sasvim, posve, potpuno, and totalno would suggest a shared conceptual content, classifying them as near-synonyms. ...
... This paper adopts the methodology proposed by Desagulier (2014Desagulier ( , 2015 and combines analytical statistics with multifactorial methods. Analytical statistics involves the application of various statistical methods and techniques to analyse and interpret data, encompassing descriptive statistics, inferential statistics, regression analysis, correlation analysis, and more. ...
... This approach should enable us to understand these three elements' interplay better. In line with Desagulier's (2014Desagulier's ( , 2015 approach, this study will utilise the output of MDCA as input for correspondence analysis (CA). ...
Article
Full-text available
Maximizers represent a subclass of degree modifiers that convey the highest degree to which a property can be carried out. This paper studies five Croatian near-synonymous maximizers (all meaning “completely, totally”), viz. posve, potpuno, sasvim, skroz, and totalno, as a part of <maximizer + adjective> construction. It is assumed here that analysed pairings act as (semi)-prefabricated units with maximizers that impose particular modes of construal. To analyse the subtle semantic differences of examined maximizers, we shall turn to the distributional hypothesis and examine contexts in which maximizers occur. Using a combination of analytical statistics (collostructional analysis) and multifactorial methods (hierarchical agglomerative cluster analysis and correspondence analysis), we aim to examine similarities (proximities) and differences (distances) between analysed constructions in order to understand intricate relationships among maximizers, fostering valuable insights into their semantics. The findings of this study provide insight into the interplay of the Croatian maximizers and adjectives.
... The preceding lines summarise the contents of CLSR but do not render justice to its real and many strengths. First among these is the way in which Desagulier systematically contextualises the features of R as they are introduced, with concrete applications of the program on linguistic datasets that are both original (his own studies of the position of intensifying adjectives, Desagulier [2014Desagulier [ , 2015) and borrowed from other linguists' work (e.g. Cheshire [2007] on general extenders or Tagliamonte and Hudson [1999] on be like). ...
Article
Full-text available
Two recent methods based on distributional semantic models (DSMs) have proved very successful in learning high-quality vector representations of words from large corpora: word2vec and GloVe. Once trained on a very large corpus, these algorithms produce distributed representations for words in the form of vectors. DSMs based on deep learning and neural networks have proved efficient in representing the meaning of individual words. In this paper, I assess to what extent state-of-the-art word-vector semantics can help corpus linguists annotate large datasets for semantic classes. Although word vectors suggest exciting opportunities for resolving semantic annotation issues, there is still room for improvement in terms of the representation of polysemy, homonymy, and multiword expressions.
Chapter
In this chapter, you will learn the basics of statistical thinking, namely inferential statistics and statistical testing. These fundamentals will serve as a basis for the following chapters.
Chapter
In this chapter, I introduce clustering techniques Their aim is to form clusters of objects so that similar objects are grouped in the same clusters and different objects are grouped in different clusters. I also introduce the concept of a network graph which, although not a clustering technique, is a useful, related addition to your corpus linguistics tool repository.