Clusters and Correspondences. A comparison of two ...
Transcript of Clusters and Correspondences. A comparison of two ...
![Page 1: Clusters and Correspondences. A comparison of two ...](https://reader034.fdocuments.us/reader034/viewer/2022052700/628e9fdce1a96a249058f1a9/html5/thumbnails/1.jpg)
Clusters and Correspondences. A comparison of two exploratory statistical techniques for
semantic description
Dylan Glynn
University of Leuven RU Quantitative Lexicology and Variational Linguistics
![Page 2: Clusters and Correspondences. A comparison of two ...](https://reader034.fdocuments.us/reader034/viewer/2022052700/628e9fdce1a96a249058f1a9/html5/thumbnails/2.jpg)
Aim of Study
Compare two simple techniques for exploratory multivariate analysis of semantic structure
Show that quantitative semantic analysis is possible
![Page 3: Clusters and Correspondences. A comparison of two ...](https://reader034.fdocuments.us/reader034/viewer/2022052700/628e9fdce1a96a249058f1a9/html5/thumbnails/3.jpg)
Cognitive Linguistics
Symbolic unit
Form-meaning pairs - no formal modules (Langacker 1987, Fillmore & al. 1988)
Encyclopaedic semantics
No semantic modules – meaning is all conception and perception (Fillmore 1985, Lakoff 1987)
Entrenchment No grammar – language is usage
…no language system, social langue, or individual competence
![Page 4: Clusters and Correspondences. A comparison of two ...](https://reader034.fdocuments.us/reader034/viewer/2022052700/628e9fdce1a96a249058f1a9/html5/thumbnails/4.jpg)
Quantitative Approaches to Semantic Structure within
Cognitive Linguistics Polysemy Lexical –
Gries (2006) run Glynn (2008) hassle
Synonymy Constructional –
Gries (1999) VPCxs, Heylen (2005) Middle Field Cxs, Grondelaers & al. (2007) 'there' Cxs Speelman & Geeraerts (forth.) Causative Cxs
Lexical –
Divjak (2006) intend verbs, Divjak & Gries (2006) try verbs, Newman & Rice (2004) posture verbs Newman & Rice (2004) prepositions
![Page 5: Clusters and Correspondences. A comparison of two ...](https://reader034.fdocuments.us/reader034/viewer/2022052700/628e9fdce1a96a249058f1a9/html5/thumbnails/5.jpg)
Hierarchical Cluster Analysis
HCA shows grouping 2-way tables agglomerative
distance matrix possibility of significance testing (via bootstrapping)
HCA visualisation dendograms different distance measures = emphasis different groupings discrete groups = misleading semantic description
![Page 6: Clusters and Correspondences. A comparison of two ...](https://reader034.fdocuments.us/reader034/viewer/2022052700/628e9fdce1a96a249058f1a9/html5/thumbnails/6.jpg)
Cluster Analysis
![Page 7: Clusters and Correspondences. A comparison of two ...](https://reader034.fdocuments.us/reader034/viewer/2022052700/628e9fdce1a96a249058f1a9/html5/thumbnails/7.jpg)
Multiple Correspondence Analysis
MCA shows correlations
n-way tables canonical correlation distance matrix
MCA visualisation
correspondence maps proximity = correlation conflated multiple spaces = misleading proximity
![Page 8: Clusters and Correspondences. A comparison of two ...](https://reader034.fdocuments.us/reader034/viewer/2022052700/628e9fdce1a96a249058f1a9/html5/thumbnails/8.jpg)
Multiple Correspondence Analysis
transitive
Trans w/o ob
Intrans w/o ob Intrans
Adjectivee
![Page 9: Clusters and Correspondences. A comparison of two ...](https://reader034.fdocuments.us/reader034/viewer/2022052700/628e9fdce1a96a249058f1a9/html5/thumbnails/9.jpg)
Corpus and Annotation
LiveJournal Corpus Online personal diaries Very large, unparsed British vs. American is distinguished, but little register variation Some gender bias toward woman, probably restricted to middle class, 15-25 year olds. Annotation 3 parameters- Semantic, Formal, and Social 120 values 20 variables 2000 occurrences
![Page 10: Clusters and Correspondences. A comparison of two ...](https://reader034.fdocuments.us/reader034/viewer/2022052700/628e9fdce1a96a249058f1a9/html5/thumbnails/10.jpg)
Annoy, Bother, Hassle Breaking Down Lemmata
Transitive Saw quite a few people I knew, including the awful stalker guy who's been hassling me ...
Transitive Oblique If you hassle me about my kinky hair, I'll cut it all off. hat in hand, humble, almost begging .
Intransitive Officer McCoy, me and him was hassling and my gun went off, hitting him somewhere ...
Nominal Mass ... because it saves all that ammoying hassle of SOD'S-BLOODY-LAW!!!!!!
Nominal Count I rarely paint my nails(It can be such a hassle!)
Adjective Attributive It's a very hassily event to do.
Adjective Predicative She will not take part in Saturday's 5000m race, saying she is tired and bothered
Gerund the technical know-how to do this sort of hassling ...
![Page 11: Clusters and Correspondences. A comparison of two ...](https://reader034.fdocuments.us/reader034/viewer/2022052700/628e9fdce1a96a249058f1a9/html5/thumbnails/11.jpg)
Annoy, Bother, Hassle Breaking Down Lemmata
Form Occurrences
Count Noun hassle (hassle_count) 146
Mass Noun hassle (hassle_mass) 217
Gerund hassle (hassle_gerund) 40
Predicative Adjective bother (bother_pred) 124
Intransitive bother (bother_intrans) 222
Transitive annoy (annoy_trans) 449
Transitive hassle (hassle_trans) 274
Transitive bother (bother_trans) 275
![Page 12: Clusters and Correspondences. A comparison of two ...](https://reader034.fdocuments.us/reader034/viewer/2022052700/628e9fdce1a96a249058f1a9/html5/thumbnails/12.jpg)
Annoy, Bother, Hassle Indirect Semantic Variable: Agent Type
Agent Type
- Human Specific so im hassling you instead of your mum, haha!
- Human Non-Specific but we started to have more people hassling us.
- Institution Well, the Church bothers me quite often,
- Activity - Event It bothers me everytime by boyfreind talks to, or about his ex girlfriends
-Thing I pulled it out but the mouse annoys me too much...
- Abstract State of Affairs I have been open to him about everything else except that part.. however, it bothers me and I'm caught in between
![Page 13: Clusters and Correspondences. A comparison of two ...](https://reader034.fdocuments.us/reader034/viewer/2022052700/628e9fdce1a96a249058f1a9/html5/thumbnails/13.jpg)
Annoy, Bother, Hassle Agglomerative Hierarchical Cluster Analysis (Dist: Euclidean/ Met: Average)
Construction-Lexeme Agent Type
![Page 14: Clusters and Correspondences. A comparison of two ...](https://reader034.fdocuments.us/reader034/viewer/2022052700/628e9fdce1a96a249058f1a9/html5/thumbnails/14.jpg)
Annoy, Bother, Hassle Multiple Correspondence Analysis
Construction-Lexeme Agent Type
![Page 15: Clusters and Correspondences. A comparison of two ...](https://reader034.fdocuments.us/reader034/viewer/2022052700/628e9fdce1a96a249058f1a9/html5/thumbnails/15.jpg)
Annoy, Bother, Hassle Agglomerative Hierarchical Cluster Analysis "pvclust"
2 kinds of p-values: AU (Approximately Unbiased) determined by multiscale bootstrap resampling BP (Bootstrap Probability) value determined by normal bootstrap resampling.
![Page 16: Clusters and Correspondences. A comparison of two ...](https://reader034.fdocuments.us/reader034/viewer/2022052700/628e9fdce1a96a249058f1a9/html5/thumbnails/16.jpg)
Annoy, Bother, Hassle PV Agglomerative Hierarchical Cluster Analysis (Dist: Euclidean/ Met: Ward)
Construction-Lexeme Agent Type
![Page 17: Clusters and Correspondences. A comparison of two ...](https://reader034.fdocuments.us/reader034/viewer/2022052700/628e9fdce1a96a249058f1a9/html5/thumbnails/17.jpg)
Annoy, Bother, Hassle Direct Semantic Variables: Cause, Affect, Humour
Cause of Event - expenditure of energy - imposition - imposition / request - interruption - request - condemnation - tease Affect on Patient - anger - repetition / boring - concern - thought - emotional pain - physical pain Humour - - Use of humour in the example - No use of humour in the example
![Page 18: Clusters and Correspondences. A comparison of two ...](https://reader034.fdocuments.us/reader034/viewer/2022052700/628e9fdce1a96a249058f1a9/html5/thumbnails/18.jpg)
Annoy, Bother, Hassle Agglomerative Hierarchical Cluster Analysis (Dist: Euclidean/ Met: Average)
Construction-Lexeme Dialect Cause Affect Humour – less forms
![Page 19: Clusters and Correspondences. A comparison of two ...](https://reader034.fdocuments.us/reader034/viewer/2022052700/628e9fdce1a96a249058f1a9/html5/thumbnails/19.jpg)
Annoy, Bother, Hassle Multiple Correspondence Analysis
Construction-Lexeme Dialect Cause Affect Humour - less forms
![Page 20: Clusters and Correspondences. A comparison of two ...](https://reader034.fdocuments.us/reader034/viewer/2022052700/628e9fdce1a96a249058f1a9/html5/thumbnails/20.jpg)
Annoy, Bother, Hassle PV Agglomerative Hierarchical Cluster Analysis (Dist: Euclidean/ Met: Ward)
Construction-Lexeme Cause Affect - less forms
![Page 21: Clusters and Correspondences. A comparison of two ...](https://reader034.fdocuments.us/reader034/viewer/2022052700/628e9fdce1a96a249058f1a9/html5/thumbnails/21.jpg)
Annoy, Bother, Hassle Bivariate Correspondence Analysis
Construction-Lexeme Cause Affect - less forms
bother trans
![Page 22: Clusters and Correspondences. A comparison of two ...](https://reader034.fdocuments.us/reader034/viewer/2022052700/628e9fdce1a96a249058f1a9/html5/thumbnails/22.jpg)
Russian Adjectival Constructions Discrepancies between HCA and MCA
![Page 23: Clusters and Correspondences. A comparison of two ...](https://reader034.fdocuments.us/reader034/viewer/2022052700/628e9fdce1a96a249058f1a9/html5/thumbnails/23.jpg)
Russian Adjectival Constructions Discrepancies between HCA and MCA
![Page 24: Clusters and Correspondences. A comparison of two ...](https://reader034.fdocuments.us/reader034/viewer/2022052700/628e9fdce1a96a249058f1a9/html5/thumbnails/24.jpg)
Annoy, Bother, Hassle Bivariate Correspondence Analysis
Construction-Lexeme Cause Affect - less forms
![Page 25: Clusters and Correspondences. A comparison of two ...](https://reader034.fdocuments.us/reader034/viewer/2022052700/628e9fdce1a96a249058f1a9/html5/thumbnails/25.jpg)
Annoy, Bother, Hassle
Detail of Correspondence Analysis Usage Cluster 1 Class Form
Transitive annoy Transitive bother
Affect Features anger repetition
concern thought emotional pain physical pain interruption aesthetic
![Page 26: Clusters and Correspondences. A comparison of two ...](https://reader034.fdocuments.us/reader034/viewer/2022052700/628e9fdce1a96a249058f1a9/html5/thumbnails/26.jpg)
Annoy, Bother, Hassle
Detail of Correspondence Analysis
Usage Cluster 2 Class Forms
Transitive hassle
Cause - Affect Features imposition request imposition request tease condemn
![Page 27: Clusters and Correspondences. A comparison of two ...](https://reader034.fdocuments.us/reader034/viewer/2022052700/628e9fdce1a96a249058f1a9/html5/thumbnails/27.jpg)
Annoy, Bother, Hassle
Detail of Correspondence Analysis
Usage Cluster 3 Class Forms
Count Noun hassle Mass Noun hassle Gerund hassle Adjective bother Intransitive bother
Affect Features energy agitation
![Page 28: Clusters and Correspondences. A comparison of two ...](https://reader034.fdocuments.us/reader034/viewer/2022052700/628e9fdce1a96a249058f1a9/html5/thumbnails/28.jpg)
Summary
Pros and Cons for HCA and MCA in Quantitative Approaches to Cog. Sem.
HCA - groups usage patterns relative to features
+ Possibility for significance testing + Clear visualisations - 'Blind' Clustering
- Discrete Grouping MCA - maps usage patterns relative to visualised features
+ ‘Analogue’ representation of associations + Correlations visible - Misleading visualisations - No significance testing
![Page 29: Clusters and Correspondences. A comparison of two ...](https://reader034.fdocuments.us/reader034/viewer/2022052700/628e9fdce1a96a249058f1a9/html5/thumbnails/29.jpg)
Summary
Quantitative Semantic Study A combination of formal, indirect semantic and direct semantic tagging is possible and can produce coherent verifiable results Although semantic analysis is more subjective than formal analysis, if we are to describe all of language, then we should also include semantic features
![Page 30: Clusters and Correspondences. A comparison of two ...](https://reader034.fdocuments.us/reader034/viewer/2022052700/628e9fdce1a96a249058f1a9/html5/thumbnails/30.jpg)
for further information: http://wwwling.arts.kuleuven.ac.be/qlvl/
http://perswww.kuleuven.be/dylan_glynn