AMU Faculty of English is pleased to invite you to a lecture by Dr. Mariusz Kaminski Welcoming "The development of the code R for comparative analysis of dictionary definitions."
This presentation aims to introduce a source code that I have written in R programming language for the purposes of a comparison of definitions from six different dictionaries. In a step-by-step fashion, I will explain successive stages of writing the script. I will show how to automatically generate a random sample of dictionary pages, clean up unnecessary symbols from the definitions, carry out tokenisation, generate a frequency table, and create a matrix for hierarchical cluster analysis (HCA) and correspondence analysis (CA). The analyses will generate two graphs: a dendrogram showing which dictionaries are alike in terms of the distribution of word frequencies (HCA), and a graph showing which words are characteristic of each dictionary (CA).
The lecture is offered within the framework of the Faculty's PhD programme, but it is open to any interested parties.
Informację wprowadził/a: Julia Ruminiecka