Introducing Corpus Linguistics: AntConc and Project Gutenberg. Dr Glenn Hadikin.
-
Upload
grant-smith -
Category
Documents
-
view
226 -
download
4
Transcript of Introducing Corpus Linguistics: AntConc and Project Gutenberg. Dr Glenn Hadikin.
![Page 1: Introducing Corpus Linguistics: AntConc and Project Gutenberg. Dr Glenn Hadikin.](https://reader035.fdocuments.us/reader035/viewer/2022062313/56649d755503460f94a55349/html5/thumbnails/1.jpg)
Introducing Corpus Linguistics: AntConc and Project Gutenberg.
Dr Glenn Hadikin
![Page 2: Introducing Corpus Linguistics: AntConc and Project Gutenberg. Dr Glenn Hadikin.](https://reader035.fdocuments.us/reader035/viewer/2022062313/56649d755503460f94a55349/html5/thumbnails/2.jpg)
• Download two magazines• Conduct a ‘keyword’ query
![Page 3: Introducing Corpus Linguistics: AntConc and Project Gutenberg. Dr Glenn Hadikin.](https://reader035.fdocuments.us/reader035/viewer/2022062313/56649d755503460f94a55349/html5/thumbnails/3.jpg)
What is corpus linguistics?
Corpus linguistics is the study of large bodies of naturally occurring text that are ‘visible’ to corpus analysis software.
![Page 4: Introducing Corpus Linguistics: AntConc and Project Gutenberg. Dr Glenn Hadikin.](https://reader035.fdocuments.us/reader035/viewer/2022062313/56649d755503460f94a55349/html5/thumbnails/4.jpg)
![Page 5: Introducing Corpus Linguistics: AntConc and Project Gutenberg. Dr Glenn Hadikin.](https://reader035.fdocuments.us/reader035/viewer/2022062313/56649d755503460f94a55349/html5/thumbnails/5.jpg)
![Page 6: Introducing Corpus Linguistics: AntConc and Project Gutenberg. Dr Glenn Hadikin.](https://reader035.fdocuments.us/reader035/viewer/2022062313/56649d755503460f94a55349/html5/thumbnails/6.jpg)
• https://www.gutenberg.org
![Page 7: Introducing Corpus Linguistics: AntConc and Project Gutenberg. Dr Glenn Hadikin.](https://reader035.fdocuments.us/reader035/viewer/2022062313/56649d755503460f94a55349/html5/thumbnails/7.jpg)
![Page 8: Introducing Corpus Linguistics: AntConc and Project Gutenberg. Dr Glenn Hadikin.](https://reader035.fdocuments.us/reader035/viewer/2022062313/56649d755503460f94a55349/html5/thumbnails/8.jpg)
![Page 9: Introducing Corpus Linguistics: AntConc and Project Gutenberg. Dr Glenn Hadikin.](https://reader035.fdocuments.us/reader035/viewer/2022062313/56649d755503460f94a55349/html5/thumbnails/9.jpg)
![Page 10: Introducing Corpus Linguistics: AntConc and Project Gutenberg. Dr Glenn Hadikin.](https://reader035.fdocuments.us/reader035/viewer/2022062313/56649d755503460f94a55349/html5/thumbnails/10.jpg)
When you see this press ctr a to highlight it all and then ctr c to copy it all
![Page 11: Introducing Corpus Linguistics: AntConc and Project Gutenberg. Dr Glenn Hadikin.](https://reader035.fdocuments.us/reader035/viewer/2022062313/56649d755503460f94a55349/html5/thumbnails/11.jpg)
Open up Wordpad and press ctr v to dump all the text to Wordpad
![Page 12: Introducing Corpus Linguistics: AntConc and Project Gutenberg. Dr Glenn Hadikin.](https://reader035.fdocuments.us/reader035/viewer/2022062313/56649d755503460f94a55349/html5/thumbnails/12.jpg)
Press ‘save as’, choose ‘plain text’ and give it a filename such as boysandgirl.txt
![Page 13: Introducing Corpus Linguistics: AntConc and Project Gutenberg. Dr Glenn Hadikin.](https://reader035.fdocuments.us/reader035/viewer/2022062313/56649d755503460f94a55349/html5/thumbnails/13.jpg)
• That’s how I got the boysandgirls.txt file on the website.
• The girls.txt file followed the same procedure but is a copy of ‘The Girl’s Own Paper’ from 1886
![Page 14: Introducing Corpus Linguistics: AntConc and Project Gutenberg. Dr Glenn Hadikin.](https://reader035.fdocuments.us/reader035/viewer/2022062313/56649d755503460f94a55349/html5/thumbnails/14.jpg)
![Page 15: Introducing Corpus Linguistics: AntConc and Project Gutenberg. Dr Glenn Hadikin.](https://reader035.fdocuments.us/reader035/viewer/2022062313/56649d755503460f94a55349/html5/thumbnails/15.jpg)
Go to ‘file’ and open ‘boysandgirls.txt’
![Page 16: Introducing Corpus Linguistics: AntConc and Project Gutenberg. Dr Glenn Hadikin.](https://reader035.fdocuments.us/reader035/viewer/2022062313/56649d755503460f94a55349/html5/thumbnails/16.jpg)
You can type any common word in to the search box at the bottom and see if it’s working okay.
![Page 17: Introducing Corpus Linguistics: AntConc and Project Gutenberg. Dr Glenn Hadikin.](https://reader035.fdocuments.us/reader035/viewer/2022062313/56649d755503460f94a55349/html5/thumbnails/17.jpg)
Go to ‘tool preferences’, ‘add files’, upload ‘girls.txt’ and press ‘load’ – this is called a reference file
![Page 18: Introducing Corpus Linguistics: AntConc and Project Gutenberg. Dr Glenn Hadikin.](https://reader035.fdocuments.us/reader035/viewer/2022062313/56649d755503460f94a55349/html5/thumbnails/18.jpg)
Before any keyword analysis you must create a ‘wordlist’
![Page 19: Introducing Corpus Linguistics: AntConc and Project Gutenberg. Dr Glenn Hadikin.](https://reader035.fdocuments.us/reader035/viewer/2022062313/56649d755503460f94a55349/html5/thumbnails/19.jpg)
• Any guesses what words or ideas will be key in ‘boysandgirls’ compared with ‘girls’?
![Page 20: Introducing Corpus Linguistics: AntConc and Project Gutenberg. Dr Glenn Hadikin.](https://reader035.fdocuments.us/reader035/viewer/2022062313/56649d755503460f94a55349/html5/thumbnails/20.jpg)
![Page 21: Introducing Corpus Linguistics: AntConc and Project Gutenberg. Dr Glenn Hadikin.](https://reader035.fdocuments.us/reader035/viewer/2022062313/56649d755503460f94a55349/html5/thumbnails/21.jpg)
Click on a word to explore further…
![Page 22: Introducing Corpus Linguistics: AntConc and Project Gutenberg. Dr Glenn Hadikin.](https://reader035.fdocuments.us/reader035/viewer/2022062313/56649d755503460f94a55349/html5/thumbnails/22.jpg)
You can go back to ‘tool preferences’ and press ‘swap’ for opposite case.
![Page 23: Introducing Corpus Linguistics: AntConc and Project Gutenberg. Dr Glenn Hadikin.](https://reader035.fdocuments.us/reader035/viewer/2022062313/56649d755503460f94a55349/html5/thumbnails/23.jpg)
Check there are 1117 occurrences of ‘the’ to make sure the files have swapped correctly.
![Page 24: Introducing Corpus Linguistics: AntConc and Project Gutenberg. Dr Glenn Hadikin.](https://reader035.fdocuments.us/reader035/viewer/2022062313/56649d755503460f94a55349/html5/thumbnails/24.jpg)
![Page 25: Introducing Corpus Linguistics: AntConc and Project Gutenberg. Dr Glenn Hadikin.](https://reader035.fdocuments.us/reader035/viewer/2022062313/56649d755503460f94a55349/html5/thumbnails/25.jpg)
If the ‘boysandgirls’ keyword list comes back (with ‘illustrated’ at the top) go back to ‘tool preferences’, clear and reload the
‘boysandgirls’ reference corpus.
![Page 26: Introducing Corpus Linguistics: AntConc and Project Gutenberg. Dr Glenn Hadikin.](https://reader035.fdocuments.us/reader035/viewer/2022062313/56649d755503460f94a55349/html5/thumbnails/26.jpg)
• Would similar patterns come up in 21st century books?
![Page 27: Introducing Corpus Linguistics: AntConc and Project Gutenberg. Dr Glenn Hadikin.](https://reader035.fdocuments.us/reader035/viewer/2022062313/56649d755503460f94a55349/html5/thumbnails/27.jpg)
Thank you – all invited to our book launch in Blackwells book shop tomorrow at 5pm.