Making Sense of Large Social Media Corpora: Keywords, Topics, Sentiment, and Hashtags in the Coronavirus Twitter Corpus

Loading...
Thumbnail Image

Date

2024

Journal Title

Journal ISSN

Volume Title

Publisher

Palgrave Macmillan

Abstract

This open access book offers a comprehensive overview of available techniques and approaches to explore large social media corpora, using as an illustrative case study the Coronavirus Twitter corpus. First, the author describes in detail a number of methods, strategies, and tools that can be used to access, manage, and explore large Twitter/X corpora, including both user-friendly applications and more advanced methods that involve the use of data management skills and custom programming scripts. He goes on to show how these tools and methods are applied to explore one of the largest Twitter datasets on the COVID-19 pandemic publicly released, covering the two years when the pandemic had the strongest impact on society. Specifically, keyword extraction, topic modelling, sentiment analysis, and hashtag analysis methods are described, contrasted, and applied to extract information from the Coronavirus Twitter Corpus. The book will be of interest to students and researchers in fields that make use of big data to address societal and linguistic concerns, including corpus linguistics, sociology, psychology, and economics.

Description

This book is freely available in OAPEN (Online Library and Publication Platform) at: https://library.oapen.org/handle/20.500.12657/90421 Creative Commons License: CC BY 4.0

Keywords

Media studies, Communication studies, Internet, Digital media

Citation

Moreno-Ortiz, A. (2024). Making Sense of Large Social Media Corpora: Keywords, Topics, Sentiment, and Hashtags in the Coronavirus Twitter Corpus. Palgrave Macmillan. https://library.oapen.org/handle/20.500.12657/90421