Overcoming Language Barriers: Assessing the Potential of Machine Translation and Topic Modeling for the Comparative Analysis of Multilingual Text Corpora
Options
BORIS DOI
Date of Publication
May 20, 2019
Publication Type
Article
Division/Institute
Subject(s)
Series
Communication methods and measures
ISSN or ISBN (if monograph)
1931-2458
Publisher
Taylor & Francis
Language
English
Publisher DOI
Description
This study assesses the potential of topic models coupled with machine translation for comparative communication research across language barriers. From a methodological point of view, the robustness of a combined approach is examined. For this purpose the results of different machine translation services (Google Translate vs. DeepL) as well as methods (full-text vs. term-by-term) are compared. From a substantive point of view, the integratability of the approach into comparative study designs is tested. For this, the online discourses about climate change in Germany, the United Kingdom, and the United States are compared. First, the results show that the approach is relatively robust and second, that integration in comparative study designs is not a problem. It is concluded that this as well as the relatively moderate costs in terms of time and money makes the strategy to couple topic models with machine translation a valuable addition to the toolbox of comparative communication researchers.
File(s)
File | File Type | Format | Size | License | Publisher/Copright statement | Content | |
---|---|---|---|---|---|---|---|
Reber_2019_Overcoming language barriers.pdf | text | Adobe PDF | 2.57 MB | publisher | published | ||
Reber_2019_Overcoming-language-barriers_accepted-manuscript.pdf | text | Adobe PDF | 1.27 MB | publisher | accepted |