Cross-National Information Attacks: A Two-Decade Analysis of Troll Behavior in Korea

2026-06-22Social and Information Networks

Social and Information NetworksComputation and LanguageCryptography and Security
AI summary

The authors created a machine learning system to identify and understand fake or harmful comments in South Korean online news over 20 years. Their tool can spot comments likely written by foreign trolls by looking at where they come from, the emotions they use, and who they target. They also explain why the system thinks a comment is suspicious by highlighting parts of the text. The study found these trolls mostly use morally charged language to criticize local politicians, which gets lots of attention and might increase political division. This method helps online platforms better identify and manage harmful comments with clear explanations.

machine learningforeign influence operationsonline trollingmoral-emotional framingexplainable AIlongitudinal analysispolitical polarizationuser engagementtext classificationplatform governance
Authors
Jaehong Kim, Hyeonseung Kim, Jiseon Kim, Alice Oh, Thorsten Holz, Wonjae Lee, Meeyoung Cha
Abstract
Coordinated foreign influence operations pose a growing threat to online platforms, but detecting state-linked troll activity and tracking its evolution remain challenging. This paper presents an explainable machine learning framework for theory-guided detection and longitudinal analysis of suspected trolling within Korean online news comment sections. Our hierarchical model classifies comments along three dimensions central to influence campaigns: foreign origin, moral-emotional framing, and target country. To support explainability, it also extracts brief span-level textual evidence that provides human-interpretable rationales. We apply the approach to 112M South Korean news comments authored by 4M users over nearly 20 years, identifying 23,998 accounts exhibiting behavior consistent with coordinated manipulation. Analyzing these accounts, we find that they predominantly rely on morally condemning rhetoric rather than direct promotion of foreign-aligned narratives; this rhetoric receives significantly higher user engagement. Among the highest-engagement comments, the moral condemnation most frequently targets domestic political figures (e.g., presidents or party leaders) on both the left and the right, potentially amplifying polarization. Our framework supports transparent platform governance through explainable, evidence-based moderation. These observed rhetorical and engagement patterns can inform how platforms and observatories prioritize defenses and intervene before harmful narrative-target combinations achieve widespread reach.