A novel strategy for quickly identifying twitter trolls

By accounting for distinctive patterns of repetition, as few as 50 tweets are needed for detection

Date:: August 12, 2020
Source:: PLOS
Summary:: Two algorithms that account for distinctive use of repeated words and word pairs require as few as 50 tweets to accurately distinguish deceptive 'troll' messages from those posted by public figures.
Share:: Facebook Twitter Pinterest LinkedIN Email

FULL STORY

Two algorithms that account for distinctive use of repeated words and word pairs require as few as 50 tweets to accurately distinguish deceptive "troll" messages from those posted by public figures. Sergei Monakhov of Friedrich Schiller University in Jena, Germany, presents these findings in the open-access journal PLOS ONE on August 12, 2020.

Troll internet messages aim to achieve a specific purpose, while also masking that purpose. For instance, in 2018, 13 Russian nationals were accused of using false personas to interfere with the 2016 U.S. presidential election via social media posts. While previous research has investigated distinguishing characteristics of troll tweets -- such as timing, hashtags, and geographical location -- few studies have examined linguistic features of the tweets themselves.

Monakhov took a sociolinguistic approach, focusing on the idea that trolls have a limited number of messages to convey, but must do so multiple times and with enough diversity of wording and topics to fool readers. Using a library of Russian troll tweets and genuine tweets from U.S. congresspeople, Monakhov showed that these troll-specific restrictions result in distinctive patterns of repeated words and word pairs that are different from patterns seen in genuine, non-troll tweets.

Then, Monakhov tested an algorithm that uses these distinctive patterns to distinguish between genuine tweets and troll tweets. He found that the algorithm required as few as 50 tweets for accurate identification of trolls versus congresspeople. He also found that the algorithm correctly distinguished troll tweets from tweets by Donald Trump -- which although provocative and "potentially misleading," according to Twitter, are not crafted to hide his purpose.

This new strategy for quickly identifying troll tweets could help inform efforts to combat hybrid warfare while preserving freedom of speech. Further research will be needed to determine whether it can accurately distinguish troll tweets from other types of messages that are not posted by public figures.

Monakhov adds: "Though troll writing is usually thought of as being permeated with recurrent messages, its most characteristic trait is an anomalous distribution of repeated words and word pairs. Using the ratio of their proportions as a quantitative measure, one needs as few as 50 tweets for identifying internet troll accounts."

RELATED TOPICS
- Computers & Math
- Science & Society
  - Political Science
  - Popular Culture

RELATED TERMS

Story Source:

Materials provided by PLOS. Note: Content may be edited for style and length.

Journal Reference:

Sergei Monakhov. Early detection of internet trolls: Introducing an algorithm based on word pairs / single words multiple repetition ratio. PLOS ONE, 2020; 15 (8): e0236832 DOI: 10.1371/journal.pone.0236832

Cite This Page:

PLOS. "A novel strategy for quickly identifying twitter trolls." ScienceDaily. ScienceDaily, 12 August 2020. <www.sciencedaily.com/releases/2020/08/200812144149.htm>.

PLOS. (2020, August 12). A novel strategy for quickly identifying twitter trolls. ScienceDaily. Retrieved February 6, 2026 from www.sciencedaily.com/releases/2020/08/200812144149.htm

PLOS. "A novel strategy for quickly identifying twitter trolls." ScienceDaily. www.sciencedaily.com/releases/2020/08/200812144149.htm (accessed February 6, 2026).

Explore More

from ScienceDaily

A novel strategy for quickly identifying twitter trolls

By accounting for distinctive patterns of repetition, as few as 50 tweets are needed for detection

Explore More

Breaking

Trending Topics

Strange & Offbeat