Cross-Domain Learning for Classifying Propaganda in Online Contents

Wang, Liqiang; Shen, Xiaoyu; de Melo, Gerard; Weikum, Gerhard

Local TagsRelease HistoryDetailsSummary

Cross-Domain Learning for Classifying Propaganda in Online Contents

Wang, L., Shen, X., de Melo, G., & Weikum, G. (2020). Cross-Domain Learning for Classifying Propaganda in Online Contents. Retrieved from https://arxiv.org/abs/2011.06844.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/21.11116/0000-0007-FEBF-5 Version Permalink: https://hdl.handle.net/21.11116/0000-0007-FEC0-2

Genre: Paper

Files

show Files

hide Files

:

arXiv:2011.06844.pdf (Preprint), 469KB

View Save

File Permalink:
https://hdl.handle.net/21.11116/0000-0007-FEC1-1

Name:
arXiv:2011.06844.pdf

Description:
File downloaded from arXiv at 2021-02-17 10:56

OA-Status:

Visibility:
Public

MIME-Type / Checksum:
application/pdf / [MD5]

Technical Metadata:

View

Copyright Date:
-

Copyright Info:
-

License:
http://creativecommons.org/licenses/by/4.0/

Locators

show

Creators

show

hide

Creators:
Wang, Liqiang¹, Author
Shen, Xiaoyu¹, Author
de Melo, Gerard², Author
Weikum, Gerhard¹, Author

Affiliations:
1Databases and Information Systems, MPI for Informatics, Max Planck Society, ou_24018
2External Organizations, ou_persistent22

Content

show

hide

Free keywords: Computer Science, Computation and Language, cs.CL

Abstract: As news and social media exhibit an increasing amount of manipulative
polarized content, detecting such propaganda has received attention as a new
task for content analysis. Prior work has focused on supervised learning with
training data from the same domain. However, as propaganda can be subtle and
keeps evolving, manual identification and proper labeling are very demanding.
As a consequence, training data is a major bottleneck. In this paper, we tackle
this bottleneck and present an approach to leverage cross-domain learning,
based on labeled documents and sentences from news and tweets, as well as
political speeches with a clear difference in their degrees of being
propagandistic. We devise informative features and build various classifiers
for propaganda labeling, using cross-domain learning. Our experiments
demonstrate the usefulness of this approach, and identify difficulties and
limitations in various configurations of sources and targets for the transfer
step. We further analyze the influence of various features, and characterize
salient indicators of propaganda.

Details

show

hide

Language(s): eng - English

Dates: Created: 2020-11-13Modified: 2020-11-22Published Online: 2020

Publication Status: Published online

Pages: 11 p.

Publishing info: -

Table of Contents: -

Rev. Type: -

Identifiers: arXiv: 2011.06844
URI: https://arxiv.org/abs/2011.06844
BibTex Citekey: Wang_2011.06844

Degree: -

Event

show

Legal Case

show

Project information

show

Source

show