26(1)
/
2021 / 6
/
pp. 79 - 104
中文新聞文本之宣傳手法標記與分析
The Analysis and Annotation of Propaganda Techniques in Chinese News Texts
作者
Meng-Hsien Shih
(Institute of Taiwan Languages and Language Teaching, National Tsing Hua University; Center for General Education, National Chung Cheng University)
Ren-feng Duann
*
(Center for General Education, National Taitung University)
Siaw-Fong Chung
(Department of English, National Chengchi University)
Meng-Hsien Shih
Institute of Taiwan Languages and Language Teaching, National Tsing Hua University; Center for General Education, National Chung Cheng University
Ren-feng Duann
*
Center for General Education, National Taitung University
Siaw-Fong Chung
Department of English, National Chengchi University
中文摘要
新聞媒體常在政治新聞文本中運用宣傳手法(propaganda techniques)表達媒體 本身之政治立場,企圖影響讀者之立場。目前尚無具宣傳手法標記之中文語料 供立場分析,本文以可解釋性的方式,人工細部標記中文新聞文本所使用之宣 傳手法、並以 Bootstrap 方式擴展標記規模的資料集,再分別以人工檢核與先 導實驗來確保標記資料集之效能。透過單純貝式分類器搭配基本的詞袋特徵進 行訓練後,機器判讀行段是否包含宣傳手法的準確率達 74.26%。本宣傳手法 之人工標記資料已公開釋出,可應用於未來機器訓練與學習預測新文本之立 場。
英文摘要
In political news media, propaganda techniques are often employed to express one's political view, or to influence the audience's stance. Chinese corpora with the annotation of propaganda techniques are yet to be developed. In this paper, with an explainable approach, we annotated the use of propaganda techniques in Chinese political news texts, and enlarged the dataset by bootstrapping using a small set of manually annotated data. To ensure the validity, we manually corrected the bootstrapped dataset and ran a pilot machine-learning experiment using a naïve Bayes classifier trained with the bag-of-words feature. A precision of 74.26% was reached for the binary classification (with or without propaganda technique). The manually annotated data with propaganda techniques is available online for the application of machine training and learning to predict the stance of new texts.
中文關鍵字
情感(立場)分析; 語言資源; 宣傳手法; 台灣新聞媒體
英文關鍵字
Sentiment (Stance) Analysis; Language Resource; Propaganda Techniques; Taiwan News Media