中国韩国日本在线观看免费,A级尤物一区,日韩精品一二三区无码,欧美日韩少妇色

當(dāng)前位置:主頁 > 文藝論文 > 廣告藝術(shù)論文 >

基于聚類分析的微博廣告發(fā)布者識別

發(fā)布時間:2018-10-14 18:40
【摘要】:微博空間存在大量的廣告內(nèi)容,這些信息嚴(yán)重影響著普通用戶的用戶體驗(yàn)和相關(guān)的研究工作,F(xiàn)有研究多使用支持向量機(jī)(SVM)或隨機(jī)森林等分類算法對廣告微博進(jìn)行處理,然而分類方法中人工標(biāo)注大數(shù)據(jù)量訓(xùn)練集存在困難,因此提出基于聚類分析的微博廣告發(fā)布者識別方法:對于用戶維度,針對微博廣告發(fā)布者通過發(fā)布大量普通微博來稀釋其廣告內(nèi)容的現(xiàn)象,提出核心微博的概念,通過提取核心微博主題及其對應(yīng)的微博序列,計算用戶特征和對應(yīng)微博的文本特征,并使用聚類算法對特征進(jìn)行聚類,從而識別微博廣告發(fā)布者。實(shí)驗(yàn)結(jié)果顯示,所提方法準(zhǔn)確率為92%,召回率為97%,F值為95%,證明所提方法在廣告內(nèi)容被人為稀釋的情況下能準(zhǔn)確地識別微博廣告發(fā)布者,可以為微博垃圾信息識別、清理等工作提供理論支持和實(shí)用方法。
[Abstract]:Weibo space has a large amount of advertising content, which seriously affects the user experience and related research work of ordinary users. In recent studies, support vector machine (SVM) (SVM) or random forest classification algorithms are often used to deal with advertising Weibo. However, it is difficult to manually annotate large amount of data training set in classification methods. Therefore, this paper puts forward a method of identifying Weibo advertisement publishers based on cluster analysis: for the user dimension, aiming at the phenomenon that a large number of ordinary Weibo advertisers dilute their advertising content by publishing a large number of ordinary Weibo, this paper puts forward the concept of the core Weibo. By extracting the core Weibo theme and its corresponding Weibo sequence, the user features and the corresponding text features are calculated, and then the features are clustered by clustering algorithm, so as to identify the advertiser. The experimental results show that the accuracy of the proposed method is 92 and the recall rate is 97 and F is 95. It is proved that the proposed method can accurately identify the advertisement publisher Weibo under the condition that the advertising content is artificially diluted, and can identify the spam information for Weibo. Cleaning work provides theoretical support and practical methods.
【作者單位】: 南京大學(xué)軟件學(xué)院
【基金】:江蘇省產(chǎn)學(xué)研前瞻性聯(lián)合研究項目(BY2015069-03)~~
【分類號】:TP391.1


本文編號:2271296

資料下載
論文發(fā)表

本文鏈接:http://www.lk138.cn/wenyilunwen/guanggaoshejilunwen/2271296.html


Copyright(c)文論論文網(wǎng)All Rights Reserved | 網(wǎng)站地圖 |

版權(quán)申明:資料由用戶cffb3***提供,本站僅收錄摘要或目錄,作者需要刪除請E-mail郵箱bigeng88@qq.com