Detecting Marionette Microblog Users for Improved Information Credibility

来源 :计算机科学技术学报(英文版) | 被引量 : 0次 | 上传用户:coldcoffee_10
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
In this paper, we propose to detect a special group of microblog users: the “marionette” users, who are created or employed by backstage “puppeteers”, either through programs or manually. Unlike normal users that access microblog for information sharing or social communication, the marionette users perform specific tasks to e financial profits. For example, they follow certain users to increase their “statistical popularity”, or retweet some tweets to amplify their “statistical impact”. The fabricated follower or retweet counts not only mislead normal users to wrong information, but also seriously impair microblog-based applications, such as hot tweets selection and expert finding. In this paper, we study the important problem of detecting marionette users on microblog platforms. This problem is challenging because puppeteers are employing complicated strategies to generate marionette users that present similar behaviors as normal users. To tackle this challenge, we propose to take into account two types of discriminative information: 1) individual user tweeting behavior and 2) the social interactions among users. By integrating both information into a semi-supervised probabilistic model, we can e昇ectively distinguish marionette users from normal ones. By applying the proposed model to one of the most popular microblog platforms (Sina Weibo) in China, we find that the model can detect marionette users with F-measure close to 0.9. In addition, we apply the proposed model to calculate the marionette ratio of the top 200 most followed microbloggers and the top 50 most retweeted posts in Sina Weibo. To accelerate the detecting speed and reduce feature generation cost, we further propose a light-weight model which utilizes fewer features to identify marionettes from retweeters.
其他文献
1.最佳的配比是多少不管是什么样的汤,太稀了都会显得寡淡无味,太稠了样子、口感都不好,所以说水的用量是关键.一般来说,8g/包的蔬菜蛋花汤需要用200 mL~300 mL热水.用适宜的
(1)番茄番茄可以有效减少患胰腺癌等癌症的几率,是最佳的维生素C食源。(2)菠菜由于富含铁及维生素B,能有效防治血管方面疾病,并能预防盲眼症。(3)坚果不仅可以提高好的胆固醇
对肠道有功能而又不能经口进食的患者,常用鼻饲法灌注食物、药物和水.传统方法是用50 mL注射器灌入流质,因胃管尾端呈漏斗状,空针乳头太小与胃管衔接不紧,灌注时流质易从空隙
该文从挂篮荷载计算、施工流程、支座及临时固结施工、挂篮安装及试验、合拢段施工、模板制作安装、钢筋安装、混凝土的浇筑及养生、测量监控等方面人手,介绍了S226海滨大桥
期刊
俗话说,好吃不过饺子。味道鲜美、馅料丰富的饺子一直是人们心中的美味佳肴,尤其是逢年过节,饺子更是餐桌上必不可少的主角。饺子虽然好吃,但也有其不利于健康之处。一般来说
为了解抗菌药物临床应用情况,为医院在抗菌药物管理方面提供依据,本研究对我院2005年12月至2006年11月出院病历进行抗菌药物使用调查,现报告如下。1资料与方法1.1一般资料:调
通过田间试验 ,研究了冬小麦氮、磷营养特征、土壤养分的动态变化及二者的关系。结果表明 ,返青前冬小麦吸收氮、磷很少 ,返青后均迅速增多 ,至孕穗~灌浆初期达一生吸收的高峰
目的:探讨电子束CT(EBCT)在心脏良性非粘液瘤性肿瘤诊断中的价值。方法:对1995- 07~1999- 02,经EBCT检查并手术、病理证实的7 例心脏良性非粘液瘤性肿瘤的EBCT、超声心动图结果与手术、病理所见进行回顾性分析。7
该文从挂篮荷载计算、施工流程、支座及临时固结施工、挂篮安装及试验、合拢段施工、模板制作安装、钢筋安装、混凝土的浇筑及养生、测量监控等方面人手,介绍了S226海滨大桥
期刊