TY - JOUR AU - Bobicev, Victoria AU - Sokolova, Marina AU - El Emam, Khaled AU - Jafer, Yasser AU - Dewar, Brian AU - Jonker, Elizabeth AU - Matwin, Stan PY - 2013 DA - 2013/10/03 TI -医学论坛上的匿名海报可以重新识别吗?JO - J Med Internet Res SP - e215 VL - 15 IS - 10kw -隐私KW -个人健康信息KW -医疗论坛KW -文本数据挖掘AB -背景:医疗论坛的参与者经常在他们的在线帖子中透露自己的个人健康信息。为了放心地透露敏感的个人健康信息,一些参与者可能会通过匿名发帖来隐藏自己的身份。他们可以使用无法轻易追踪到自己的假身份、昵称或假名来做到这一点。然而,每个人的写作风格都有其独特的特点,通过作者归因分析可以确定匿名用户的真实身份。虽然之前有关于作者身份归属问题的研究,但在医学论坛上,关于自动作者身份归属的研究一直很缺乏。本文的重点是证明在医学论坛中,基于字符的作者归因方法比基于单词的方法更好。目的:目标是建立一个系统,准确地属性的作者发表在医学论坛上的消息。Authorship Attributor系统使用文本分析技术来抓取医学论坛,并自动关联由同一作者撰写的消息。作者属性处理非结构化文本,而不考虑文档类型、上下文和内容。 Methods: The messages were labeled by nicknames of the forum participants. We evaluated the system’s performance through its accuracy on 6000 messages gathered from 2 medical forums on an in vitro fertilization (IVF) support website. Results: Given 2 lists of candidate authors (30 and 50 candidates, respectively), we obtained an F score accuracy in detecting authors of 75% to 80% on messages containing 100 to 150 words on average, and 97.9% on longer messages containing at least 300 words. Conclusions: Authorship can be successfully detected in short free-form messages posted on medical forums. This raises a concern about the meaningfulness of anonymous posting on such medical forums. Authorship attribution tools can be used to warn consumers wishing to post anonymously about the likelihood of their identity being determined. SN - 14388871 UR - //www.mybigtv.com/2013/10/e215/ UR - https://doi.org/10.2196/jmir.2514 UR - http://www.ncbi.nlm.nih.gov/pubmed/24091380 DO - 10.2196/jmir.2514 ID - info:doi/10.2196/jmir.2514 ER -
Baidu
map