TY - JOUR AU - Sokolova, Marina AU - El Emam, Khaled AU - Arbuckle, Luk AU - Neri, Emilio AU - Rose, Sean AU - Jonker, Elizabeth PY - 2012 DA - 2012/07/09 TI - P2P手表:点对点文件共享网络中的个人健康信息检测JO - J Med Internet Res SP - e95 VL - 14 IS - 4kw -隐私KW -个人健康信息KW -自然语言处理,文本数据挖掘AB -背景:点对点(P2P)文件共享网络的用户可能会因疏忽而泄露个人健康信息(PHI)。除了可能对受影响的个人造成伤害外,这还可能增加健康信息保管人的数据泄露风险。抓取P2P网络的自动PHI检测工具可以识别PHI并提醒保管人。虽然之前已经有关于电子健康记录中个人信息检测的工作,但对于异构用户文件中PHI自动检测的研究一直很缺乏。目的:建立一个能够准确检测P2P文件共享网络中文件PHI值的系统。该系统,我们称之为P2P Watch,使用文本处理技术的管道来自动检测通过P2P网络交换的文件中的PHI。无论文件格式、文档类型和内容如何,P2P Watch都会处理非结构化文本。方法:开发P2P Watch,对P2P网络上交换的文本文件中的PHI值进行提取和分析。如果文本包含关于一个人的可识别信息(例如,姓名和出生日期)和此人健康的具体信息(例如,诊断、处方和医疗程序),我们将文本标记为PHI。 We evaluated the system’s performance through its efficiency and effectiveness on 3924 files gathered from three P2P networks. Results: P2P Watch successfully processed 3924 P2P files of unknown content. A manual examination of 1578 randomly selected files marked by the system as non-PHI confirmed that these files indeed did not contain PHI, making the false-negative detection rate equal to zero. Of 57 files marked by the system as PHI, all contained both personally identifiable information and health information: 11 files were PHI disclosures, and 46 files contained organizational materials such as unfilled insurance forms, job applications by medical professionals, and essays. Conclusions: PHI can be successfully detected in free-form textual files exchanged through P2P networks. Once the files with PHI are detected, affected individuals or data custodians can be alerted to take remedial action. SN - 1438-8871 UR - //www.mybigtv.com/2012/4/e95/ UR - https://doi.org/10.2196/jmir.1898 UR - http://www.ncbi.nlm.nih.gov/pubmed/22776692 DO - 10.2196/jmir.1898 ID - info:doi/10.2196/jmir.1898 ER -
Baidu
map