TY -非盟的黄Yanqun AU -王,倪盟——张,志强盟——刘,宏磊AU -范,小路AU -魏,局域网非盟-陈,回族PY - 2021 DA - 2021/7/23 TI -病人表示从结构化电子病历基于嵌入技术:开发和验证研究乔-地中海JMIR通知SP - e19905六世- 9 - 7 KW -电子医疗记录KW - Skip-gram KW -特性表示KW -病人表示KW -中风AB -背景:由于数据表示的多样性、稀疏性和高维性,结构化电子病历(sEMR)数据的二次使用已经成为一个挑战。构建有效的sEMR数据表示对于后续的数据应用变得越来越重要。目的:将自然语言处理领域的嵌入技术应用于sEMR数据表示,探讨基于嵌入的特征和患者表示在临床应用中的可行性和优越性。方法:整个训练语料库包括104752例住院患者病历,13757个疾病诊断、体格检查和程序、实验室检查、用药等医学概念。使用Skip-gram算法将每个医学概念嵌入到200维实数向量中,并从记录的20次洗刷医学概念中进行一些自适应更改。患者记录中所有医学概念的向量的平均值代表患者。对于基于嵌入的特征表示评估,我们使用医学概念向量之间的余弦相似性来捕获医学概念之间潜在的临床关联。我们进一步对脑卒中患者进行了聚类分析,以评估和比较基于嵌入的患者表征。无监督评价采用Hopkins统计量、Silhouette指数(SI)和Davies-Bouldin指数,有监督评价采用准确率、召回率和F1评分。 Results: The dimension of patient representation was reduced from 13,757 to 200 using the embedding-based representation. The average cosine similarity of the selected disease (subarachnoid hemorrhage) and its 15 clinically relevant medical concepts was 0.973. Stroke patients were clustered into two clusters with the highest SI (0.852). Clustering analyses conducted on patients with the embedding representations showed higher applicability (Hopkins statistic 0.931), higher aggregation (SI 0.862), and lower dispersion (Davies-Bouldin index 0.551) than those conducted on patients with reference representation methods. The clustering solutions for patients with the embedding-based representation achieved the highest F1 scores of 0.944 and 0.717 for two clusters. Conclusions: The feature-level embedding-based representations can reflect the potential clinical associations among medical concepts effectively. The patient-level embedding-based representation is easy to use as continuous input to standard machine learning algorithms and can bring performance improvements. It is expected that the embedding-based representation will be helpful in a wide range of secondary uses of sEMR data. SN - 2291-9694 UR - https://medinform.www.mybigtv.com/2021/7/e19905 UR - https://doi.org/10.2196/19905 UR - http://www.ncbi.nlm.nih.gov/pubmed/34297000 DO - 10.2196/19905 ID - info:doi/10.2196/19905 ER -
Baidu
map