@Article{信息:doi 10.2196 / / jmir。4912,作者="Jay, Caroline and Harper, Simon and Dunlop, Ian and Smith, Sam and Sufi, Shoaib and Goble, Carole and Buchan, Iain",标题="自然语言搜索接口:健康数据需要单字段变量搜索",期刊="J Med Internet Res",年="2016",月="Jan",日="14",卷="18",数="1",页="e13",关键词="搜索行为;搜索引擎;研究数据档案;背景:数据发现,特别是关键变量及其相互关系的发现,是二级数据分析的关键,反过来,也是数据科学不断发展的领域。界面设计者假定他们的用户是领域专家,因此他们提供了复杂的界面来支持这些“专家”。“这样的界面回到了一个搜索第一次就需要准确的时代,因为每次搜索都有很高的计算成本。我们的工作是医疗和社会研究资助机构之间的一项政府研究倡议的一部分,该倡议旨在改善医疗研究中社会数据的使用。目标:数据科学的跨学科性质不能对特定科学家的领域专业知识做出假设,他们的兴趣可能涉及多个领域。在这里,我们考虑科学家寻求存档数据进行二次分析的共同需求。 This has more in common with search needs of the ``Google generation'' than with their single-domain, single-tool forebears. Our study compares a Google-like interface with traditional ways of searching for noncomplex health data in a data archive. Methods: Two user interfaces are evaluated for the same set of tasks in extracting data from surveys stored in the UK Data Archive (UKDA). One interface, Web search, is ``Google-like,'' enabling users to browse, search for, and view metadata about study variables, whereas the other, traditional search, has standard multioption user interface. Results: Using a comprehensive set of tasks with 20 volunteers, we found that the Web search interface met data discovery needs and expectations better than the traditional search. A task {\texttimes} interface repeated measures analysis showed a main effect indicating that answers found through the Web search interface were more likely to be correct (F1,19=37.3, P<.001), with a main effect of task (F3,57=6.3, P<.001). Further, participants completed the task significantly faster using the Web search interface (F1,19=18.0, P<.001). There was also a main effect of task (F2,38=4.1, P=.025, Greenhouse-Geisser correction applied). Overall, participants were asked to rate learnability, ease of use, and satisfaction. Paired mean comparisons showed that the Web search interface received significantly higher ratings than the traditional search interface for learnability (P=.002, 95{\%} CI [0.6-2.4]), ease of use (P<.001, 95{\%} CI [1.2-3.2]), and satisfaction (P<.001, 95{\%} CI [1.8-3.5]). The results show superior cross-domain usability of Web search, which is consistent with its general familiarity and with enabling queries to be refined as the search proceeds, which treats serendipity as part of the refinement. Conclusions: The results provide clear evidence that data science should adopt single-field natural language search interfaces for variable search supporting in particular: query reformulation; data browsing; faceted search; surrogates; relevance feedback; summarization, analytics, and visual presentation. ", issn="1438-8871", doi="10.2196/jmir.4912", url="//www.mybigtv.com/2016/1/e13/", url="https://doi.org/10.2196/jmir.4912", url="http://www.ncbi.nlm.nih.gov/pubmed/26769334" }
Baidu
map