@Article{信息:doi 10.2196 / /公共健康。5869,作者=“Nguyen, Quynh C和Li,大鹏和孟,Hsien-Wen和Kath, Suraj和Nsoesie, Elaine和Li, fei和Wen, Ming”,标题=“从地理标记的Twitter数据中构建幸福,饮食和身体活动指标的国家社区数据集”,期刊=“JMIR公共卫生监测”,年=“2016”,月=“10”,日=“17”,卷=“2”,数=“2”,页=“e158”,关键词=“社交媒体;Twitter消息;健康行为;幸福;食物;背景:研究表明,人们生活、娱乐和工作的地点会影响健康和福祉。然而,缺乏社区数据,特别是缺乏跨地域及时和一致的数据,阻碍了了解社区对健康的影响。社交媒体数据为社区研究提供了可能的新数据资源。目的:本研究的目的是根据地理标记的Twitter数据,建立一个包含区域级幸福和健康行为指标的国家社区数据库。 Methods: We utilized Twitter's streaming application programming interface to continuously collect a random 1{\%} subset of publicly available geolocated tweets for 1 year (April 2015 to March 2016). We collected 80 million geotagged tweets from 603,363 unique Twitter users across the contiguous United States. We validated our machine learning algorithms for constructing indicators of happiness, food, and physical activity by comparing predicted values to those generated by human labelers. Geotagged tweets were spatially mapped to the 2010 census tract and zip code areas they fall within, which enabled further assessment of the associations between Twitter-derived neighborhood variables and neighborhood demographic, economic, business, and health characteristics. Results: Machine labeled and manually labeled tweets had a high level of accuracy: 78{\%} for happiness, 83{\%} for food, and 85{\%} for physical activity for dichotomized labels with the F scores 0.54, 0.86, and 0.90, respectively. About 20{\%} of tweets were classified as happy. Relatively few terms (less than 25) were necessary to characterize the majority of tweets on food and physical activity. Data from over 70,000 census tracts from the United States suggest that census tract factors like percentage African American and economic disadvantage were associated with lower census tract happiness. Urbanicity was related to higher frequency of fast food tweets. Greater numbers of fast food restaurants predicted higher frequency of fast food mentions. Surprisingly, fitness centers and nature parks were only modestly associated with higher frequency of physical activity tweets. Greater state-level happiness, positivity toward physical activity, and positivity toward healthy foods, assessed via tweets, were associated with lower all-cause mortality and prevalence of chronic conditions such as obesity and diabetes and lower physical inactivity and smoking, controlling for state median income, median age, and percentage white non-Hispanic. Conclusions: Machine learning algorithms can be built with relatively high accuracy to characterize sentiment, food, and physical activity mentions on social media. Such data can be utilized to construct neighborhood indicators consistently and cost effectively. Access to neighborhood data, in turn, can be leveraged to better understand neighborhood effects and address social determinants of health. We found that neighborhoods with social and economic disadvantage, high urbanicity, and more fast food restaurants may exhibit lower happiness and fewer healthy behaviors. ", issn="2369-2960", doi="10.2196/publichealth.5869", url="http://publichealth.www.mybigtv.com/2016/2/e158/", url="https://doi.org/10.2196/publichealth.5869", url="http://www.ncbi.nlm.nih.gov/pubmed/27751984" }
Baidu
map