Healthc Inform Res.  2018 Oct;24(4):300-308. 10.4258/hir.2018.24.4.300.

Google Search Trends Predicting Disease Outbreaks: An Analysis from India

Affiliations
  • 1Department of Community Medicine, Kalpana Chawla Government Medical College and Hospital, Karnal, India.
  • 2Department of Biostatistics, Post Graduate Institute of Medical Education and Research (PGIMER), Chandigarh, India.
  • 3Department of Community Medicine, Vardhman Mahavir Medical College and Safdarjung Hospital, New Delhi, India.
  • 4State Integrated Disease Surveillance Project (IDSP) Cell, Department of Health, Haryana, India.
  • 5Integrated Disease Surveillance Project (IDSP), Chandigarh Administration, Chandigarh, India.
  • 6Department of Community Medicine, School of Public Health, Post Graduate Institute of Medical Education and Research (PGIMER), Chandigarh, India. selvkathir@gmail.com

Abstract


OBJECTIVES
Prompt detection is a cornerstone in the control and prevention of infectious diseases. The Integrated Disease Surveillance Project of India identifies outbreaks, but it does not exactly predict outbreaks. This study was conducted to assess temporal correlation between Google Trends and Integrated Disease Surveillance Programme (IDSP) data and to determine the feasibility of using Google Trends for the prediction of outbreaks or epidemics.
METHODS
The Google search queries related to malaria, dengue fever, chikungunya, and enteric fever for Chandigarh union territory and Haryana state of India in 2016 were extracted and compared with presumptive form data of the IDSP. Spearman correlation and scatter plots were used to depict the statistical relationship between the two datasets. Time trend plots were constructed to assess the correlation between Google search trends and disease notification under the IDSP
RESULTS
Temporal correlation was observed between the IDSP reporting and Google search trends. Time series analysis of the Google Trends showed strong correlation with the IDSP data with a lag of −2 to −3 weeks for chikungunya and dengue fever in Chandigarh (r > 0.80) and Haryana (r > 0.70). Malaria and enteric fever showed a lag period of −2 to −3 weeks with moderate correlation.
CONCLUSIONS
Similar results were obtained when applying the results of previous studies to specific diseases, and it is considered that many other diseases should be studied at the national and sub-national levels.

Keyword

Disease Outbreaks; Communicable Diseases; Information Technology; Public Health Surveillance; Epidemiological Monitoring

MeSH Terms

Communicable Diseases
Dataset
Dengue
Disease Notification
Disease Outbreaks*
Epidemiological Monitoring
India*
Malaria
Public Health Surveillance
Typhoid Fever

Figure

  • Figure 1 Screen shot of Google Trends website depicting the search strategy used for observing the pattern generated for the searches related to dengue in Chandigarh, 2016 (Map shows the geographical pattern of the searches made).

  • Figure 2 Correlational plots between Google Trends and IDSP (Integrated Disease Surveillance Programme) data for Haryana (A) and Chandigarh (B) in 2016.

  • Figure 3 Line diagram depicting Google Trends and IDSP data for major febrile illnesses for Haryana (left) and Chandigarh (right) in 2016.


Cited by  2 articles

Prediction of COVID-19 Outbreaks Using Google Trends in India: A Retrospective Analysis
U Venkatesh, Periyasamy Aravind Gandhi
Healthc Inform Res. 2020;26(3):175-184.    doi: 10.4258/hir.2020.26.3.175.

Keyword Trends for Mother–Child Oral Health in Korea Based on Social Media Big Data from Naver
Jung-Eun Park, Ja-Won Cho, Jong-Hwa Jang
Healthc Inform Res. 2020;26(3):212-219.    doi: 10.4258/hir.2020.26.3.212.


Reference

1. Ministry of Health & Family Welfare, Government of India. Integrated Disease Surveillance Programme [Internet]. New Delhi: Ministry of Health & Family Welfare;c2018. cited at 2018 Jul 4. Available from: http://www.idsp.nic.in/.
2. Milinovich GJ, Williams GM, Clements AC, Hu W. Internet-based surveillance systems for monitoring emerging infectious diseases. Lancet Infect Dis. 2014; 14(2):160–168.
Article
3. Hoffman DL, Novak TP, Venkatesh A. Has the Internet become indispensable? Commun ACM. 2004; 47(7):37–42.
Article
4. Telecom Regulatory Authority of India. The Indian Telecom Services Performance Indicators, April-June 2017 [Internet]. New Delhi: Telecom Regulatory Authority of India;2017. cited at 2018 Jul 4. Available from: http://trai.gov.in/sites/default/files/Performance_Indicator_Reports_28092017.pdf.
5. Hellawell GO, Turner KJ, Le Monnier KJ, Brewster SF. Urology and the Internet: an evaluation of internet use by urology patients and of information available on urological topics. BJU Int. 2000; 86(3):191–194.
Article
6. Tonsaker T, Bartlett G, Trpkov C. Health information on the Internet: goldmine or minefield? Can Fam Physician. 2014; 60(5):407–408.
7. Hesse BW, Moser RP, Rutten LJ. Surveys of physicians and electronic health information. N Engl J Med. 2010; 362(9):859–860.
Article
8. Diaz JA, Griffith RA, Ng JJ, Reinert SE, Friedmann PD, Moulton AW. Patients' use of the Internet for medical information. J Gen Intern Med. 2002; 17(3):180–185.
Article
9. Moretti FA, Oliveira VE, Silva EM. Access to health information on the Internet: a public health issue? Rev Assoc Med Bras (1992). 2012; 58(6):650–658.
10. Choi H, Varian H. Predicting the present with Google Trends. Econ Rec. 2012; 88:2–9.
Article
11. Bragazzi NL. A Google Trends-based approach for monitoring NSSI. Psychol Res Behav Manag. 2013; 7:1–8.
Article
12. Kang M, Zhong H, He J, Rutherford S, Yang F. Using Google Trends for influenza surveillance in South China. PLoS One. 2013; 8(1):e55205.
Article
13. StatCounter Global Stats. Search Engine Market Share India [Internet]. Dublin: StatCounter;2017. cited at 2018 Sep 27. Available from: http://gs.statcounter.com/search-engine-market-share/all/india/2016.
14. Cervellin G, Comelli I, Lippi G. Is Google Trends a reliable tool for digital epidemiology? Insights from different clinical settings. J Epidemiol Glob Health. 2017; 7(3):185–189.
Article
15. Longini IM Jr, Nizam A, Xu S, Ungchusak K, Hanshaoworakul W, Cummings DA, et al. Containing pandemic influenza at the source. Science. 2005; 309(5737):1083–1087.
Article
16. Ferguson NM, Cummings DA, Cauchemez S, Fraser C, Riley S, Meeyai A, et al. Strategies for containing an emerging influenza pandemic in Southeast Asia. Nature. 2005; 437(7056):209–214.
Article
17. Ginsberg J, Mohebbi MH, Patel RS, Brammer L, Smolinski MS, Brilliant L. Detecting influenza epidemics using search engine query data. Nature. 2009; 457(7232):1012–1014.
Article
18. Polgreen PM, Chen Y, Pennock DM, Nelson FD. Using internet searches for influenza surveillance. Clin Infect Dis. 2008; 47(11):1443–1448.
Article
19. Samaras L, Garcia-Barriocanal E, Sicilia MA. Syndromic surveillance models using web data: the case of scarlet fever in the UK. Inform Health Soc Care. 2012; 37(2):106–124.
Article
20. Desai R, Hall AJ, Lopman BA, Shimshoni Y, Rennick M, Efron N, et al. Norovirus disease surveillance using Google Internet query share data. Clin Infect Dis. 2012; 55(8):e75–e78.
Article
21. Ortiz JR, Zhou H, Shay DK, Neuzil KM, Fowlkes AL, Goss CH. Monitoring influenza activity in the United States: a comparison of traditional surveillance systems with Google Flu Trends. PLoS One. 2011; 6(4):e18687.
Article
22. Hulth A, Rydevik G. Web query-based surveillance in Sweden during the influenza A(H1N1)2009 pandemic, April 2009 to February 2010. Euro Surveill. 2011; 16(18):19856.
Article
Full Text Links
  • HIR
Actions
Cited
CITED
export Copy
Close
Share
  • Twitter
  • Facebook
Similar articles
Copyright © 2024 by Korean Association of Medical Journal Editors. All rights reserved.     E-mail: koreamed@kamje.or.kr