Kosin Med J.  2025 Mar;40(1):21-30. 10.7180/kmj.24.160.

Common statistical methods used in medical research

Affiliations
  • 1Biostatistics Collaboration Unit, Yonsei University College of Medicine, Seoul, Korea

Abstract

This paper aims to review the statistical methods that are widely used in medical and clinical research. Statistical analysis is crucial for ensuring the reliability of research results; therefore, methods should be carefully tailored to research interests and objectives, depending on the type and characteristics of the data. This paper explains various types of data, discusses how to confirm whether the data satisfy these methods’ specific assumptions, and elucidates the meaning and interpretation of univariable and multivariable analysis. Finally, it presents corresponding examples to help other researchers understand how these methods are applied in real-world studies and select appropriate methods that ensure both research quality and reliability.

Keyword

Categorical data; Continuous data; Multivariable analysis; Univariable analysis

Figure

  • Fig. 1. Visualization of associations between variables. (A) Visualization of the relationship between categorical and continuous variables when using the parametric method: a bar graph with the two groups’ mean and standard deviation values is commonly used to visualize the results of comparing means between groups. Each bar represents the mean of the corresponding group, and the whiskers indicate the standard deviation. (B) Visualization of the relationship between categorical and continuous variables when using the non-parametric method: a box plot with the median (interquartile range) of the two groups is commonly used to visualize the results of comparing means between groups. (C) Visualization of the relationship between continuous and continuous variables: a scatter plot is frequently presented with the results of correlation analysis or univariable linear regression to illustrate the association between two continuous variables. (D) Visualization of the relationship between categorical and categorical variables: a bar graph representing the outcome rates of groups is typically plotted with results from the chi-square test. The proportion of each outcome is shown by each bar.


Reference

References

1. Im SI. How to write case reports in medicine. Kosin Med J. 2022; 37:102–6. DOI: 10.7180/kmj.22.109.
2. Kang G, Kim SE. How to write an original article in medicine and medical science. Kosin Med J. 2022; 37:96–101. DOI: 10.7180/kmj.22.105.
3. Lee HS. Ethical issues in clinical research and publication. Kosin Med J. 2022; 37:278–82. DOI: 10.7180/kmj.22.132.
4. Kim DJ, Kil SY, Son J, Lee HS. How to conduct well-designed clinical research. Kosin Med J. 2022; 37:187–91. DOI: 10.7180/kmj.22.122.
5. Das KR, Imon AH. A brief review of tests for normality. Am J Theor Appl Stat. 2016; 5:5–12. DOI: 10.11648/j.ajtas.20160501.12.
6. Kwak SG, Park SH. Normality test in clinical research. J Rheum Dis. 2019; 26:5–11. DOI: 10.4078/jrd.2019.26.1.5.
7. Rosner B. Fundamentals of biostatistics. 8th ed. Cengage Learning;2015.
8. Krithikadatta J. Normal distribution. J Conserv Dent. 2014; 17:96–7. DOI: 10.4103/0972-0707.124171. PMID: 24554873.
9. Shaffer JP. Multiple hypothesis testing. Annu Rev Psychol. 1995; 46:561–84. DOI: 10.1146/annurev.ps.46.020195.003021.
10. Yang JM, Park YS, Chung SP, Chung HS, Lee HS, You JS, et al. Implementation of a clinical pathway based on a computerized physician order entry system for ischemic stroke attenuates off-hour and weekend effects in the ED. Am J Emerg Med. 2014; 32:884–9. DOI: 10.1016/j.ajem.2014.04.049. PMID: 24928408.
11. Hinkle DE, Wiersma W, Jurs SG. Applied statistics for the behavioral sciences. 5th ed. Houghton Mifflin;2002.
12. Ridker PM, Rifai N, Cook NR, Bradwin G, Buring JE. Non-HDL cholesterol, apolipoproteins A-I and B100, standard lipid measures, lipid ratios, and CRP as risk factors for cardiovascular disease in women. JAMA. 2005; 294:326–33. DOI: 10.1001/jama.294.3.326. PMID: 16030277.
13. Fisher RA. Statistical methods for research workers. In : Kotz S, Johnson NL, editors. Breakthroughs in statistics: methodology and distribution. Springer;1992. p. 66–70.
14. Draper NR, Smith H. Applied regression analysis. John Wiley & Sons;1998.
15. Harrell FE. Regression modeling strategies: with applications to linear models, logistic regression, and survival analysis. Springer;2001.
16. Seber GA, Lee AJ. Linear regression analysis. John Wiley & Sons;2012.
17. Daoud JI. Multicollinearity and regression analysis. J Phys Conf Ser. 2017; 949:012009.
18. Shrestha N. Detecting multicollinearity in regression analysis. Am J Appl Math Stat. 2020; 8:39–42. DOI: 10.12691/ajams-8-2-1.
19. Mehmood T, Liland KH, Snipen L, Saebo S. A review of variable selection methods in partial least squares regression. Chemom Intell Lab Syst. 2012; 118:62–9. DOI: 10.1016/j.chemolab.2012.07.010.
20. Jeon S, Yang J, Lee HS. Statistical mistakes commonly made when writing medical articles. J Korean Soc Radiol. 2023; 84:866–78. DOI: 10.3348/jksr.2022.0108. PMID: 37559816.
21. Armitage P, Berry G, Matthews JN. Statistical methods in medical research. 4th ed. Wiley-Blackwell;2001.
22. Myers RH. Classical and modern regression with applications. 2nd ed. Duxbury Press;1990.
23. Nam HS, Kim EY, Kim SH, Kim YD, Kim J, Lee HS, et al. Prediction of thrombus resolution after intravenous thrombolysis assessed by CT-based thrombus imaging. Thromb Haemost. 2012; 107:786–94. DOI: 10.1160/th11-08-0585. PMID: 22318312.
24. Kleinbaum DG, Klein M. Logistic regression: a self-learning text. 2nd ed. Springer;2002.
25. Elkin ME, Zhu X. Predictive modeling of clinical trial terminations using feature engineering and embedding learning. Sci Rep. 2021; 11:3446. DOI: 10.1038/s41598-021-82840-x. PMID: 33568706.
26. Faraway JJ. Extending the linear model with R: generalized linear, mixed effects and nonparametric regression models. 2nd ed. Chapman and Hall/CRC;2016.
27. Fagerland MW, Hosmer DW. A generalized Hosmer-Lemeshow goodness-of-fit test for multinomial logistic regression models. Stata J. 2012; 12:447–53. DOI: 10.1177/1536867x1201200307.
28. Kumar R, Indrayan A. Receiver operating characteristic (ROC) curve for medical researchers. Indian Pediatr. 2011; 48:277–87. DOI: 10.1007/s13312-011-0055-4. PMID: 21532099.
29. Friedman LM, Furberg CD, DeMets DL, Reboussin DM, Granger CB. Fundamentals of clinical trials. 5th ed. Springer;2015.
30. Kim J, Song TJ, Song D, Lee HS, Nam CM, Nam HS, et al. Interarm blood pressure difference and mortality in patients with acute ischemic stroke. Neurology. 2013; 80:1457–64. DOI: 10.1212/wnl.0b013e31828cf87c. PMID: 23516316.
31. Nakagawa S, Cuthill IC. Effect size, confidence interval and statistical significance: a practical guide for biologists. Biol Rev Camb Philos Soc. 2007; 82:591–605. DOI: 10.1111/j.1469-185x.2007.00027.x. PMID: 17944619.
Full Text Links
  • KMJ
Actions
Cited
CITED
export Copy
Close
Share
  • Twitter
  • Facebook
Similar articles
Copyright © 2025 by Korean Association of Medical Journal Editors. All rights reserved.     E-mail: koreamed@kamje.or.kr