COMPARATIVE ANALYSIS OF SINGLE-SAMPLE HYPOTHESIS TESTING: CRITICAL EVALUATION OF FREQUENTIST APPROACHES AND BAYESIAN INFERENCE ON SIMULATED DATA

Pardomuan Robinson Sihombing

doi:10.64930/jstar.v6i1.142

Penulis

Pardomuan Robinson Sihombing BPS-Statistics Indonesia https://orcid.org/0000-0002-1597-9077

DOI:

https://doi.org/10.64930/jstar.v6i1.142

Kata Kunci:

Bayes Factor; Cohen's d; Bayesian Inference; One-Sample Test

Abstrak

The validity of statistical inference is a key pillar in data-driven decision making, but it is often threatened by the inappropriate selection of methods for non-ideal data. This study aims to evaluate the performance of single-sample hypothesis testing methods by comparing the frequentist paradigm (Student's t-test, Wilcoxon signed-rank test, sign test) and the Bayesian paradigm (Bayes factor). Through Monte Carlo simulations using R Studio with 1,000 iterations, this study investigates statistical power, Type I error rate, and the accuracy of effect size estimates (Cohen's d, Rank-Biserial Correlation, Cohen's g) under Normal, Heavy-tailed (t-Student), and Skewed (Log-normal) distribution conditions with sample variations . The results show that under the t-Student distribution ( ), the Wilcoxon test consistently outperforms the T-test in terms of power (0.514 vs. 0.416 at n ). Another crucial finding is the bias in Cohen's d estimation on Log-normal data, which tends to underestimate the actual impact of location when compared to Rank-Biserial Correlation. The Bayesian approach proved to be more conservative but provided better inference stability in large samples

Unduhan

Data unduhan belum tersedia.

Referensi

Blair, R. . C., & Higgins, J. J. (1980). A Comparison of the Power of Wilcoxon ’ s Rank-Sum Statistic to That of Student ’ s t Statistic under Various Nonnormal Distributions. Journal of Educational Statistics, 5(4), 309–335.

[2] Bridge, P. D., & Sawilowsky, S. S. (1999). Increasing physicians’ awareness of the impact of statistics on research outcomes: comparative power of the t-test and Wilcoxon Rank-Sum test in small samples applied research. Journal of Clinical Epidemiology, 52(3), 229–235.

[3] Cohen, J. (1988). Statistical Power Analysis for the Behavioral Sciences. In Educacao e Sociedade (2nd ed.). Lawrence Erlbaum Associates. http://www.biblioteca.pucminas.br/teses/Educacao_PereiraAS_1.pdf%0Ahttp://www.anpocs.org.br/portal/publicacoes/rbcs_00_11/rbcs11_01.htm%0Ahttp://repositorio.ipea.gov.br/bitstream/11058/7845/1/td_2306.pdf%0Ahttps://direitoufma2010.files.wordpress.com/2010/

[4] Jeffreys, H. (1961). Theory of probability. Oxford University Press.

[5] Kerby, D. S. (2014). The Simple Difference Formula: An Approach to Teaching Nonparametric Correlation. Comprehensive Psychology, 3(1), 1–9. https://doi.org/10.2466/11.it.3.1

[6] Morey, R. D., & Rouder, J. N. (2011). Bayes Factor Approaches for Testing Interval Null Hypotheses. Psychological Methods, 16(4), 406–419. https://doi.org/10.1037/a0024377

[7] Morris, T. P., White, I. R., & Crowther, M. J. (2019). Using simulation studies to evaluate statistical methods. Statistics in Medicine, 38(11), 2074–2102. https://doi.org/10.1002/sim.8086

[8] Rouder, J. N., Speckman, P. L., Sun, D., Morey, R. D., & Iverson, G. (2009). Bayesian t tests for accepting and rejecting the null hypothesis. Psychonomic Bulletin & Review, 16(2), 225–237. https://doi.org/10.3758/PBR.16.2.225

[9] Sellke, T., Bayarri, M. J., & Berger, J. O. (2001). Calibration of p values for testing precise null hypotheses. The American Statistician, 55(1), 62–71.

[10] Setiawan, E. P., & Sukoco, H. (2021). Exploring First Year University Students’ Statistical Literacy: A Case on Describing and Visualizing Data. Journal on Mathematics Education, 12(3), 427–448.

[11] Siegel, S. (1997). Statistik Nonparametrik untuk Ilmu-ilmu Sosial. PT Gramedia Pustaka.

[12] Tomczak, M., & Tomczak, E. (2014). The need to report effect size estimates revisited. An overview of some recommended measures of effect size. Trends in Sport Sciences, 1(21), 19–25.

[13] Walpole, R. E. (2012). Probability & Statistics for Engineers & Scientists. Pearson.

[14] Wasserstein, R. L., & Lazar, N. A. (2016). The ASA ’ s statement on p-values : context , process , and purpose. The American Statistician, 70(2), 70(2), 129–133. https://doi.org/10.1080/00031305.2016.1154108

COMPARATIVE ANALYSIS OF SINGLE-SAMPLE HYPOTHESIS TESTING: CRITICAL EVALUATION OF FREQUENTIST APPROACHES AND BAYESIAN INFERENCE ON SIMULATED DATA

Penulis

DOI:

Kata Kunci:

Abstrak

Unduhan

Referensi

Unduhan

Diterbitkan

Cara Mengutip

Terbitan

Bagian

Lisensi

MENU UTAMA

INFORMASI

BAHASA

ISSN

DIINDEKS OLEH

TOOLS