H-statistic with winsorized modified one-step M-estimator as central tendency measure

Two-sample independent t-test and ANOVA are classical procedures which are widely used to test the equality of two groups and more than two groups respectively. However, these parametric procedures are easily affected by non-normality, becoming more obvious when heterogeneity of variances and unbala...

Full description

Saved in:
Bibliographic Details
Main Author: Ong, Gie Xao
Format: Thesis
Language:eng
eng
Published: 2017
Subjects:
Online Access:https://etd.uum.edu.my/6993/1/s811088_01.pdf
https://etd.uum.edu.my/6993/2/s811088_02.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
id my-uum-etd.6993
record_format uketd_dc
institution Universiti Utara Malaysia
collection UUM ETD
language eng
eng
advisor Syed Yahaya, Sharipah Soaad
Abdullah, Suhaida
topic QA273-280 Probabilities
Mathematical statistics
spellingShingle QA273-280 Probabilities
Mathematical statistics
Ong, Gie Xao
H-statistic with winsorized modified one-step M-estimator as central tendency measure
description Two-sample independent t-test and ANOVA are classical procedures which are widely used to test the equality of two groups and more than two groups respectively. However, these parametric procedures are easily affected by non-normality, becoming more obvious when heterogeneity of variances and unbalanced group sizes exist. It is well known that the violation in the assumption of the tests will lead to inflation in Type I error rate and decreasing in the power of test. Nonparametric procedures like Mann-Whitney and Kruskal-Wallis may be the alternative to the parametric procedures, however, loss of information occur due to the ranking data. In mitigating these problems, robust procedures can be used as the other alternative. One of the procedures is H-statistic. When used with modified one-step M-estimator (MOM), the test statistic (MOM-H) produces good control of Type I error rate even under small sample size but inconsistent under certain conditions investigated. Furthermore, power of test is low which might be due to the trimming process. In this study, MOM was winsorized (WMOM) to retain the original sample size. The Hstatistic when combines with WMOM as the central tendency measure (WMOM-H) shows better control of Type I error rate as compared to MOM-H especially under balanced design regardless of the shape of distributions. It also performs well under highly skewed and heavy tailed distribution for unbalanced design. On top of that, WMOM-H also generates better power value, as compared to MOM-H and ANOVA under most of the conditions investigated. WMOM-H also has better control of Type I error rates with no liberal value (>0.075) compared to the parametric (t-test and ANOVA) and nonparametric (Mann-Whitney and Kruskal-Wallis) procedures. In general, this study demonstrates that winsorization process (WMOM) is able to improve the performance of H-statistic in terms of controlling Type I error rate and increasing power of test.
format Thesis
qualification_name masters
qualification_level Master's degree
author Ong, Gie Xao
author_facet Ong, Gie Xao
author_sort Ong, Gie Xao
title H-statistic with winsorized modified one-step M-estimator as central tendency measure
title_short H-statistic with winsorized modified one-step M-estimator as central tendency measure
title_full H-statistic with winsorized modified one-step M-estimator as central tendency measure
title_fullStr H-statistic with winsorized modified one-step M-estimator as central tendency measure
title_full_unstemmed H-statistic with winsorized modified one-step M-estimator as central tendency measure
title_sort h-statistic with winsorized modified one-step m-estimator as central tendency measure
granting_institution Universiti Utara Malaysia
granting_department Awang Had Salleh Graduate School of Arts & Sciences
publishDate 2017
url https://etd.uum.edu.my/6993/1/s811088_01.pdf
https://etd.uum.edu.my/6993/2/s811088_02.pdf
_version_ 1747828142215528448
spelling my-uum-etd.69932021-08-18T05:36:30Z H-statistic with winsorized modified one-step M-estimator as central tendency measure 2017 Ong, Gie Xao Syed Yahaya, Sharipah Soaad Abdullah, Suhaida Awang Had Salleh Graduate School of Arts & Sciences Awang Had Salleh Graduate School of Arts and Sciences QA273-280 Probabilities. Mathematical statistics Two-sample independent t-test and ANOVA are classical procedures which are widely used to test the equality of two groups and more than two groups respectively. However, these parametric procedures are easily affected by non-normality, becoming more obvious when heterogeneity of variances and unbalanced group sizes exist. It is well known that the violation in the assumption of the tests will lead to inflation in Type I error rate and decreasing in the power of test. Nonparametric procedures like Mann-Whitney and Kruskal-Wallis may be the alternative to the parametric procedures, however, loss of information occur due to the ranking data. In mitigating these problems, robust procedures can be used as the other alternative. One of the procedures is H-statistic. When used with modified one-step M-estimator (MOM), the test statistic (MOM-H) produces good control of Type I error rate even under small sample size but inconsistent under certain conditions investigated. Furthermore, power of test is low which might be due to the trimming process. In this study, MOM was winsorized (WMOM) to retain the original sample size. The Hstatistic when combines with WMOM as the central tendency measure (WMOM-H) shows better control of Type I error rate as compared to MOM-H especially under balanced design regardless of the shape of distributions. It also performs well under highly skewed and heavy tailed distribution for unbalanced design. On top of that, WMOM-H also generates better power value, as compared to MOM-H and ANOVA under most of the conditions investigated. WMOM-H also has better control of Type I error rates with no liberal value (>0.075) compared to the parametric (t-test and ANOVA) and nonparametric (Mann-Whitney and Kruskal-Wallis) procedures. In general, this study demonstrates that winsorization process (WMOM) is able to improve the performance of H-statistic in terms of controlling Type I error rate and increasing power of test. 2017 Thesis https://etd.uum.edu.my/6993/ https://etd.uum.edu.my/6993/1/s811088_01.pdf text eng public https://etd.uum.edu.my/6993/2/s811088_02.pdf text eng public masters masters Universiti Utara Malaysia Abdullah, S., Syed Yahaya, S. S., & Othman, A. R. (2011). Modified Alexander- Govern test as alternative to t-test and ANOVA F test. Sains Malaysiana, 40(10), 1187-1192. Ahmad Mahir, R.., & Al-Khazaleh, A. M. H. (2009). New method to estimate missing data by using the asymmetrical winsorized mean in a time series. Applied Mathematical Sciences, 3(35), 1715–1726. Alan, O., Phyllis, S., & John, Q. (2008). The importance of teaching power in statistical hypothesis testing. Paper session presented at the Northeast Decision Sciences Annual Meeting, Brooklyn, New York. Alexander, R. A. & Govern, D. M. (1994). A new and simpler approximation for ANOVA under variance heterogeneity. Journal of Educational Statistic, 19(2), 91–101. doi: 10.3102/10769986019002091 Babu, G. J., Padmanabhan, A. R., & Puri, M. L. (1999). Robust one-way ANOVA under possibly non-regular conditions. Biometrical Journal. 41(3), 321-339. Beran, R. (1986). Simulated power functions. The Annals of Statistics, 14(1), 151-173. Box, G. E. P. (1953). Non-normality and tests of variances. Biometrika, 40(3/4), 318–355. doi: 10.2307/2333350 Box, G. E. P. (1954). Some theorems on quadratic forms applied in the study of analysis of variance problems, I. Effect of inequality of variance in the oneway classification. Annals of Mathematical Statistics, 25(2), 290 – 302. Bradley, J. V. (1968). Distribution-free statistical tests. Englewood Cliffs, NJ: Prentice Hall. Bradley, J. V. (1978). Robustness?. British Journal of Mathematical and Statistical Psychology, 31(2), 144-152. Chernick, M. R. (2008). Bootstrap methods: a guide for practitioners and researchers (2nd ed.). Newtown, PA: Wiley-Interscience. Clark-Carter, D. (1997). The account taken of statistical power in research. British Journal of Psychology, 88(1), 71-83. doi: 10.1111/j.2044-8295.1997.tb02621.x Cohen, J. (1988). Statistical power analysis for the behavioral sciences (2nd ed.). Hillsdale, NJ: Lawrence Erlbaum Associates. Cohen, J. (1992a). A power primer. Psychological Bulletin, 112(1), 155-159. doi: 10.1037/0033-2909.112.1.155 Cohen, J. (1992b). Statistical power analysis. Current Directions in Psychological Science, 1(3), 98-101. Davison, A. C., & Hinkley, D. V. (1997). Bootstrap methods and their applications. (Cambridge series in statistical and probabilistic mathematics). United States of America: Cambridge University Press. Dixon W. J. (1960). Simplified estimation from censored normal samples. The Annals of Mathematical Statistics, 31(2), 385-391. Dixon W. J., & Tukey J. W. (1968). Approximate behavior of the distribution of winsorized t (trimming/winsorization 2). Technometrics, 10(1), 83-98. doi: 10.2307/1266226 Efron, B., & Tibshirani, R. J. (1986). Bootstrap methods for standard errors, confidence intervals, and other measures of statistical accuracy. Statistical Science, 1(1), 54 – 77. Efron, B., & Tibshirani, R. J. (1993). An introduction to the bootstrap. United States of America: Chapman & Hall Inc. Erceg-Hurn, D. M., & Mirosevich, V. M. (2008). Modern robust statistical methods. American Psychologist, 63(7), 591-601. Fan, W., & Hancock, G. R. (2012). Robust means modeling: an alternative for hypothesis testing of independent means under variance heterogeneity and nonnormality. Journal of Educational and Behavioral Statistics, 37(1), 137–156. doi: 10.3102/1076998610396897 Guo, Jiin-Huarng & Luh, Wei-Ming (2000). An invertible transformation twosample trimmed t-statistic under heterogeneity and nonnormality. Statistic & Probability Letters. 49(1), 1-7. doi:10.1016/S0167-7152(00)00022-5 Haddad, F. S., Syed Yahaya S. S., & Alfaro J. L. (2013). Alternative Hotelling's T2 charts using winsorized modified one-step M-estimator. Quality and Reliability Engineering International, 29(4), 583-593. doi: 10.1002/qre.1407 Hall, P. (1986). On the number of bootstrap simulations required to construct a confidence interval. The Annals of Statistics, 14 (4), 1453-1462. Hall, P., & Padmanabhan, A. R. (1992). On the bootstrap and the trimmed mean. Journal of Multivariate Analysis, 41(1), 132-153. doi:10.1016/0047-259X(92)90062-K Hampel, F. (1968). Contribution to the Theory of Robust Estimation. (Ph.D. thesis). University of California, Berkeley. Hayes, A. F. (2005). Statistical methods for communication science. New Jersey: Lawrence Erlbaum Associates, Inc. Hoaglin, D. C. (1985). Summarizing shape numerically: The g-and h-distributions. In D. Hoaglin, F. Mosteller & J. Tukey (Eds), Exploring data tables, trends, and shapes (pp. 461–513). New York, NY: Wiley. Hogg, R. V. (1974). Adaptive robust procedures: a partial review and some suggestions for future applications and theory. Journal of the American Statistical Association, 69(348), 909-923. Huber, P. J. (1964). Robust estimation of a location parameter. Annals of Mathematical Statistics, 35(1), 73 – 101. doi:10.1214/aoms/1177703732 Huber, P. J. (1981). Robust Statistics. New York: Wiley. Keselman, H. J., Huberty, C. J., Lix, L. M., Olejnik, S., Cribbie, R. A., Donahue, B., …Levin, J. R. (1998). Statistical practices of educational researchers: An analysis of their ANOVA, MANOVA, and ANCOVA analyses. Review of Educational Research, 68(3), 350-386. doi: 10.3102/00346543068003350 Keselman, H. J., Wilcox, R. R., Lix, L. M., Algina, J., & Fradette, K. (2007). Adaptive robust estimation and testing. British Journal of Mathematical and Statistical Psychology, 60(2), 267–293. doi: 10.1348/000711005X63755 Keselman, H. J., Wilcox, R. R., Othman, A. R., & Fradette, K. (2002). Trimming, transforming statistics, and bootstrapping: circumventing the biasing effects of heteroscedasticity and non-normality. Journal of Modern Applied Statistical Methods, 1(2), 288 – 399. Keselman, H. J., Othman, A. R., Wilcox, R. R., & Fradette, K. (2004). The new and improved two-sample t-test. American Psychological Society, 15(1), 57-51. Keselman, H. J., Algina, J., Lix, L., Wilcox, R. R., & Deering, K. (2008). A generally robust approach for testing hypotheses and setting confidence intervals for effect sizes. Psychological Methods, 13(2), 110-129. doi: 10.1037/1082-989X.13.2.110. Kulinskaya, E., Staudte, R. G., & Gao, H. (2003). Power approximations in testing for unequal means in a one-way ANOVA weighted for unequal variances. Communications in Statistics - Theory and Method, 32(12), 2353-2371. doi: 10.1081/STA-120025383 Lix, L. M., & Keselman, H. J. (1998). To trim or not to trim: tests of location equality under heteroscedasticity and non-normality. Educational and Psychological Measurement, 58(3), 409-442. doi: 10.1177/0013164498058003004 Manly, B. F. J. (2007). Randomization, bootstrap and Monte Carlo methods in biology (3rd ed.). Boca Raton, FL: Chapman & Hall/CRC. Md Yusof, Z, Abdullah, S., Syed Yahaya, S. S., & Othman, A. R. (2011). Testing the equality of central tendency measures using various trimming strategies. African Journal of Mathematics and Computer Science Research,4(1), 32-38. Md Yusof, Z., Abdullah, S., & Syed Yahaya, S. S. (2012a). Type I error rate of parametric, robust and nonparametric methods for two group cases. World Applied Sciences Journal, 16(12), 1817-1819. Md Yusof, Z, Abdullah, S., Syed Yahaya, S. S., & Othman, A. R. (2012b), A robust alternative to the t-test. Modern Applied Science, 6(5), 27-33. doi:10.5539/mas.v6n5p27 Mendes, M., & Akkartal, E., (2010). Comparison of ANOVA F and WELCH tests with their respective permutation versions in terms of Type I error rate and test power. Kafkas Univ Vet Faj Derg, 16(5). 711-716. Mendes, M., & Yigit, S. (2012), Comparison of ANOVA-F and ANOM tests with regard to Type I error rate and test power. Journal of Statistical Computation and Simulation, 83(11), 1-12. doi: 10.1080/00949655.2012.679942 Murphy, K. R., Myors, B. & Wolach, A. (2008). Statistical power analysis: a simple and general model for traditional and modern hypothesis tests (3rd ed.). New York: Routledge. Othman, A. R., Keselman, H. J., Padmanabhan, A. R., Wilcox, R. R & Fradette, K. (2004). Comparing measures of the "typical" score across treatment groups. British Journal of Mathematical and Statistical Psychology. 57(2), 215-234. Reed, J. F., & Stark, D. B. (1996). Hinge estimators of location: robust to asymmetry. Computer Methods and Programs Biomedicine, 49(1), 11-17. Rivest, L. P. (1994). Statistical properties of Winsorized means for skewed distributions. Biometrika, 81(2), 373-383. doi: 10.2307/2336967 Rogan, J. C., & Keselman, H.J. (1977). Is the ANOVA F-test robust to variance heterogeneity when sample sizes are equal? An investigation via a coefficient of variation. American Educational Research Journal, 14(4), 493 –498. doi: 10.3102/00028312014004493 Rosenberger, J. L., & Gasko, M. (1983). Comparing location estimators: Trimmed means, medians, and trimean. In D. Hoaglin, F. Mosteller, & J. Tukey (Eds.), Understanding robust and exploratory data analysis (pp. 297– 336). New York: Wiley. Rousseeuw, P. J., & Croux, C. (1993). Alternatives to the median absolute deviation. Journal of the American Statistical Association, 88(424), 1273-1283. doi: 10.1080/01621459.1993.10476408 Sawilowsky, S.S. (1990). Nonparametric tests of interaction in experimental design. Review of Educational Research, 60(1), 91-126. doi: 10.3102/00346543060001091 Sawilowsky, S. S., & Blair, R. C. (1992). A more realistic look at the robustness and Type II error properties of the t-test to departures from population normality. Psychological Bulletin, 111(2), 352-360. doi: 10.1037/0033-2909.111.2.352 Schneider, P. J., & Penfield, D. A. (1997). Alexander and Govern's approximation: providing an alternative to ANOVA under variance heterogeneity. The Journal of Experimental Education, 65(3), 271-286. Scheffe, H. (1959). The Analysis of Variance. New York: Wiley. Schrader, R. M., & Hettmansperger, T. P. (1980). Robust Analysis of Variance Based Upon a Likelihood Ratio Criterion. Biometrika, 67(1), 93-101. doi: 10.1093/biomet/67.1.93 Sharma, D., & Kibria, B. M. G., (2012). On some test statistics for testing homogeneity of variances: a comparative study. Journal of Statistical Computation and Simulation, 83(10), 1-20. doi: 10.1080/00949655.2012.675336 Siegel, S. (1957). Nonparametric statistics. The American Statistician, 11(3), 13-19. Snedecor, G. W., & Cochran, W. G. (1980). Statistical methods (7th ed.). Ames, IA: Iowa University Press. Spector, P. E. (1993). SAS programming for researchers and social scientists. Newbury Park: Sage Publication Inc. Staudte, R. G., & Sheather, S. J. (1990). Robust estimation and testing. New York : John Wiley & Sons Inc. Syed Yahaya, S. S., Othman, A. R. & Keselman, H. J. (2004a). Testing the equality of location parameters for skewed distributions using S1 with high breakdown robust scale estimators. In M. Hubert, G. Pison, A. Struyf, & S. Van Aelst (Eds.), Theory and Applications of Recent Robust Methods, Series: Statistics for Industry and Technology (pp. 319-328). Basel: Birkhauser. Syed Yahaya, S. S., Othman, A. R., & Keselman, H. J. (October, 2004b). An alternative approach for testing location measures in the one-way independent group design. Paper presenting session of the International Conference on Statistics and Mathematics and Its Applications in the Development of Science and Technology. Bandung, Indonesia. Syed Yahaya, S. S. (2005). Robust statistical procedures for testing the equality of central tendency parameters under skewed distributions. (Unpublished Doctoral thesis). Universiti Sains Malaysia, Malaysia. Syed Yahaya, S. S., Othman, A. R., & Keselman, H. J. (2006). Comparing the “typical score” across independent groups based on different criteria for trimming. Metodološki zvezki, 3(1), 49-62. Tomarkin, A. J., & Serlin, R. C. (1986). Comparison of ANOVA alternatives under variance heterogeneity and specific noncentrality structures. Psychological Bulletin. 99(1), 90 – 99. Tukey J. W., & McLaughlin D. H. (1963). Less vulnerable confidence and significance procedures for location based on a single sample: trimming/winsorization 1. Sankhyā: The Indian Journal of Statistics, Series A, 25(3), 331-352. Welch, B. L. (1951). On the comparison of several mean values: an alternative approach. Biometrika, 38(3/4), 330-336. Westfall, P. H., & Young, S. S. (1993). Resampling-based multiple testing. New York: Wiley. Wilcox, R. R. (1994). A one-way random effects model for trimmed means. Psychometrika, 59 (3), 289-307. Wilcox, R. R. (1995). ANOVA: the practical importance of heteroscedastic methods, using trimmed means versus means, and designing simulation studies. British Journal of Mathematical and Statistical Psychology, 48(1), 99-114. doi: 10.1111/j.2044-8317.1995.tb01052.x Wilcox, R. R. (1997). Introduction to robust estimation and hypothesis testing (3rd ed.). New York: Academic Press. Wilcox, R. R. (1998). How many discoveries have been lost by ignoring modern statistical methods?. American Psychologist, 53(3), 300–314. doi: 10.1037//0003-066X.53.3.300 Wilcox, R. R. (2003). Applying contemporary statistical techniques. San Diego: Academic Press. Wilcox, R. R. (2012). Introduction to robust estimation and hypothesis testing (3rd ed.). New York: Academic Press. Wilcox, R. R., Charlin, V. L., & Thompson, K. L. (1986). New Monte Carlo results on the robustness of the ANOVA F, W and F* statistics. Communications in Statistics-Simulations, 15(4), 933-943. doi: 10.1080/03610918608812553 Wilcox, R. R., & Keselman, H. J. (2002). Power analyses when comparing trimmed means. Journal of Modern Applied Statistical Methods, 1(1), 24-31. Wilcox, R.R., & Keselman, H. J. (2003a). Modern robust data analysis methods: measures of central tendency. Psychological Methods, 8(3), 254–274. doi: 10.1037/1082-989X.8.3.254 Wilcox, R. R., & Keselman H. J. (2003b). Repeated measures ANOVA based on a modified one-step M-estimator. Journal of British Mathematical and Statistical Psychology, 56(1), 15 – 26. doi: 10.1348/000711003321645313 Wilcox, R. R., Keselman, H. J., & Kowalchuk, R. K. (1998). Can test for treatment group equality be improved? The bootstrap and trimmed means conjecture. British Journal of Mathematical and Statistical Psychology, 51(1), 123-134. doi: 10.1111/j.2044-8317.1998.tb00670.x Wilcox, R. R., Keselman H. J., Muska, J., & Cribbie, R. (2000). Repeated measures ANOVA: Some new results on comparing trimmed means and means. British Journal of Mathematical and Statistical Psychology, 53(1), 69-82. Yang, K., Li, J., & Gao, H. (2006). The impact of sample imbalance on identifying differentially expressed genes. BMC Bioinformatics, 7(4), S8. doi: 10.1186/1471-2105-7-S4-S8