• Title/Summary/Keyword: statistical validity test

Search Result 278, Processing Time 0.024 seconds

An Assessment of Statistical Validity of Articles Published in the Journal of Korean Acupuncture & Moxibusition Society - from 1984 to 2002 - (대한침구학회지 논문의 통계적 오류에 관한 연구)

  • Lee, Seung-deok
    • Journal of Acupuncture Research
    • /
    • v.21 no.1
    • /
    • pp.176-188
    • /
    • 2004
  • This study was carried out to investigate statistical validity of medical articles that used various statistical techniques such as t-test, analysis of variance, correlation analysis, regression analysis and chi-square test. For study 429 original articles using those statistical methods were selected from Journal of Korean Acupuncture & Moxibusition Society published from 1984 to 2002. 429 original articles were reviewed to analyzed the statistical procedures. Results are summarized as follows : 1. In this study 93 articles(21.68%) of 429 ones didn't report statement of statistical method in detail. 2. 53 articles(12.53%) didn't report p-value in correctly, and 245 articles(57.11 %) used mean${\pm}$standard error (Mean${\pm}$SEM.) and 109 articles used mean${\pm}$standard deviation(Mean${\pm}$SD.). All of 23 articles using nonparametric statistical techniques made an error to central tendency or dispersion. 3. 175 articles(59.93%) and 14 articles(4.79%) of 292 ones made an error to description of equal variances and normal distribution. 4. 99 articles(50%) of 185 ones misused t-test and 4 articles of 5 ones misused chi-square test. 5. 28 articles(73.68%) of 38 ones using discrete variable misused parametric technique such as t-test or ANOVA. 2 articles and 1 article of 125 ones choosing paired samples misused independent t-test and Mann-Whitney U test. 6. 20 articles using analysis of variance didn't use multiple comparison.

  • PDF

A Study on the Statistical Model Validation using Response-adaptive Experimental Design (반응적응 시험설계법을 이용하는 통계적 해석모델 검증 기법 연구)

  • Jung, Byung Chang;Huh, Young-Chul;Moon, Seok-Jun;Kim, Young Joong
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2014.10a
    • /
    • pp.347-349
    • /
    • 2014
  • Model verification and validation (V&V) is a current research topic to build computational models with high predictive capability by addressing the general concepts, processes and statistical techniques. The hypothesis test for validity check is one of the model validation techniques and gives a guideline to evaluate the validity of a computational model when limited experimental data only exist due to restricted test resources (e.g., time and budget). The hypothesis test for validity check mainly employ Type I error, the risk of rejecting the valid computational model, for the validity evaluation since quantification of Type II error is not feasible for model validation. However, Type II error, the risk of accepting invalid computational model, should be importantly considered for an engineered products having high risk on predicted results. This paper proposes a technique named as the response-adaptive experimental design to reduce Type II error by adaptively designing experimental conditions for the validation experiment. A tire tread block problem and a numerical example are employed to show the effectiveness of the response-adaptive experimental design for the validity evaluation.

  • PDF

Statistical Approach to Test Construct Validity and Obtain Weights for the Children's Dietary Life Recognition and Practice Index (우리나라 초등학교 어린이의 식생활 인지.실천 수준 평가지표 구성타당도 평가 및 산정방법 연구)

  • Kwon, Se-Hyug;Kim, Hye-Young P.;Lee, Jung-Sug;Kwa, Tong-Kyung;Chung, Hae-Rang;Choi, Young-Sun;Kang, Myung-Hee
    • Journal of Nutrition and Health
    • /
    • v.44 no.1
    • /
    • pp.41-48
    • /
    • 2011
  • Constructs with seven latent evaluation indicators and 18 observable survey questions were developed by food and nutrition experts to calculate a food safety recognition and practice index for children. The purpose of this study was to suggest statistical approaches to test construction validity on the constructs, obtain weights of the evaluation indicators, and develop questionnaires to calculate a children's food recognition and practice index. Survey data of 2,400 elementary fifth grade students were used as empirical results. Test validity was evaluated by exploratory factor analysis and confirmed to be highly significant by confirmatory factor analysis [i.e., linear structural relations (LISREL) analysis]. Standardized path coefficients of the LISREL analysis were suggested based on weights, and the weights were compared using the AHP and Delphi methods.

Reliability and Validity Study on the Korean Version of the Fullerton Advanced Balance Scale (한국어판 플러턴 어드밴스드 균형 척도의 신뢰도와 타당도 연구)

  • Kim, Gyoung-mo
    • Physical Therapy Korea
    • /
    • v.23 no.1
    • /
    • pp.31-37
    • /
    • 2016
  • Background: The assessment tool developed in other countries should be translated into Korean language using rigorous methodological approaches in order to be used in Korea. Because these procedures are insufficient for establishing the cross-cultural and linguistic equivalence, the need for statistical methods is raised. The Fullerton Advanced Balance Scale was translated into Korean and the content validity was verified through the back translation method, but the reliability and validity have not yet been proven by statistical methods. Objects: The purpose of this study was to investigate the reliability and validity of the Korean version of the Fullerton Advanced Balance Scale (KFAB) by statistical methods in elderly people. Methods: A total of 97 elderly adults (39 males and 58 females) participated in this study. Internal consistency of the KFAB was measured using Cronbach's alpha and an intraclass correlation coefficient (ICC) was used to assess test-retest reliability between the two measurement sessions. Concurrent validity was measured by comparing the KFAB responses with the Korean version of the Berg Balance Scale (KBBS) using the Spearman correlation coefficient. Construct validity of the KFAB was measured using the exploratory factor analysis to evaluate the unidimensionality of the questionnaire. The significance level was set at ${\alpha}=.05$. Results: The internal consistency of the KFAB was found be adequate with Cronbach's alpha (.96), and test-retest reliability was excellent as evidenced by the high ICC (r=.996). Concurrent validity showed high correlation between the KFAB and KBBS (r=.89, p<.001). Construct validity was evaluated using exploratory factor analysis. The result from Bartlett test of sphericity was statistically significant (p<.001), and the value of Kaiser-Meyer-Olkin measure of sampling adequacy was .93. Exploratory factor analysis revealed the existence of only one dominant factor that explained 76.43% of the variance. Conclusion: The KFAB is a reliable, valid and appropriate tool for measuring the balance functions in elderly people.

Bootstrap Median Tests for Right Censored Data

  • Park, Hyo-Il;Na, Jong-Hwa
    • Journal of the Korean Statistical Society
    • /
    • v.29 no.4
    • /
    • pp.423-433
    • /
    • 2000
  • In this paper, we consider applying the bootstrap method to the median test procedures for right censored data. For doing this, we show that the median test statistics can be represented by the differences of two sampler medians. Then we review to the re-sampling methods for censored dta and propose the test procedures under the location translation assumption and Behrens-Fisher problem. Also we compare our procedures with other re-sampling method, which is so-called permutation test through an example. Finally we show the validity of bootstrap median test procedure in the appendix.

  • PDF

On the Goodness-of-fit Test in Regression Using the Difference Between Nonparametric and Parametric Fits

  • Hong, Chang-Kon;Joo, Jae-Seon
    • Communications for Statistical Applications and Methods
    • /
    • v.8 no.1
    • /
    • pp.1-14
    • /
    • 2001
  • This paper discusses choosing the weight function of the Hardle and Mammen statistic in nonparametric goodness-of-fit test for regression curve. For this purpose, we modify the Hardle and Mammen statistic and derive its asymptotic distribution. Some results on the test statistic from the wild bootstrapped sample are also obtained. Through Monte Carlo experiment, we check the validity of these results. Finally, we study the powers of the test and compare with those of the Hardle and Mammen test through the simulation.

  • PDF

The consideration for methods of statistical analysis about the thesis published in the journal of korean oriental medical Ophthalmology & Otolaryngology & Dermatology from 2003 to 2005 (2003년부터 2005년까지 안이비인후피부과 학회지에 게재된 논문들의 통계적 분석 방법에 대한 고찰)

  • Kim, Keoo-Seok;Nam, Hae-Jung;Park, Owe-Suk;Kim, Hee-Jeong;Cha, Jae-Hoon;Kim, Yoon-Bum
    • The Journal of Korean Medicine Ophthalmology and Otolaryngology and Dermatology
    • /
    • v.19 no.3 s.31
    • /
    • pp.134-145
    • /
    • 2006
  • Objective : This study was carried out to investigate what type of assumption and conditions are needed for the application of various statistical techniques such as descriptive statistics, t-test, analysis of variance, correlation analysis, regression analysis and chi-square test and to evaluate that they are used correctly in the research process. Methods : One more methods of statistical analysis were used in 91 papers among 162 papers selected from the journal of Korean oriental medical Ophthalmology & Otolaryngology & Dermatology from April 2003 to December 2005. So we analysed the type of statistical analysis method in 91 papers(clinical and experimental study) and assessed the their validity of statistical techniques by the check list consisting of 34 items(3 items for validity assessment of descriptive statistics, 6 items for t-test, 7 items for analysis of variance, correlation analysis and regression analysis, respectively, 4 items for chi-square test) Results : 1. The type of 65(40%) cases is experimental trial, the type of 55(34%) cases is case report, the type of 26(16%) cases is clinical trial and the type of 16(10%) cases is review, in 91 papers using statistical techniques among 162 papers selected from the journal of Korean oriental medical Ophthalmology & Otolaryngology & Dermatol-ogy from April 2003 to December 2005. 2. One more methods of statistical analysis were used in the experimental and clinical study. When we classified 125 units using statistical analysis methods in 91 papers according to statistical techniques such as descriptive statistics, t-test, analysis of variance, correlation analysis, regression analysis and chi-square test, the number of independent sample t-test is 33(26%), the number of only descriptive statistics is 28(22%), the number of independent sample t-test is 33(26%), the number of only descriptive statistics is 28(22%), the number of one way ANOVA is 15(12%), the number of non-parametric test 10(8%). 3. After carrying out one way ANOVA, the number of using multiple comparison methods is 15(Scheffe:6(26%), Duncan:4(17%), Dunnett:3(13%), Tukey:2(9%)) out of 23 (total case carrying out one way ANOVA). 8(35%) out of 23 did not enforce multiple comparison methods after carrying out one way ANOVA. 4. From the assessment of validity about 63 cases using statistical techniques(except descriptive statistics), 5(8%) cases are proper, the other 58(92%) are improper, so we recognized a serious misuse of statistical application in our journal. 5. The number of case below 10 sample size in experimental and clinical study(except descriptive statistics) is 31(34%) and frequent. Also the number of case containing no mention of sample size is 41(45%, including culture study). 6. For example of statistical error, there are wrong choice of statistical technique, lack of check on standard assumption(such as standard distribution, equivariance, independence), and so on. Conclusions : We investigated the validity of statistical analysis methods in our journal by check list consisting of 34 items and suggested correct statistical analysis methods. We should practice the spread of education about statistical analysis methods and precis application, enhance objectivity and reliability of our thesis and further correspond with purpose of scientific study.

  • PDF

Multi-facet Analysis on Validity of Sasang Type Diagnostic Test (사상체질 진단검사 타당성 분석에 대한 연구)

  • Lee, Soo-Jin;Kim, Myoung-Geun;Chae, Han
    • The Journal of Korean Medicine
    • /
    • v.29 no.1
    • /
    • pp.7-14
    • /
    • 2008
  • Purpose : The purpose of study was to develop generalized validity evaluation methods and terms for Sasang type diagnostic tests. Methods : A generalized statistical evaluation model for Sasang typology was suggested and generalized validity evaluation indices were proposed with this model. Results : The usefulness of validity evaluations, such as sensitivity and specificity values, were confirmed by the systematic review of the data from previously reported studies. Conclusion :Major obstacles in the multi-facet analysis and systematic review for Sasang type diagnostic tests were discussed with this test validity study.

  • PDF

Statistical Errors of Articles Published in the Journal of Oriental Rehabilitation Medicine(I) (한방재활의학과학회지의 통계적 오류에 관한 고찰(I))

  • Park, Tae-Yong;Heo, Tae-Young;Shin, Byung-Cheul
    • Journal of Korean Medicine Rehabilitation
    • /
    • v.20 no.4
    • /
    • pp.105-130
    • /
    • 2010
  • Objectives : The purpose of this study was to assess the statistical methods errors used in the journal of Oriental Rehabilitation Medicine(JORM) and to identify the types of errors in statistical analysis. Methods : We reviewed quantitative articles that were published in the JORM from January 2005 through October 2009. Those were not used by statistical analysis such as literature studies, case study, review articles were not included in this analysis. A total of 296 articles was reviewed. We evaluated the adequacy and the validity of the statistical techniques with our checklist established be modified Lee's checklist, and three statistical evaluators assessed together to minimize bias. Results : Of the 222 articles, 213 were used in inferential and descriptive statistics. Of those 80% of articles adopting descriptive and inferential statistics were detected having statistical errors. One articles used 1.7 statistical method unit generally. Most frequently employed statistics were student t-test, one way ANOVA. pearson correlation analysis, Mann-whitney U test, paired t-test, and chi-square test in their order. However, most frequent statistics having errors were similar in order. The most common statistic errors were as follow: 1. absence of normality test, 2. misuse between paired test and unpaired test, 3. wrong choice of repeated measures analysis without consideration of time variables, 4, increase of Type I error by using inappropriate multiple test, 5. inappropriate application of discrete or categorical data instead of continuous data in correlation analysis, 6. poor consideration of basic consumption in chi-square test, 7. confusion between frequency comparison and average comparison, 8. mentioning the statistical technique without using it. Conclusions : We found various mistake or misuses in the applications of statistical methodologies in the articles published in the JORM. Careful consideration of statistical use and review from the specialist of statistics are warranted for improving the quality of JORM.

The estimation of cholesterol intake in elderly: reliability and validity of short, Semi-Quantitative Food Frequency Questionnaire (SQ-FFQ)

  • Nindya, Triska Susila;Mahmudiono, Trias;Rachmah, Qonita
    • Journal of Nutrition and Health
    • /
    • v.54 no.1
    • /
    • pp.95-103
    • /
    • 2021
  • Purpose: High intake of cholesterol leads to cardiovascular disruption. Estimating the actual intake of cholesterol can be beneficial for nutrition intervention. This research aimed to develop Semi-Quantitative Food Frequency Questionnaire (SQ-FFQ) to estimate cholesterol intake and analyze its reliability and validity. Methods: SQ-FFQ was developed by sorting high cholesterol food items in Indonesian food database and food items' availability. A total of 30 older adults were randomly chosen from Public Health Center in Jagir District, Surabaya, Indonesia to test its validity. Reliability test was done by measuring the same developed SQ-FFQ in one-month period, while validity test was done by comparing SQ-FFQ results with 6-days food record. Statistical analysis used for reliability test was paired t-test, the Intra-class Correlation Coefficient (ICC), and Cronbach's α to measure the internal consistency. Meanwhile, validity of developed SQ-FFQ was analyzed using paired t-test and Bland-Altman. Results: Reliability of 2 administered SQ-FFQs showed a good agreement based on paired t-test analysis (p = 0.200), ICC (0.609), and Cronbach's α (0.757). Strong agreement was found in most of food items, but agreements for egg yolk and fried duck were poor. Significant difference was found between those food items (p = 0.001 vs. p < 0.001, respectively) with mean difference were -25.3 mg and 46.2 mg. Validity of developed SQ-FFQ2 compared to 6-days food diary records also found a strong agreement based on paired t-test and the Bland-Altman analysis. Conclusion: This baseline research provides a reasonably valid and repeatable measure of cholesterol intake estimation that can be widely used in nutrition and public health study, especially in Indonesia. No study has been conducted in Indonesia on the development of tools to estimate the cholesterol intake.