• Title/Summary/Keyword: Chi-Square

Search Result 3,597, Processing Time 0.031 seconds

A Note on the Chi-Square Test for Multivariate Normality Based on the Sample Mahalanobis Distances

  • Park, Cheolyong
    • Journal of the Korean Statistical Society
    • /
    • v.28 no.4
    • /
    • pp.479-488
    • /
    • 1999
  • Moore and Stubblebine(1981) suggested a chi-square test for multivariate normality based on cell counts calculated from the sample Mahalanobis distances. They derived the limiting distribution of the test statistic only when equiprobable cells are employed. Using conditional limit theorems, we derive the limiting distribution of the statistic as well as the asymptotic normality of the cell counts. These distributions are valid even when equiprobable cells are not employed. We finally apply this method to a real data set.

  • PDF

Criteria of Association Rule based on Chi-Square for Nominal Database

  • Park, Hee-Chang;Lee, Ho-Soon
    • 한국데이터정보과학회:학술대회논문집
    • /
    • 2004.04a
    • /
    • pp.25-38
    • /
    • 2004
  • Association rule mining searches for interesting relationships among items in a given database. Association rules are frequently used by retail stores to assist in marketing, advertising, floor placement, and inventory control. There are three primary quality measures for association rule, support and confidence and lift. In this paper we present the relation between the measure of association based on chi square statistic and the criteria of association rule for nominal database and propose the objective criteria for association.

  • PDF

A Nonparametric Test for the Parallelism of Regression Lines Based on Kendall's Tau (Kendall의 Tau에 의한 회귀직선의 평행성에 관한 비모수 검정)

  • Song, Moon-Sup
    • Journal of the Korean Statistical Society
    • /
    • v.7 no.1
    • /
    • pp.17-26
    • /
    • 1978
  • For testing $\beta_i=\beta, i=1,...,k$, in the regression model $Y_{ij} = \alpha_i + \beta_ix_{ij} + e_{ij}, j=1,...,n_i$, a simple and robust test based on Kendall's tau is proposed. Its asymptotic distribution is proved to be chi-square under the null hypthesis and noncentral chi-square under an appropriate sequence of alternatives. For the optimal designs, the asymptotic relative efficiency of the proposed procedure with respect to the least squares procedure is the same as that of the Wilcoxon test with respect to the t-test.

  • PDF

Minimum Chi-square estimation and the bootstrap (최소카이제곱추정과 붓스트랩)

  • 정한영;이기원;구자용
    • The Korean Journal of Applied Statistics
    • /
    • v.7 no.2
    • /
    • pp.269-277
    • /
    • 1994
  • Bootstrap approximation is compared with ordinary asymptotic method in the context of minimum chi-square estimation through application in a real problem. Fixed interval search method is shown to be superior over a random interval search method or Newton-Raphson method. All the procedures are implemented by S-Plus functions.

  • PDF

Tests for Uniformity : A Comparative Study

  • Rahman, Mezbahur;Chakrobartty, Shuvro
    • Journal of the Korean Data and Information Science Society
    • /
    • v.15 no.1
    • /
    • pp.211-218
    • /
    • 2004
  • The subject of assessing whether a data set is from a specific distribution has received a good deal of attention. This topic is critically important for uniform distributions. Several parametric tests are compared. These tests also can be used in testing randomness of a sample. Anderson-Darling $A^2$ statistic is found to be most powerful.

  • PDF

A Sequence of Improvement over the Lindley Type Estimator with the Cases of Unknown Covariance Matrices

  • Kim, Byung-Hwee;Baek, Hoh-Yoo
    • Communications for Statistical Applications and Methods
    • /
    • v.12 no.2
    • /
    • pp.463-472
    • /
    • 2005
  • In this paper, the problem of estimating a p-variate (p $\ge$4) normal mean vector is considered in decision-theoretic set up. Using a simple property of the noncentral chi-square distribution, a sequence of estimators dominating the Lindley type estimator with the cases of unknown covariance matrices has been produced and each improved estimator is better than previous one.

Polymorphisms in Heat Shock Proteins A1B and A1L (HOM) as Risk Factors for Oesophageal Carcinoma in Northeast India

  • Saikia, Snigdha;Barooah, Prajjalendra;Bhattacharyya, Mallika;Deka, Manab;Goswami, Bhabadev;Sarma, Manash P;Medhi, Subhash
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.16 no.18
    • /
    • pp.8227-8233
    • /
    • 2016
  • Background: To investigate polymorphisms in heat shock proteins A1B and A1L (HOM) and associated risk of oesophageal carcinoma in Northeast India. Materials and Methods: The study includes oesophageal cancer (ECA) patients attending general outpatient department (OPD) and endoscopic unit of Gauhati Medical College. Patients were diagnosed based on endoscopic and histopathological findings. Genomic DNA was typed for HSPA1B1267 and HSPA1L2437 SNPs using the polymerase chain reaction with restriction fragment length polymorphisms. Results: A total of 78 cases and 100 age-sex matched healthy controls were included in the study with a male: female ratio of 5:3 and a mean age of $61.4{\pm}8.5years$. Clinico-pathological evaluation showed 84% had squamous cell carcinoma and 16% were adenocarcinoma. Dysphagia grades 4 (43.5%) and 5 (37.1%) were observed by endoscopic and hispathological evaluation. The frequency of genomic variation of A1B from wild type A/A to heterozygous A/G and mutant G/G showed a positive association [chi sq=19.9, p=<0.05] and the allelic frequency also showed a significant correlation [chi sq=10.3, with cases vs. controls, OR=0.32, $p{\leq}0.05$]. The genomic variation of A1L from wild T/T to heterozygous T/C and mutant C/C were found positively associated [chi sq=7.02, p<0.05] with development of ECA. While analyzing the allelic frequency, there was no significant association [chi sq=3.19, OR=0.49, p=0.07]. Among all the risk factors, betel quid [OR=9.79, Chi square=35.0, p<0.05], tobacco [OR=2.95, chi square=10.6, p<0.05], smoking [OR=3.23, chi square=10.1, p<0.05] demonstrated significant differences between consumers vs. non consumers regarding EC development. Alcohol did not show any significant association [OR=1.34, chi square=0.69, p=0.4] independently. Conclusions: It can be concluded that the present study provides marked evidence that polymorphisms of HSP70 A1B and HSP70 A1L genes are associated with the development of ECA in a population in Northeast India, A1B having a stronger influence. Betel quid consumption was found to be a highly significant risk factor, followed by smoking and tobacco chewing. Although alcohol was not a potent risk factor independently, alcohol consumption along with tobacco, smoking and betel nut was found to contribute to development of ECA.

Bag of Visual Words Method based on PLSA and Chi-Square Model for Object Category

  • Zhao, Yongwei;Peng, Tianqiang;Li, Bicheng;Ke, Shengcai
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.9 no.7
    • /
    • pp.2633-2648
    • /
    • 2015
  • The problem of visual words' synonymy and ambiguity always exist in the conventional bag of visual words (BoVW) model based object category methods. Besides, the noisy visual words, so-called "visual stop-words" will degrade the semantic resolution of visual dictionary. In view of this, a novel bag of visual words method based on PLSA and chi-square model for object category is proposed. Firstly, Probabilistic Latent Semantic Analysis (PLSA) is used to analyze the semantic co-occurrence probability of visual words, infer the latent semantic topics in images, and get the latent topic distributions induced by the words. Secondly, the KL divergence is adopt to measure the semantic distance between visual words, which can get semantically related homoionym. Then, adaptive soft-assignment strategy is combined to realize the soft mapping between SIFT features and some homoionym. Finally, the chi-square model is introduced to eliminate the "visual stop-words" and reconstruct the visual vocabulary histograms. Moreover, SVM (Support Vector Machine) is applied to accomplish object classification. Experimental results indicated that the synonymy and ambiguity problems of visual words can be overcome effectively. The distinguish ability of visual semantic resolution as well as the object classification performance are substantially boosted compared with the traditional methods.

Pearson-type Chi-square Test on the Joint Orientations from Different Depths in Boreholes (시추공 영상자료와 카이제곱 검정을 이용한 절리 방향성의 수직적 변화양상에 관한 정량적 평가)

  • Kim, Ki-Seog;Park, Young-Do;Park, Yeon-Jun
    • Tunnel and Underground Space
    • /
    • v.18 no.3
    • /
    • pp.185-193
    • /
    • 2008
  • We have carried out Pearson-type chi-square tests on the orientation data of joints from different depths in order to estimate the homogeneity of joint orientations obtained from a borehole. The orientation data of joints were collected from two non-foliated massive rocks of granitic gneisses in South Korea since orientations of joints in folded metamorphic rocks, for example, are controlled by foliation and also changes as the orientations of foliation change by folding. Borehole images were used for the analysis of the orientations of individual joints. The orientation data were subdivided into the upper level data and lower level data. The data from these two levels are plotted on the patch net consisting of 21 orientation patches. Then, the two patterns on the patch net were analyzed using a contingency table. From the chi-square test on the data collected from two sites, we found that some data sets show statistically meaningful differences in orientations of joints. Since joints are one of the important parameters in determining the physical properties of rock masses, in situ investigation of joints are desirable in the geotechnical investigation and also in design of subsurface structures (e.g. tunnels and underground storages).

A Study on the Emission Characteristics and Prediction of VOCs (Volatile Organic Compounds) using Small Chamber Method (소형챔버법을 이용한 휘발성유기화합물(VOCs) 방출특성 및 예측에 관한 연구)

  • Pang, Seung-Ki;Sohn, Jang-Yeul;Lee, Kwang-Ho
    • KIEAE Journal
    • /
    • v.4 no.4
    • /
    • pp.11-18
    • /
    • 2004
  • In this study, the measurement system was developed for the measurement of pollutants from building materials, and specimens were made with concrete, gypsum board, mortar and wall paper. Characteristics of VOCs and TVOC concentration and Emission Factor as a function of time were assessed, and the conclusion was drawn as follows. (1) From predicting TVOC concentration decrease of specimen 7 with the wall paper attached to the concrete, the graph may become linear by converting the value of y-axis into the log function, and the prediction equation can be expressed as $y=34906{\ast}e^{-0.0093{\ast}time}$. Moreover, chi-square value was 0.83 which is relatively high value, indicating that TVOC concentration can be properly predicted if the same materials are used indoors. (2) From predicting VOCs Emission Factor decrease of specimen 7, the prediction equation can be expressed as $EF=15111{\ast}e^{-0.0093{\ast}time}$, and chi-square value was 0.83. (3) From predicting TVOC concentration decrease of specimen 7, prediction equation can be considered to be $y=254323{\ast}(1-e^{-0.1046{\ast}time})$, and chi-square was 0.994 which is significantly high value, indicating that indoor TVOC concentration can be properly predicted if the same materials are used indoors. Furthermore, the prediction of concentration decrease using cumulative value of hourly measured concentration is considered to be more accurate than that using just hourly measured value directly. (4) From predicting Emission Factor decrease with cumulative hourly data of Emission Factor, chi-square appeared to be higher than that by just using hourly data of Emission Factor directly. Therefore, the prediction of Emission Factor with cumulative hourly data can provide more reliable prediction equation than the case by using just hourly concentration directly.