• Title/Summary/Keyword: conditional test

Search Result 192, Processing Time 0.023 seconds

TAKES: Two-step Approach for Knowledge Extraction in Biomedical Digital Libraries

  • Song, Min
    • Journal of Information Science Theory and Practice
    • /
    • v.2 no.1
    • /
    • pp.6-21
    • /
    • 2014
  • This paper proposes a novel knowledge extraction system, TAKES (Two-step Approach for Knowledge Extraction System), which integrates advanced techniques from Information Retrieval (IR), Information Extraction (IE), and Natural Language Processing (NLP). In particular, TAKES adopts a novel keyphrase extraction-based query expansion technique to collect promising documents. It also uses a Conditional Random Field-based machine learning technique to extract important biological entities and relations. TAKES is applied to biological knowledge extraction, particularly retrieving promising documents that contain Protein-Protein Interaction (PPI) and extracting PPI pairs. TAKES consists of two major components: DocSpotter, which is used to query and retrieve promising documents for extraction, and a Conditional Random Field (CRF)-based entity extraction component known as FCRF. The present paper investigated research problems addressing the issues with a knowledge extraction system and conducted a series of experiments to test our hypotheses. The findings from the experiments are as follows: First, the author verified, using three different test collections to measure the performance of our query expansion technique, that DocSpotter is robust and highly accurate when compared to Okapi BM25 and SLIPPER. Second, the author verified that our relation extraction algorithm, FCRF, is highly accurate in terms of F-Measure compared to four other competitive extraction algorithms: Support Vector Machine, Maximum Entropy, Single POS HMM, and Rapier.

On Effect of Nonnormality on Size of Test for Dimensionality in Discriminant Analysis

  • Changha Hwang
    • Communications for Statistical Applications and Methods
    • /
    • v.3 no.3
    • /
    • pp.25-30
    • /
    • 1996
  • In discriminant analysis the procedures commonly used to estimate the dimensionality involve testing a sequence of dimensionality hypotheses. There is a problem with the size of the test since dimensionality hypotheses are tested sequentially and thus they are actually conditional tests. The focus of this paper is to investigate in asymptotic sense what happens to the sequential testing procedure if the assumption of normality does not hold.

  • PDF

A NEW INDEX OF DIMENSIONALITY - DETECT

  • Kim, Hae-Rim
    • The Pure and Applied Mathematics
    • /
    • v.3 no.2
    • /
    • pp.141-154
    • /
    • 1996
  • A data-driven index of dimensionality for an educational or psychological test - DETECT, short for Dimensionality Evaluation To Enumerate Contributing Traits, is proposed in this paper. It is based on estimated conditional covariances of item pairs, given score on remaining test items. Its purpose is to detect whatever multidimensionality structure exists, especially in the case of approximate simple structure. It does so by assigning items to relatively dimensionally homogeneous clusters via attempted maximization of the DETECT over all possible item cluster partitions. The performance of DETECT is studied through real and simulated data analyses.

  • PDF

Bootstrap-Based Test for Volatility Shifts in GARCH against Long-Range Dependence

  • Wang, Yu;Park, Cheolwoo;Lee, Taewook
    • Communications for Statistical Applications and Methods
    • /
    • v.22 no.5
    • /
    • pp.495-506
    • /
    • 2015
  • Volatility is a variation measure in finance for returns of a financial instrument over time. GARCH models have been a popular tool to analyze volatility of financial time series data since Bollerslev (1986) and it is said that volatility is highly persistent when the sum of the estimated coefficients of the squared lagged returns and the lagged conditional variance terms in GARCH models is close to 1. Regarding persistence, numerous methods have been proposed to test if such persistency is due to volatility shifts in the market or natural fluctuation explained by stationary long-range dependence (LRD). Recently, Lee et al. (2015) proposed a residual-based cumulative sum (CUSUM) test statistic to test volatility shifts in GARCH models against LRD. We propose a bootstrap-based approach for the residual-based test and compare the sizes and powers of our bootstrap-based CUSUM test with the one in Lee et al. (2015) through simulation studies.

Goodness-of-fit test for mean and variance functions

  • Jung, Sin-Ho;Lee, Kee-Won
    • Journal of the Korean Statistical Society
    • /
    • v.26 no.2
    • /
    • pp.199-210
    • /
    • 1997
  • Using regression methods based on quasi-likelihood equation, one only needs to specify the conditional mean and variance functions for the response variable in the analysis. In this paper, an omnibus lack-of-fit test is proposed to test the validity of these two functions. Our test is consistent against the alternative under which either the mean or the variance is not the one specified in the null hypothesis. The large-sample null distribution of our test statistics can be approximated through simulations. Extensive numerical studies are performed to demonstrate that the new test preserves the prescribed type I error probability. Power comparisons are conducted to show the advantage of the new proposal.

  • PDF

An Analysis of the Absolute Vs. Conditional Convergency Hypothesis and the Determinants of Labor Productivity in Manufacturing Industries: The Korean Case (16개 광역시도별 제조업 부문에 대한 절대적 및 조건부 수렴가설 검증 및 생산성 결정요인 분석)

  • Park, Chuhwan;Shin, Kwang Ha
    • International Area Studies Review
    • /
    • v.17 no.4
    • /
    • pp.89-106
    • /
    • 2013
  • In this paper, we analysed the absolute and conditional convergency hypothesis and the determinants of productivity in manufacturing industries from 2000 to 2009 with 16 provinces and metro-cities by using panel analysis. In terms of convergency hypothesis test, the results show that both of the convergency hypothesis, the absolute vs. conditional hypothesis, reject the null hypothesis(H0) implying the labor productivity of the 16 province and metro-cities converged to the steady state equilibrium. Also, the speed of the absolute and conditional convergency for the 16 province and metro-cities are average 4.4% and 0.73% respectively. In addition, the results of the determinants of the labor productivity in manufacturing industry show that human capital and manufacturing location coefficient affect to the value- added per capita significantly, but government expenditure per capita doesn't affect to the value- added per capita. As for the total factor productivity, government expenditure per capita and fixed capital per capita are important factors, but research and development doesn't. Hence the government has to revise the balanced regional development policy to develop regional manufacturing industries for the vulnerable regions. Also, it requires more study regarding income disparities and productivity.

Comparison to Automobile Pilot Lamp by Accelerated Life Test (가속수명시험을 통한 자동차용 파일럿램프의 비교평가)

  • Shin, Min-Gyung;Wei, Shin-Hwan;Kim, Hyung-Min
    • Journal of Applied Reliability
    • /
    • v.8 no.2
    • /
    • pp.75-85
    • /
    • 2008
  • In this paper, we compared domestic with foreign pilot lamps installed on the instrument board or electronic modules (car audio, air-conditional system, etc.) of an automotive vehicle by an accelerated life test in order to estimate the life of domestic pilot lamps. An accelerated life test method was developed and the relation of the life and voltage stress was analyzed. The main results are as follows; i) $B_{10}$ life of pilot lamp is above 5,000 hours, vehicle travel time for 10 years. ii) the life of domestic pilot lamp is longer than that of foreign thing. iii) the life distribution of domestic pilot lamp is wider than that of foreign thing. iv) it is possible to promote import replacement of automobile pilot lamp.

  • PDF

A new extension of Lindley distribution: modified validation test, characterizations and different methods of estimation

  • Ibrahim, Mohamed;Yadav, Abhimanyu Singh;Yousof, Haitham M.;Goual, Hafida;Hamedani, G.G.
    • Communications for Statistical Applications and Methods
    • /
    • v.26 no.5
    • /
    • pp.473-495
    • /
    • 2019
  • In this paper, a new extension of Lindley distribution has been introduced. Certain characterizations based on truncated moments, hazard and reverse hazard function, conditional expectation of the proposed distribution are presented. Besides, these characterizations, other statistical/mathematical properties of the proposed model are also discussed. The estimation of the parameters is performed through different classical methods of estimation. Bayes estimation is computed under gamma informative prior under the squared error loss function. The performances of all estimation methods are studied via Monte Carlo simulations in mean square error sense. The potential of the proposed model is analyzed through two data sets. A modified goodness-of-fit test using the Nikulin-Rao-Robson statistic test is investigated via two examples and is observed that the new extension might be used as an alternative lifetime model.

Conditional Probability Based Early Termination of Recursive Coding Unit Structures in HEVC (HEVC의 재귀적 CU 구조에 대한 조건부 확률 기반 고속 탐색 알고리즘)

  • Han, Woo-Jin
    • Journal of Broadcast Engineering
    • /
    • v.17 no.2
    • /
    • pp.354-362
    • /
    • 2012
  • Recently, High Efficiency Video Coding (HEVC) is under development jointly by MPEG and ITU-T for the next international video coding standard. Compared to the previous standards, HEVC supports variety of splitting units, such as coding unit (CU), prediction unit (PU), and transform unit (TU). Among them, it has been known that the recursive quadtree structure of CU can improve the coding efficiency while the encoding complexity is increased significantly. In this paper, a simple conditional probability to predict the early termination condition of recursive unit structure is introduced. The proposed conditional probability is estimated based on Bayes' formula from local statistics of rate-distortion costs in encoder. Experimental results show that the proposed method can reduce the total encoding time by about 32% according to the test configuration while the coding efficiency loss is 0.4%-0.5%. In addition, the encoding time can be reduced by 50% with 0.9% coding efficiency loss when the proposed method was used jointly with HM4.0 early CU termination algorithm.

3rd, 4th and 5th Graders' Probability Understanding (초등학교 3, 4, 5학년 학생들의 확률 이해 실태)

  • Yoon, Hye-Young;Lee, Kwang-Ho
    • Education of Primary School Mathematics
    • /
    • v.14 no.1
    • /
    • pp.69-79
    • /
    • 2011
  • The purpose of this study is to analyze 3rd, 4th and 5th graders' probability understanding and raise issues concerning instructional methods and search for the possibility of learning probability. For the purpose, a descriptive study through pencil-and-paper test regarding fairness, sample space, probability of event, probability comparison, independence and conditional probability was conducted. The following conclusions were drawn from the results obtained in this study. First, the 3rd, 4th, and 5th grade students scored the highest in the sample space questions. In descending order of skill, the students scored the highest in sample space following probability of events, fairness and probability comparison. Second, however, the level of independence understanding was low. There was no meaningful differences between grades and the conditional probability was the least understood. The independence is difficult to develop naturally according to cognitive development. The conditional probability recognizing the probability of an event changes in non-replacement situations was very difficult for these students. Third, there were significant differences between the 5th graders and the 3rd and 4th graders in the probability comparison questions. It shows that 5th graders understand the concept of proportion when they compare equal ratio probability of an event. The 3rd graers could do different ratio probability of an event more easily than equal ratio probability of an event after they were instructed on probability comparison.