• Title/Summary/Keyword: Statistical Information

Search Result 6,996, Processing Time 0.038 seconds

The Study on Application of Data Gathering for the site and Statistical analysis process (초기 데이터 분석 로드맵을 적용한 사례 연구)

  • Choi, Eun-Hyang;Ree, Sang-Bok
    • Proceedings of the Korean Society for Quality Management Conference
    • /
    • 2010.04a
    • /
    • pp.226-234
    • /
    • 2010
  • In this thesis, we present process that remove mistake of data before statistical analysis. If field data which is not simple examination about validity of data, we cannot believe analyzed statistics information. As statistical analysis information is produced based on data to be input in statistical analysis process, the data to be input should be free of error. In this paper, we study the application of statistical analysis road map that can enhance application on site by organizing basic theory and approaching on initial data exploratory phase, essential step before conducting statistical analysis. Therefore, access to statistical analysis can be enhanced and reliability on result of analysis can be secured by conducting correct statistical analysis.

  • PDF

On Line LS-SVM for Classification

  • Kim, Daehak;Oh, KwangSik;Shim, Jooyong
    • Communications for Statistical Applications and Methods
    • /
    • v.10 no.2
    • /
    • pp.595-601
    • /
    • 2003
  • In this paper we propose an on line training method for classification based on least squares support vector machine. Proposed method enables the computation cost to be reduced and the training to be peformed incrementally, With the incremental formulation of an inverse matrix in optimization problem, current information and new input data can be used for building the new inverse matrix for the estimation of the optimal bias and Lagrange multipliers, so the large scale matrix inversion operation can be avoided. Numerical examples are included which indicate the performance of proposed algorithm.

Automatic Mapping Between Large-Scale Heterogeneous Language Resources for NLP Applications: A Case of Sejong Semantic Classes and KorLexNoun for Korean

  • Park, Heum;Yoon, Ae-Sun
    • Language and Information
    • /
    • v.15 no.2
    • /
    • pp.23-45
    • /
    • 2011
  • This paper proposes a statistical-based linguistic methodology for automatic mapping between large-scale heterogeneous languages resources for NLP applications in general. As a particular case, it treats automatic mapping between two large-scale heterogeneous Korean language resources: Sejong Semantic Classes (SJSC) in the Sejong Electronic Dictionary (SJD) and nouns in KorLex. KorLex is a large-scale Korean WordNet, but it lacks syntactic information. SJD contains refined semantic-syntactic information, with semantic labels depending on SJSC, but the list of its entry words is much smaller than that of KorLex. The goal of our study is to build a rich language resource by integrating useful information within SJD into KorLex. In this paper, we use both linguistic and statistical methods for constructing an automatic mapping methodology. The linguistic aspect of the methodology focuses on the following three linguistic clues: monosemy/polysemy of word forms, instances (example words), and semantically related words. The statistical aspect of the methodology uses the three statistical formulae ${\chi}^2$, Mutual Information and Information Gain to obtain candidate synsets. Compared with the performance of manual mapping, the automatic mapping based on our proposed statistical linguistic methods shows good performance rates in terms of correctness, specifically giving recall 0.838, precision 0.718, and F1 0.774.

  • PDF

A Comparison Study on Statistical Modeling Methods (통계모델링 방법의 비교 연구)

  • Noh, Yoojeong
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.17 no.5
    • /
    • pp.645-652
    • /
    • 2016
  • The statistical modeling of input random variables is necessary in reliability analysis, reliability-based design optimization, and statistical validation and calibration of analysis models of mechanical systems. In statistical modeling methods, there are the Akaike Information Criterion (AIC), AIC correction (AICc), Bayesian Information Criterion, Maximum Likelihood Estimation (MLE), and Bayesian method. Those methods basically select the best fitted distribution among candidate models by calculating their likelihood function values from a given data set. The number of data or parameters in some methods are considered to identify the distribution types. On the other hand, the engineers in a real field have difficulties in selecting the statistical modeling method to obtain a statistical model of the experimental data because of a lack of knowledge of those methods. In this study, commonly used statistical modeling methods were compared using statistical simulation tests. Their advantages and disadvantages were then analyzed. In the simulation tests, various types of distribution were assumed as populations and the samples were generated randomly from them with different sample sizes. Real engineering data were used to verify each statistical modeling method.

Data Processing Performance Measurement by Protocol Changed in Power SCADA System (전력감시제어설비의 프로토콜 변경에 따른 데이터처리 성능측정)

  • Lee Yong-Doo;Choi Seong-Man;Yoo Cheol-Jung;Chang Ok-Bae
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2005.11b
    • /
    • pp.517-519
    • /
    • 2005
  • 전력설비의 증가 및 전력감시제어설비의 대용량 및 통합화에 따른 데이터처리 및 데이터통신에 대한 정보 요구량 증가와 이기종 컴퓨터간 및 네트워크 상호간의 높은 데이터처리 속도를 요구하고 있는 실정이다. 이러한 전력설비의 증가로 인하여 전력감시제어설비 또한 새로운 변화와 시스템 개선이 필요하다고 본다. 이에 따라 EMS의 부담을 경감시키고 공급신뢰도의 향상을 위하여 변전소들이 점차 무인화되면서 원격소장치의 프로토콜 변경으로 인한 트래픽의 발생정도 및 트래픽량에 의한 응답처리속도에 대하여 알아 보았다. 이러한 결과 고속 대용량화의 한계에 대비하는 대처방안임을 확신하고 전력감시제어설비 시스템의 안전성을 극대화할 수 있는 계기를 마련하고자 한다.

  • PDF

The Errors of Population Projections for Korea on Korean Information Statistical System

  • Yoon, Yong-Hwa;Kim, Jong-Tae
    • Journal of the Korean Data and Information Science Society
    • /
    • v.18 no.2
    • /
    • pp.419-427
    • /
    • 2007
  • Recently, Korean National Statistical Office submits the results of population projections for Korea from 1960 to 2050 year. The purpose of this paper is to suggest the reasonable assumptions for the survey of population, and then to detect the errors of the surveyed population (1960-2005) on Korean Information Statistical System.

  • PDF

Two Properties of Ancillary Statistics

  • Lee, Yong-Goo
    • Journal of the Korean Statistical Society
    • /
    • v.17 no.2
    • /
    • pp.93-100
    • /
    • 1988
  • Two properties of ancillary statistics are considered. One is to find a role of ancillary statistics in the statistical inference by showing that the ancillary statistic can recover the lost information and to give a criteria for comparing the conditional inference with unconditional inference. The other is to find an ancillary statistic of translation model and its relationship with observed Fisher information.

  • PDF

Comparison between nonlinear statistical time series forecasting and neural network forecasting

  • Inkyu;Cheolyoung;Sungduck
    • Communications for Statistical Applications and Methods
    • /
    • v.7 no.1
    • /
    • pp.87-96
    • /
    • 2000
  • Nonlinear time series prediction is derived and compared between statistic of modeling and neural network method. In particular mean squared errors of predication are obtained in generalized random coefficient model and generalized autoregressive conditional heteroscedastic model and compared with them by neural network forecasting.

  • PDF

On the Design of Statistical Software in the Network Environment

  • Han, Beom-Soo;Ahn, Jeong-Yong;Han, Kyung-Soo
    • Communications for Statistical Applications and Methods
    • /
    • v.9 no.1
    • /
    • pp.167-174
    • /
    • 2002
  • Computer network provides a powerful infrastructure for information sharing and the development of the statistical software with new concepts. In this paper, we discuss the design concepts of the statistical software in the network environment.