• 제목/요약/키워드: Data Bias

검색결과 1,738건 처리시간 0.029초

학습과 예측의 유전 제어: 플라즈마 식각공정 데이터 모델링에의 응용 (Genetic Control of Learning and Prediction: Application to Modeling of Plasma Etch Process Data)

  • 우형수;곽관웅;김병환
    • 제어로봇시스템학회논문지
    • /
    • 제13권4호
    • /
    • pp.315-319
    • /
    • 2007
  • A technique to model plasma processes was presented. This was accomplished by combining the backpropagation neural network (BPNN) and genetic algorithm (GA). Particularly, the GA was used to optimize five training factor effects by balancing the training and test errors. The technique was evaluated with the plasma etch data, characterized by a face-centered Box Wilson experiment. The etch outputs modeled include Al etch rate, AI selectivity, DC bias, and silica profile angle. Scanning electron microscope was used to quantify the etch outputs. For comparison, the etch outputs were modeled in a conventional fashion. GABPNN models demonstrated a considerable improvement of more than 25% for all etch outputs only but he DC bias. About 40% improvements were even achieved for the profile angle and AI etch rate. The improvements demonstrate that the presented technique is effective to improving BPNN prediction performance.

집단지성(Collective Intelligence)과 의사결정의 편향성 (Collective Intelligence and Human Decision Bias)

  • 한주희;신경식;채상미
    • Journal of Information Technology Applications and Management
    • /
    • 제22권2호
    • /
    • pp.113-122
    • /
    • 2015
  • Collective intelligence can be an influential factor of decision-making based on collaboration and information exchange between individuals. Our study explores whether collective intelligence can mitigate the loss aversion effect, bias and error in human judgment, and collective intelligence in online communities can reduce the loss aversion effect. Our community settings display both individual-level and group-level loss aversion effect, investigate effective collective intelligence characteristics like investment commitment, participant experience. Using a multi-method approach our research comprises a web-based experiment with 100 participants investing 3 situations from a real-world community, data from a survey measuring loss aversion behavior of participants. The results suggest the loss aversion effect mitigates under the online-circumstance. Overall, our results suggest that, while collective intelligence mitigates the loss aversion effect, participants do not transfer these results to other settings.

Pooling shrinkage estimator of reliability for exponential failure model using the sampling plan (n, C, T)

  • Al-Hemyari, Z.A.;Jehel, A.K.
    • International Journal of Reliability and Applications
    • /
    • 제12권1호
    • /
    • pp.61-77
    • /
    • 2011
  • One of the most important problems in the estimation of the parameter of the failure model, is the cost of experimental sampling units, which can be reduced by using any prior information available about ${\theta}$, and devising a two-stage pooling shrunken estimation procedure. We have proposed an estimator of the reliability function (R(t)) of the exponential model using two-stage time censored data when a prior value about the unknown parameter (${\theta}$) is available from the past. To compare the performance of the proposed estimator with the classical estimator, computer intensive calculations for bias, mean squared error, relative efficiency, expected sample size and percentage of the overall sample size saved expressions, were done for varying the constants involved in the proposed estimator (${\tilde{R}}$(t)).

  • PDF

Minimum Hellinger Distance Estimation and Minimum Density Power Divergence Estimation in Estimating Mixture Proportions

  • Pak, Ro-Jin
    • Journal of the Korean Data and Information Science Society
    • /
    • 제16권4호
    • /
    • pp.1159-1165
    • /
    • 2005
  • Basu et al. (1998) proposed a new density-based estimator, called the minimum density power divergence estimator (MDPDE), which avoid the use of nonparametric density estimation and associated complication such as bandwidth selection. Woodward et al. (1995) examined the minimum Hellinger distance estimator (MHDE), proposed by Beran (1977), in the case of estimation of the mixture proportion in the mixture of two normals. In this article, we introduce the MDPDE for a mixture proportion, and show that both the MDPDE and the MHDE have the same asymptotic distribution at a model. Simulation study identifies some cases where the MHDE is consistently better than the MDPDE in terms of bias.

  • PDF

기혼여성의 노동공급 결정요인에 관한 연구 (An Analysis on the Determinants of Labor Supply for Married Women)

  • 김지경;조유현
    • 대한가정학회지
    • /
    • 제39권2호
    • /
    • pp.15-24
    • /
    • 2001
  • The purpose of this research was to investigate the factors affecting the labor supply of married women. Based on the theoretical review of the process for the labor supply of married women and the review of previous research, the emperical specification was deduced as a function of husband's income, assets, education and age, the number of children and home ownership. The data of this research was collected with questionnaire in 1998. The data consisted of the answers by 200 married women. For the measurement of the emperical specification, Logit, Tobit, and Selection Bias Corrected Regression which modifies selection bias were used. Although several different discussions can made depending on the measurement method, the emperical result of this research showed that the labor supply of married women is explained by husband's income, assets, the level of education and work experience.

  • PDF

Estimation of Denominators- a New Approach for Calculating of Various Rates in Cancer Registries

  • Haroon, A.S.;Gupta, S.M.;Tyagi, B.B.;Farhat, J.
    • Asian Pacific Journal of Cancer Prevention
    • /
    • 제13권7호
    • /
    • pp.3229-3232
    • /
    • 2012
  • In this study, cancer incidence data were assessed to provide various rates of five year age groups for a given year, lying between two census years. The individual exponential growth rate method is most useful in both population-based and non-population cased cancer registries in India to estimate the population by five yearly age groups and also find the rates of crude rates, age standard rates and cumulative rates. This method has been shown to endure from bias and often results sacrificing the overall growth rate and correction factor must be needful in five year age group population to maintain it. A second method, the difference distribution method is also able to maintain the overall growth rate and overcome the bias in estimation of five yearly age group populations. From this point of view these methods serving a new technique for population estimation by five yearly age groups for inter census years.

On the Negative Estimates of Direct and Maternal Genetic Correlation - A Review

  • Lee, C.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제15권8호
    • /
    • pp.1222-1226
    • /
    • 2002
  • Estimates of genetic correlation between direct and maternal effects for weaning weight of beef cattle are often negative in field data. The biological existence of this genetic antagonism has been the point at issue. Some researchers perceived such negative estimate to be an artifact from poor modeling. Recent studies on sources affecting the genetic correlation estimates are reviewed in this article. They focus on heterogeneity of the correlation by sex, selection bias caused from selective reporting, selection bias caused from splitting data by sex, sire by year interaction variance, and sire misidentification and inbreeding depression as factors contributing sire by year interaction variance. A biological justification of the genetic antagonism is also discussed. It is proposed to include the direct-maternal genetic covariance in the analytical models.

GPS 주파수간 편이 추정정확도 분석 (Estimation Accuracy Analysis of GPS Inter-Frequency Biases)

  • 김민우;김정래;허문범
    • 항공우주시스템공학회지
    • /
    • 제4권1호
    • /
    • pp.19-22
    • /
    • 2010
  • The accuracy and integrity of global navigation satellite systems (GNSS) can be improved by using GNSS augmentation systems. Large ionospheric spatial gradient, during ionosphere storm, is a major threat for using GNSS augmentation systems by increasing the spatial decorrelation between a reference system and users. Ionosphere decorrelation behavior can be analyzed by using dual frequency GPS data. GNSS receivers have their own biases, called inter-frequency bias (IFB) between dual(P1 and P2) frequencies and they must be accurately estimated before computing ionosphere delays. GPS network data in Korea is used to compute each receiver's IFB, and their estimation accuracy and variability are analyzed. IFB estimation methodology to apply for ionosphere gradient analysis is discussed.

  • PDF

A Study on Split Variable Selection Using Transformation of Variables in Decision Trees

  • Chung, Sung-S.;Lee, Ki-H.;Lee, Seung-S.
    • Journal of the Korean Data and Information Science Society
    • /
    • 제16권2호
    • /
    • pp.195-205
    • /
    • 2005
  • In decision tree analysis, C4.5 and CART algorithm have some problems of computational complexity and bias on variable selection. But QUEST algorithm solves these problems by dividing the step of variable selection and split point selection. When input variables are continuous, QUEST algorithm uses ANOVA F-test under the assumption of normality and homogeneity of variances. In this paper, we investigate the influence of violation of normality assumption and effect of the transformation of variables in the QUEST algorithm. In the simulation study, we obtained the empirical powers of variable selection and the empirical bias of variable selection after transformation of variables having various type of underlying distributions.

  • PDF

Comparison of Latin Hypercube Sampling and Simple Random Sampling Applied to Neural Network Modeling of HfO2 Thin Film Fabrication

  • Lee, Jung-Hwan;Ko, Young-Don;Yun, Il-Gu;Han, Kyong-Hee
    • Transactions on Electrical and Electronic Materials
    • /
    • 제7권4호
    • /
    • pp.210-214
    • /
    • 2006
  • In this paper, two sampling methods which are Latin hypercube sampling (LHS) and simple random sampling were. compared to improve the modeling speed of neural network model. Sampling method was used to generate initial weights and bias set. Electrical characteristic data for $HfO_2$ thin film was used as modeling data. 10 initial parameter sets which are initial weights and bias sets were generated using LHS and simple random sampling, respectively. Modeling was performed with generated initial parameters and measured epoch number. The other network parameters were fixed. The iterative 20 minimum epoch numbers for LHS and simple random sampling were analyzed by nonparametric method because of their nonnormality.