• Title/Summary/Keyword: Regression Analysis Method

Search Result 4,614, Processing Time 0.03 seconds

Modeling of Indium Tin Oxide(ITO) Film Deposition Process using Neural Network (신경회로망을 이용한 ITO 박막 성장 공정의 모형화)

  • Min, Chul-Hong;Park, Sung-Jin;Yoon, Neung-Goo;Kim, Tae-Seon
    • Journal of the Korean Institute of Electrical and Electronic Material Engineers
    • /
    • v.22 no.9
    • /
    • pp.741-746
    • /
    • 2009
  • Compare to conventional Indium Tin Oxide (ITO) film deposition methods, cesium assisted sputtering method has been shown superior electrical, mechanical, and optical film properties. However, it is not easy to use cesium assisted sputtering method since ITO film properties are very sensitive to Cesium assisted equipment condition but their mechanism is not yet clearly defined physically or mathematically. Therefore, to optimize deposited ITO film characteristics, development of accurate and reliable process model is essential. For this, in this work, we developed ITO film deposition process model using neural networks and design of experiment (DOE). Developed model prediction results are compared with conventional statistical regression model and developed neural process model has been shown superior prediction results on modeling of ITO film thickness, sheet resistance, and transmittance characteristics.

Classification and Regression Tree Analysis for Molecular Descriptor Selection and Binding Affinities Prediction of Imidazobenzodiazepines in Quantitative Structure-Activity Relationship Studies

  • Atabati, Morteza;Zarei, Kobra;Abdinasab, Esmaeil
    • Bulletin of the Korean Chemical Society
    • /
    • v.30 no.11
    • /
    • pp.2717-2722
    • /
    • 2009
  • The use of the classification and regression tree (CART) methodology was studied in a quantitative structure-activity relationship (QSAR) context on a data set consisting of the binding affinities of 39 imidazobenzodiazepines for the α1 benzodiazepine receptor. The 3-D structures of these compounds were optimized using HyperChem software with semiempirical AM1 optimization method. After optimization a set of 1481 zero-to three-dimentional descriptors was calculated for each molecule in the data set. The response (dependent variable) in the tree model consisted of the binding affinities of drugs. Three descriptors (two topological and one 3D-Morse descriptors) were applied in the final tree structure to describe the binding affinities. The mean relative error percent for the data set is 3.20%, compared with a previous model with mean relative error percent of 6.63%. To evaluate the predictive power of CART cross validation method was also performed.

Fuzzy Regression Analysis by Fuzzy Neual Networks: Application to Quality Evaluation Problem (퍼지 신경망에 의한 퍼지 회귀분석:품질 평가 문제에의 응용)

  • 권기택
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.4 no.2
    • /
    • pp.7-13
    • /
    • 1999
  • This paper propose a fuzzy regression method using fuzzy neural networks when a membership value is attached to each input -output pair. First, an architecture of fuzzy neural networks with fuzzy weights and fuzzy biases is shown. Next, a cost function is defined using the fuzzy output from the fuzzy neural network and the corresponding target output with a membership value. A learning algorithm is derived from the cost function. The derived learning algorithm trains the fuzzy neural network so that the level set of the fuzzy output includes the target output. Last, the proposed method is applied to the quality evaluation problem of injection molding

  • PDF

Credit Scoring Using Splines (스플라인을 이용한 신용 평점화)

  • Koo Ja-Yong;Choi Daewoo;Choi Min-Sung
    • The Korean Journal of Applied Statistics
    • /
    • v.18 no.3
    • /
    • pp.543-553
    • /
    • 2005
  • Linear logistic regression is one of the most widely used method for credit scoring in credit risk management. This paper deals with credit scoring using splines based on Logistic regression. Linear splines and an automatic basis selection algorithm are adopted. The final model is an example of the generalized additive model. A simulation using a real data set is used to illustrate the performance of the spline method.

Linear regression under log-concave and Gaussian scale mixture errors: comparative study

  • Kim, Sunyul;Seo, Byungtae
    • Communications for Statistical Applications and Methods
    • /
    • v.25 no.6
    • /
    • pp.633-645
    • /
    • 2018
  • Gaussian error distributions are a common choice in traditional regression models for the maximum likelihood (ML) method. However, this distributional assumption is often suspicious especially when the error distribution is skewed or has heavy tails. In both cases, the ML method under normality could break down or lose efficiency. In this paper, we consider the log-concave and Gaussian scale mixture distributions for error distributions. For the log-concave errors, we propose to use a smoothed maximum likelihood estimator for stable and faster computation. Based on this, we perform comparative simulation studies to see the performance of coefficient estimates under normal, Gaussian scale mixture, and log-concave errors. In addition, we also consider real data analysis using Stack loss plant data and Korean labor and income panel data.

Enhancement of Text Classification Method (텍스트 분류 기법의 발전)

  • Shin, Kwang-Seong;Shin, Seong-Yoon
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2019.05a
    • /
    • pp.155-156
    • /
    • 2019
  • Traditional machine learning based emotion analysis methods such as Classification and Regression Tree (CART), Support Vector Machine (SVM), and k-nearest neighbor classification (kNN) are less accurate. In this paper, we propose an improved kNN classification method. Improved methods and data normalization achieve the goal of improving accuracy. Then, three classification algorithms and an improved algorithm were compared based on experimental data.

  • PDF

A Bayesian joint model for continuous and zero-inflated count data in developmental toxicity studies

  • Hwang, Beom Seuk
    • Communications for Statistical Applications and Methods
    • /
    • v.29 no.2
    • /
    • pp.239-250
    • /
    • 2022
  • In many applications, we frequently encounter correlated multiple outcomes measured on the same subject. Joint modeling of such multiple outcomes can improve efficiency of inference compared to independent modeling. For instance, in developmental toxicity studies, fetal weight and number of malformed pups are measured on the pregnant dams exposed to different levels of a toxic substance, in which the association between such outcomes should be taken into account in the model. The number of malformations may possibly have many zeros, which should be analyzed via zero-inflated count models. Motivated by applications in developmental toxicity studies, we propose a Bayesian joint modeling framework for continuous and count outcomes with excess zeros. In our model, zero-inflated Poisson (ZIP) regression model would be used to describe count data, and a subject-specific random effects would account for the correlation across the two outcomes. We implement a Bayesian approach using MCMC procedure with data augmentation method and adaptive rejection sampling. We apply our proposed model to dose-response analysis in a developmental toxicity study to estimate the benchmark dose in a risk assessment.

A Study on a Working Pattern Analysis Prototype using Correlation Analysis and Linear Regression Analysis in Welding BigData Environment (용접 빅데이터 환경에서 상관분석 및 회귀분석을 이용한 작업 패턴 분석 모형에 관한 연구)

  • Jung, Se-Hoon;Sim, Chun-Bo
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.9 no.10
    • /
    • pp.1071-1078
    • /
    • 2014
  • Recently, information providing service using Big Data is being expanded. Big Data processing technology is actively being academic research to an important issue in the IT industry. In this paper, we analyze a skilled pattern of welder through Big Data analysis or extraction of welding based on R programming. We are going to reduce cost on welding work including weld quality, weld operation time by providing analyzed results non-skilled welder. Welding has a problem that should be invested long time to be a skilled welder. For solving these issues, we apply connection rules algorithms and regression method to much pattern variable for welding pattern analysis of skilled welder. We analyze a pattern of skilled welder according to variable of analyzed rules by analyzing top N rules. In this paper, we confirmed the pattern structure of power consumption rate and wire consumption length through experimental results of analyzed welding pattern analysis.

A Verification on the Statistical Significance between Groups Using Regression Analysis (회귀분석을 이용한 집단 간 통계적 유의성 검증에 관한 연구)

  • Nam, Soo-Tai;Shin, Seong-Yoon;Jin, Chan-Yong
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2019.05a
    • /
    • pp.163-164
    • /
    • 2019
  • In this study, we compared the differences between groups in a model that investigates the effect of smartphone users on the intent to use continuously. There are various methodologies for group difference analysis, but in this study, we try to verify the size comparison of regression analysis $R^2$. In order to analyze the difference between groups, we try to prove through hypothesis test whether there is a meaningful difference in the intention of continuous use of Korean and Chinese smartphone users collected through previous research. The results of the analysis are useful as a method to determine whether smartphone users in China and Korea are aware of differences or not. According to this procedure, first, the formula for calculating Z-transformation of Fisher and Z-score test statistic calculation formula were used. However, this methodology is also used in the verification of control effect using correlation coefficient. Also, the theoretical implications are presented based on the analysis results.

  • PDF

Curriculum of Basic Data Science Practices for Non-majors (비전공자 대상 기초 데이터과학 실습 커리큘럼)

  • Hur, Kyeong
    • Journal of Practical Engineering Education
    • /
    • v.12 no.2
    • /
    • pp.265-273
    • /
    • 2020
  • In this paper, to design a basic data science practice curriculum as a liberal arts subject for non-majors, we proposed an educational method using an Excel(spreadsheet) data analysis tool. Tools for data collection, data processing, and data analysis include Excel, R, Python, and Structured Query Language (SQL). When it comes to practicing data science, R, Python and SQL need to understand programming languages and data structures together. On the other hand, the Excel tool is a data analysis tool familiar to the general public, and it does not have the burden of learning a programming language. And if you practice basic data science practice with Excel, you have the advantage of being able to concentrate on acquiring data science content. In this paper, a basic data science practice curriculum for one semester and weekly Excel practice contents were proposed. And, to demonstrate the substance of the educational content, examples of Linear Regression Analysis were presented using Excel data analysis tools.