• 제목/요약/키워드: stepwise variable selection

검색결과 53건 처리시간 0.023초

Prediction of Thermal Decomposition Temperature of Polymers Using QSPR Methods

  • Ajloo, Davood;Sharifian, Ali;Behniafar, Hossein
    • Bulletin of the Korean Chemical Society
    • /
    • 제29권10호
    • /
    • pp.2009-2016
    • /
    • 2008
  • The relationship between thermal decomposition temperature and structure of a new data set of eighty monomers of different polymers were studied by multiple linear regression (MLR). The stepwise method was used in order to variable selection. The best descriptors were selected from over 1400 descriptors including; topological, geometrical, electronic and hybrid descriptors. The effect of number of descriptors on the correlation coefficient (R) and F-ratio were considered. Two models were suggested, one model having four descriptors ($R^2$ = 0.894, $Q^2_{cv}$ = 0.900, F = 172.1) and other model involving 13 descriptors ($R^2$ = 0.956, $Q^2_{cv}$ = 0.956, F = 125.4).

A Study on the Fault Process and Equipment Analysis of Plastic Ball Grid Array Manufacturing Using Data-Mining Techniques

  • Sim, Hyun Sik
    • Journal of Information Processing Systems
    • /
    • 제16권6호
    • /
    • pp.1271-1280
    • /
    • 2020
  • The yield and quality of a micromanufacturing process are important management factors. In real-world situations, it is difficult to achieve a high yield from a manufacturing process because the products are produced through multiple nanoscale manufacturing processes. Therefore, it is necessary to identify the processes and equipment that lead to low yields. This paper proposes an analytical method to identify the processes and equipment that cause a defect in the plastic ball grid array (PBGA) during the manufacturing process using logistic regression and stepwise variable selection. The proposed method was tested with the lot trace records of a real work site. The records included the sequence of equipment that the lot had passed through and the number of faults of each type in the lot. We demonstrated that the test results reflect the real situation in a PBGA manufacturing process, and the major equipment parameters were then controlled to confirm the improvement in yield; the yield improved by approximately 20%.

플라즈마 정보인자 기반 가상계측을 통한 Si 식각률의 첫 장 효과 분석 (Analysis of First Wafer Effect for Si Etch Rate with Plasma Information Based Virtual Metrology)

  • 유상원;권지원
    • 반도체디스플레이기술학회지
    • /
    • 제20권4호
    • /
    • pp.146-150
    • /
    • 2021
  • Plasma information based virtual metrology (PI-VM) that predicts wafer-to-wafer etch rate variation after wet cleaning of plasma facing parts was developed. As input parameters, plasma information (PI) variables such as electron temperature, fluorine density and hydrogen density were extracted from optical emission spectroscopy (OES) data for etch plasma. The PI-VM model was trained by stepwise variable selection method and multi-linear regression method. The expected etch rate by PI-VM showed high correlation coefficient with measured etch rate from SEM image analysis. The PI-VM model revealed that the root cause of etch rate variation after the wet cleaning was desorption of hydrogen from the cleaned parts as hydrogen combined with fluorine and decreased etchant density and etch rate.

정신과 입원환자의 행동변화에 영향을 주는 요소에 관한 연구 (A Study of the Factor on Behavioral Change of the Psychiatric in-patient)

  • 이소우;김태경
    • 대한간호학회지
    • /
    • 제14권2호
    • /
    • pp.84-92
    • /
    • 1984
  • This article examined relationships between selected variables, such as demographic background, care, treatment variables, environmental characteristics, and patient's daily behavior and mood change. Relationship were determined between independent variabltherapeutic-rapeutie approach, demographic data, environmental management approach-,and dependent variable-patient's daily behavioral and mood change. 35 patients selected within some criteria in a psychiatric ward, were obserbed during 5 weeks by use of Wyatt's Behavior & Mood Rating Scale ac-cording to the object of the study. At the same time, the frequence of the care and treatment were collected. Criteria for sample selection and independent variables as an influential factor to the patient behavioral change, based on a literature revienw and clinical experiences. Pearson's correlation and multiple regression analysis were used to determine the influfntial factors to the patient behavioral change. Systematic reading (r=.8324), Psychiatrist's individual interview (r=.5764), tranquilizer (r=.3441) and hospitalization processing date (r=.4143) were related with patient's behavioral change. That is these 4 variables can be said to influence to the patient's behavior and mood. A stepwise multiple regression analysis of the effect of the independent varibles of systematic reading, psychintrists individual interview, tranquilizer and hospitalization processing date on the dependent variable, patient's behavioral change was carried out. Systematic reading with on R²of. 69 revealed to be the main influential factor to the patient's behavior and mood change, as the next factor psychiatrist individual interview. A total inclusion of these factors revealed a 73% prediction for the patient's behavior and mood change. But the most influential factor was the interaction of the systematic reading and psychiatrist's individual interview.

  • PDF

플라즈마 정보인자를 활용한 SiO2 식각 깊이 가상 계측 모델의 특성 인자 역할 분석 (Role of Features in Plasma Information Based Virtual Metrology (PI-VM) for SiO2 Etching Depth)

  • 장윤창;박설혜;정상민;유상원;김곤호
    • 반도체디스플레이기술학회지
    • /
    • 제18권4호
    • /
    • pp.30-34
    • /
    • 2019
  • We analyzed how the features in plasma information based virtual metrology (PI-VM) for SiO2 etching depth with variation of 5% contribute to the prediction accuracy, which is previously developed by Jang. As a single feature, the explanatory power to the process results is in the order of plasma information about electron energy distribution function (PIEEDF), equipment, and optical emission spectroscopy (OES) features. In the procedure of stepwise variable selection (SVS), OES features are selected after PIEEDF. Informative vector for developed PI-VM also shows relatively high correlation between OES features and etching depth. This is because the reaction rate of each chemical species that governs the etching depth can be sensitively monitored when OES features are used with PIEEDF. Securing PIEEDF is important for the development of virtual metrology (VM) for prediction of process results. The role of PIEEDF as an independent feature and the ability to monitor variation of plasma thermal state can make other features in the procedure of SVS more sensitive to the process results. It is expected that fault detection and classification (FDC) can be effectively developed by using the PI-VM.

회귀계수의 유의성 검정방법에 따른 설계강우량 시간분포 분석 (Temporal distritution analysis of design rainfall by significance test of regression coefficients)

  • 박진희;이재준
    • 한국수자원학회논문집
    • /
    • 제55권4호
    • /
    • pp.257-266
    • /
    • 2022
  • 국지성 호우 및 설계빈도 이상 강우의 증가로 침수피해가 매년 증가하고 있으며 이에 따라 홍수 조절 및 방어를 위한 수공구조물의 중요성이 증가하고 있다. 수공구조물은 목적과 성능에 따른 설계가 이루어지고 있고 홍수량이 중요한 산정 요소이나 국내에서는 관측자료의 신뢰성 부족 및 데이터의 부족으로 인하여 수공구조물 설계를 위한 수문해석 입력자료로 사용되는 설계강우량은 정확한 확률강우량의 산정과 시간분포가 중요한 요소로 작용한다. 실무에서는 Huff의 4분위 방법의 누가우량백분율을 이용하여 설계강우량의 시간분포 회귀식을 산정하고 있으며 분위별 곡선에 대한 회귀식은 전반적으로 정확도가 높게 나타나는 6차 다항회귀식을 일률적으로 사용하고 있다. 본 연구에서는 실무에서 일반적으로 설계강우량의 시간분포를 위해 사용하고 있는 Huff의 4분위 방법의 누가우량백분율을 이용하여 통계 모델링에서 간결함의 원리에 따라 변수선택법을 이용하여 시간분포 회귀식을 유도하였으며, 유의성 검정을 통한 시간분포 회귀식의 검증을 실시하였다. 변수선택법과 유의성 검정을 통한 시간분포 회귀식 산정 결과 전진선택법과 후방제거법의 장점을 모두 가지고 있는 단계선택법을 이용하여 시간분포 회귀식을 유도하는 것이 가장 적합한 것으로 분석되었다.

Designing Hypothesis of 2-Substituted-N-[4-(1-methyl-4,5-diphenyl-1H-imidazole-2-yl)phenyl] Acetamide Analogs as Anticancer Agents: QSAR Approach

  • Bedadurge, Ajay B.;Shaikh, Anwar R.
    • 대한화학회지
    • /
    • 제57권6호
    • /
    • pp.744-754
    • /
    • 2013
  • Quantitative structure-activity relationship (QSAR) analysis for recently synthesized imidazole-(benz)azole and imidazole - piperazine derivatives was studied for their anticancer activities against breast (MCF-7) cell lines. The statistically significant 2D-QSAR models ($r^2=0.8901$; $q^2=0.8130$; F test = 36.4635; $r^2$ se = 0.1696; $q^2$ se = 0.12212; pred_$r^2=0.4229$; pred_$r^2$ se = 0.4606 and $r^2=0.8763$; $q^2=0.7617$; F test = 31.8737; $r^2$ se = 0.1951; $q^2$ se = 0.2708; pred_$r^2=0.4386$; pred_$r^2$ se = 0.3950) were developed using molecular design suite (VLifeMDS 4.2). The study was performed with 18 compounds (data set) using random selection and manual selection methods used for the division of the data set into training and test set. Multiple linear regression (MLR) methodology with stepwise (SW) forward-backward variable selection method was used for building the QSAR models. The results of the 2D-QSAR models were further compared with 3D-QSAR models generated by kNN-MFA, (k-Nearest Neighbor Molecular Field Analysis) investigating the substitutional requirements for the favorable anticancer activity. The results derived may be useful in further designing novel imidazole-(benz)azole and imidazole-piperazine derivatives against breast (MCF-7) cell lines prior to synthesis.

한우 거세우 고기 관능평가 데이터의 로지스틱 회귀분석 (Logistic Regressions with Sensory Evaluation Data about Hanwoo Steer Beef)

  • 이혜정;김재희
    • 응용통계연구
    • /
    • 제23권5호
    • /
    • pp.857-870
    • /
    • 2010
  • 국립축산과학원에서는 2006년 부터 2008년 까지 전국 소비자들을 대상으로 한우 거세우 표본 시료에 대한 관능 평가 조사를 실시하여 데이터를 수집하였으며 본 연구에서는 한우 관능 평가 데이터에 대해 사회 인구학적 요인과 한국 소비자들의 맛 평가에 대한 연관성을 탐구하고자 한다. 소비자 거주지역, 연령, 성별, 직업, 월수입과 쇠고기 부위를 설명변수로 맛등급 평가를 반응변수로 이항 다중 로지스틱 모형과 다항 다중 로지스틱 모형을 적합하고 회귀계수별 유의성 검정과 적합도 검정을 실시한다. 단계별 변수 선택으로 최종 모형을 선택하고 반응변수 범주에 대한 오즈비를 계산하여 맛등급과 설명변수들 간의 관련성을 파악한다. 또한 맛과 관련 있는 연속형 변수를 설명변수로 포함한 경우에 대해서도 이항 다중 로지스틱 모형과 다항 다중 로지스틱 모형을 적합하고 비교한다. 그 결과 거주 지역, 연령, 월수입과 쇠고기 부위 변수들이 선택되었으며 영남지역에서 맛에 대한 오즈가 큰 편이며 수입이 많고 연령이 높을수록 맛에 대한 오즈가 작은 편이었다. 요리법으로는 탕에 대한 구이의 오즈비가 큰 편이며 쇠고기 부위별로는 우둔에 비해서 등심이 다른 부위들 보다 맛에 대한 차이가 크다고 볼 수 있다. 연속형 변수로는 연도가 맛등급에 큰 영향을 미치는 변수로 나타났다.

DIVERGENT SELECTION FOR POSTWEANING FEED CONVERSION IN ANGUS BEEF CATTLE V. PREDICTION OF FEED CONVERSION USING WEIGHTS AND LINEAR BODY MEASUREMENTS

  • Park, N.H.;Bishop, M.D.;Davis, M.E.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제7권3호
    • /
    • pp.441-448
    • /
    • 1994
  • Postweaning performance data were obtained on 187 group fed purebred Angus calves from 12 selected sires (six high and six low feed conversion sires) in 1985 and 1986. The objective of this portion of the study was to develop prediction equations for feed conversion from a stepwise regression analysis. Variables measured were on-test weight (ONTSTWT), on-test age (ONTSTAG), five weights by 28-d periods, seven linear body measurements: heart girth (HG), hip height (HH), head width (HDW), head length (HDL), muzzle circumference (MC), length between hooks and pins (HOPIN) and length between shoulder and hooks (SHHO), and backfat thickness (BF). Stepwise regressions for maintenance adjusted feed conversion (ADJFC) and unadjusted feed conversion (UNADFC) over the first 140 d of the test, and total feed conversion (FC) until progeny reached 8.89 mm of back fat were obtained separately by conversion groups and sexes and for combined feed conversion groups and sexes. In general, weights were more important than linear body measurements in prediction of feed utilization. To some extent this was expected as weight is related directly to gain which is a component of feed conversion. Weight at 112 d was the most important variable in prediction of feed conversion when data from both feed conversion groups and sexes were combined. Weights at 84 and 140 d were important variables in prediction of UNADFC and FC, respectively, of bulls. ONTSTWT and weight at 140 d had the highest standardized partial regression coefficients for UNADFC and ADJFC, respectively, of heifers. Results indicated that linear measurements, such as MC, HDL and HOPIN, are useful in prediction of feed conversion when feed in takes are unavailable.

범주형 재무자료에 대한 신용평가모형 검증 비교 (Validation Comparison of Credit Rating Models for Categorized Financial Data)

  • 홍종선;이창혁;김지훈
    • Communications for Statistical Applications and Methods
    • /
    • 제15권4호
    • /
    • pp.615-631
    • /
    • 2008
  • 재무자료에 대한 신용평가모형은 각각의 재무변수를 평활한 예측부도율로 변환하여 사용한다. 본 연구에서는 연속형 재무자료를 변환하여 설정된 신용평가모형의 문제점을 살펴보고, 연속형 재무변수를 다양한 형태로 범주화한 신용평가모형들을 제안한다. 범주형 재무자료를 사용해서 개발한 여러 종류의 신용평가모형들의 성과를 다양한 적합성 검증 방법으로 비교하고, 범주형 재무자료를 이용한 신용평가모형의 유용성을 토론한다.