• 제목/요약/키워드: Multiple-Linear-Regression

검색결과 1,745건 처리시간 0.028초

Estimation of genetic parameters and trends for production traits of dairy cattle in Thailand using a multiple-trait multiple-lactation test day model

  • Buaban, Sayan;Puangdee, Somsook;Duangjinda, Monchai;Boonkum, Wuttigrai
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제33권9호
    • /
    • pp.1387-1399
    • /
    • 2020
  • Objective: The objective of this study was to estimate the genetic parameters and trends for milk, fat, and protein yields in the first three lactations of Thai dairy cattle using a 3-trait,-3-lactation random regression test-day model. Methods: Data included 168,996, 63,388, and 27,145 test-day records from the first, second, and third lactations, respectively. Records were from 19,068 cows calving from 1993 to 2013 in 124 herds. (Co) variance components were estimated by Bayesian methods. Gibbs sampling was used to obtain posterior distributions. The model included herd-year-month of testing, breed group-season of calving-month in tested milk group, linear and quadratic age at calving as fixed effects, and random regression coefficients for additive genetic and permanent environmental effects, which were defined as modified constant, linear, quadratic, cubic and quartic Legendre coefficients. Results: Average daily heritabilities ranged from 0.36 to 0.48 for milk, 0.33 to 0.44 for fat and 0.37 to 0.48 for protein yields; they were higher in the third lactation for all traits. Heritabilities of test-day milk and protein yields for selected days in milk were higher in the middle than at the beginning or end of lactation, whereas those for test-day fat yields were high at the beginning and end of lactation. Genetics correlations (305-d yield) among production yields within lactations (0.44 to 0.69) were higher than those across lactations (0.36 to 0.68). The largest genetic correlation was observed between the first and second lactation. The genetic trends of 305-d milk, fat and protein yields were 230 to 250, 25 to 29, and 30 to 35 kg per year, respectively. Conclusion: A random regression model seems to be a flexible and reliable procedure for the genetic evaluation of production yields. It can be used to perform breeding value estimation for national genetic evaluation in the Thai dairy cattle population.

중회귀 모형을 이용한 울산지역 오존 포텐셜 모형의 설계 및 평가 (Design and Assessment of an Ozone Potential Forecasting Model using Multi-regression Equations in Ulsan Metropolitan Area)

  • 김유근;이소영;임윤규;송상근
    • 한국대기환경학회지
    • /
    • 제23권1호
    • /
    • pp.14-28
    • /
    • 2007
  • This study presented the selection of ozone ($O_3$) potential factors and designed and assessed its potential prediction model using multiple-linear regression equations in Ulsan area during the springtime from April to June, $2000{\sim}2004$. $O_3$ potential factors were selected by analyzing the relationship between meterological parameters and surface $O_3$ concentrations. In addition, cluster analysis (e.g., average linkage and K-means clustering techniques) was performed to identify three major synoptic patterns (e.g., $P1{\sim}P3$) for an $O_3$ potential prediction model. P1 is characterized by a presence of a low-pressure system over northeastern Korea, the Ulsan was influenced by the northwesterly synoptic flow leading to a retarded sea breeze development. P2 is characterized by a weakening high-pressure system over Korea, and P3 is clearly associated with a migratory anticyclone. The stepwise linear regression was performed to develop models for prediction of the highest 1-h $O_3$ occurring in the Ulsan. The results of the models were rather satisfactory, and the high $O_3$ simulation accuracy for $P1{\sim}P3$ synoptic patterns was found to be 79, 85, and 95%, respectively ($2000{\sim}2004$). The $O_3$ potential prediction model for $P1{\sim}P3$ using the predicted meteorological data in 2005 showed good high $O_3$ prediction performance with 78, 75, and 70%, respectively. Therefore the regression models can be a useful tool for forecasting of local $O_3$ concentration.

고소성 해성점토지반의 압축지수에 대한 비교 연구 (A Comparison Study on Compression Index of Marine Clay with High-Plasticity)

  • 정길수;박병수;홍영길;유남재
    • 산업기술연구
    • /
    • 제25권A호
    • /
    • pp.57-65
    • /
    • 2005
  • In this paper, for the highly plastic marine soft clay distributed in west and southern coast of Korean peninsula of Kwangyang and Busan New Port areas, correlation between compression index and other indices representing geotechnical engineering properties such as liquid limit, void ratio and natural water content were analyzed. Appropriate empirical equations of being able to estimate the compressibility of clays in the specific areas were proposed and compared with other existing empirical ones. For analyses of the data and test results, data for marine clays were used from areas of the South Container Port of the Busan New Port, East Breakwater, Passenger Quay, Jungma Reclamation and Reclamation Containment in the 3rd stage in Kwangyang. In order to find the best regression model by using the commercially available software, MS EXCEL 2000, results obtained from the simple linear regression analysis, using the values of liquid limit, initial void ratio and natural water content as independent variables, were compared with the existing empirical equations. Multiple linear regression was also performed to find the best fit regression curves for compression index and other soil properties by combining those independent variables. On the other hands, another software of SPSS for non-linear regression was used to analyze the correlations between compression index and other soil properties.

  • PDF

Evaluation of Sigumjang Aroma by Stepwise Multiple Regression Analysis of Gas Chromatographic Profiles

  • Choi, Ung-Kyu;Kwon, O-Jun;Lee, Eun-Jeong;Son, Dong-Hwa;Cho, Young-Je;Im, Moo-Hyeog;Chung, Yung-Gun
    • Journal of Microbiology and Biotechnology
    • /
    • 제10권4호
    • /
    • pp.476-481
    • /
    • 2000
  • A linear correlation, by the stepwise multiple regression analysis, was found between the sensory test of Sigumjang aroma and the gas chromatographic data which were transformed with logarithm. GC data is the most objective method to evaluate Sigumjang aroma. A multiple correlation coefficient and a determination coefficient of more than 0.9 were obtained at the 9th and 13th steps, respectively. At step 31, the coefficient of determination level of 0.95 was attained. The accuracy of its estimation became higher as the number of the variables entered into the regression model increased. Over 90% of the Sigumjang aroma was explained by 13 compounds indentified on GC. The contributing proportion of the peak 26 was the highest followed by peaks 57 (9.27%), 29 (7.51%), 54 (6.01%), 8 (5.99%), 49 (4.97%), and 13 (4.11%).

  • PDF

Some Remarks on the Likelihood Inference for the Ratios of Regression Coefficients in Linear Model

  • Kim, Yeong-Hwa;Yang, Wan-Yeon;Kim, M.J.;Park, C.G.
    • Journal of the Korean Data and Information Science Society
    • /
    • 제15권1호
    • /
    • pp.251-261
    • /
    • 2004
  • The paper focuses primarily on the standard linear multiple regression model where the parameter of interest is a ratio of two regression coefficients. The general model includes the calibration model, the Fieller-Creasy problem, slope-ratio assays, parallel-line assays, and bioequivalence. We provide an orthogonal transformation (cf. Cox and Reid (1987)) of the original parameter vector. Also, we give some remarks on the difficulties associated with likelihood based confidence interval.

  • PDF

근적외선 분광분석법을 이용한 LAS (linear alkyl benzene sulfonate)의 정량분석법 (Near Infrared Spectroscopy of LAS (linear alkyl benzene sulfonate))

  • 조창희;최병기;김효진
    • Environmental Analysis Health and Toxicology
    • /
    • 제15권1_2호
    • /
    • pp.39-43
    • /
    • 2000
  • Linear alkyl sulfonates (LAS) constitute a large fraction of the surfactants used in cleaning processes in households, trade and industry Despite the industrial significance and the possible environmental impact of these compounds, the fast and inexpensive determination of LAS concentrations is still a difficult task. In this study, near infrared (NIR) spectroscopy which is a rapid spectroscopic analysis method compared with a traditional analytical method for the measurement of LAS concentration such as HPLC, GC and standard wet chemistry method. NIR spectra of LAS between 0.313 and 25.0% (w/v) in water were utilized to develop a calibration model. The best results (R = 0.998, SEP = 0.244% (w/v)) obtained by using partial least-squares regression with spectral data treatment and 2nd derivatization were comparable to the results (SEC = 0.186% (w/v), SEP = 0.206% (w/v)) obtained by using multiple linear least-squares regression (MLR). However, models based on derivative spectra have no significant advantage with MLR.

  • PDF

Interrelation of Retention Factor of Amino-Acids by QSPR and Linear Regression

  • Lee, Seung-Ki;Polyakova, Yulia;Row, Kyung-Ho
    • Bulletin of the Korean Chemical Society
    • /
    • 제24권12호
    • /
    • pp.1757-1762
    • /
    • 2003
  • The interrelation between retention factors of several L-amino acids and their physico-chemical and structural properties can be determined in chromatographic research. In this paper we describe a predictor for retention factors with various properties of the L-amino acids. Eighteen L-amino acids are included in this study, and retention factors are measured experimentally by RP-HPLC. Binding energy ($E_b$), hydrophobicity (log P), molecular refractivity (MR), polarizability (${\alpha}$), total energy ($E_t$), water solubility (log S), connectivity index (${\chi}$) of different orders and Wiener index (w) are theoretically calculated. Relationships between these properties and retention factors are established, and their predictive and interpretive ability are evaluated. The equation of the relationship between retention factors and various descriptors of L-amino acids is suggested as linear and multiple linear form, and the correlation coefficients estimated are relatively higher than 0.90.

머신러닝 알고리즘 기반의 의료비 예측 모델 개발 (Development of Medical Cost Prediction Model Based on the Machine Learning Algorithm)

  • Han Bi KIM;Dong Hoon HAN
    • Journal of Korea Artificial Intelligence Association
    • /
    • 제1권1호
    • /
    • pp.11-16
    • /
    • 2023
  • Accurate hospital case modeling and prediction are crucial for efficient healthcare. In this study, we demonstrate the implementation of regression analysis methods in machine learning systems utilizing mathematical statics and machine learning techniques. The developed machine learning model includes Bayesian linear, artificial neural network, decision tree, decision forest, and linear regression analysis models. Through the application of these algorithms, corresponding regression models were constructed and analyzed. The results suggest the potential of leveraging machine learning systems for medical research. The experiment aimed to create an Azure Machine Learning Studio tool for the speedy evaluation of multiple regression models. The tool faciliates the comparision of 5 types of regression models in a unified experiment and presents assessment results with performance metrics. Evaluation of regression machine learning models highlighted the advantages of boosted decision tree regression, and decision forest regression in hospital case prediction. These findings could lay the groundwork for the deliberate development of new directions in medical data processing and decision making. Furthermore, potential avenues for future research may include exploring methods such as clustering, classification, and anomaly detection in healthcare systems.

보행자 전용도로의 이용자 경관만족 요인분석 -분당 신도시를 중심으로- (The Analysis of User's Degree on Landscape Satisfaction Factors for Pedestrian Road -Case Study of Bun-Dang New Town-)

  • 김대현
    • 한국환경복원기술학회지
    • /
    • 제4권2호
    • /
    • pp.1-10
    • /
    • 2001
  • The purpose of this study was to investigate factors and variables which have significant effects on landscape satisfaction of urban pedestrian road in Bun-dang new town and to suggest basic information for urban pedestrian road design. These works consist of two phase. First, we tested the Hye-Cheon college students' degree of landscape satisfaction for 37 spots of urban pedestrian road and then selected 10 sports slide by the Sturges' formula. Second, we analysed factors and variables on landscape satisfaction of urban pedestrian road using the semantic differential scale method and then processed using descriptive analysis, factor analysis and multiple linear regression analysis. The major findings of this study can be summarized as follows; 1) The difference of landscape adjectives between the highest score of landscape satisfaction slide and the lowest score landscape satisfaction slide were diversity of vegetation, plenty of the shade of a tree, naturalness and cleanness. 2) Diversity of vegetation, width of road, freedom of danger and diversity of environment can be significant variables of major effects on landscape satisfaction of urban pedestrian road by using the multiple linear regression analysis. 3) Factors covering the landscape satisfaction of urban pedestrian road have been found to be Environment of urban pedestrian road and Constitution of urban pedestrian road. By using the Varimaxs' rotation factor analysis for the number of factors' cumulative percentage has been obtained as 64%. 4) Environment of urban pedestrian road and Constitution of urban pedestrian road can be significant factors of major effects on landscape satisfaction of urban pedestrian road by using the multiple linear regression analysis. In conclusion, the landscape satisfaction factors and variables of urban pedestrian road need to be considered in plan or design the urban pedestrian road.

  • PDF

Prediction of unconfined compressive and Brazilian tensile strength of fiber reinforced cement stabilized fly ash mixes using multiple linear regression and artificial neural network

  • Chore, H.S.;Magar, R.B.
    • Advances in Computational Design
    • /
    • 제2권3호
    • /
    • pp.225-240
    • /
    • 2017
  • This paper presents the application of multiple linear regression (MLR) and artificial neural network (ANN) techniques for developing the models to predict the unconfined compressive strength (UCS) and Brazilian tensile strength (BTS) of the fiber reinforced cement stabilized fly ash mixes. UCS and BTS is a highly nonlinear function of its constituents, thereby, making its modeling and prediction a difficult task. To establish relationship between the independent and dependent variables, a computational technique like ANN is employed which provides an efficient and easy approach to model the complex and nonlinear relationship. The data generated in the laboratory through systematic experimental programme for evaluating UCS and BTS of fiber reinforced cement fly ash mixes with respect to 7, 14 and 28 days' curing is used for development of the MLR and ANN model. The data used in the models is arranged in the format of four input parameters that cover the contents of cement and fibers along with maximum dry density (MDD) and optimum moisture contents (OMC), respectively and one dependent variable as unconfined compressive as well as Brazilian tensile strength. ANN models are trained and tested for various combinations of input and output data sets. Performance of networks is checked with the statistical error criteria of correlation coefficient (R), mean square error (MSE) and mean absolute error (MAE). It is observed that the ANN model predicts both, the unconfined compressive and Brazilian tensile, strength quite well in the form of R, RMSE and MAE. This study shows that as an alternative to classical modeling techniques, ANN approach can be used accurately for predicting the unconfined compressive strength and Brazilian tensile strength of fiber reinforced cement stabilized fly ash mixes.