• Title/Summary/Keyword: regression function

Search Result 2,156, Processing Time 0.029 seconds

Semiparametric kernel logistic regression with longitudinal data

  • Shim, Joo-Yong;Seok, Kyung-Ha
    • Journal of the Korean Data and Information Science Society
    • /
    • v.23 no.2
    • /
    • pp.385-392
    • /
    • 2012
  • Logistic regression is a well known binary classification method in the field of statistical learning. Mixed-effect regression models are widely used for the analysis of correlated data such as those found in longitudinal studies. We consider kernel extensions with semiparametric fixed effects and parametric random effects for the logistic regression. The estimation is performed through the penalized likelihood method based on kernel trick, and our focus is on the efficient computation and the effective hyperparameter selection. For the selection of optimal hyperparameters, cross-validation techniques are employed. Numerical results are then presented to indicate the performance of the proposed procedure.

Price Response Function with Price-Dependent Quality Evaluation at Segment Level (가격을 품질의 지표로 사용하는 세분시장의 가격반응함수 추출)

  • Kwak, Young-Sik;Lee, Yun-Kyung;Nam, Yong-Sik
    • Journal of Global Scholars of Marketing Science
    • /
    • v.16 no.2
    • /
    • pp.77-94
    • /
    • 2006
  • The purpose of this study was to identify the consumers who use the level of price as the indicator of the product quality and calibrate the price response function with price-dependent quality evaluation. In order to implement the purpose of this study, Home theater market in China had been segmented by the mixture regression model, and price response function was calibrated at segment level. Based on the types of price response function, segments were allocated into one of two groups; the group using the level of price as the quality indicator or the group not using the level of price as that. Then, characteristics of both groups were compared in terms of product attributes and demographic variables.

  • PDF

Improving Estimative Capability of Software Development Effort using Radial Basis Function Network (RBF 망 이용 소프트웨어 개발 노력 추정 성능향상)

  • Lee, Sang-Un;Park, Yeong-Mok;Park, Jae-Hong
    • The KIPS Transactions:PartD
    • /
    • v.8D no.5
    • /
    • pp.581-586
    • /
    • 2001
  • An increasingly important facet of software development is the ability to estimated the associated coast and effort of development early in the development life cycle. In spite of the most generally sued procedures for estimation of the software development effort and cost were linear regression analysis. As a result of the software complexity and various development environments, the software effort and cost estimates that are grossly inaccurate. The application of nonlinear methods hold the greatest promise for achieving this objects. Therefore this paper presents an RBF (radial basis function) network model that is able to represent the nonlinear relation for software development effort, The research describes appropriate RBF network modeling in the context of a case study for 24 software development projects. Also, this paper compared the RBF network model with a regression analysis model. The RBF network model is the most accuracy of all.

  • PDF

Association Between Cadmium Exposure and Liver Function in Adults in the United States: A Cross-sectional Study

  • Hong, Dongui;Min, Jin-Young;Min, Kyoung-Bok
    • Journal of Preventive Medicine and Public Health
    • /
    • v.54 no.6
    • /
    • pp.471-480
    • /
    • 2021
  • Objectives: Cadmium is widely used, leading to extensive environmental and occupational exposure. Unlike other organs, for which the harmful and carcinogenic effects of cadmium have been established, the hepatotoxicity of cadmium remains unclear. Some studies detected correlations between cadmium exposure and hepatotoxicity, but others concluded that they were not associated. Thus, we investigated the relationship between cadmium and liver damage in the general population. Methods: In total, 11 838 adult participants from National Health and Nutrition Examination Survey 1999-2015 were included. Urinary cadmium levels and the following liver function parameters were measured: alanine aminotransferase (ALT), aspartate aminotransferase (AST), gamma glutamyl transferase (GGT), total bilirubin (TB), and alkaline phosphatase (ALP). Linear and logistic regression analyses were performed to assess the associations between urinary cadmium concentrations and each liver function parameter after adjusting for age, sex, race/ethnicity, annual family income, smoking status, alcohol consumption status, physical activity, and body mass index. Results: The covariate-adjusted results of the linear regression analyses showed significant positive relationships between log-transformed urinary cadmium levels and each log-transformed liver function parameter, where beta±standard error of ALT, AST, GGT, TB, and ALP were 0.049±0.008 (p<0.001), 0.030±0.006 (p<0.001), 0.093±0.011 (p<0.001), 0.034±0.009 (p<0.001), and 0.040±0.005 (p<0.001), respectively. Logistic regression also revealed statistically significant results. The odds ratios (95% confidence intervals) of elevated ALT, AST, GGT, TB, and ALP per unit increase in log-transformed urinary cadmium concentration were 1.360 (1.210 to 1.528), 1.307 (1.149 to 1.486), 1.520 (1.357 to 1.704), 1.201 (1.003 to 1.438), and 1.568 (1.277 to 1.926), respectively. Conclusions: Chronic exposure to cadmium showed positive associations with liver damage.

Optimized Neural Network Weights and Biases Using Particle Swarm Optimization Algorithm for Prediction Applications

  • Ahmadzadeh, Ezat;Lee, Jieun;Moon, Inkyu
    • Journal of Korea Multimedia Society
    • /
    • v.20 no.8
    • /
    • pp.1406-1420
    • /
    • 2017
  • Artificial neural networks (ANNs) play an important role in the fields of function approximation, prediction, and classification. ANN performance is critically dependent on the input parameters, including the number of neurons in each layer, and the optimal values of weights and biases assigned to each neuron. In this study, we apply the particle swarm optimization method, a popular optimization algorithm for determining the optimal values of weights and biases for every neuron in different layers of the ANN. Several regression models, including general linear regression, Fourier regression, smoothing spline, and polynomial regression, are conducted to evaluate the proposed method's prediction power compared to multiple linear regression (MLR) methods. In addition, residual analysis is conducted to evaluate the optimized ANN accuracy for both training and test datasets. The experimental results demonstrate that the proposed method can effectively determine optimal values for neuron weights and biases, and high accuracy results are obtained for prediction applications. Evaluations of the proposed method reveal that it can be used for prediction and estimation purposes, with a high accuracy ratio, and the designed model provides a reliable technique for optimization. The simulation results show that the optimized ANN exhibits superior performance to MLR for prediction purposes.

Analysis of Temperature Effects on Microbial Growth Parameters and Estimation of Food Shelf Life with Confidence Band

  • Park, Jin-Pyo;Lee, Dong-Sun
    • Preventive Nutrition and Food Science
    • /
    • v.13 no.2
    • /
    • pp.104-111
    • /
    • 2008
  • As a way to account for the variability of the primary model parameters in the secondary modeling of microbial growth, three different regression approaches were compared in determining the confidence interval of the temperature-dependent primary model parameters and the estimated microbial growth during storage: bootstrapped regression with all the individual primary model parameter values; bootstrapped regression with average values at each temperature; and simple regression with regression lines of 2.5% and 97.5% percentile values. Temperature dependences of converted parameters (log $q_o$, ${\mu}_{max}^{1/2}$, log $N_{max}$) of hypothetical initial physiological state, maximum specific growth rate, and maximum cell density in Baranyi's model were subjected to the regression by quadratic, linear, and linear function, respectively. With an advantage of extracting the primary model parameters instantaneously at any temperature by using mathematical functions, regression lines of 2.5% and 97.5% percentile values were capable of accounting for variation in experimental data of microbial growth under constant and fluctuating temperature conditions.

Cloud Removal Using Gaussian Process Regression for Optical Image Reconstruction

  • Park, Soyeon;Park, No-Wook
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.4
    • /
    • pp.327-341
    • /
    • 2022
  • Cloud removal is often required to construct time-series sets of optical images for environmental monitoring. In regression-based cloud removal, the selection of an appropriate regression model and the impact analysis of the input images significantly affect the prediction performance. This study evaluates the potential of Gaussian process (GP) regression for cloud removal and also analyzes the effects of cloud-free optical images and spectral bands on prediction performance. Unlike other machine learning-based regression models, GP regression provides uncertainty information and automatically optimizes hyperparameters. An experiment using Sentinel-2 multi-spectral images was conducted for cloud removal in the two agricultural regions. The prediction performance of GP regression was compared with that of random forest (RF) regression. Various combinations of input images and multi-spectral bands were considered for quantitative evaluations. The experimental results showed that using multi-temporal images with multi-spectral bands as inputs achieved the best prediction accuracy. Highly correlated adjacent multi-spectral bands and temporally correlated multi-temporal images resulted in an improved prediction accuracy. The prediction performance of GP regression was significantly improved in predicting the near-infrared band compared to that of RF regression. Estimating the distribution function of input data in GP regression could reflect the variations in the considered spectral band with a broader range. In particular, GP regression was superior to RF regression for reproducing structural patterns at both sites in terms of structural similarity. In addition, uncertainty information provided by GP regression showed a reasonable similarity to prediction errors for some sub-areas, indicating that uncertainty estimates may be used to measure the prediction result quality. These findings suggest that GP regression could be beneficial for cloud removal and optical image reconstruction. In addition, the impact analysis results of the input images provide guidelines for selecting optimal images for regression-based cloud removal.

Receiver Operating Characteristic (ROC) Curves Using Neural Network in Classification

  • Lee, Jea-Young;Lee, Yong-Won
    • Journal of the Korean Data and Information Science Society
    • /
    • v.15 no.4
    • /
    • pp.911-920
    • /
    • 2004
  • We try receiver operating characteristic(ROC) curves by neural networks of logistic function. The models are shown to arise from model classification for normal (diseased) and abnormal (nondiseased) groups in medical research. A few goodness-of-fit test statistics using normality curves are discussed and the performances using neural networks of logistic function are conducted.

  • PDF

Retrieval of Land SurfaceTemperature based on High Resolution Landsat 8 Satellite Data (고해상도 Landsat 8 위성자료기반의 지표면 온도 산출)

  • Jee, Joon-Bum;Kim, Bu-Yo;Zo, Il-Sung;Lee, Kyu-Tae;Choi, Young-Jean
    • Korean Journal of Remote Sensing
    • /
    • v.32 no.2
    • /
    • pp.171-183
    • /
    • 2016
  • Land Surface Temperature (LST) retrieved from Landsat 8 measured from 2013 to 2014 and it is corrected by surface temperature observed from ground. LST maps are retrieved from Landsat 8 calculate using the linear regression function between raw Landsat 8 LST and ground surface temperature. Seasonal and annual LST maps developed an average LST from season to annual, respectively. While the higher LSTs distribute on the industrial and commercial area in urban, lower LSTs locate in surrounding rural, sea, river and high altitude mountain area over Seoul and surrounding area. In order to correct the LST, linear regression function calculate between Landsat 8 LST and ground surface temperature observed 3 Korea Meteorological Administration (KMA) synoptic stations (Seoul(ID: 108), Incheon(ID: 112) and Suwon(ID: 119)) on the Seoul and surrounding area. The slopes of regression function are 0.78 with all data and 0.88 with clear sky except 5 cloudy pixel data. And the original Landsat 8 LST have a correlation coefficient with 0.88 and Root Mean Square Error (RMSE) with $5.33^{\circ}C$. After LST correction, the LST have correlation coefficient with 0.98 and RMSE with $2.34^{\circ}C$ and the slope of regression equation improve the 0.95. Seasonal and annual LST maps represent from urban to rural area and from commercial to industrial region clearly. As a result, the Landsat 8 LST is more similar to the real state when corrected by surface temperature observed ground.

Typology of ROII Patterns on Cluster Analysis in Korean Enterprises

  • Kim, Young Sun;Kwon, Oh Jun;Kim, Ki Sik;Rhee, Kyung Yong
    • Safety and Health at Work
    • /
    • v.3 no.4
    • /
    • pp.278-286
    • /
    • 2012
  • Objectives: Authors investigated the pattern of the rate of occupational injuries and illnesses (ROII) at the level of enterprises in order to build a network for exchange of experience and knowledge, which would contribute to workers' safety and health through safety climate of workplace. Methods: Occupational accidents were analyzed at the manufacturing work site unit. A two step clustering process for the past patterns regarding the ROII from 2001 to 2009 was investigated. The ROII patterns were categorized based on regression analysis and the patterns were further divided according to the subtle changes with Mahalanobis distance and Ward's linkage. Results: The first clustering of ROII through regression analysis showed 5 different functions; 29 work sites of the linear function, 50 sites of the quadratic function, 95 sites of the logarithm function, 62 sites of the exponential function, and 54 sites of the sine function. Fourteen clusters were created in the second clustering. There were 3 clusters in each function categorized in the first clustering except for sine function. Each cluster consisted of the work sites with similar ROII patterns, which had unique characteristics. Conclusion: The five different patterns of ROII suggest that tailored management activities should be applied to every work site. Based on these differences, the authors selected exemplary work sites and built a network to help the work sites to share information on safety climate and accident prevention measures. The causes of different patterns of ROII, building network and evaluation of this management model should be evaluated as future researches.