• Title/Summary/Keyword: regression function

Search Result 2,149, Processing Time 0.025 seconds

Robust Nonparametric Regression Method using Rank Transformation

  • Park, Dongryeon
    • Communications for Statistical Applications and Methods
    • /
    • v.7 no.2
    • /
    • pp.575-583
    • /
    • 2000
  • Consider the problem of estimating regression function from a set of data which is contaminated by a long-tailed error distribution. The linear smoother is a kind of a local weighted average of response, so it is not robust against outliers. The kernel M-smoother and the lowess attain robustness against outliers by down-weighting outliers. However, the kernel M-smoother and the lowess requires the iteration for computing the robustness weights, and as Wang and Scott(1994) pointed out, the requirement of iteration is not a desirable property. In this article, we propose the robust nonparametic regression method which does not require the iteration. Robustness can be achieved not only by down-weighting outliers but also by transforming outliers. The rank transformation is a simple procedure where the data are replaced by their corresponding ranks. Iman and Conover(1979) showed the fact that the rank transformation is a robust and powerful procedure in the linear regression. In this paper, we show that we can also use the rank transformation to nonparametric regression to achieve the robustness.

  • PDF

Regularity of Maximum Likelihood Estimation for ARCH Regression Model with Lagged Dependent Variables

  • Hwang, Sun Y.
    • Journal of the Korean Statistical Society
    • /
    • v.29 no.1
    • /
    • pp.9-16
    • /
    • 2000
  • This article addresses the problem of maximum likelihood estimation in ARCH regression with lagged dependent variables. Some topics in asymptotics of the model such as uniform expansion of likelihood function and construction of a class of MLE are discussed, and the regularity property of MLE is obtained. The error process here is possibly non-Gaussian.

  • PDF

Factors Related to Sexual Function in Men with Rectal Cancer (직장암 남성의 성기능 관련 요인)

  • Woo, Sang Jun;Lee, Eun Sook;Kim, Hyeong Rok;Kim, Chang Hyun
    • The Journal of Korean Society for School & Community Health Education
    • /
    • v.20 no.3
    • /
    • pp.91-100
    • /
    • 2019
  • Purpose: The purpose of this study was to investigate the sexual function of male patients receiving rectal cancer and to analyze the factors related to sexual function. Methods: This study included 71 male patients undergoing outpatient treatment after surgery at C University Hospital, Chonnam, Korea from April 1 to September 1, 2014. The sexual function of males with colorectal cancer was calculated using the Korean Translation of International Index of Erectile Function(IIEF). Data analysis was performed using t-test, ANOVA, and regression analysis. The study was IRB approved. Results: The sexual function index scores of the subjects were 33.28±19.47 points. Regression analysis showed that sexual function increased as the duration after operation increased(p=.001), higher location of cancer(p=.007), age decreased(p=.013). The explanatory power (adj. R2) of the analysis model was 0.186. Conclusion: Sexual function of males with rectal cancer differed according to duration after operation, and location of cancer, age. Therefore, medical staff think that it can be used as basic data for appropriate education and counseling by age, time, and type of treatment to improve sexual function of men with rectal cancer.

The association of perfluoroalkyl substances (PFAS) exposure and kidney function in Korean adolescents using data from Korean National Environmental Health Survey (KoNEHS) cycle 4 (2018-2020): a cross-sectional study

  • Jisuk Yun;Eun-Chul Jang;Soon-Chan Kwon;Young-Sun Min;Yong-Jin Lee
    • Annals of Occupational and Environmental Medicine
    • /
    • v.35
    • /
    • pp.5.1-5.14
    • /
    • 2023
  • Background: Perfluoroalkyl substances (PFAS) are chemicals widely used in various products in everyday life. Due to its unique strong binding force, the half-life of PFAS is very long, so bioaccumulation and toxicity to the human body are long-standing concerns. In particular, effects on kidney function have recently emerged and there are no studies on the effect of PFAS on kidney function through epidemiological investigations in Korea. From 2018 to 2020, the Korean National Environmental Health Survey (KoNEHS) cycle 4, conducted an epidemiological investigation on the blood concentration of PFAS for the first time in Korea. Based on this data, the relationship between PFAS blood concentration and kidney function was analyzed for adolescents. Methods: We investigated 5 types of PFAS and their total blood concentration in 811 middle and high school students, living in Korea and included in KoNEHS cycle 4, and tried to find changes in kidney function in relation to PFAS concentration. After dividing the concentration of each of the 5 PFAS and the total concentration into quartiles, multivariable linear regression was performed to assess the correlation with kidney function. The bedside Schwartz equation was used as an indicator of kidney function. Results: As a result of multivariable linear regression, when observing a change in kidney function according to the increase in the concentration of each of the 5 PFAS and their total, a significant decrease in kidney function was confirmed in some or all quartiles. Conclusions: In this cross-sectional study of Korean adolescents based on KoNEHS data, a negative correlation between serum PFAS concentration and kidney function was found. A well-designed longitudinal study and continuous follow-up are necessary.

A UCP-based Model to Estimate the Software Development Cost (소프트웨어 개발 비용을 추정하기 위한 사용사례 점수 기반 모델)

  • Park, Ju-Seok;Chong, Ki-Won
    • The KIPS Transactions:PartD
    • /
    • v.11D no.1
    • /
    • pp.163-172
    • /
    • 2004
  • In the software development project applying object-oriented development methodology, the research on the UCP(Use Case Point) as a method to estimate development effort is being carried on. The existing research proposes the linear model calculating the development effort that multiplies an invariant on AUCP(Adjusted Use Case Point) which applied technical and environmental factors. However, the statistical model that estimates the development effort using AUCP and UUCP(Unadjusted Use Case Point) is not being studied. The irrelevant relationship of the linear regression model, whose development period is increasing tremendously as the software size increases, is confirmed. Moreover, during the UCP calculating process, there can be errors in FP by applying the TCF(Technical Complexity Factor) and EF(Environmental Factor). This paper presents a non-linear regression model, that does not consider the TCF and EF, and that estimate the development effort from UUCP directly by utilizing the exponential function. An exponential function is selected among the linear, logarithm, polynomial, power, and exponential model via statistical evaluations of the models mentioned above.

Mapping the Spatial Distribution of IRG Growth Based on UAV

  • Na, Sang-Il;Park, Chan-Won;Kim, Young-Jin;Lee, Kyung-Do
    • Korean Journal of Soil Science and Fertilizer
    • /
    • v.49 no.5
    • /
    • pp.495-502
    • /
    • 2016
  • Italian Ryegrass (IRG), which is known as high yielding and the highest quality winter annual forage crop, is grown in mid-south area in Korea. The objective of this study was to evaluate the use of unmanned aerial vehicle (UAV) for the monitoring IRG growth. Unmanned aerial vehicle imagery obtained from middle March to late May in Nonsan, Chungcheongnam-do. Unmanned aerial vehicle imagery corrected geometrically and atmospherically to calculate normalized difference vegetation index (NDVI). We analyzed the relationships between $NDVI_{UAV}$ of IRG and biophysical measurements such as plant height, fresh weight, and dry weight over an entire IRG growth period. The similar trend between $NDVI_{UAV}$ and growth parameters was shown. Correlation analysis between $NDVI_{UAV}$ and IRG growth parameters revealed that $NDVI_{UAV}$ was highly correlated with fresh weight (r=0.988), plant height (r=0.925), and dry weight (r=0.853). According to the relationship among growth parameters and $NDVI_{UAV}$, the temporal variation of $NDVI_{UAV}$ was significant to interpret IRG growth. Four different regression models, such as (1) Linear regression function, (2) Linear regression through the origin, (3) Power function, and (4) Logistic function were developed to evaluate the relationship between temporal $NDVI_{UAV}$ and measured IRG growth parameters. The power function provided higher accurate results to predict growth parameters than linear or logistic functions using coefficient of determination. The spatial distribution map of IRG growth was in strong agreement with the field measurements in terms of geographical variation and relative numerical values when $NDVI_{UAV}$ was applied to power function. From these results, $NDVI_{UAV}$ can be used as a new tool for monitoring IRG growth.

Software Development Effort Estimation Using Function Point (기능점수를 이용한 소프트웨어 개발노력 추정)

  • Lee, Sang-Un;Gang, Jeong-Ho;Park, Jung-Yang
    • The KIPS Transactions:PartD
    • /
    • v.9D no.4
    • /
    • pp.603-612
    • /
    • 2002
  • Area of software measurement in software engineering is active more than thirty years. There is a huge collection of researches but still no concrete software development effort and cost estimation model. If we want to measure the effort and cost of a software project, we need to estimate the size of the software. A number of software metrics are identified in the literature; the most frequently cited measures are LOC (line of code) and FPA (function point analysis). The FPA approach has features that overcome the major problems with using LOC as a measure of system size. This paper presents simple linear regression model that related software development effort to software size measured in FP. The model is derived from the plotting of the effort and FP relation. The experimental data are collected from 789 software development projects that were recently developed under the various development environments and development methods. Also, the model is compare with other regression analysis model. The presented model has the best estimation ability among the software effort estimation models.

Development of Empirical Formulas for Storage Function Method (저류함수법의 매개변수 산정식 개발)

  • Choi, Jong-Nam;Ahn, Won-Shik;Kim, Tae-Gyun;Chung, Gun-Hui
    • Journal of the Korean Society of Hazard Mitigation
    • /
    • v.9 no.5
    • /
    • pp.125-130
    • /
    • 2009
  • Storage function method which considers the non-linearity of the relationship between rainfall and runoff has been frequently used to predict runoff in a basin and a flood pattern. However, it is time-consuming to estimate appropriate parameters of every basin and rainfall event, which requires the empirical parameter equation applicable in Korea. In this study, multiple regression analysis is used to develop empirical equations to estimate parameters of Storage Function method using basin characteristics. The basin area, maximum stream length, and stream slope are considered as the basin characteristics as the result of the regression analysis. Collinearity is removed and trial-and-error method is used to choose the most descriptive parameters to the dependent variables in Han River basin which is divided into 30 subbasins. The developed equations are validated using the rainfall events in MunMak gauging station and named as 'Han River equation'. The equation could provide the useful information about Storage Function method parameter to calculate runoff from a basin and predict river stage.

An Analysis on the First Flush Phenomenon by Stormwater Runoff in Eutrophic Lake Watershed (부영양상태 호수유역의 강우유출수에 의한 초기세척효과 분석)

  • Cho, Jae-Heon;Seo, Hyung-Jun
    • Journal of Environmental Impact Assessment
    • /
    • v.16 no.5
    • /
    • pp.341-350
    • /
    • 2007
  • Lake Youngrang is a lagoon whose effluent flows into the East Sea. Because two resort towns and two golf courses are situated at the lake basin, many tourists visit this area. Stormwater runoff surveys were carried out for the eight storm events from 2004 to 2005 in the eutrophic lake watershed to give a basic data for the diffuse pollution control of the lake. Dimensionless mass-volume curves indicating the distribution of pollutant mass vs. volume were used to analyze the first flush phenomenon. The mass-volume curves were fitted with a power function and polynomial equation curves. The regression analysis showed that the polynomial equation curves were better than the power function in representing the tendency of the first flush, and second degree polynomial equation curves indicated the strength of the first flush effectively.

A Generalized M-Estimator in Linear Regression

  • Song, Moon-Sup;Park, Chang-Soon;Nam, Ho-Soo
    • Communications for Statistical Applications and Methods
    • /
    • v.1 no.1
    • /
    • pp.27-32
    • /
    • 1994
  • We propose a robust regression estimator which has both a high breakdown point and a bounded influence function. The main contribution of this article is to present a weight function in the generalized M (GM)-estimator. The weighting schemes which control leverage points only without considering residuals cannot be efficient, since control leverage points only without considering residuals cannot be efficient, since these schemes inevitably downweight some good leverage points. In this paper we propose a weight function which depends both on design points and residuals, so as not to downweight good leverage points. Some motivating illustrations are also given.

  • PDF