• Title/Summary/Keyword: 잔차도

Search Result 11, Processing Time 0.021 seconds

Outlier Detection Using Dynamic Plots (동적 그림을 이용한 이상치 검색)

  • Ahn, Byung-Jin;Seo, Han-Son
    • The Korean Journal of Applied Statistics
    • /
    • v.24 no.5
    • /
    • pp.979-986
    • /
    • 2011
  • A linear regression method is commonly used to analyze data because of its simplicity and applicability; however, it is well known that data may contain some outliers and influential cases that may have a harmful effect on a statistical analysis. Thus detection and examination of outliers or influential cases are important parts of data analysis. In detecting multiple outliers, masking effects usually occur and make it difficult to identify the true outliers. We propose to use dynamic plots as a method resistant to masking effect. The procedure using dynamic plots is useful to find appropriate basic sets with which a dependent outliers detection method start and detect a true outliers set. Examples are given to demonstrate the effectiveness of the suggested idea.

Application of trend surface analysis(TSA) to a precipitation modification study over urban areas in the southern United States of America (미국 남부지역의 도시화로 인한 강수변화 연구에 대한 경향면 분석의 적용)

  • Choi, Young Eun;Henderson, Keith G.
    • Journal of the Korean Geographical Society
    • /
    • v.30 no.4
    • /
    • pp.333-351
    • /
    • 1995
  • Trend surface analysis (TSA) was selected to estimate a natural trend in precipitation and to examine urban influences on precipitation over five urban areas (Houston, Dallas, and San Antonio, TX; New Orleans, LA; and Memphis, TN) in the southern United States. TSA was applied to monthly, seasonal and annual normal precipitation data for the period of 1961-1990. Winter and spring have more trends than summer and fall and the period of November through March have more marked trends than the period of April through October in all study areas except the Houston area. Residual maps for Houston, Dallas and San Antonio have positive residuals in the city and downwind during summer indicating that urban effects on precipitation enhancement in these areas do exist during these seasons after eliminating the natural precipitation variations. Summer residual maps for New Orleans and Memphis have no distinct precipitation increases due to urban effects. The June residual map in New Orleans and the July residual map in Memphis have positive values in the city, but the magnitude of values is smaller than other cities.

  • PDF

Modelling Spatial Variation of Housevalue Determinants (주택가격 결정인자의 공간적 다양성 모델링)

  • Kang Youngok
    • Journal of the Korean Geographical Society
    • /
    • v.39 no.6 s.105
    • /
    • pp.907-921
    • /
    • 2004
  • Lots of characteristics such as dwelling, neighborhood, and accessibility characteristics affect to the housevalue. Many researches have been done to identify values of each characteristic using hedonic technique. However, there is a limit to identify interaction of each characteristic and variation of each characteristic among the accessibility context. This paper has implemented the Expansion Method research paradigm to model the housevalue determination process in the city of Seoul. The findings of this paper have revealed the presence of contextual variations in the housevalue determination process. The initial model for housevalue reveals that as $F_1$ increases (i.e., larger the number of rooms/bathrooms, larger parking space) and/or $F_2$ increases (i.e., higher owner occupied housing units, higher apartment housing units) and/or $F_3$ increases, (i.e., higher the ratio of higher than college graduated households, 8 school zone, older housing units) the estimated housevalue increases. However, the above relationships drift across their respective contexts. The houses which have negative $F_1$ value, the housevalue does not fluctuate according to the distance to the city center or subcenters. However, the houses which have positive $F_1$ value, the closer to the subcenters or shorter to the river, the higher the estimated housevalues. On the other hand, in areas far from the subcenters, the estimated housevalues does not fluctuate much according to the corresponding $F_2$ level. In areas close to the subcenters, the estimated housevalues vary tremendously according to the $F_2$ value. In the residual analysis, it is revealed that large apartment which are located in Kangnam, IchongDong, MokDong are underestimated. This paper has contributed to our understanding of the housevalue determination process by providing an alternative conceptualization to the traditional approach.

Synoptic Analysis of Heavy Rainstorms over Urban Areas in the Southern United States (미국 남부지방 도시호우의 종관적 분석)

  • Youngeun Choi
    • Journal of the Korean Geographical Society
    • /
    • v.33 no.3
    • /
    • pp.395-409
    • /
    • 1998
  • The purpose of this paper is to determine the atmospheric conditions in whih urban areas affect the precipitation processes and to evaluate whether certain weather types show more apparent urban effect on precipitation modification over five cities in the southem United States. Each heavy rainstorm is classified into one of three synoptic weather types (frontal storm, airmass storm or tropical disturbance storm). Heavy rainstorm day is defined as day producing rainfall totals that equal o exceed 2 inches (50.08 mm). Houston, Dallass and San Antonio show possible urban effects on rainfall totals and frequencies of heavy rainstorms by airmass storm type while New Orleans and Memphis do not reveal any distinct precipitation enhancements through the synoptic analysis. The results of TSA (Trend Surface Analysis) show that frontal and tropical disturbance storm types have stronger climatic gradients than airmass types and the patterns of rainfall totals have stronger trends than those of rainfall frequencies for the five cities. The results suggest that airmass type events may well reveal possible precipitation enhancements due to urban effects since they are less influenced by a strong climate gradient and they provide favorable conditions for development of urban heat islands. Residual analysis confirms that rainfall totals and frequencies of heavy rainstorms by airmass storm type have positive residuals over the city or the major effect area.

  • PDF

Development of Weight Estimation Equation and Weight Table in Pinus densiflora Stand (Kangwon and Centr al Distr icts) (소나무(강원지방·중부지방) 중량추정식 및 중량표 개발)

  • Jintaek, Kang;Jongsu, Yim;Chiwung, Go;Sangmin, Sung;Yeongmo, Son
    • Journal of Korean Society of Forest Science
    • /
    • v.111 no.4
    • /
    • pp.630-643
    • /
    • 2022
  • This study was conducted to derive the fresh weight and dry weight estimation formulas of Pinus densiflora and prepare a weight table using them. Aone-variable formula using only the diameter at breast height (DBH) and a two-variable formula using DBH and height were used to calculate the fresh and dry weight. Each equation was verified using statistics, such as fit index, standard error, and residuals. Theoptimal equation was evaluated for applicability by calculating the weight as a coefficient derived from a statistical verification. W = bD+cD2 was selected for the one-variable equation, while W = aDbHc was selected for the two-variable equation. The fit index of the former was 0.87-0.92, while that of the latter was 0.94-0.98, both of which showed a good fit. A new weight table was prepared using the optimal estimation formula, and it was compared and analyzed with a previous weight table. Analysis results showed that Gangwon pine had higher values in the previous weight table, while pines in the central region had higher values in the newly created weight table.

An analysis of depression of the individuals with disabilities using repeated measurement data (반복 측정 자료를 이용한 장애인 우울에 대한 분석)

  • Hong, Haesun;Huh, Jib
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.5
    • /
    • pp.1055-1067
    • /
    • 2017
  • Most previous works to study for the depression of the disabilities in Korea have analyzed the repeated measured data of each individual under the mutually independent assumption. In this study, Korea Welfare Panel data of the disabilities surveyed additionally every three years are analyzed to detect the significant exploratory variables by the linear mixed models. A suitable correlation matrix is considered for the dependency of repeated measurement of each individual. The random effect to reflect the characteristics of the individuals as well as the fixed effect is included in the fitted linear mixed model. By the residual plot of the fixed effect model, the problem that the averages of residuals of each individual do not seem to be around zero is described. Further, the residual plot and the Q-Q plot coming from the selected final model are shown that the problem is modified well.

Spatial Dependency and Heterogeneity of Adult Diseases : In the Cases of Obesity, Diabetes and High Blood Pressure in the U.S.A. (성인병의 공간적 의존성과 이질성 : 미국의 비만, 당뇨, 고혈압을 사례로)

  • Yang, Byung-Yun;Hwang, Chul-Sue
    • Journal of the Korean association of regional geographers
    • /
    • v.16 no.5
    • /
    • pp.610-622
    • /
    • 2010
  • The proportion of overweight and obese individuals in the United States has been continuously increasing up to recently. Many studies related to obesity have concentrated on jurisdictional levels of aggregation, making it very difficult to dearly illustrate at risk regions. In other words, little research has been conducted in relation to spatial patterns considering spatial dependency and heterogeneity by spatial autocorrelation models over space. In response, this research analyzes spatial patterns between overweight/obesity and risk factors, such as high blood pressure and diabetes, over space. Specifically, the Moran''s I and Geary''s C will be conducted for global and local measures. What is more, the Ordinary Least Square (OLS) linear regression and Geographically Weighted Regression methods will be applied to identify spatial dependency and spatial heterogeneity. Data provided by the Behavioral Risk Factor Surveillance System (BRFSS) have Body-Mass Index (BMI) rates, containing 4 rates of under, healthy, overweight, and obesity. In addition, high blood pressure and diabetes rates in the United States will be used as independent variables. Lastly, we are confident that this research will be beneficial for a decision maker to make a prevention plan for obesity.

  • PDF

Evaluating the Accuracy of Spatial Interpolators for Estimating Land Price (지가 추정을 위한 공간내삽법의 정확성 평가)

  • JUN, Byong-Woon
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.20 no.3
    • /
    • pp.125-140
    • /
    • 2017
  • Until recently, regression based spatial interpolation methods and Kriging based spatial interpolation methods have been largely used to estimate land price or housing price, but less attention has been paid on comparing the performance of these spatial interpolation methods. In this regard, this research applied regression based spatial interpolators and Kriging based spatial interpolators for estimating the land prices in Dalseo-gu, Daegu metropolitan city and evaluated the accuracy of eight spatial interpolators. OLS, SLM, SEM, and GWR were used as regression based spatial interpolators while SK, OK, UK, and CK were employed as Kriging based spatial interpolators. The global accuracy was statistically evaluated by RMSE, adjusted RMSE, and COD. The relative accuracy was visually compared by three-dimensional residual error map and scatterplot. Results from statistical and visual analyses indicate that GWR reflecting the spatial non-stationarity was a relatively more accurate spatial predictor to estimate land prices in the study area than SAR and Kriging based spatial interpolators considering the spatial dependence. The findings from this research will contribute to the secondary research into analyzing the urban spatial structure with land prices.

Growth Curve Estimation of Stand Volume by Major Species and Forest Type on Actual Forest in Korea (주요 수종 및 임상별 현실림의 재적생장량 곡선 추정)

  • Yoon, Jun-Hyuck;Bae, Eun-Ji;Son, Yeong-Mo
    • Journal of Korean Society of Forest Science
    • /
    • v.110 no.4
    • /
    • pp.648-657
    • /
    • 2021
  • This study was conducted to estimate the volume growth by forest type and major species using the national forest resource inventory and to predict the final age of maturity by deriving the mean annual increment (MAI) and the current annual increment (CAI). We estimated the volume growth using the Chapman-Richards model. In the volume estimation equations by forest type, coniferous forests exhibited the highest growth. According to the estimation formula for each major species, Larix kaempferi will grow the highest among coniferous tree species and Quercus mongolica among broad-leaved tree species. And these estimation formulas showed that the fitness index was generally low, such as 0.32 for L. kaempferi and 0.21 for Quercus variabilis. In the analysis of residual amount, which indicates the applicability of the volume estimation formula, the estimates of the estimation formula tended to be underestimated in about 30 years or more, but most of the residuals were evenly distributed around zero. Therefore, these estimation formulas have no difficulty estimating the volume of actual forest species in Korea. The maximum age attained by calculating MAI was 34 years for P. densiflora, 35 years for L. kaempferi, and 31 years for P. rigida among coniferous tree species. In broad-leaved tree species, we discovered that the maximum age was 32 years for Q. variabilis, 30 years for Q. acutissima, and 29 years for Q. mongolica. We calculated MAI and CAI to detect the point at which these two curves intersected. This point was defined by the maximum volume harvesting age. These results revealed no significant difference between the current standard cutting age in public and private forests recommended by the Korea Forest Service, supporting the reliability of forestry policy data.

Development of Weight Estimation Equations and Weight Tables for Larix kaempferi and Pinus rigida Stand (일본잎갈나무와 리기다소나무의 중량추정식 및 중량표 개발)

  • Jintaek Kang;Chiung Ko;Jeongmuk Park;Jongsu Yim;Sun-Jeong Lee;Myoungsoo Won
    • Journal of Korean Society of Forest Science
    • /
    • v.112 no.4
    • /
    • pp.472-489
    • /
    • 2023
  • This study was conducted to derive the optimal estimation equations for deriving the green and dry weights of Larix kaempferi (Japanese larch) and Pinus rigida (Rigida pine), which are major coniferous tree species in South Korea. The equations were then used to develop weight tables. Table development began with the sampling of 150 L. kaempferi and 90 P. rigida trees distributed throughout the national scale, after which green weights were measured on-site. Samples from each stand were then collected, and their dry weights were measured in a laboratory. The equation used to calculate green and dry weights was divided into a one-variable formula that uses only the diameter at breast height (DBH) and a two-variable equation that employs DBH and height. The equations used to estimate the green and dry weights of logs were divided into one- and two-variable equations using DBH. Statistical data, such as the fitness index (FI), root mean square error, standard error of estimation, and residual diagram, were used to verify the suitability of the estimation equations. Applicability was examined by calculating weights using the derived optimal equations. The equation W = bD+cD2 was used in measurements involving only DBH, whereas the equation W = aDbHc was employed in cases involving both diameter and height at breast height. The FI of W = bD+cD2 was 0.91, while that of W = aDbHc was 0.95, both of which are high values. With these estimation formulas, weight tables for the green and dry weights of L. kaempferi and P. rigida were prepared and compared with weight tables created 20 years ago. The green and dry weight tables of both species were larger.