• Title/Summary/Keyword: 중회귀 분석

Search Result 4,234, Processing Time 0.038 seconds

A study on bias effect of LASSO regression for model selection criteria (모형 선택 기준들에 대한 LASSO 회귀 모형 편의의 영향 연구)

  • Yu, Donghyeon
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.4
    • /
    • pp.643-656
    • /
    • 2016
  • High dimensional data are frequently encountered in various fields where the number of variables is greater than the number of samples. It is usually necessary to select variables to estimate regression coefficients and avoid overfitting in high dimensional data. A penalized regression model simultaneously obtains variable selection and estimation of coefficients which makes them frequently used for high dimensional data. However, the penalized regression model also needs to select the optimal model by choosing a tuning parameter based on the model selection criterion. This study deals with the bias effect of LASSO regression for model selection criteria. We numerically describes the bias effect to the model selection criteria and apply the proposed correction to the identification of biomarkers for lung cancer based on gene expression data.

Early Prediction of Carcass Yield Grade by Ultrasound in Hanwoo (초음파를 이용한 한우 육량등급의 조기예측)

  • Rhee, Y. J.;Seok, H. K.;Kim, S. J.;Song, Y. H.
    • Journal of Animal Science and Technology
    • /
    • v.45 no.2
    • /
    • pp.327-334
    • /
    • 2003
  • This study was carried out to make early prediction of carcass yield grade. Sixty six Hanwoo steers were measured for back fat thickness, longissimus muscle area and body weight at 18, 21 and 24 months of age by ultrasound. Carcass evaluation was done after ultrasound measurement at 24 month of age. Ultrasonic yield grade at 18, 21 and 24 month of age were predicted by regression and decision tree methods. Classifying by carcass yield grade, ultrasonic back fat thickness at 18, 21 and 24 months of age was significantly different in each carcass yield grade (p<0.05). The prediction accuracy of carcass yield grade by regression method was 78.8% at 18 months, 86.4% at 21 months and 90.9% at 24 months of age. By using the decision tree method for carcass yield grade, 78.8%, 89.4% and 89.4% of prediction accuracy were obtained at 18, 21 and 24 months of age, respectively.

NAVER Data Lab data-based Assessment of National Awareness Vulnerability of Past Floods over the Korean Peninsula (2011-2018) (NAVER DATA LAB 데이터 기반 과거 한반도 홍수에 대한 대중 인지도 취약성 평가 (2011-2018))

  • Eun Mi Lee;Young Uk Yu;Young hun Jeong;Jong Hun Kam
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2023.05a
    • /
    • pp.59-59
    • /
    • 2023
  • 기후변화로 인한 집중호우와 홍수는 하천의 범람, 내수침수 등을 일으킨다. 최근 발생한 2022년9월 태풍 '힌남노'는 포항시 10명의 인명 피해와 1조 7000억원의 재산 피해로 막대한 피해를 야기시켰다. 본 연구는 2011년부터 2018년까지 시군구 단위의 행정구역별 홍수 기간 강우량, 피해액, 홍수 지역의 인구 자료를 NAVER DATA LAB(2016년부터 자료 제공) '홍수' 검색량 데이터와 비교 분석하였다. 본 연구에서는 다량의 강우량 또는 높은 피해액이 발생한 시기에 홍수 검색량이 낮았던 지역을 홍수에 대한 대중 인지도가 취약한 지역으로 정의하였다. '홍수' 검색량과 강우량, 피해액, 홍수 지역 인구와의 상관관계를 분석한 결과, 강우량과 인구는 각각 0.86, 0.81의 높은 상관계수를 보인 반면, 피해액은 0.52로 상대적으로 낮은 상관관계를 보였다. 2016-2018년 특/광역시단위 분석 결과, 총 17번의 홍수 발생 중 '인천광역시'와 '세종특별시'에서 피해액 규모가 각각 2, 3순위로 높았던 반면 홍수 인지도는 각각 6, 11순위로 홍수 인지도가 취약한 지역으로 평가되었다. 도 단위 평가 시, 총 34번의 홍수 발생 중 '강원도'와 '경상북도'에서 피해액 규모 3순위, 강우량 10순위 일 때, 홍수 인지도는 27순위로 홍수 인지도가 취약한 지역으로 평가되었다. 다중 선형회귀 기법을 통해 2016년부터의 데이터를 기반으로 모델을 훈련하여 2016년 이전의 '홍수' 검색량 예측 자료를 재생산하였다. 2011-2015년 특/광역시 중심의 평가에서, 총 25번의 홍수 발생 중 부산광역시에서 피해액 규모가 1순위, 강우량이 2순위로 높았던 반면 홍수 인지도는 6순위로 홍수인지도가 취약한 지역으로 평가되었다. 도 단위 평가 시, 총 50번의 홍수 발생 중 '충청남도'와 '경기도'에서 피해액 규모가 3순위일 때 홍수 인지도가 7순위로 홍수 인지도가 취약한 지역으로 평가되었다. 본 연구는 물리·사회시스템의 빅데이터를 분석하여, 사회수문학적 접근 방식으로 홍수에 대한 사회적 취약성을 새롭게 제시하며 사회과학과 수자원 분야의 융합연구 필요성을 강조하였다.

  • PDF

Sentiment Analysis for Public Opinion in the Social Network Service (SNS 기반 여론 감성 분석)

  • HA, Sang Hyun;ROH, Tae Hyup
    • The Journal of the Convergence on Culture Technology
    • /
    • v.6 no.1
    • /
    • pp.111-120
    • /
    • 2020
  • As an application of big data and artificial intelligence techniques, this study proposes an atypical language-based sentimental opinion poll methodology, unlike conventional opinion poll methodology. An alternative method for the sentimental classification model based on existing statistical analysis was to collect real-time Twitter data related to parliamentary elections and perform empirical analyses on the Polarity and Intensity of public opinion using attribute-based sensitivity analysis. In order to classify the polarity of words used on individual SNS, the polarity of the new Twitter data was estimated using the learned Lasso and Ridge regression models while extracting independent variables that greatly affect the polarity variables. A social network analysis of the relationships of people with friends on SNS suggested a way to identify peer group sensitivity. Based on what voters expressed on social media, political opinion sensitivity analysis was used to predict party approval rating and measure the accuracy of the predictive model polarity analysis, confirming the applicability of the sensitivity analysis methodology in the political field.

Wheel Load Distribution Factor for Girder Moment and Shear Force of Skew Plate Girder Bridges (판형사교 거더의 휨모멘트와 전단력에 대한 하중분배계수)

  • Seo, Chang-Bum;Song, Jae-Ho
    • Journal of the Korean Society of Hazard Mitigation
    • /
    • v.5 no.1 s.16
    • /
    • pp.33-43
    • /
    • 2005
  • The girder wheel load distribution factors stated in the Korean Bridge Specification and AASHTO Standard Specifications do not account for the effect of skewness of plate girders, and very little research has been conducted on girder wheel load distribution factors. The purpose of the study is to propose load distribution factor formulas for skew plate girder bridges which comprise various parameters through structural analysis. To confirm the validity of finite element models used in this study analytic values are compared with the field test results. From the results it should be noted that span length is not such a dominant parameter compared with others. In view of better load distribution of interior girders, skew arranged cross beams or bracing are preferable, furthemore bracing system is more effective than cross beam system. By means of regression analysis on the basis of analytic results wheel load distribution factor formulas are proposed and compared with current codes.

Girder Wheel Load Distribution Factor of Skew Plate Girder Bridges (강판형 사교의 거더분배계수에 관한 연구)

  • Seo, Chang-Bum;Song, Jae-Ho
    • Journal of the Korea institute for structural maintenance and inspection
    • /
    • v.9 no.1
    • /
    • pp.293-303
    • /
    • 2005
  • The girder wheel load distribution factors stated in the Korean Bridge Specification and AASHTO Standard Specifications do not account for the effect of skewness of plate girders, and very little research has been conducted on girder wheel load distribution factors. The purpose of the study is to propose load distribution factor formulas for skew plate girder bridges which comprise various parameters through structural analysis. To comprise the validity of finite element models used in this study analytic values are compared with the field test results. From the results it should be noted that span length is not such a dominant parameter compared with others. In view of better load distribution of interior girders, skew arranged cross beams or bracing are preferable, furthemore bracing system is more effective than cross beam system. By means of regression analysis on the basis of analytic results wheel load distribution factor formulas are proposed and compared with current codes.

A Study on the Influence of Consumers Functional Recognition on Their Switching Behaviors, using Food Providers' Web Sites (외식기업 온라인 웹사이트를 이용하는 소비자들의 기능별 지각 수준이 전환 행동에 미치는 영향)

  • Choi, Eun-Joo
    • Culinary science and hospitality research
    • /
    • v.16 no.2
    • /
    • pp.31-48
    • /
    • 2010
  • The purpose of this study was to examine the influence of food service web site users' functional recognition extent on switching behaviors. For this, a survey of web site users was carried out. As for analytic methods, frequency analysis was used to examine respondents' demographic features. In addition, simple regression analysis and multiple regression analysis were carried out used to look into the influence of functional recognition of food providers' web sites on switching behaviors. Study findings are as follows: all the functional variables such as entertainment, advertisement & public relations, communication and purchase decision-making function have significant influence on users' switching behaviors. When users' recognition extent of food providers' online web sites is high, their switching behavior is also high. In particular, the following clause have the greatest influence upon users' switching behaviors pattern. In the function of entertainment, (1) it is easy to search on web site; in the advertisement function, (2) the image of restaurant can easily be recognize; In the communication function, (3) the image of new products can be seen with ease; and in the purchase decision-making function, (4) web sites are easily accessible.

  • PDF

Regression models on flood damage records by rainfall characteristics for regional flood damage estimates (지역별 홍수피해추정을 위한 강우특성에 대한 홍수피해자료의 회귀모형)

  • Lim, Yeon Taek;Choi, Hyun Il
    • Journal of Wetlands Research
    • /
    • v.22 no.4
    • /
    • pp.302-311
    • /
    • 2020
  • There are limitations to cope with flood damage by structural strategies alone because both frequency and intensity of floods are increasing due to climate change. Therefore, it is one of the necessary factors in the nonstructural countermeasures to collect and analyze historical flood damage records for the future flood damage assessments. In order to estimate flood damage costs in Gyeongsangbuk-do where severe flood damage occurs frequently due to geographical and climatic effects, this paper has performed the regression analysis on flood damage records over the past 20 years (1999-2018) by rainfall characteristics, which is one of the major causes of flood damage. This paper has then examined the relationship between the terrain features and rainfall characteristics in the regional regression functions, and also estimated the flood damage risk for 100-year rainfall by using the regional regression functions presented for the 22 administrative districts in Gyeongsangbuk-do excluding Ulleung-gun. The flood damage assessment shows that the relatively high damage risk is estimated for county areas adjacent to the eastern coast in Gyeongsangbuk-do. The regional damage estimate functions in this paper are expected to be used as one of the nonstructural countermeasures to estimate flood damage risk for the design or forecasting rainfall data.

Wheel Load Distribution of Simply Supported Reinforced Concrete Slab Bridge (철근콘크리트 단순 슬래브 교량의 윤하중분포폭에 관한 연구)

  • 오병환;신호상;한승환
    • Magazine of the Korea Concrete Institute
    • /
    • v.10 no.3
    • /
    • pp.125-134
    • /
    • 1998
  • 최근에 수행된 일련의 철근콘크리트 슬래브 교량의 파괴시험의 결과 비록 교량의 노후화되었다 하더라도 내하력은 설계하중보다 더 크게 나타나고 있다. 본 연구에서는 철근콘크리트 슬래브 교량의 이런 높은 내하능력을 보이는 여러 가지 원인들 가운데 가장 큰 영향을 줄 것으로 예상되는 슬래브 교량의 하중분배거동에 대한 연구를 수행하였다. 철근콘크리트 슬래브 교량의 윤하중분포폭에 영향을 미치는 주요 변수들에는 지간길이, 교량폭, 단부보, 하중형태 및 지점조건이 있다. 본 연구결과에 의하면 지간길이와 교폭에 따라 현행의 윤하중분포폭은 과소 혹은 과대 평가되고 있다. 이들 각 변수들에 대한 포괄적인 유한요소 해석과 분석을 통하여 철근콘크리트 슬래브 교량의 윤하중분포폭을 도출하였고 이들 결과들을 비선형 회귀분석을 통하여 슬래브 교량의 윤하중분포폭의 예측 및 설계식을 제안하였다. 본 연구에서 제안된 윤하중분포폭의 식은 철근콘크리트 슬래브 교량의 보다 정확한 설계 및 합리적인 내하력 산정시 매우 효율적으로 사용될 것으로 사료된다.

An Investigation on the Heat Efficiency of Hot Air Heater (온풍난방기 열효율조사 연구)

  • Kim, Yeong-Jung;Yu, Yeong-Seon;Gang, Geum-Chun;Baek, Lee;Yun, Jin-Ha
    • Proceedings of the Korean Society for Agricultural Machinery Conference
    • /
    • 2002.02a
    • /
    • pp.133-138
    • /
    • 2002
  • 온풍난방기 20대를 대상으로 온풍난방기의 베기가스중 탄산가스농도, 온풍온도차, 배기가스온도 및 열효율을 조사하여 사용연수별로 분석하였으며 주요 결과는 다음과 같다. 가. 사용연수별 온풍난방기 배기가스중 탄산가스농도와 온풍온도차는 사용연수에 따른 조사표본의 부족으로 큰 차이가 있었다고 하기는 어렵고 정확한 조사를 위해서는 보다 많은 대수가 필요할 것으로 판단된다. 나. 사용연수별 배기가스온도는 사용연한이 오래될수록 높아졌다고 판단된다. 이는 열교환기에서 흡입공기와 연소열이 열교환이 충분히 이루어지지 않았거나, 버너노즐의 노후화, 버너송풍기, 온풍난방기 송풍기의 노후화에 따른 결과라 여겨진다. 사용연수에 따른 배기가스온도는 회귀방정식 Y= 79.032Ln(X) + 116.66 ($R^2$= 0.6784)로 나타낼 수 있었다. 다. 사용연수별 온풍난방기의 열효율은 사용연한이 오래될수록 감소하는 경향을 보였으며 회귀방정식 Y = 95.167 X $^{-0.054}$ ($R^2$= 0.5696)로 나타낼 수 있었다.

  • PDF