• Title/Summary/Keyword: 음이항 회귀 분석

Search Result 91, Processing Time 0.022 seconds

Estimation of the Effects of Daily Walking Hours and Days on the Mental Health of Urban Residents - The Case in Seoul - (주거지역 가로환경 및 일상 걷기가 정신 건강에 미치는 영향 - 서울시 대상으로 -)

  • Koo, Bonyu;Baek, Seungjoo;Yoon, Heeyeun
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.52 no.1
    • /
    • pp.87-100
    • /
    • 2024
  • This study aimed to investigate the impact of the quality of the street environment in residential areas on the mental health of urban residents, considering the frequency of street use. Using a zero-inflated negative binomial regression model, the study analyzed the influence of walking frequency and the street environment on depressive symptoms of urban residents. The research focused on Seoul, South Korea, in 2017, with depressive symptoms as the dependent variable and street environment variables, walking variables, and individual characteristics as independent variables. Additionally, the study explores the interaction effect of street greenery and walking frequency to analyze the synergistic impacts of walking in green spaces on mental health. The findings indicate that a higher ratio of street green areas is associated with fewer depressive symptoms. Increased walking frequency is linked to a reduction in depressive symptoms or a weaker manifestation of such symptoms. The interaction effect confirms that more frequent walking in green spaces is associated with weaker depressive symptoms. Lower ratios of visual complexity are correlated with reduced depressive symptoms. This study contributes to addressing urban residents' mental health issues at the community level by emphasizing the importance of the street green environment in residential areas.

Analysis of Accident Characteristics and Improvement Strategies of Flash Signal-operated Intersection in Seoul (서울시 점멸신호 운영에 따른 교통사고 분석 및 개선방안에 관한 연구)

  • Kim, Seung-Jun;Park, Byung-Jung;Lee, Jin-Hak;Kim, Ok-Sun
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.13 no.6
    • /
    • pp.54-63
    • /
    • 2014
  • Traffic accident frequency and severity level in Korea are known to be very serious. Especially the number of pedestrian fatalities was much worse and 1.6 time higher than the OECD average. According to the National Police Agency, the flash signals are reported to have many safety benefits as well as travel time reduction, which is opposed to the foreign studies. With this background of expanding the flash signal, this research aims to investigate the overall impact of the flash signal operation on safety, investigating and comparing the accident occurrence on the flash signal and the full signal intersections. For doing this accident prediction models for both flash and full signal intersections were estimated using independent variables (geometric features and traffic volume) and 3-year (2011-2013) accident data collected in Seoul. Considering the rare and random nature of accident occurrence and overdispersion (variance > mean) of the data, the negative binomial regression model was applied. As a result, installing wider crosswalk and increasing the number of pedestrian push buttons seemed to increase the safety of the flash signal intersections. In addition, the result showed that the average accident occurrence at the flash signal intersections was higher than at the full signal-operated intersections, 9% higher with everything else the same.

A Study on Developing Crash Prediction Model for Urban Intersections Considering Random Effects (임의효과를 고려한 도심지 교차로 교통사고모형 개발에 관한 연구)

  • Lee, Sang Hyuk;Park, Min Ho;Woo, Yong Han
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.14 no.1
    • /
    • pp.85-93
    • /
    • 2015
  • Previous studies have estimated crash prediction models with the fixed effect model which assumes the fixed value of coefficients without considering characteristics of each intersections. However the fixed effect model would estimate under estimation of the standard error resulted in over estimation of t-value. In order to overcome these shortcomings, the random effect model can be used with considering heterogeneity of AADT, geometric information and unobserved factors. In this study, data collections from 89 intersections in Daejeon and estimates of crash prediction models were conducted using the random and fixed effect negative binomial regression model for comparison and analysis of two models. As a result of model estimates, AADT, speed limits, number of lanes, exclusive right turn pockets and front traffic signal were found to be significant. For comparing statistical significance of two models, the random effect model could be better statistical significance with -1537.802 of log-likelihood at convergence comparing with -1691.327 for the fixed effect model. Also likelihood ration value was computed as 0.279 for the random effect model and 0.207 for the fixed effect model. This mean that the random effect model can be improved for statistical significance of models comparing with the fixed effect model.

The Effects of Sentiment and Readability on Useful Votes for Customer Reviews with Count Type Review Usefulness Index (온라인 리뷰의 감성과 독해 용이성이 리뷰 유용성에 미치는 영향: 가산형 리뷰 유용성 정보 활용)

  • Cruz, Ruth Angelie;Lee, Hong Joo
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.1
    • /
    • pp.43-61
    • /
    • 2016
  • Customer reviews help potential customers make purchasing decisions. However, the prevalence of reviews on websites push the customer to sift through them and change the focus from a mere search to identifying which of the available reviews are valuable and useful for the purchasing decision at hand. To identify useful reviews, websites have developed different mechanisms to give customers options when evaluating existing reviews. Websites allow users to rate the usefulness of a customer review as helpful or not. Amazon.com uses a ratio-type helpfulness, while Yelp.com uses a count-type usefulness index. This usefulness index provides helpful reviews to future potential purchasers. This study investigated the effects of sentiment and readability on useful votes for customer reviews. Similar studies on the relationship between sentiment and readability have focused on the ratio-type usefulness index utilized by websites such as Amazon.com. In this study, Yelp.com's count-type usefulness index for restaurant reviews was used to investigate the relationship between sentiment/readability and usefulness votes. Yelp.com's online customer reviews for stores in the beverage and food categories were used for the analysis. In total, 170,294 reviews containing information on a store's reputation and popularity were used. The control variables were the review length, store reputation, and popularity; the independent variables were the sentiment and readability, while the dependent variable was the number of helpful votes. The review rating is the moderating variable for the review sentiment and readability. The length is the number of characters in a review. The popularity is the number of reviews for a store, and the reputation is the general average rating of all reviews for a store. The readability of a review was calculated with the Coleman-Liau index. The sentiment is a positivity score for the review as calculated by SentiWordNet. The review rating is a preference score selected from 1 to 5 (stars) by the review author. The dependent variable (i.e., usefulness votes) used in this study is a count variable. Therefore, the Poisson regression model, which is commonly used to account for the discrete and nonnegative nature of count data, was applied in the analyses. The increase in helpful votes was assumed to follow a Poisson distribution. Because the Poisson model assumes an equal mean and variance and the data were over-dispersed, a negative binomial distribution model that allows for over-dispersion of the count variable was used for the estimation. Zero-inflated negative binomial regression was used to model count variables with excessive zeros and over-dispersed count outcome variables. With this model, the excess zeros were assumed to be generated through a separate process from the count values and therefore should be modeled as independently as possible. The results showed that positive sentiment had a negative effect on gaining useful votes for positive reviews but no significant effect on negative reviews. Poor readability had a negative effect on gaining useful votes and was not moderated by the review star ratings. These findings yield considerable managerial implications. The results are helpful for online websites when analyzing their review guidelines and identifying useful reviews for their business. Based on this study, positive reviews are not necessarily helpful; therefore, restaurants should consider which type of positive review is helpful for their business. Second, this study is beneficial for businesses and website designers in creating review mechanisms to know which type of reviews to highlight on their websites and which type of reviews can be beneficial to the business. Moreover, this study highlights the review systems employed by websites to allow their customers to post rating reviews.

Neighborhood Environment Associated with Physical Activity among Rural Adults: Applying Zero-Inflated Negative Binominal Regression Modeling (영과잉 음이항 회귀모형을 적용한 농촌지역 성인 신체활동의 지역사회환경 요인 분석)

  • Kim, Bongjeong
    • Journal of Korean Public Health Nursing
    • /
    • v.29 no.3
    • /
    • pp.488-502
    • /
    • 2015
  • Purpose: This study was conducted to determine the neighborhood environmental factors associated with physical activity among adults living in rural communities. Methods: A cross-sectional descriptive survey was conducted with a convenience sample of 201 adults living in three Ri in Y-city, Gyeonggi-do. Data were collected from face-to-face interview by trained interviewers and were analyzed using a zero-inflated negative binominal regression model. Results: Participants reported engaged in moderate or vigorous physical activity was 76.1%; 10.5% of participants reported that they met moderate physical activity recommendations and 14.5% of participants reported that they met vigorous physical activity recommendations. Zero-inflated negative binominal regression analysis showed association of increasing days of physical activity with social cohesion (${\beta}=.130$, p=.005), social network (${\beta}=-.096$, p=.003), and safety for crime (${\beta}=-.151$, p=.036), and no days of physical activity was associated with no attainment of education and marginally associated with increasing BMI. Conclusion: Neighborhood environmental factors including social cohesion, social network, and crime for safety were significantly associated with physical activity of rural adults. Community health nurses should expand an approach for individual behavior change to incorporate rural adults' specific neighborhood environmental factors into physical activity interventions.

Analysis of Neighborhood Environmental Factors Affecting Bicycle Accidents and Accidental Severity in Seoul, Korea (서울시 자전거 교통사고와 사고 심각도에 영향을 미치는 근린환경 요인 분석)

  • Hwang, Sun-Geun;Lee, Sugie
    • Journal of Korea Planning Association
    • /
    • v.53 no.7
    • /
    • pp.49-66
    • /
    • 2018
  • The purpose of this study is to analyze neighborhood environmental factors affecting bicycle accidents and accidental severity in Seoul, Korea. The use of bicycles has increased rapidly as daily transportation means in recent years. As a result, bicycle accidents are also steadily increasing. Using Traffic Accident Analysis System (TAAS) data from 2015 to 2017, this study uses negative binomial regression analysis to identify neighborhood environmental factors affecting bicycle accidents and accidential severity. The main results are as follows. First, bicycle accidents are more likely to occur in commercial and mixed land use areas where pedestrians, bicycle and vehicles are moving together. Second, bicycle accidents are positively associated with road structures such as four-way intersection. In contrast, three-way intersection is negatively associated with serious bicycle accidents. The density of speed hump or street tree is negatively associated with bicycle accidents and accidential severity. This finding indicates the effect of speed limit or street trees on bicycle safety. Fourth, bicycle infrastructures are also important factors affecting bicycle accidents and accidential severity. Bicycle-exclusive roads or bicycle-pedestrian mixed roads are positively associated with bicycle accidents and accidential severity. Finally, this study suggests policy implications to improve bicycle safety.

Heat-Wave Data Analysis based on the Zero-Inflated Regression Models (영-과잉 회귀모형을 활용한 폭염자료분석)

  • Kim, Seong Tae;Park, Man Sik
    • Journal of the Korean Data Analysis Society
    • /
    • v.20 no.6
    • /
    • pp.2829-2840
    • /
    • 2018
  • The random variable with an arbitrary value or more is called semi-continuous variable or zero-inflated one in case that its boundary value is more frequently observed than expected. This means the boundary value is likely to be practically observed more than it should be theoretically under certain probability distribution. When the distribution considered is continuous, the variable is defined as semi-continuous and when one of discrete distribution is assumed for the variable, we regard it as zero-inflated. In this study, we introduce the two-part model, which consists of one part for modelling the binary response and the other part for modelling the variable greater than the boundary value. Especially, the zero-inflated regression models are explained by using Poisson distribution and negative binomial distribution. In real data analysis, we employ the zero-inflated regression models to estimate the number of days under extreme heat-wave circumstances during the last 10 years in South Korea. Based on the estimation results, we create prediction maps for the estimated number of days under heat-wave advisory and heat-wave warning by using the universal kriging, which is one of the spatial prediction methods.

Determinants of Inventor Productivity: An Empirical Result from Panel Regressions Using Network Characteristics (발명자 생산성 결정요인: 네트워크 특성을 이용한 패널회귀분석결과)

  • Choo, Kineung
    • Journal of Technology Innovation
    • /
    • v.25 no.3
    • /
    • pp.83-113
    • /
    • 2017
  • This paper constructs panel data of inventors listed on patents applied for the KIPO during 1991-2005 and analyzes the effects of network characteristics on inventor productivity. The findings are as follows: ⅰ) Strong ties within a network have positive effects on inventor productivity. ⅱ) An inventor with high centrality shows high producitivity. ⅲ) Technological diversity of a network enhances inventor productivity. ⅳ) An inventor belonging to a network of good quality shows higher productivity. ⅴ) Network size is positively related with inventor producitvity. ⅵ) A lone inventor shows the highest productivity among types of inventors, and a co-inventor with the experience of standalone invention is more productive compared to an inventor with only the experience of co-invention. ⅶ) The productivity effects of network variables differ across regions. ⅷ) Differences among regions do not decrease though geographical boundaries become less important.

리뷰어 평점 이력이 리뷰 조작에 대한 인식 및 리뷰 유용성에 미치는 영향: 여행플랫폼을 중심으로

  • Jang, Mun-Gyeong;Lee, Sae-Rom;Baek, Hyeon-Mi
    • 한국벤처창업학회:학술대회논문집
    • /
    • 2022.11a
    • /
    • pp.181-185
    • /
    • 2022
  • 고객들은 조작된 온라인 리뷰가 범람하는 가운데 진정성과 가치를 지닌 리뷰를 보고자한다. 귀인 이론(Attribution theory)의 관점에서, 사람들은 리뷰어의 과거 평가 이력을 바탕으로 리뷰가 진정성 있는지를 판단하는 경향이 있다. 이러한 배경에서 본 연구의 목적은 리뷰어의 과거 평점 이력이 조작된 리뷰로 인식하는 것에 어떠한 영향을 미치며, 최종적으로 리뷰 유용성이 어떠한 영향을 미치는지 알아보는 것이다. 제안된 가설을 검증하기 위해 2차 데이터 분석(연구1)과 실험(연구2)을 수행했으며, 두 연구는 일관된 결과를 보여준다. 연구 1은 리뷰어의 과거 평가 이력이 리뷰 유용성에 미치는 영향을 분석하였다. 귀인이론에 근거하면, 사람들은 리뷰를 다른 목적을 가지고 작성되었다고 인식할 경우에 리뷰가 조작되었다고 생각하고, 그 리뷰가 물건이나 서비스의 진정한 가치를 평가하지 않았다고 간주한다. 따라서 해당 리뷰는 유용성이 낮게 평가되는 경향이 있다. 2차 데이터를 분석하기 위해 우리는 Python을 이용한 웹 스크레이퍼를 개발하여 TripAdvisor(TripAdvisor.com)에서 호텔 정보, 리뷰, 리뷰 정보 등의 연구 데이터를 수집하였다. 수집한 890명 리뷰어에 대한 100,621개의 리뷰를 분석하기 위해 음이항 회귀 분석을 수행하였다. 분석 결과, 평균 평점을 낮게 주는 리뷰어의 경우에 리뷰 유용성에 유의미한 영향을 미치지 않는 것으로 나타났다. 사람들은 극단적인 평점을 거의 주지 않는 리뷰어가 작성한 리뷰가 더 도움이 된다고 평가했다. 연구 2는 리뷰어의 과거 평점 이력을 기준으로 리뷰가 조작되었다고 평가하는 사람들의 인식 프로세스를 실험하였다. 실험 결과, 사람들은 리뷰어의 과거 평점 이력이 평균적으로 평점을 낮게 주는 경우에는 리뷰가 의심스럽다고 판단하지 않는 것으로 나타났다. 그리고 사람들은 리뷰어가 대부분 극단적인 평점을 주는 이력이 있다면 해당 리뷰어가 작성한 리뷰가 의심스럽다고 판단하는 것으로 나타났다. 연구2는 사람들이 리뷰어의 과거 평점 이력을 바탕으로 리뷰가 조작되었는지 또는 리뷰가 도움이 되는지 판단하는 경향이 있음을 보여준다. 본 연구는 귀인이론을 바탕으로 리뷰어의 과거 평점 이력이 리뷰 조작성에 대한 인식과 리뷰 유용성에 미치는 영향을 분석하여, 해당 연구분야에 새로운 관점을 추가한 기여점이 있다.

  • PDF

Visualization analysis using R Shiny (R의 Shiny를 이용한 시각화 분석 활용 사례)

  • Na, Jonghwa;Hwang, Eunji
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.6
    • /
    • pp.1279-1290
    • /
    • 2017
  • R's {shiny} package provides an environment for creating web applications with only R scripts. Shiny does not require knowledge of a separate web programming language and its development is very easy and straightforward. In addition, Shiny has a variety of extensibility, and its functions are expanding day by day. Therefore, the presentation of high-quality results is an excellent tool for R-based analysts. In this paper, we present actual cases of large data analysis using Shiny. First, geological anomaly zone is extracted by analyzing topographical data expressed in the form of contour lines by analysis related to spatial data. Next, we will construct a model to predict major diseases by 16 cities and provinces nationwide using weather, environment, and social media information. In this process, we want to show that Shiny is very effective for data visualization and analysis.