• Title/Summary/Keyword: statistic technique

Search Result 121, Processing Time 0.021 seconds

Study on Accident Prediction Models in Urban Railway Casualty Accidents Using Logistic Regression Analysis Model (로지스틱회귀분석 모델을 활용한 도시철도 사상사고 사고예측모형 개발에 대한 연구)

  • Jin, Soo-Bong;Lee, Jong-Woo
    • Journal of the Korean Society for Railway
    • /
    • v.20 no.4
    • /
    • pp.482-490
    • /
    • 2017
  • This study is a railway accident investigation statistic study with the purpose of prediction and classification of accident severity. Linear regression models have some difficulties in classifying accident severity, but a logistic regression model can be used to overcome the weaknesses of linear regression models. The logistic regression model is applied to escalator (E/S) accidents in all stations on 5~8 lines of the Seoul Metro, using data mining techniques such as logistic regression analysis. The forecasting variables of E/S accidents in urban railway stations are considered, such as passenger age, drinking, overall situation, behavior, and handrail grip. In the overall accuracy analysis, the logistic regression accuracy is explained 76.7%. According to the results of this analysis, it has been confirmed that the accuracy and the level of significance of the logistic regression analysis make it a useful data mining technique to establish an accident severity prediction model for urban railway casualty accidents.

Classification of ratings in online reviews (온라인 리뷰에서 평점의 분류)

  • Choi, Dongjun;Choi, Hosik;Park, Changyi
    • Journal of the Korean Data and Information Science Society
    • /
    • v.27 no.4
    • /
    • pp.845-854
    • /
    • 2016
  • Sentiment analysis or opinion mining is a technique of text mining employed to identify subjective information or opinions of an individual from documents in blogs, reviews, articles, or social networks. In the literature, only a problem of binary classification of ratings based on review texts in an online review. However, because there can be positive or negative reviews as well as neutral reviews, a multi-class classification will be more appropriate than the binary classification. To this end, we consider the multi-class classification of ratings based on review texts. In the preprocessing stage, we extract words related with ratings using chi-square statistic. Then the extracted words are used as input variables to multi-class classifiers such as support vector machines and proportional odds model to compare their predictive performances.

Development of Patient Transfer Techniques based on Postural-stability Principles for the Care Helpers in Nursing Homes and Evaluation of Effectiveness (자세안정성 원리에 기반한 환자이동기술 개발 및 효과검정)

  • Ma, Ryewon;Jung, Dukyoo
    • Journal of Korean Academy of Nursing
    • /
    • v.46 no.1
    • /
    • pp.39-49
    • /
    • 2016
  • Purpose: This study was done to develop a postural-stability patient transfer technique for care helpers in nursing homes and to evaluate its effectiveness. Methods: Four types of patient transfer techniques (Lifting towards the head board of the bed, turning to the lateral position, sitting upright on the bed, transferring from wheel chair to bed) were practiced in accordance with the following three methods; Care helpers habitually used transfer methods (Method 1), patient transfer methods according to care helper standard textbooks (Method 2), and a method developed by the author ensuring postural-stability (Method 3). The care helpers' muscle activity and four joint angles were measured. The collected data were analyzed using the program SPSS Statistic 21.0. To differentiate the muscle activity and joint angle, the Friedman test was executed and the post-hoc analysis was conducted using the Wilcoxon Signed Rank test. Results: Muscle activity was significantly lower during Method 3 compared to Methods 1 and 2. In addition, the joint angle was significantly lower for the knee and shoulder joint angle while performing Method 3 compared to Methods 1 and 2. Discussion: Findings indicate that using postural-stability patient transfer techniques can contribute to the prevention of musculoskeletal disease which care helpers suffer from due to physically demanding patient care in nursing homes.

A Researh for Consumer Dissatisfaction and Institutional Improvement of The Overseas Direct Purchase using Exploratory Data Analysis (탐색적 자료 분석(EDA) 기법을 활용한 온라인 해외직접구매에 대한 소비자 불만족 및 제도 개선 방안 연구)

  • Park, Seongwoo;Kang, Juyoung
    • The Journal of Bigdata
    • /
    • v.5 no.1
    • /
    • pp.41-54
    • /
    • 2020
  • With the recent expansion of Internet channels and the development of financial technology and information and communication technology, direct overseas purchases have expanded. Although direct overseas purchases dominate consumers in terms of price and scarcity by providing relatively low-priced products and products that are difficult to obtain in Korea, there is a higher chance of consumer dissatisfaction in terms of delivery, product, A/S and refund than domestic purchases. Therefore, this study analyzed consumer dissatisfaction caused by active overseas direct purchase and studied ways to improve problems with overseas direct purchase. As a research method, Several statistical data were collected from the Korea Consumer Agency(KCA), the Korea Customs Service(KCS) and the Korea International Trade Association(KITA) and analyzed using the Exploratory Data Analysis Technique (EDA). The analysis confirmed that consumers were not well aware of information about direct overseas purchases and that the type or degree of consumer complaints varied depending on the type of purchase. Therefore, this study suggests a direction for the revitalization of overseas direct purchases by using EDA to identify the overall status of overseas direct purchases and consumer dissatisfaction and to improve them.

Returns to Investment on Research and Extension in Korean Horticulture (원예부문 연구 및 지도 사업의 투자효과 분석)

  • Kang, Kyeong-Ha;Lee, Min-Soo;Choe, Young-Chan
    • Journal of Agricultural Extension & Community Development
    • /
    • v.7 no.2
    • /
    • pp.257-277
    • /
    • 2000
  • The objectives of this study are to investigate the relationship between the growth of the horticultural sector and horticultural research and extension and to examine the socioeconomic returns to investment on research and extension in Korean horticulture. Data for horticultural production values, producer price indices and research and extension budgets for horticultural sector from 1965 to 1998 are collected from various sources. Multi-variate time series analysis technique with vector auto-regression model and Akino-Hayami Formula were employed for the analysis. This study finds (1) horticultural production responds about seven years later to the horticultural research investment shock. the magnitude of the impacts increases to a peak in seventeen years from the initial expenditures and then declines slowly thereafter until twenty years. and this peak gives a tip that horticultural research impact lasts much longer than grain's or agriculture's: (2) the social surplus from research investment benefits more to the consumer rather than to the horticultural producer: (3) B/C ratios in horticultural research are quite high with the range of 9 to 55 from 1965 to 1998. but these have been decreased since the early 1990s: (4) the socioeconomic returns to horticultural research is quite high with 56 percents of internal rate of return. It remains to be analyzed returns to investment on extension in horticulture because of no statistic significance in this study.

  • PDF

Food therapy analysis of the primary ailments from the 『ShikLyoChanYo(食療纂要)』 (『식료찬요(食療纂要)』에 기재(記載)된 7개 병증(病證)의 식약요법(食藥療法)에 관한 소고(小考))

  • Yeo, Min-Kyung;Yin, Lin;Hwang, Su-Jung;Lee, Byung-Wook;Kim, Ki-Wook
    • The Journal of Korean Medical History
    • /
    • v.27 no.1
    • /
    • pp.61-76
    • /
    • 2014
  • The "ShikLyoChanYo", written in 1460 by JunSoonYi (全循義), master court doctor in JoSeon (朝鮮) Dynasty, is the very first specialty publication of Korean dietary treatment existing today. Both Chinese and Korean scholars have assumed that this book had been lost long time ago. In November 2003, however, a Korean philologist found a version of the book, Yangyang (襄陽, a district name in Korea), and this book has attracted a lot of interest of Korean traditional medical science and agricultural science since then. This paper is to dissert the document of food therapy from the book with profound document study and statistical analysis in the fields of traditional Chinese medicine and traditional Korean medicine on dietetics. It completes the study of the application of all the dietetic treatments according to symptoms of diseases and all the plants and medication applied to cure chronic conditions that are clinically examined for the purpose of food therapy. A general survey on sundry records related to this food therapy of the "ShikLyoChanYo" has been done to make this dissertation and it carried out a statistic analysis of all the dietetic mixing technique of all plants and medication. Among other symptoms of illnesses from the book, there are 7 frequently addressed ailments chosen from the aspect of food therapy - a stroke, a disease diagnosed by thirst, a serious cough, an ache resulting from numbness, a disease relating to stomach, blurry vision and weak hearing, and a drinking related disease. This part is to discuss these illnesses and how to cure them with food based on its characteristics and rules of application.

Big Data Processing and Performance Improvement for Ship Trajectory using MapReduce Technique

  • Kim, Kwang-Il;Kim, Joo-Sung
    • Journal of the Korea Society of Computer and Information
    • /
    • v.24 no.10
    • /
    • pp.65-70
    • /
    • 2019
  • In recently, ship trajectory data consisting of ship position, speed, course, and so on can be obtained from the Automatic Identification System device with which all ships should be equipped. These data are gathered more than 2GB every day at a crowed sea port and used for analysis of ship traffic statistic and patterns. In this study, we propose a method to process ship trajectory data efficiently with distributed computing resources using MapReduce algorithm. In data preprocessing phase, ship dynamic and static data are integrated into target dataset and filtered out ship trajectory that is not of interest. In mapping phase, we convert ship's position to Geohash code, and assign Geohash and ship MMSI to key and value. In reducing phase, key-value pairs are sorted according to the same key value and counted the ship traffic number in a grid cell. To evaluate the proposed method, we implemented it and compared it with IALA waterway risk assessment program(IWRAP) in their performance. The data processing performance improve 1 to 4 times that of the existing ship trajectory analysis program.

An Analytical Study on Automatic Classification of Domestic Journal articles Using Random Forest (랜덤포레스트를 이용한 국내 학술지 논문의 자동분류에 관한 연구)

  • Kim, Pan Jun
    • Journal of the Korean Society for information Management
    • /
    • v.36 no.2
    • /
    • pp.57-77
    • /
    • 2019
  • Random Forest (RF), a representative ensemble technique, was applied to automatic classification of journal articles in the field of library and information science. Especially, I performed various experiments on the main factors such as tree number, feature selection, and learning set size in terms of classification performance that automatically assigns class labels to domestic journals. Through this, I explored ways to optimize the performance of random forests (RF) for imbalanced datasets in real environments. Consequently, for the automatic classification of domestic journal articles, Random Forest (RF) can be expected to have the best classification performance when using tree number interval 100~1000(C), small feature set (10%) based on chi-square statistic (CHI), and most learning sets (9-10 years).

Bilataral Trade Balance between Korea and Her Trading Partners: Using Panel Approach (한국의 무역상대국간 무역수지와 환율간의 장기관계분석: 패널분석의 적용)

  • Kim, Joung-Gu
    • International Area Studies Review
    • /
    • v.14 no.1
    • /
    • pp.185-202
    • /
    • 2010
  • While it is often assumed that a country's trade balance will improve in the long-run if its currency is allowed to depreciate, this is not necessarily the case for specific industry. This paper is to examine the long-run relationships between trade balance and real exchange rate using bilateral data of SITC 10 Industry Classification for Korea vis-${\grave{a}}$-vis her trading partners Indonesia, India, China, Japan on a quarterly basis over the period of 1999Q1 to 2008Q4. I applied the recent panel cointegration technique to reduce the small sample problems and improving power performance of the relevant estimation and inference procedures. The results reveal evidence of the Marshall-Lerner Condition in Indonesia 2 industries, India 5 industries, Japanese 4 industries, Chinese 6 industries. Whole group's cointegration statistic of India, China, Japan was supported Marshall-Lerner Condition but Indonesia was rejected.

Evaluating flexural strength of concrete with steel fibre by using machine learning techniques

  • Sharma, Nitisha;Thakur, Mohindra S.;Upadhya, Ankita;Sihag, Parveen
    • Composite Materials and Engineering
    • /
    • v.3 no.3
    • /
    • pp.201-220
    • /
    • 2021
  • In this study, potential of three machine learning techniques i.e., M5P, Support vector machines and Gaussian processes were evaluated to find the best algorithm for the prediction of flexural strength of concrete mix with steel fibre. The study comprises the comparison of results obtained from above-said techniques for given dataset. The dataset consists of 124 observations from past research studies and this dataset is randomly divided into two subsets namely training and testing datasets with (70-30)% proportion by weight. Cement, fine aggregates, coarse aggregates, water, super plasticizer/ high-range water reducer, steel fibre, fibre length and curing days were taken as input parameters whereas flexural strength of the concrete mix was taken as the output parameter. Performance of the techniques was checked by statistic evaluation parameters. Results show that the Gaussian process technique works better than other techniques with its minimum error bandwidth. Statistical analysis shows that the Gaussian process predicts better results with higher coefficient of correlation value (0.9138) and minimum mean absolute error (1.2954) and Root mean square error value (1.9672). Sensitivity analysis proves that steel fibre is the significant parameter among other parameters to predict the flexural strength of concrete mix. According to the shape of the fibre, the mixed type performs better for this data than the hooked shape of the steel fibre, which has a higher CC of 0.9649, which shows that the shape of fibers do effect the flexural strength of the concrete. However, the intricacy of the mixed fibres needs further investigations. For future mixes, the most favorable range for the increase in flexural strength of concrete mix found to be (1-3)%.