• Title/Summary/Keyword: Accuracy improvement

Search Result 2,373, Processing Time 0.038 seconds

Improving Performance of Recommendation Systems Using Topic Modeling (사용자 관심 이슈 분석을 통한 추천시스템 성능 향상 방안)

  • Choi, Seongi;Hyun, Yoonjin;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.3
    • /
    • pp.101-116
    • /
    • 2015
  • Recently, due to the development of smart devices and social media, vast amounts of information with the various forms were accumulated. Particularly, considerable research efforts are being directed towards analyzing unstructured big data to resolve various social problems. Accordingly, focus of data-driven decision-making is being moved from structured data analysis to unstructured one. Also, in the field of recommendation system, which is the typical area of data-driven decision-making, the need of using unstructured data has been steadily increased to improve system performance. Approaches to improve the performance of recommendation systems can be found in two aspects- improving algorithms and acquiring useful data with high quality. Traditionally, most efforts to improve the performance of recommendation system were made by the former approach, while the latter approach has not attracted much attention relatively. In this sense, efforts to utilize unstructured data from variable sources are very timely and necessary. Particularly, as the interests of users are directly connected with their needs, identifying the interests of the user through unstructured big data analysis can be a crew for improving performance of recommendation systems. In this sense, this study proposes the methodology of improving recommendation system by measuring interests of the user. Specially, this study proposes the method to quantify interests of the user by analyzing user's internet usage patterns, and to predict user's repurchase based upon the discovered preferences. There are two important modules in this study. The first module predicts repurchase probability of each category through analyzing users' purchase history. We include the first module to our research scope for comparing the accuracy of traditional purchase-based prediction model to our new model presented in the second module. This procedure extracts purchase history of users. The core part of our methodology is in the second module. This module extracts users' interests by analyzing news articles the users have read. The second module constructs a correspondence matrix between topics and news articles by performing topic modeling on real world news articles. And then, the module analyzes users' news access patterns and then constructs a correspondence matrix between articles and users. After that, by merging the results of the previous processes in the second module, we can obtain a correspondence matrix between users and topics. This matrix describes users' interests in a structured manner. Finally, by using the matrix, the second module builds a model for predicting repurchase probability of each category. In this paper, we also provide experimental results of our performance evaluation. The outline of data used our experiments is as follows. We acquired web transaction data of 5,000 panels from a company that is specialized to analyzing ranks of internet sites. At first we extracted 15,000 URLs of news articles published from July 2012 to June 2013 from the original data and we crawled main contents of the news articles. After that we selected 2,615 users who have read at least one of the extracted news articles. Among the 2,615 users, we discovered that the number of target users who purchase at least one items from our target shopping mall 'G' is 359. In the experiments, we analyzed purchase history and news access records of the 359 internet users. From the performance evaluation, we found that our prediction model using both users' interests and purchase history outperforms a prediction model using only users' purchase history from a view point of misclassification ratio. In detail, our model outperformed the traditional one in appliance, beauty, computer, culture, digital, fashion, and sports categories when artificial neural network based models were used. Similarly, our model outperformed the traditional one in beauty, computer, digital, fashion, food, and furniture categories when decision tree based models were used although the improvement is very small.

Development and Analysis of COMS AMV Target Tracking Algorithm using Gaussian Cluster Analysis (가우시안 군집분석을 이용한 천리안 위성의 대기운동벡터 표적추적 알고리듬 개발 및 분석)

  • Oh, Yurim;Kim, Jae Hwan;Park, Hyungmin;Baek, Kanghyun
    • Korean Journal of Remote Sensing
    • /
    • v.31 no.6
    • /
    • pp.531-548
    • /
    • 2015
  • Atmospheric Motion Vector (AMV) from satellite images have shown Slow Speed Bias (SSB) in comparison with rawinsonde. The causes of SSB are originated from tracking, selection, and height assignment error, which is known to be the leading error. However, recent works have shown that height assignment error cannot be fully explained the cause of SSB. This paper attempts a new approach to examine the possibility of SSB reduction of COMS AMV by using a new target tracking algorithm. Tracking error can be caused by averaging of various wind patterns within a target and changing of cloud shape in searching process over time. To overcome this problem, Gaussian Mixture Model (GMM) has been adopted to extract the coldest cluster as target since the shape of such target is less subject to transformation. Then, an image filtering scheme is applied to weigh more on the selected coldest pixels than the other, which makes it easy to track the target. When AMV derived from our algorithm with sum of squared distance method and current COMS are compared with rawindsonde, our products show noticeable improvement over COMS products in mean wind speed by an increase of $2.7ms^{-1}$ and SSB reduction by 29%. However, the statistics regarding the bias show negative impact for mid/low level with our algorithm, and the number of vectors are reduced by 40% relative to COMS. Therefore, further study is required to improve accuracy for mid/low level winds and increase the number of AMV vectors.

A Strategy for Environmental Improvement and Internationalization of the IEODO Ocean Research Station's Radiation Observatory (이어도 종합해양과학기지의 복사관측소 환경 개선 및 국제화 추진 전략)

  • LEE, SANG-HO;Zo, Il-SUNG;LEE, KYU-TAE;KIM, BU-YO;JUNG, HYUN-SEOK;RIM, SE-HUN;BYUN, DO-SEONG;LEE, JU-YEONG
    • The Sea:JOURNAL OF THE KOREAN SOCIETY OF OCEANOGRAPHY
    • /
    • v.22 no.3
    • /
    • pp.118-134
    • /
    • 2017
  • The radiation observation data will be used importantly in research field such as climatology, weather, architecture, agro-livestock and marine science. The Ieodo Ocean Research Station (IORS) is regarded as an ideal observatory because its location can minimize the solar radiation reflection from the surrounding background and also the data produced here can serve as a reference data for radiation observation. This station has the potential to emerge as a significant observatory and join a global radiation observation group such as the Baseline Surface Radiation Network (BSRN), if the surrounding of observatory is improved and be equipped with the essential radiation measuring instruments (pyaranometer and pyrheliometer). IORS has observed the solar radiation using a pyranometer since November 2004 and the data from January 1, 2005 to December 31, 2015 were analyzed in this study. During the period of this study, the daily mean solar radiation observed from IORS decreased to $-3.80W/m^2/year$ due to the variation of the sensor response in addition to the natural environment. Since the yellow sand and fine dust from China are of great interest to scientists around the world, it is necessary to establish a basis of global joint response through the radiation data obtained at the Ieodo as well as at Sinan Gageocho and Ongjin Socheongcho Ocean Research Station. So it is an urgent need to improve the observatory surrounding and the accuracy of the observed data.

Improved Method of License Plate Detection and Recognition using Synthetic Number Plate (인조 번호판을 이용한 자동차 번호인식 성능 향상 기법)

  • Chang, Il-Sik;Park, Gooman
    • Journal of Broadcast Engineering
    • /
    • v.26 no.4
    • /
    • pp.453-462
    • /
    • 2021
  • A lot of license plate data is required for car number recognition. License plate data needs to be balanced from past license plates to the latest license plates. However, it is difficult to obtain data from the actual past license plate to the latest ones. In order to solve this problem, a license plate recognition study through deep learning is being conducted by creating a synthetic license plates. Since the synthetic data have differences from real data, and various data augmentation techniques are used to solve these problems. Existing data augmentation simply used methods such as brightness, rotation, affine transformation, blur, and noise. In this paper, we apply a style transformation method that transforms synthetic data into real-world data styles with data augmentation methods. In addition, real license plate data are noisy when it is captured from a distance and under the dark environment. If we simply recognize characters with input data, chances of misrecognition are high. To improve character recognition, in this paper, we applied the DeblurGANv2 method as a quality improvement method for character recognition, increasing the accuracy of license plate recognition. The method of deep learning for license plate detection and license plate number recognition used YOLO-V5. To determine the performance of the synthetic license plate data, we construct a test set by collecting our own secured license plates. License plate detection without style conversion recorded 0.614 mAP. As a result of applying the style transformation, we confirm that the license plate detection performance was improved by recording 0.679mAP. In addition, the successul detection rate without image enhancement was 0.872, and the detection rate was 0.915 after image enhancement, confirming that the performance improved.

Validation of initial nutrition screening tool for hospitalized patients (입원 환자용 초기 영양검색도구의 타당도 검증)

  • Kim, Hye-Suk;Lee, Seonheui;Kim, Hyesook;Kwon, Oran
    • Journal of Nutrition and Health
    • /
    • v.52 no.4
    • /
    • pp.332-341
    • /
    • 2019
  • Purpose: Poor nutrition in hospitalized patients is closely linked to an increased risk of infection, which can result in complications affecting mortality, as well as increased length of hospital stay and hospital costs. Therefore, adequate nutritional support is essential to manage the nutritional risk status of patients. Nutritional support needs to be preceded by nutrition screening, in which accuracy is crucial, particularly for the initial screening. To perform initial nutrition screening of hospitalized patients, we used the Catholic Kwandong University (CKU) Nutritional Risk Screening (CKUNRS) tool, originally developed at CKU Hospital. To validate CKUNRS against the Patient-Generated Subjective Global Assessment (PG-SGA) tool, which is considered the gold standard for nutritional risk screening, results from both tools were compared. Methods: Nutritional status was evaluated in 686 adult patients admitted to CKU Hospital from May 1 to July 31, 2018 using both CKUNRS and PG-SGA. Collected data were analyzed, and the results compared, to validate CKUNRS as a nutrition screening tool. Results: The comparison of CKUNRS and PG-SGA revealed that the prevalence of nutritional risk on admission was 15.6% (n = 107) with CKUNRS and 44.6% (n = 306) with PG-SGA. The sensitivity and specificity of CKUNRS to evaluate nutritional risk status were 98.7% (96.8 ~ 99.5) and 33.3% (28.1 ~ 39.0), respectively. Thus, the sensitivity was higher, but the specificity lower compared with PG-SGA. Cohen's kappa coefficient was 0.34, indicating valid agreement between the two tools. Conclusion: This study found concordance between CKUNRS and PG-SGA. However, the prevalence of nutritional risk in hospitalized patients was higher when determined by CKUNRS, compared with that by PG-SGA. Accordingly, CKUNRS needs further modification and improvement in terms of screening criteria to promote more effective nutritional support for patients who have been admitted for inpatient care.

Exploring the Factors Influencing on the Accuracy of Self-Reported Responses in Affective Assessment of Science (과학과 자기보고식 정의적 영역 평가의 정확성에 영향을 주는 요소 탐색)

  • Chung, Sue-Im;Shin, Donghee
    • Journal of The Korean Association For Science Education
    • /
    • v.39 no.3
    • /
    • pp.363-377
    • /
    • 2019
  • This study reveals the aspects of subjectivity in the test results in a science-specific aspect when assessing science-related affective characteristic through self-report items. The science-specific response was defined as the response that appear due to student's recognition of nature or characteristics of science when his or her concepts or perceptions about science were attempted to measure. We have searched for cases where science-specific responses especially interfere with the measurement objective or accurate self-reports. The results of the error due to the science-specific factors were derived from the quantitative data of 649 students in the 1st and 2nd grade of high school and the qualitative data of 44 students interviewed. The perspective of science and the characteristics of science that students internalize from everyday life and science learning experiences interact with the items that form the test tool. As a result, it was found that there were obstacles to accurate self-report in three aspects: characteristics of science, personal science experience, and science in tool. In terms of the characteristic of science in relation to the essential aspect of science, students respond to items regardless of the measuring constructs, because of their views and perceived characteristics of science based on subjective recognition. The personal science experience factor representing the learner side consists of student's science motivation, interaction with science experience, and perception of science and life. Finally, from the instrumental point of view, science in tool leads to terminological confusion due to the uncertainty of science concepts and results in a distance from accurate self-report eventually. Implications from the results of the study are as follows: review of inclusion of science-specific factors, precaution to clarify the concept of measurement, check of science specificity factors at the development stage, and efforts to cross the boundaries between everyday science and school science.

Study on improvement of USLE P factor considering topography and cultivation method (지형 및 경작 방법을 반영한 범용토양유실량 산정공식 보전관리 인자 개선 연구)

  • Sung, Yunsoo;Lee, Gwanjae;Lee, Gwanjae;Han, Jeongho;Kim, Jonggun;Lim, Kyoung Jae;Kim, Ki Sung
    • Journal of Wetlands Research
    • /
    • v.21 no.2
    • /
    • pp.163-172
    • /
    • 2019
  • The USLE P factor is a factor that varies depending on how croplands are managed and cultivated. Previous studies tend to overestimate the amount of soil loss because the factor was estimated from the slope of the watershed rather than the estimate of each cultivated land. In addition, the accuracy of estimating the soil loss is decreasing due to the fact that the factor is calculated without considering various conditions of cultivated land defined by Wishmeier and Smith. In order to overcome these problems, the Ministry of Environment (MOE) has proposed to establish the topsoil notification and calculate the P factor according to the cultivation methods (e.g., tillage system, support practice). However, it is required to apply the conditions proposed in the United States to domestic circumstances as it is causing uncertainties. Thus, this study selected the watersheds where soil loss was serious (Haean, Jaun, Banbyeoncheon), measured the actual slopes and slope lengths, and examined the crop, tillage systems, and support practice for each cultivated land. The P factors were recalculated considering the actual conditions of cultivated land and compared to the factors proposed by the previous studies (MOE). As the result of the study, the P factors calculated based on the previous studies were 0.8 ~ 1.0 in three watersheds. On the other hand, it is confirmed that there is a significant difference between the factors notified by MOE and estimated by reflecting the topography and cultivation methods in this study. Therefore, it is considered that the research for developing the cultivation conditions to calculate the P factor suitable for the domestic environment should be continuously carried out.

A Study on Particulate Matter Forecasting Improvement by using Asian Dust Emissions in East Asia (황사배출량을 적용한 동아시아 미세먼지 예보 개선 연구)

  • Choi, Daeryun;Yun, Huiyoung;Chang, Limseok;Lee, Jaebum;Lee, Younghee;Myoung, Jisu;Kim, Taehee;Koo, Younseo
    • Journal of the Korean Society of Urban Environment
    • /
    • v.18 no.4
    • /
    • pp.531-546
    • /
    • 2018
  • Air quality forecasting system with Asian dust emissions was developed in East Asia, and $PM_{10}$ forecasting performance of chemical transport model with Asian dust emissions was validated and evaluated. The chemical transport model (CTM) with Asian dust emission was found to supplement $PM_{10}$ concentrations that had been under-estimated in China regions and improved statistics for performance of CTM, although the model were overestimated during some periods in China. In Korea, the prediction model adequately simulated inflow of Asian dust events on February 22~24 and March 16~17, but the model is found to be overestimated during no Asian dust event periods on April. However, the model supplemented $PM_{10}$ concentrations, which was underestimated in most regions in Korea and the statistics for performance of the models were improved. The $PM_{10}$ forecasting performance of air quality forecasting model with Asian dust emissions tends to improve POD (Probability of Detection) compared to basic model without Asian dust emissions, but A (Accuracy) has shown similar or decreased, and FAR (False Alarms) have increased during 2017.Therefore, the developed air quality forecasting model with Asian dust emission was not proposed as a representative $PM_{10}$ forecast model in South Korea.

Very short-term rainfall prediction based on radar image learning using deep neural network (심층신경망을 이용한 레이더 영상 학습 기반 초단시간 강우예측)

  • Yoon, Seongsim;Park, Heeseong;Shin, Hongjoon
    • Journal of Korea Water Resources Association
    • /
    • v.53 no.12
    • /
    • pp.1159-1172
    • /
    • 2020
  • This study applied deep convolution neural network based on U-Net and SegNet using long period weather radar data to very short-term rainfall prediction. And the results were compared and evaluated with the translation model. For training and validation of deep neural network, Mt. Gwanak and Mt. Gwangdeoksan radar data were collected from 2010 to 2016 and converted to a gray-scale image file in an HDF5 format with a 1km spatial resolution. The deep neural network model was trained to predict precipitation after 10 minutes by using the four consecutive radar image data, and the recursive method of repeating forecasts was applied to carry out lead time 60 minutes with the pretrained deep neural network model. To evaluate the performance of deep neural network prediction model, 24 rain cases in 2017 were forecast for rainfall up to 60 minutes in advance. As a result of evaluating the predicted performance by calculating the mean absolute error (MAE) and critical success index (CSI) at the threshold of 0.1, 1, and 5 mm/hr, the deep neural network model showed better performance in the case of rainfall threshold of 0.1, 1 mm/hr in terms of MAE, and showed better performance than the translation model for lead time 50 minutes in terms of CSI. In particular, although the deep neural network prediction model performed generally better than the translation model for weak rainfall of 5 mm/hr or less, the deep neural network prediction model had limitations in predicting distinct precipitation characteristics of high intensity as a result of the evaluation of threshold of 5 mm/hr. The longer lead time, the spatial smoothness increase with lead time thereby reducing the accuracy of rainfall prediction The translation model turned out to be superior in predicting the exceedance of higher intensity thresholds (> 5 mm/hr) because it preserves distinct precipitation characteristics, but the rainfall position tends to shift incorrectly. This study are expected to be helpful for the improvement of radar rainfall prediction model using deep neural networks in the future. In addition, the massive weather radar data established in this study will be provided through open repositories for future use in subsequent studies.

Temperature and Solar Radiation Prediction Performance of High-resolution KMAPP Model in Agricultural Areas: Clear Sky Case Studies in Cheorwon and Jeonbuk Province (고해상도 규모상세화모델 KMAPP의 농업지역 기온 및 일사량 예측 성능: 맑은 날 철원 및 전북 사례 연구)

  • Shin, Seoleun;Lee, Seung-Jae;Noh, Ilseok;Kim, Soo-Hyun;So, Yun-Young;Lee, Seoyeon;Min, Byung Hoon;Kim, Kyu Rang
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.22 no.4
    • /
    • pp.312-326
    • /
    • 2020
  • Generation of weather forecasts at 100 m resolution through a statistical downscaling process was implemented by Korea Meteorological Administration Post- Processing (KMAPP) system. The KMAPP data started to be used in various industries such as hydrologic, agricultural, and renewable energy, sports, etc. Cheorwon area and Jeonbuk area have horizontal planes in a relatively wide range in Korea, where there are many complex mountainous areas. Cheorwon, which has a large number of in-situ and remotely sensed phenological data over large-scale rice paddy cultivation areas, is considered as an appropriate area for verifying KMAPP prediction performance in agricultural areas. In this study, the performance of predicting KMAPP temperature changes according to ecological changes in agricultural areas in Cheorwon was compared and verified using KMA and National Center for AgroMeteorology (NCAM) observations. Also, during the heat wave in Jeonbuk Province, solar radiation forecast was verified using Automated Synoptic Observing System (ASOS) data to review the usefulness of KMAPP forecast data as input data for application models such as livestock heat stress models. Although there is a limit to the need for more cases to be collected and selected, the improvement in post-harvest temperature forecasting performance in agricultural areas over ordinary residential areas has led to indirect guesses of the biophysical and phenological effects on forecasting accuracy. In the case of solar radiation prediction, it is expected that KMAPP data will be used in the application model as detailed regional forecast data, as it tends to be consistent with observed values, although errors are inevitable due to human activity in agricultural land and data unit conversion.