• Title/Summary/Keyword: 데이터 품질진단

Search Result 89, Processing Time 0.023 seconds

A Study on the Evaluation of the Work-Net, a Web-based Public Employment Information System (웹 기반 공공고용정보시스템 워크넷(Work-Net)평가에 관한 연구)

  • Kim, Soon-Won
    • Journal of the Korean Society for information Management
    • /
    • v.20 no.2
    • /
    • pp.93-112
    • /
    • 2003
  • A public information system is being expanded, along with the advance of information technology, to strengthen national competitiveness and provide people with better services. And there also is a growin need for the better performance of that system, as a tremendous amount of public financial resources is invested in that. To address that need, it's required to make an evaluation of its efficiency on a regular basis to identify its problems and make it work better. The purpose of this study was, accordinglu, to examine the quality of data and services provided by the Work-Net, a Web-based public employment information system. The subjects in this study were 102 users of it, and the system was evaluated in terms of content, accuracy, timeliness, display format, ease of use and customer support. For data analysis, t-test and one-way ANOVA were implemented to find out the general characteristics of the users, and to see Whether or not their view was different according to the type of information they searched for. The findings of this study are expected to lay some foundation for intensifying the efficiency of the public and private employment information systems.

Steel Plate Faults Diagnosis with S-MTS (S-MTS를 이용한 강판의 표면 결함 진단)

  • Kim, Joon-Young;Cha, Jae-Min;Shin, Junguk;Yeom, Choongsub
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.1
    • /
    • pp.47-67
    • /
    • 2017
  • Steel plate faults is one of important factors to affect the quality and price of the steel plates. So far many steelmakers generally have used visual inspection method that could be based on an inspector's intuition or experience. Specifically, the inspector checks the steel plate faults by looking the surface of the steel plates. However, the accuracy of this method is critically low that it can cause errors above 30% in judgment. Therefore, accurate steel plate faults diagnosis system has been continuously required in the industry. In order to meet the needs, this study proposed a new steel plate faults diagnosis system using Simultaneous MTS (S-MTS), which is an advanced Mahalanobis Taguchi System (MTS) algorithm, to classify various surface defects of the steel plates. MTS has generally been used to solve binary classification problems in various fields, but MTS was not used for multiclass classification due to its low accuracy. The reason is that only one mahalanobis space is established in the MTS. In contrast, S-MTS is suitable for multi-class classification. That is, S-MTS establishes individual mahalanobis space for each class. 'Simultaneous' implies comparing mahalanobis distances at the same time. The proposed steel plate faults diagnosis system was developed in four main stages. In the first stage, after various reference groups and related variables are defined, data of the steel plate faults is collected and used to establish the individual mahalanobis space per the reference groups and construct the full measurement scale. In the second stage, the mahalanobis distances of test groups is calculated based on the established mahalanobis spaces of the reference groups. Then, appropriateness of the spaces is verified by examining the separability of the mahalanobis diatances. In the third stage, orthogonal arrays and Signal-to-Noise (SN) ratio of dynamic type are applied for variable optimization. Also, Overall SN ratio gain is derived from the SN ratio and SN ratio gain. If the derived overall SN ratio gain is negative, it means that the variable should be removed. However, the variable with the positive gain may be considered as worth keeping. Finally, in the fourth stage, the measurement scale that is composed of selected useful variables is reconstructed. Next, an experimental test should be implemented to verify the ability of multi-class classification and thus the accuracy of the classification is acquired. If the accuracy is acceptable, this diagnosis system can be used for future applications. Also, this study compared the accuracy of the proposed steel plate faults diagnosis system with that of other popular classification algorithms including Decision Tree, Multi Perception Neural Network (MLPNN), Logistic Regression (LR), Support Vector Machine (SVM), Tree Bagger Random Forest, Grid Search (GS), Genetic Algorithm (GA) and Particle Swarm Optimization (PSO). The steel plates faults dataset used in the study is taken from the University of California at Irvine (UCI) machine learning repository. As a result, the proposed steel plate faults diagnosis system based on S-MTS shows 90.79% of classification accuracy. The accuracy of the proposed diagnosis system is 6-27% higher than MLPNN, LR, GS, GA and PSO. Based on the fact that the accuracy of commercial systems is only about 75-80%, it means that the proposed system has enough classification performance to be applied in the industry. In addition, the proposed system can reduce the number of measurement sensors that are installed in the fields because of variable optimization process. These results show that the proposed system not only can have a good ability on the steel plate faults diagnosis but also reduce operation and maintenance cost. For our future work, it will be applied in the fields to validate actual effectiveness of the proposed system and plan to improve the accuracy based on the results.

A Study on Maturity Model of Information Integration System (정보연계 시스템의 성숙도 모델에 관한 연구)

  • Ha, Hyodong;Lee, Ook
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.20 no.8
    • /
    • pp.570-578
    • /
    • 2019
  • In this era of big data, a variety of government organizations are trying to create new added value via Information Integration. Therefore, several projects related to government agencies' information sharing have activated system connection/integration. The risk factors of system operation, however, have increased as the volume of Information Integration System grows. The interference in information sharing is predicted to affect the operation of the agencies, and the issue will grow even worse with massive impact on civil society when the agency operation is interrupted due to system failures in terms of infrastructure, software, data quality, and security. Diverse studies related to the maintenance of Information System have been conducted, but there is currently no evaluation framework for the operational system of Information Integration between various government agencies. In this respect, this study distinguishes each of the Information System components, Data, IT, People, Process, systematizes with Plan-Do-See, and finally presents a maturity model for Information Integration. Nine derived processes were analyzed through interview and questionnaires from Information Integration System officials, further suggesting maturity stage applying CMMI. This model allows diagnosis of the maturity level of an Information Integration System, and is expected to be utilized as resource for improving organizational processes.

Customer Voices in Telehealth: Constructing Positioning Maps from App Reviews (고객 리뷰를 통한 모바일 앱 서비스 포지셔닝 분석: 비대면 진료 앱을 중심으로)

  • Minjae Kim;Hong Joo Lee
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.4
    • /
    • pp.69-90
    • /
    • 2023
  • The purpose of this study is to evaluate the service attributes and consumer reactions of telemedicine apps in South Korea and visualize their differentiation by constructing positioning maps. We crawled 23,219 user reviews of 6 major telemedicine apps in Korea from the Google Play store. Topics were derived by BERTopic modeling, and sentiment scores for each topic were calculated through KoBERT sentiment analysis. As a result, five service characteristics in the application attribute category and three in the medical service category were derived. Based on this, a two-dimensional positioning map was constructed through principal component analysis. This study proposes an objective service evaluation method based on text mining, which has implications. In sum, this study combines empirical statistical methods and text mining techniques based on user review texts of telemedicine apps. It presents a system of service attribute elicitation, sentiment analysis, and product positioning. This can serve as an effective way to objectively diagnose the service quality and consumer responses of telemedicine applications.

A Study on the Design of the Grid-Cell Assessment System for the Optimal Location of Offshore Wind Farms (해상풍력발전단지의 최적 위치 선정을 위한 Grid-cell 평가 시스템 개념 설계)

  • Lee, Bo-Kyeong;Cho, Ik-Soon;Kim, Dae-Hae
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.24 no.7
    • /
    • pp.848-857
    • /
    • 2018
  • Recently, around the world, active development of new renewable energy sources including solar power, waves, and fuel cells, etc. has taken place. Particularly, floating offshore wind farms have been developed for saving costs through large scale production, using high-quality wind power and minimizing noise damage in the ocean area. The development of floating wind farms requires an evaluation of the Maritime Safety Audit Scheme under the Maritime Safety Act in Korea. Floating wind farms shall be assessed by applying the line and area concept for systematic development, management and utilization of specified sea water. The development of appropriate evaluation methods and standards is also required. In this study, proper standards for marine traffic surveys and assessments were established and a systemic treatment was studied for assessing marine spatial area. First, a marine traffic data collector using AIS or radar was designed to conduct marine traffic surveys. In addition, assessment methods were proposed such as historical tracks, traffic density and marine traffic pattern analysis applying the line and area concept. Marine traffic density can be evaluated by spatial and temporal means, with an adjusted grid-cell scale. Marine traffic pattern analysis was proposed for assessing ship movement patterns for transit or work in sea areas. Finally, conceptual design of a Marine Traffic and Safety Assessment Solution (MaTSAS) was competed that can be analyzed automatically to collect and assess the marine traffic data. It could be possible to minimize inaccurate estimation due to human errors such as data omission or misprints through automated and systematic collection, analysis and retrieval of marine traffic data. This study could provides reliable assessment results, reflecting the line and area concept, according to sea area usage.

Design and Implementation of Produce Farming Field-Oriented Smart Pest Information Retrieval System based on Mobile for u-Farm (u-Farm을 위한 모바일 기반의 농작물 재배 현장 중심형 스마트 병해충 정보검색 시스템 설계 및 구현)

  • Kang, Ju-Hee;Jung, Se-Hoon;Nor, Sun-Sik;So, Won-Ho;Sim, Chun-Bo
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.10 no.10
    • /
    • pp.1145-1156
    • /
    • 2015
  • There is a shortage of mobile application systems readily applicable to the field of crop cultivation in relation to diseases and insect pests directly connected to the quality of crops. Most of system have been devoted to diseases and insect pests that would offer full predictions and basic information about diseases and insect pests currently. But for lack of the instant diagnostic functions seriously and the field of crop cultivation, we design and implement a crop cultivation field-oriented smart diseases and insect pests information retrieval system based on mobile for u-Farm. The proposed system had such advantages as providing information about diseases and insect pests in the field of crop cultivation and allowing the users to check the information with their smart-phones real-time based on the Lucene, a search library useful for the specialized analysis of images, and JSON data structure. And it was designed based on object-oriented modeling to increase its expandability and reusability. It was capable of search based on such image characteristic information as colors as well as the meta-information of crops and meta-information-based texts. The system was full of great merits including the implementation of u-Farm, the real-time check, and management of crop yields and diseases and insect pests by both the farmers and cultivation field managers.

A Study on the Activation of Pet Plant Kit Industry - Catering to the Demands of Industry Professionals - (반려식물 키트 산업의 활성화 방안에 관한 연구 - 산업 종사자의 수요를 중심으로 -)

  • Roh, Hoi-Eun;Lim, Chae-Jun;Lee, Min-Ji;Jo, Jang-Hwan
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.52 no.3
    • /
    • pp.46-58
    • /
    • 2024
  • The purpose of this study is to understand the current status of the pet plant kit industry and determine the priorities for support policies to revitalize the industry. SWOT analysis assessed the industry's current state, and the Analytic Hierarchy Process (AHP) was used with industry professionals to prioritize support policies. The SWOT analysis results indicated that SO strategies involve leveraging government support policies to enhance marketing and developing eco-friendly DIY products. WO strategies include launching advertising campaigns to increase market recognition and establishing strategic partnerships to expand distribution. ST strategies focus on strengthening price competitiveness and proposing unique values, while WT strategies involve improving production processes and enhancing product quality based on consumer feedback. The AHP analysis identified 3 top-level and 12 sub-level evaluation items, with data collected from 17 expert surveys. The results showed the 'entry phase' (0.482), 'activation phase' (0.397), and 'advanced phase' (0.121) were prioritized, with 'organizing seminars' (0.181) as the most crucial subcategory and 'support for kit development' (0.020) as the least. The pet plant kit industry is in its early stages, and appropriate policy incubation can help activate the garden industry. This study provides foundational information on the industry's needs for activation.

Estimation of Fresh Weight and Leaf Area Index of Soybean (Glycine max) Using Multi-year Spectral Data (다년도 분광 데이터를 이용한 콩의 생체중, 엽면적 지수 추정)

  • Jang, Si-Hyeong;Ryu, Chan-Seok;Kang, Ye-Seong;Park, Jun-Woo;Kim, Tae-Yang;Kang, Kyung-Suk;Park, Min-Jun;Baek, Hyun-Chan;Park, Yu-hyeon;Kang, Dong-woo;Zou, Kunyan;Kim, Min-Cheol;Kwon, Yeon-Ju;Han, Seung-ah;Jun, Tae-Hwan
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.23 no.4
    • /
    • pp.329-339
    • /
    • 2021
  • Soybeans (Glycine max), one of major upland crops, require precise management of environmental conditions, such as temperature, water, and soil, during cultivation since they are sensitive to environmental changes. Application of spectral technologies that measure the physiological state of crops remotely has great potential for improving quality and productivity of the soybean by estimating yields, physiological stresses, and diseases. In this study, we developed and validated a soybean growth prediction model using multispectral imagery. We conducted a linear regression analysis between vegetation indices and soybean growth data (fresh weight and LAI) obtained at Miryang fields. The linear regression model was validated at Goesan fields. It was found that the model based on green ratio vegetation index (GRVI) had the greatest performance in prediction of fresh weight at the calibration stage (R2=0.74, RMSE=246 g/m2, RE=34.2%). In the validation stage, RMSE and RE of the model were 392 g/m2 and 32%, respectively. The errors of the model differed by cropping system, For example, RMSE and RE of model in single crop fields were 315 g/m2 and 26%, respectively. On the other hand, the model had greater values of RMSE (381 g/m2) and RE (31%) in double crop fields. As a result of developing models for predicting a fresh weight into two years (2018+2020) with similar accumulated temperature (AT) in three years and a single year (2019) that was different from that AT, the prediction performance of a single year model was better than a two years model. Consequently, compared with those models divided by AT and a three years model, RMSE of a single crop fields were improved by about 29.1%. However, those of double crop fields decreased by about 19.6%. When environmental factors are used along with, spectral data, the reliability of soybean growth prediction can be achieved various environmental conditions.

A COVID-19 Diagnosis Model based on Various Transformations of Cough Sounds (기침 소리의 다양한 변환을 통한 코로나19 진단 모델)

  • Minkyung Kim;Gunwoo Kim;Keunho Choi
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.3
    • /
    • pp.57-78
    • /
    • 2023
  • COVID-19, which started in Wuhan, China in November 2019, spread beyond China in 2020 and spread worldwide in March 2020. It is important to prevent a highly contagious virus like COVID-19 in advance and to actively treat it when confirmed, but it is more important to identify the confirmed fact quickly and prevent its spread since it is a virus that spreads quickly. However, PCR test to check for infection is costly and time consuming, and self-kit test is also easy to access, but the cost of the kit is not easy to receive every time. Therefore, if it is possible to determine whether or not a person is positive for COVID-19 based on the sound of a cough so that anyone can use it easily, anyone can easily check whether or not they are confirmed at anytime, anywhere, and it can have great economic advantages. In this study, an experiment was conducted on a method to identify whether or not COVID-19 was confirmed based on a cough sound. Cough sound features were extracted through MFCC, Mel-Spectrogram, and spectral contrast. For the quality of cough sound, noisy data was deleted through SNR, and only the cough sound was extracted from the voice file through chunk. Since the objective is COVID-19 positive and negative classification, learning was performed through XGBoost, LightGBM, and FCNN algorithms, which are often used for classification, and the results were compared. Additionally, we conducted a comparative experiment on the performance of the model using multidimensional vectors obtained by converting cough sounds into both images and vectors. The experimental results showed that the LightGBM model utilizing features obtained by converting basic information about health status and cough sounds into multidimensional vectors through MFCC, Mel-Spectogram, Spectral contrast, and Spectrogram achieved the highest accuracy of 0.74.