• Title/Summary/Keyword: 데이터품질관리

Search Result 855, Processing Time 0.339 seconds

A Study on the Profiling of Collect Site for the Effective Reputation Analysis (효과적인 평판분석을 위한 수집사이트 프로파일링에 관한 연구)

  • Song, Eun-Jee;Kang, Min-Sik
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2014.05a
    • /
    • pp.617-618
    • /
    • 2014
  • 본 논문에서는 보다 정확하고 효과적인 평판분석을 위하여 서비스 산업별 타겟으로 하는 수집사이트를 프로파일링 하는 방법을 제안한다. 먼저 각 서비스에 특화된 타겟 사이트를 추출하고 등록하고 각 서비스에 관련한 정보 및 의견 공유 게시판과 지식인 추천/질문 등 지식 공유 사이트를 추출한다. 또한 업종별 주요 사이트를 선택하고 등록하여 유효 데이터 수집한다. 이를 통해 실시간 수집 데이터의 활용 기술을 이용하여 수집원 프로파일링을 통한 미디어별 수집 주기 산정하고 수집 엔진의 유연한 확장성을 활용한 실시간 수집 제반 기술 확대할 수 있다. 또한 지속적인 수집원 변경관리를 수행한다. 즉, 신규 생성, 변경, 삭제되는 사이트에 대한 변경관리를 수행하고 지속적인 수집량 모니터링을 통한 수집여부를 점검하며 수집 필터링 규칙에 대한 튜닝으로 데이터 품질 확보하도록 한다.

  • PDF

Improvement of the Electronic Customs System to Improve the Quality of Export and Import Declaration Data (수출입 신고데이터 품질제고를 위한 전자통관 시스템 개선)

  • Jo, Hang-Jin;Park, Koo-Rack;Lee, Jang-Sik
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2021.07a
    • /
    • pp.429-430
    • /
    • 2021
  • 본 논문에서는 우리나라 수출입 등 통관의 전반적인 것을 담당하는 전자통관시스템의 문제점을 발굴하고 개선점을 찾아 보완하여 보다 진화된 양질의 전자통관시스템으로 개발하는 것을 목표로 하고 있다. 전자통관시스템의 문제점을 중심으로 구분하면 통관업무, 사전검증시스템, 데이터정제시스템으로 볼 수 있다. 각 부분별로 문제점을 분석한 결과 오류 데이터 전송, 자가진단 점검기능 부재, 경험치로 관리, 사후관리 체계 부재, 오류분석 기능 미활용으로 나왔으며, 이런 문제점을 해결하기 위해 개선방안에 대해 면밀히 검토하여 맞춤형 대책을 마련하여 처음 신고인이 양질의 정보를 정확히 입력할 수 있는 시스템 구축부터 검증시스템을 통해 정확한 정보를 정제하는 중간단계를 거쳐 마지막으로 최상의 결과를 도출 및 제공하기까지 시스템을 향상시켜 이용객에게 더욱 정확한 처리결과 제공 및 진화된 국가행정시스템을 구축함으로써 국가경쟁력을 강화할 수 있다.

  • PDF

A Study of the Workflow and the Metadata for Web Records Archiving (웹 기록물 아카이빙을 위한 워크플로우 및 메타데이터 연구)

  • Seung-Jun Cha;Dong-Suk Chun;Kyu-Chul Lee
    • Annual Conference of KIPS
    • /
    • 2008.11a
    • /
    • pp.1379-1382
    • /
    • 2008
  • 웹은 급속하게 변화하는 현대사회에서 정부와 시민들의 주요 의사소통의 채널이 되고 있다. 웹에서 유통되는 정보량이 급증하면서 정보원으로서의 웹에 대한 의존도가 크게 높아졌을 뿐만 아니라 전적으로 웹에만 존재하는 정보자원도 증가하고 있다. 중요한 가치를 지닌 웹사이트는 짧은 수명주기와 수집, 보존, 활용에 대한 방안이 없어 소멸되고 있는 실정이다. 이러한 문제를 해결하기 위해 웹 기록물 아카이빙을 위한 기반기술로 워크플로우 및 메타데이터 정의가 필요하다. 따라서 본 논문에서는 웹 기록물을 아카이빙하기 위해 선별, 수집, 품질관리 및 목록화, 보존, 저장으로 구성되는 워크플로우 및 장기 보존과 검색에 필수적인 메타데이터를 정의하였다. 이러한 연구 개발 및 적용을 통해 사라져 가는 중요한 자원인 웹 기록물을 후대에 중요한 기록물 자원으로 저장 및 관리할 수 있게 될 것이다.

Application of a REID-Based Monitoring System for the Concrete Pour Process (RFID를 응용한 콘크리트 타설 모니터링 시스템의 적용방안)

  • Moon, Sung-Woo;Hong, Seung-Moon
    • Korean Journal of Construction Engineering and Management
    • /
    • v.8 no.3
    • /
    • pp.142-149
    • /
    • 2007
  • A ubiquitous environment in construction should be developed integrating hardware and software systems. The objective of this paper is to study the feasibility of applying the RFID technology to the concrete pour process, and improve the effectiveness of data exchange A pilot system of u-CPS (Ubiquitous Concrete Pour System) has been developed to test the feasibility. The pilot can automatically generate the data for concrete pour work such as departure time, arrival time, concrete pour time. Construction managers can keep track of the progress of concrete pour work using the information. A case study was done for a building construction using the pilot system, the result of which demonstrated that the RFID-base system can help improve the effectiveness of data communication during the concrete pour process.

A Prediction Model for Coating Thickness Based on PLS Model and Variable Selection (부분최소자승법과 변수선택을 이용한 코팅두께 예측모델 개발)

  • Lee, Hye-Seon;Lee, Young-Rok;Jun, Chi-Hyuck;Hong, Jae-Hwa
    • The Korean Journal of Applied Statistics
    • /
    • v.23 no.2
    • /
    • pp.295-304
    • /
    • 2010
  • Coating thickness is one of target variables in quality control process in steel industry. To predict coating thickness and to control quality of anti-fingerprint steel coils, ultraviolet-visible spectra are measured. We propose a variable-interval selection procedure based on the variable importance in projection in partial least square model. Using the proposed variable interval selection method, prediction performance gets better in the reduced model than the full model with full spectra absorbance. It is also shown that the first differencing as a data preprocessing technique does work well for the prediction of coating thickness.

Assessment of Missing Data Estimation with Rain Radar (강우레이더를 활용한 강수량 결측 보정에 관한 연구)

  • Kim, Tae Hyung;Lee, Jong-Hyeon;Lee, Yeong-Gon;Jang, Seung-Yeong;Choe, Gyu-Hyeon
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2018.05a
    • /
    • pp.310-310
    • /
    • 2018
  • Generally, precipitation measurement were conducted with various authrities. Among these, the MOLIT conduct the hydrological survey for the water resource management such as flood and low-flow forecasting, drought countermeasure, streamflow management. There is totally 424 observatory were existed and each precipitation measurement were obtained and quality assuranced with 10-min interval. It could be arranged or estimated with nearby observatory and radar reflectivity when the total amount of precipitation are existed. The objective of the study is therefore to suggest the method to estimate missing data with rain radar reflectivity. To validate suggested method, 50 observartory were obtained, and the efficiency were analyzed with estimated and observed precipitation. As the result of the study, the suggested method has reliability, and can be used as a method for quality assurance.

  • PDF

A biometric information collecting system for biomedical big data analysis (생체 의학 빅 데이터 분석을 위한 생체 정보 수집 시스템)

  • Lim, Damsub;Hong, Sunhag;Ku, Mino;Min, Dugki
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2013.10a
    • /
    • pp.513-516
    • /
    • 2013
  • In this paper, we present an information collecting system in medical information management domain. Our proposed system performs a systemized process, consisting of collection, transmission, and management, to develop intelligent medical information system and medical big data processing system. Our information collecting system consists of low-power biomedical sensors, biomedical information collecting devices, and storage systems. Currently, almost biomedical information of patients is collected manually by employees like nurses and medical doctors. Therefore, collected biometric data can be error-pronoun data. Since there is a lack to make big data of medical information, it is difficult to enhance the quality of medical services and researches. Accordingly, through our proposed system, we can overcome the problems like error-pronoun biometric data. In addition, we can extremely extend the area of collectable biometric data. Furthermore, using this system, we are able to make a real-time biomedical analysis system, like a real-time patient diagnosis system, and establish a strategy to against future medical markets changing rapidly.

  • PDF

Cooperative Video Streaming and Active Node Buffer Management Technique in Hybrid CDN/P2P Architecture

  • Lee, Jun Pyo
    • Journal of the Korea Society of Computer and Information
    • /
    • v.24 no.11
    • /
    • pp.11-19
    • /
    • 2019
  • Recently, hybrid CDN/P2P video streaming architecture is specially designed and deployed to achieve the scalability of P2P networks and the desired low delay and high throughput of CDNs. In this paper, we propose a cooperative video streaming and active node buffer management technique in hybrid CDN/P2P architecture. The key idea of this streaming strategy is to minimize network latency such as jitter and packet loss and to maximize the QoS(quality of service) by effectively and efficiently utilizing the information sharing of file location in CDN's proxy server which is an end node located close to a user and P2P network. Through simulation, we show that the proposed cooperative video streaming and active node buffer management technique based on CDN and P2P network improves the performance of realtime video streaming compared to previous methods.

Automatic Classification of Academic Articles Using BERT Model Based on Deep Learning (딥러닝 기반의 BERT 모델을 활용한 학술 문헌 자동분류)

  • Kim, In hu;Kim, Seong hee
    • Journal of the Korean Society for information Management
    • /
    • v.39 no.3
    • /
    • pp.293-310
    • /
    • 2022
  • In this study, we analyzed the performance of the BERT-based document classification model by automatically classifying documents in the field of library and information science based on the KoBERT. For this purpose, abstract data of 5,357 papers in 7 journals in the field of library and information science were analyzed and evaluated for any difference in the performance of automatic classification according to the size of the learned data. As performance evaluation scales, precision, recall, and F scale were used. As a result of the evaluation, subject areas with large amounts of data and high quality showed a high level of performance with an F scale of 90% or more. On the other hand, if the data quality was low, the similarity with other subject areas was high, and there were few features that were clearly distinguished thematically, a meaningful high-level performance evaluation could not be derived. This study is expected to be used as basic data to suggest the possibility of using a pre-trained learning model to automatically classify the academic documents.

콘텐츠라인

  • Korea Database Promotion Center
    • Digital Contents
    • /
    • no.10 s.149
    • /
    • pp.93-100
    • /
    • 2005
  • IT 엑스포부산 U도시부산가능성확인…국제행사초석다져/2005 데이터베이스 그랜드 컨퍼런스 "데이터고도화, 품질관리부터시작하세요!”/오마 테스트페스트 넘버10 국제모바일표준화시험대회한국서개최/ETRI 게임 핵심기술 발표회 모든단말기동시연동게임기술개발/한국게임운영자협회 설립 발족식 KGMA, “게임마스터위상제고에전력”

  • PDF