• Title/Summary/Keyword: 데이터부족문제

Search Result 550, Processing Time 0.036 seconds

Exploring Time Series Data Information Extraction and Regression using DTW based kNN (DTW 거리 기반 kNN을 활용한 시계열 데이터 정보 추출 및 회귀 예측)

  • Hyeonjun Yang;Chaeguk Lim;Woohyuk Jung;Jihwan Woo
    • Information Systems Review
    • /
    • v.26 no.2
    • /
    • pp.83-93
    • /
    • 2024
  • This study proposes a preprocessing methodology based on Dynamic Time Warping (DTW) and k-Nearest Neighbors (kNN) to effectively represent time series data for predicting the completion quality of electroplating baths. The proposed DTW-based kNN preprocessing approach was applied to various regression models and compared. The results demonstrated a performance improvement of up to 43% in maximum RMSE and 24% in MAE compared to traditional decision tree models. Notably, when integrated with neural network-based regression models, the performance improvements were pronounced. The combined structure of the proposed preprocessing method and regression models appears suitable for situations with long time series data and limited data samples, reducing the risk of overfitting and enabling reasonable predictions even with scarce data. However, as the number of data samples increases, the computational load of the DTW and kNN algorithms also increases, indicating a need for future research to improve computational efficiency.

A Hybrid Blockchain-Based E-Voting System with BaaS (BaaS를 이용한 하이브리드 블록체인 기반 전자투표 시스템)

  • Kang Myung Joe;Kim Mi Hui
    • KIPS Transactions on Computer and Communication Systems
    • /
    • v.12 no.8
    • /
    • pp.253-262
    • /
    • 2023
  • E-voting is a concept that includes actions such as kiosk voting at a designated place and internet voting at an unspecified place, and has emerged to alleviate the problem of consuming a lot of resources and costs when conducting offline voting. Using E-voting has many advantages over existing voting systems, such as increased efficiency in voting and ballot counting, reduced costs, increased voting rate, and reduced errors. However, centralized E-voting has not received attention in public elections and voting on corporate agendas because the results of voting cannot be trusted due to concerns about data forgery and modulation and hacking by others. In order to solve this problem, recently, by designing an E-voting system using blockchain, research has been actively conducted to supplement concepts lacking in existing E-voting, such as increasing the reliability of voting information and securing transparency. In this paper, we proposed an electronic voting system that introduced hybrid blockchain that uses public and private blockchains in convergence. A hybrid blockchain can solve the problem of slow transaction processing speed, expensive fee by using a private blockchain, and can supplement for the lack of transparency and data integrity of transactions through a public blockchain. In addition, the proposed system is implemented as BaaS to ensure the ease of type conversion and scalability of blockchain and to provide powerful computing power. BaaS is an abbreviation of Blockchain as a Service, which is one of the cloud computing technologies and means a service that provides a blockchain platform ans software through the internet. In this paper, in order to evaluate the feasibility, the proposed system and domestic and foreign electronic voting-related studies are compared and analyzed in terms of blockchain type, anonymity, verification process, smart contract, performance, and scalability.

Small-business Counseling: Impact for Applying Triple Helix (소상공인 경영 컨설팅: 트리플 힐릭스 적용의 효과)

  • Kim, Taekyung
    • Asia-Pacific Journal of Business Venturing and Entrepreneurship
    • /
    • v.11 no.2
    • /
    • pp.183-195
    • /
    • 2016
  • Although we have faced substantial public interests on the issues of small-business, the lack of effective solutions corresponding to them should be worried. This paper introduces findings from the application of Triple Helix to counseling activities for small-business stores with diagnosing problems and suggesting alternatives. The findings of this paper are based on counseling projects conducted by Gyunggi Small-medium Business Corporation, universities in the region, and multiple small-business stores together from June to November in 2015. The application of Triple Helix was positive for increasing the effectiveness of counseling, and this key finding was obtained from action research cycles. It was also confirmed that cooperation from three different entities including company, university and government institute was beneficial in increasing problem identification capabilities by students and providing opportunities for testing knowledge and skills, which means Triple Helix application to small-business helps management education to be more practical. Contributions of this study supplies substantial insights to academic audience, and practitioners can learn positive effects of Triple Helix and potential issues for implementation in the context of small-business counseling.

  • PDF

A Design of Semantic Contents Search System for Multimedia Ontology (멀티미디어 온톨로지 기반의 의미론적 콘텐츠 검색 시스템 설계)

  • Hwang, Chi-Gon;Moon, Seok-Jae;Lee, Daesung;Yoon, Chang-Pyo
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2013.05a
    • /
    • pp.245-248
    • /
    • 2013
  • With the development of multimedia and network technology, the production of multimedia contents is rapidly increasing. Meanwhile, the technology to search and use the contents is still insufficient. There are standards for multimedia contents to address the problem, but they cannot fully support diverse multimedia data types or ensure their interoperability. In this paper, an ontology-based content search system is proposed to ensure the interoperability of multimedia contents. The ontology is configured by presenting the rules for it using the schema structure of the multimedia description scheme (MDS) of MPEG-7. Based on this ontology, the association of the multimedia data is expanded to design an access system that allows semantic search.

  • PDF

Attention-based word correlation analysis system for big data analysis (빅데이터 분석을 위한 어텐션 기반의 단어 연관관계 분석 시스템)

  • Chi-Gon, Hwang;Chang-Pyo, Yoon;Soo-Wook, Lee
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.27 no.1
    • /
    • pp.41-46
    • /
    • 2023
  • Recently, big data analysis can use various techniques according to the development of machine learning. Big data collected in reality lacks an automated refining technique for the same or similar terms based on semantic analysis of the relationship between words. Since most of the big data is described in general sentences, it is difficult to understand the meaning and terms of the sentences. To solve these problems, it is necessary to understand the morphological analysis and meaning of sentences. Accordingly, NLP, a technique for analyzing natural language, can understand the word's relationship and sentences. Among the NLP techniques, the transformer has been proposed as a way to solve the disadvantages of RNN by using self-attention composed of an encoder-decoder structure of seq2seq. In this paper, transformers are used as a way to form associations between words in order to understand the words and phrases of sentences extracted from big data.

Questionnaire Survey on the Occurrence Time and Cause of Defect in Remodeled Apartment (아파트 리모델링 공사의 하자발생 시기 및 원인에 관한 연구)

  • Park, Sun-Gyu
    • The Journal of the Korea Contents Association
    • /
    • v.15 no.12
    • /
    • pp.596-603
    • /
    • 2015
  • Rapid economic growth occurred in Korean since the 1970s. This economic growth caused rapid urbanization and an increase in population in city. As a results, there was a housing shortage problem in korea. The government and construction companies have been continuously building residential houses such as apartments, mansions, others. Apartments mostly consist of residential houses and are generally made from reinforced concrete structure. Reinforced concrete structures including apartments need to be renovated, because they deteriorate with respect to time. However, there is no available data or information regarding the cost or the period of time that is needed for renovates of these apartment. We are especially short of information on the defect data of remodeling construction. Therefore, the purpose of this paper is to provide fundamental data about the cause of defect in remodeled apartments as well as the appropriate time to execute renovation through a questionnaire survey with apartment residents as the participants.

A study on evapotranspiration using Terra MODIS images and soil water deficit index (Terra MODIS 위성영상과 토양수분 부족지수를 이용한 증발산량 산정 연구)

  • Jinuk Kim;Yonggwan Lee;Jeehun Chung;Jiwan Lee;Seongjoon Kim
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2023.05a
    • /
    • pp.119-119
    • /
    • 2023
  • 본 연구에서는 Terra MODIS(MODerate resolution Imaging Spectroradiometer) 위성영상과 토양수분 부족지수(Soil Water Deficit Index, SWDI)를 이용하여 2012년부터 2022년까지 한반도 전국의 1km 공간 증발산량을 산정하였다. 공간 증발산량을 산정하기 위한 과정은 크게 두 가지로 구분된다. 첫 번째로 MODIS의 LST(Land Surface Temperature), NDVI(Normalized Difference Vegetation Index), 선행강우 및 무강우 누적일수를 이용해 1 km 공간 토양수분을 산정하였다. 농촌진흥청 토양수분관측망 자료 중 토지피복, 토양 속성을 고려하여 선정된 70개소 토양수분 실측데이터와 비교한 결과 지점별 평균 R2 0.63~0.90으로 유의미한 상관성을 나타내었다. 산정된 공간 토양수분은 생장저해수분점과 초기위조점의 관계를 이용한 SWDI로 변환하였다. 두 번째로 순 복사량, 기온 및 NDVI의 적은 수문인자를 통해 증발산량 산정이 가능한 MS-PT(Modified Satellite-based Priestley-Taylor) 모형을 기반으로 계절별 식생과 토양수분 상태를 고려하여 1 km 공간 증발산량을 산정하였다. MS-PT 모형에서 가정한 대기 증발 변수 Diurnal temperature (DT)와 지표 수분의 상관성 문제를 해결하기 위해 DT를 SWDI로 적용하였다. 모형 결과의 검증을 위해 국내 플럭스 타워 (설마천, 청미천, 덕유산) 증발산량 관측자료와의 결정계수(Coefficient of determination, R2), RMSE(Root Mean Square Error) 및 IOA(Index of Agreement)를 산정하였다. 본 연구의 결과로 생산되는 국내 증발산량의 시, 공간적 변동성은 증발산량을 통한 수문학적 가뭄지수 및 급성 가뭄을 파악하는데 활용될 수 있을 것으로 판단된다.

  • PDF

Temporal Analysis of Agricultural Reservoir Water Surface Area using Remote Sensing and CNN (위성영상 및 CNN을 활용한 소규모 농업용 저수지의 수표면적 시계열 분석)

  • Yang, Mi-Hye;Nam, Won-Ho;Lee, Hee-Jin;Kim, Taegon
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2021.06a
    • /
    • pp.118-118
    • /
    • 2021
  • 최근 지구 온난화 현상으로 인한 기후변화로 이상기후 현상이 발생하고 있으며 이로 인해 장기적으로 폭염의 빈도 및 강도 상승에 따른 가뭄 피해 우려가 증가하고 있다. 농업 가뭄은 강수량 부족, 토양 수분 부족, 저수량 부족 등 농업분야에 영향을 주는 인자들과 관련되어 있어 농작물 생육 및 수확량 감소를 야기한다. 우리나라는 논농사가 주를 이루고 있어 국내 농업 가뭄은 주수원공인 농업용 저수지의 가용저수용량으로 판단 가능하다. 따라서 안정적인 농업용수 공급을 위해 수리시설물의 모니터링, 공급량 등의 분석이 이루어져야 하며, 농업 가뭄에 대비하기 위해 농업용 저수지의 가용저수용량 파악이 필요하다. 수자원 분야에서 지점자료의 시·공간적 한계점을 보완하기 위해 인공위성 자료를 활용한 연구가 활발히 이루어지고 있으며, 본 연구에서는 위성영상 자료 및 딥러닝 기반 알고리즘을 적용하여 농업용 저수지 수표면 탐지 및 시계열 분석을 목적으로 한다. 위성영상 자료는 5일 주기 및 10 m 공간해상도를 가진 Sentinel-2 위성영상 자료를 활용하고자 하였으며, 딥러닝에 적용하기 위하여 100장 이상의 영상 이미지를 구축하였다. 딥러닝 기반 알고리즘으로는 Convolutional Neural Network (CNN)을 활용하였으며, CNN은 주로 이미지 분류나 객체 검출 문제를 해결하기 위해 제안된 모델로 최근 픽셀 단위로 분류가 가능한 알고리즘이 개발되어 높은 정확도의 수표면 탐지가 가능할 것으로 판단된다. 따라서 본 연구에서는 CNN 기반 수표면 탐지 알고리즘을 개발하여 Sentinel-2 영상 기준 경기도 안성시를 대상으로 소규모 농업용 저수지의 수표면적에 대한 시계열 데이터를 분석하고자 한다.

  • PDF

Analysis and Modeling of Essential Concepts and Process for Peer-Reviewing Data Paper (데이터논문 동료심사를 위한 핵심 개념 분석과 프로세스 모델링)

  • Sungsoo Ahn;Sung-Nam Cho;Youngim Jung
    • Journal of Korean Library and Information Science Society
    • /
    • v.54 no.3
    • /
    • pp.321-346
    • /
    • 2023
  • A data paper describing research data helps credit researchers producing the data while helping other researchers verify previous research and start new research by reusing the data. Publishing a data paper and depositing data to a public data repository are increasing with these benefits. A domestic academic society that plans to publish data papers faces challenges, including timely acquiring tremendous knowledge concerning data paper structures and templates, peer review policy and process, and trustworthy data repositories, as a data paper has different characteristics, unlike a research paper. However, the need for more research and information concerning the critical elements of data paper and the peer-review process makes it difficult to operate for data paper review and publication. To address these issues, we propose essential concepts of the data paper and the data paper peer-review, including the process model of the peer-review with in-depth analysis of five data journals' data paper templates, articles, and other guides worldwide. Academic societies intending to publish or add data papers as a new type of paper may establish policies and define a peer-review process by adopting the proposed conceptual models, effectively streamlining the preparation of data paper publication.

A Transfer Learning Method for Solving Imbalance Data of Abusive Sentence Classification (욕설문장 분류의 불균형 데이터 해결을 위한 전이학습 방법)

  • Seo, Suin;Cho, Sung-Bae
    • Journal of KIISE
    • /
    • v.44 no.12
    • /
    • pp.1275-1281
    • /
    • 2017
  • The supervised learning approach is suitable for classification of insulting sentences, but pre-decided training sentences are necessary. Since a Character-level Convolution Neural Network is robust for each character, so is appropriate for classifying abusive sentences, however, has a drawback that demanding a lot of training sentences. In this paper, we propose transfer learning method that reusing the trained filters in the real classification process after the filters get the characteristics of offensive words by generated abusive/normal pair of sentences. We got higher performances of the classifier by decreasing the effects of data shortage and class imbalance. We executed experiments and evaluations for three datasets and got higher F1-score of character-level CNN classifier when applying transfer learning in all datasets.