• 제목/요약/키워드: Data inconsistency

검색결과 228건 처리시간 0.021초

다차원 개념 계층을 지원하는 공간 데이터 큐브의 점진적 일괄 갱신 기법 (Incremental Batch Update of Spatial Data Cube with Multi-dimensional Concept Hierarchies)

  • 옥근형;이동욱;유병섭;이재동;배해영
    • 한국멀티미디어학회논문지
    • /
    • 제9권11호
    • /
    • pp.1395-1409
    • /
    • 2006
  • 공간 데이터 웨어하우스에서는 OLAP(On-Line Analytical Processing) 연산을 제공하기 위해 다차원 데이터를 공간 데이터 큐브의 형태로 관리한다. 개념 계층을 지원하는 공간 데이터 큐브의 크기는 삽입되는 데이터에 비해 방대하기 때문에 구축된 큐브의 구조를 최대한 유지하면서 새로 삽입되는 데이터를 반영시킬 수 있는 점진적 갱신 기법이 연구되어 왔다. 하지만 접두 및 접미의 중복을 제거하여 데이터를 압축 저장하는 큐브에서는 병합된 경로 간의 충돌로 인해 큐브 갱신 시 갱신 내용과 상관없는 셀까지 동시에 갱신되어 갱신이상 현상이 발생한다. 본 논문에서는 공간 데이터 큐브의 점진적 일괄 갱신 기법을 제안한다. 제안 기법은 갱신에 필요한 노드 복사본을 관리하는 자료 구조 및 재귀 탐색을 이용하여, 경로 간의 충돌이 발생할 경우 해당 노드의 복사본을 생성한 후 이를 갱신함으로써 갱신이상 현상을 방지한다. 이를 통해 다차원 개념 계층이 포함된 공간 데이터 큐브를 효율적으로 갱신할 수 있다. 성능 평가를 통해 기존 갱신 기법에 비해 제안 기법의 갱신 속도가 향상되었음을 보인다.

  • PDF

F_MixBERT: Sentiment Analysis Model using Focal Loss for Imbalanced E-commerce Reviews

  • Fengqian Pang;Xi Chen;Letong Li;Xin Xu;Zhiqiang Xing
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제18권2호
    • /
    • pp.263-283
    • /
    • 2024
  • Users' comments after online shopping are critical to product reputation and business improvement. These comments, sometimes known as e-commerce reviews, influence other customers' purchasing decisions. To confront large amounts of e-commerce reviews, automatic analysis based on machine learning and deep learning draws more and more attention. A core task therein is sentiment analysis. However, the e-commerce reviews exhibit the following characteristics: (1) inconsistency between comment content and the star rating; (2) a large number of unlabeled data, i.e., comments without a star rating, and (3) the data imbalance caused by the sparse negative comments. This paper employs Bidirectional Encoder Representation from Transformers (BERT), one of the best natural language processing models, as the base model. According to the above data characteristics, we propose the F_MixBERT framework, to more effectively use inconsistently low-quality and unlabeled data and resolve the problem of data imbalance. In the framework, the proposed MixBERT incorporates the MixMatch approach into BERT's high-dimensional vectors to train the unlabeled and low-quality data with generated pseudo labels. Meanwhile, data imbalance is resolved by Focal loss, which penalizes the contribution of large-scale data and easily-identifiable data to total loss. Comparative experiments demonstrate that the proposed framework outperforms BERT and MixBERT for sentiment analysis of e-commerce comments.

A State-of-the-Art Review on Debonding Failures of FRP Laminates Externally Adhered to Concrete

  • Kang, Thomas H.K.;Howell, Joe;Kim, Sang-Hee;Lee, Dong-Joo
    • International Journal of Concrete Structures and Materials
    • /
    • 제6권2호
    • /
    • pp.123-134
    • /
    • 2012
  • There is significant concern in the engineering community regarding the safety and effectiveness of fiber-reinforced polymer (FRP) strengthening of RC structures because of the potential for brittle debonding failures. In this paper, previous research programs conducted by other researchers were reviewed in terms of the debonding failure of FRP laminates externally attached to concrete. This review article also discusses the influences on bond strength and failure modes as well as the existing experimental research and developed equations. Based on the review, several important conclusions were re-emphasized, including the finding that the bond transfer strength is proportional to the concrete compressive strength; that there is a certain bond development length that has to be exceeded; and that thinner adhesive layers in fact lower the chances of a concrete-adhesive interface failure. It is also found that there exist uncertainty and inaccuracy in the available models when compared with the experimental data and inconsistency among the models. This demonstrates the need for continuing research and compilation of data on the topic of FRP's bond strength.

한국의 중소 제조업체 노동력 부족의 개념과 측정 (Alternative Labor Shortage Statistical Measures for Small and Medium Enterprises in Korea)

  • 설동훈
    • 한국인구학
    • /
    • 제27권1호
    • /
    • pp.121-146
    • /
    • 2004
  • 한국의 중소 제조업 노동력 부족 실태는 노동부의 <노동력수요동향조사>와 중소기업청의 <중소기업인력실태조사> 결과를 통해 파악할 수 있다. 그러나 이 두 기관에서 조사한 자료의 개념과 측정도구의 불일치가 매우 심해 인력부족실태를 정확히 파악하는 데 심각한 어려움이 있다. 본 연구는 한국의 '인력부족'의 개념 정의와 측정 및 조사방법에서 혼란이 발생하고 있음을 밝혀낸 후, 대안적인 통계 지표들을 제시하여 그 혼동을 최소화할 수 있는 방안을 제시하고 있다.

SSR (Simple Sector Remapper) the fault tolerant FTL algorithm for NAND flash memory

  • Lee, Gui-Young;Kim, Bumsoo;Kim, Shin-han;Byungsoo Jung
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2002년도 ITC-CSCC -2
    • /
    • pp.932-935
    • /
    • 2002
  • In this paper, we introduce new FTL(Flash Translation Layer) driver algorithm that tolerate the power off errors. FTL driver is the software that provide the block device interface to the upper layer software such as file systems or application programs that using the flash memory as a block device interfaced storage. Usually, the flash memory is used as the storage devices of the mobile system due to its low power consumption and small form factor. In mobile system, the state of the power supplement is not stable, because it using the small sized battery that has limited capacity. So, a sudden power off failure can be occurred when we read or write the data on the flash memory. During the write operation, power off failure may introduce the incomplete write operation. Incomplete write operation denotes the inconsistency of the data in flash memory. To provide the stable storage facility with flash memory in mobile system, FTL should provide the fault tolerance against the power off failure. SSR (Simple Sector Remapper) is a fault tolerant FTL driver that provides block device interface and also provides tolerance against power off errors.

  • PDF

현대 니트패션에 나타난 해체주의 특성 (Characteristics of deconstruction expressed in the contemporary knit fashion)

  • 이윤미
    • 복식문화연구
    • /
    • 제26권4호
    • /
    • pp.583-597
    • /
    • 2018
  • The purpose of this study is to classify and analyze the deconstruction phenomena expressed in contemporary knit fashion design, and to analyze the inner meaning of deconstruction based on certain characteristics. As a method of study, literature data for theoretical backgrounds, prior studies, and internet data were analyzed. The scope of this study was restricted to knitwear published in the world's four major collections (Milan, Paris, New York and London) from 2014 F/W to 2018 S/S. Based on prior studies, four concepts of deconstruction were derived: "$Diff{\acute{e}}reance$", "Intertextuality", "Intermeaning of Meaning", "Dis De Phenomenon". The results of the study were as follows: first, "$Diff{\acute{e}}reance$" refers to a transcendence of time and space. These expressions are discursive, unrealistic, and convey freedom through intent that deviates from rules and norms. Second, "Intertextuality" indicates a mixture of different texts, such as styles, materials, and items. These expressions deliver novelty with amusement, and can be entertaining depending on audience expectations. Third, "Intermeaning of Meaning" is accidental category - depending on how the wearer wears the clothing. -; accordingly, free and spontaneous creativity is an emerging trend in fashion. Fourth, the clothing was expressed in deformed and distorted form by the construction and destruction of the structure, a technique we describe as the "Dis De Phenomenon". In this concept, the sense of free design of young emotion appears along with the sense of purity and shock due to intentional inconsistency.

Image Tracking Algorithm using Template Matching and PSNF-m

  • Bae, Jong-Sue;Song, Taek-Lyul
    • International Journal of Control, Automation, and Systems
    • /
    • 제6권3호
    • /
    • pp.413-423
    • /
    • 2008
  • The template matching method is used as a simple method to track objects or patterns that we want to search for in the input image data from image sensors. It recognizes a segment with the highest correlation as a target. The concept of this method is similar to that of SNF (Strongest Neighbor Filter) that regards the measurement with the highest signal intensity as target-originated among other measurements. The SNF assumes that the strongest neighbor (SN) measurement in the validation gate originates from the target of interest and the SNF utilizes the SN in the update step of a standard Kalman filter (SKF). The SNF is widely used along with the nearest neighbor filter (NNF), due to computational simplicity in spite of its inconsistency of handling the SN as if it is the true target. Probabilistic Strongest Neighbor Filter for m validated measurements (PSNF-m) accounts for the probability that the SN in the validation gate originates from the target while the SNF assumes at any time that the SN measurement is target-originated. It is known that the PSNF-m is superior to the SNF in performance at a cost of increased computational load. In this paper, we suggest an image tracking algorithm that combines the template matching and the PSNF-m to estimate the states of a tracked target. Computer simulation results are included to demonstrate the performance of the proposed algorithm in comparison with other algorithms.

측정의 본성에 대한 초등학생들의 인식론적 견해 (Elementary Students' Epistemological Views on the Nature of Scientific Measurement)

  • 양찬호;이지현;김영훈;노태희
    • 한국초등과학교육학회지:초등과학교육
    • /
    • 제30권4호
    • /
    • pp.430-441
    • /
    • 2011
  • We investigated the elementary students' epistemological views on the nature of scientific measurement. The Views About Scientific Measurement (Ibrahim, 2005) was administered to 117 sixth graders. The analyses of the results indicated that there was an inconsistency in their epistemological views depending on the contexts of the measurement. They also had some difficulties in understanding a distribution of the data, which is needed to understand the necessity of repeating measurements, choosing a best representative value, and comparing data sets. They were found to have some naive views on scientific measurement which influenced negatively for fostering modern epistemological views on the nature of scientific measurement. The results suggest that the nature of scientific measurement should be emphasized explicitly in the national curriculum, and an effective method which improves elementary students' epistemological views on the nature of scientific measurement also be developed.

투석기간에 따른 투석 환자의 불확실성 요인 (Factors Influencing Uncertainty in Dialysis Patient by Duration of Dialysis)

  • 윤수정;이영희
    • 성인간호학회지
    • /
    • 제24권6호
    • /
    • pp.597-606
    • /
    • 2012
  • Purpose: This study was to describe the uncertainty, depression, physical symptom, and family support among patients undergoing dialysis. Further, the factors that impact uncertainty were also examined. Methods: A convenience sample of 145 patients who received dialysis was selected. A descriptive correlation study was conducted. Data were collected using structured questionnaires and the collected data were analyzed using descriptive statistics and multiple regression analysis. Results: The patient who received more than five years of dialysis reported higher levels on inconsistency of uncertainty than patient with less than five years. These latter patients' reported uncertainty was positively correlated with depression, whereas, patients family support was correlated with uncertainty. The group's uncertainty with less than five years of dialysis explained about 13% of the variance. In contrast, variables of education level, family support, and monthly income were predictors of uncertainty and explained 33% of the variation. Conclusion: These results can provide for nursing intervention to facilitate reduction of uncertainty. To provide dialysis period-sensitive nursing intervention for uncertainty among dialysis patient, depression should be considered below five years. While factors such as education level, family support, and monthly income should be taken into account over five years.

분산 이기종 환경에서의 메시지미들웨어(MOM) 시스템 통합방안 연구 (Methods to System Integration in Distributed Heterogeneous Environments)

  • 김종배;송재영;류성열
    • 디지털콘텐츠학회 논문지
    • /
    • 제6권3호
    • /
    • pp.163-168
    • /
    • 2005
  • 전산구조와 기술이 분산환경으로 옮겨가고 있고 인수합병 및 프로세스 아웃소싱 둥의 증가로 인해 혹은 조직내의 다양한 시스템이 신규 개발 또는 증설됨에 따라 이기종 플랫폼간의 상호연계미비, 유지보수의 난이함, 데이터 중복성과 일관성 결여 등의 문제점들이 발생되면서 EAI환경의 도입에 대한 요구가 증가하고 있으나, 많은 비용과 솔루션 선정의 어려움으로 그 적용이 쉽지 않다. 따라서 본 연구에서는 분산시스템 환경에서 이기종 간의 데이터 및 응용프로그램 통합을 위한 효율적인 대안으로 메시지미들웨어를 적용한 시스템통합방안을 제시하였다. 본 논문에서 밝힌 시스템간의 메시지 미들웨어를 이용한 데이터 통합방안은 비용과 성능 측면에서 소규모 시스템간의 인터페이스를 구축할 수 있는 효율적 대안이 될 것으로 기대한다.

  • PDF