• 제목/요약/키워드: meta information

검색결과 1,265건 처리시간 0.027초

다중 클래스 데이터셋의 메타특징이 판별 알고리즘의 성능에 미치는 영향 연구 (The Effect of Meta-Features of Multiclass Datasets on the Performance of Classification Algorithms)

  • 김정훈;김민용;권오병
    • 지능정보연구
    • /
    • 제26권1호
    • /
    • pp.23-45
    • /
    • 2020
  • 기업의 경쟁력 확보를 위해 판별 알고리즘을 활용한 의사결정 역량제고가 필요하다. 하지만 대부분 특정 문제영역에는 적합한 판별 알고리즘이 어떤 것인지에 대한 지식은 많지 않아 대부분 시행착오 형식으로 최적 알고리즘을 탐색한다. 즉, 데이터셋의 특성에 따라 어떠한 분류알고리즘을 채택하는 것이 적합한지를 판단하는 것은 전문성과 노력이 소요되는 과업이었다. 이는 메타특징(Meta-Feature)으로 불리는 데이터셋의 특성과 판별 알고리즘 성능과의 연관성에 대한 연구가 아직 충분히 이루어지지 않았기 때문이며, 더구나 다중 클래스(Multi-Class)의 특성을 반영하는 메타특징에 대한 연구 또한 거의 이루어진 바 없다. 이에 본 연구의 목적은 다중 클래스 데이터셋의 메타특징이 판별 알고리즘의 성능에 유의한 영향을 미치는지에 대한 실증 분석을 하는 것이다. 이를 위해 본 연구에서는 다중 클래스 데이터셋의 메타특징을 데이터셋의 구조와 데이터셋의 복잡도라는 두 요인으로 분류하고, 그 안에서 총 7가지 대표 메타특징을 선택하였다. 또한, 본 연구에서는 기존 연구에서 사용하던 IR(Imbalanced Ratio) 대신 시장집중도 측정 지표인 허핀달-허쉬만 지수(Herfindahl-Hirschman Index, HHI)를 메타특징에 포함하였으며, 역ReLU 실루엣 점수(Reverse ReLU Silhouette Score)도 새롭게 제안하였다. UCI Machine Learning Repository에서 제공하는 복수의 벤치마크 데이터셋으로 다양한 변환 데이터셋을 생성한 후에 대표적인 여러 판별 알고리즘에 적용하여 성능 비교 및 가설 검증을 수행하였다. 그 결과 대부분의 메타특징과 판별 성능 사이의 유의한 관련성이 확인되었으며, 일부 예외적인 부분에 대한 고찰을 하였다. 본 연구의 실험 결과는 향후 메타특징에 따른 분류알고리즘 추천 시스템에 활용할 것이다.

가상공동체에서 지식탐색을 통한 지식공유에 관한 연구 (Understanding Knowledge Sharing in Virtual Communities through Knowledge Seeking Behavior)

  • 김재경
    • 한국IT서비스학회지
    • /
    • 제13권1호
    • /
    • pp.71-86
    • /
    • 2014
  • This study investigated knowledge browsing behavior as the factor affecting the increase of knowledge sharing intention. To conduct this study in the specific context of knowledge seeking and sharing behavior of virtual community members, literature on knowledge seeking behavior, meta-knowledge, and knowledge sharing intention was reviewed. Structural Equation Modeling was conducted to analyze survey data to test the research model of this study. The result showed that knowledge browsing have positive effects on creating of virtual community members' subject knowledge and meta-knowledge, which, in turn, affected positively their knowledge sharing intention. One of the main contributions of this study is that knowledge seeking behavior influence one's knowledge sharing intention in a virtual community. Organization managers should consider knowledge seeking behavior as not only a self-interested, consuming activity, but also a productive one through its function of constructing subject knowledge and meta-knowledge.

산업용 음성 DB를 위한 XML 기반 메타데이터 (XML Based Meta-data Specification for Industrial Speech Databases)

  • 주영희;홍기형
    • 대한음성학회지:말소리
    • /
    • 제55권
    • /
    • pp.77-91
    • /
    • 2005
  • In this paper, we propose an XML based meta-data specification for industrial speech databases. Building speech databases is very time-consuming and expensive. Recently, by the government supports, huge amount of speech corpus has been collected as speech databases. However, the formats and meta-data for speech databases are different depending on the constructing institutions. In order to advance the reusability and portability of speech databases, a standard representation scheme should be adopted by all speech database construction institutions. ETRI proposed a XML based annotation scheme [51 for speech databases, but the scheme has too simple and flat modeling structure, and may cause duplicated information. In order to overcome such disadvantages in this previous scheme, we first define the speech database more formally and then identify object appearing in speech databases. We then design the data model for speech databases in an object-oriented way. Based on the designed data model, we develop the meta-data specification for industrial speech databases.

  • PDF

Importance of Meta-Analysis and Practical Obstacles in Oncological and Epidemiological Studies: Statistics Very Close but Also Far!

  • Tanriverdi, Ozgur;Yeniceri, Nese
    • Asian Pacific Journal of Cancer Prevention
    • /
    • 제16권3호
    • /
    • pp.1303-1306
    • /
    • 2015
  • Studies of epidemiological and prognostic factors are very important for oncology practice. There is a rapidly increasing amount of research and resultant knowledge in the scientific literature. This means that health professionals have major challenges in accessing relevant information and they increasingly require best available evidence to make their clinical decisions. Meta-analyses of prognostic and other epidemiological factors are very practical statistical approaches to define clinically important parameters. However, they also feature many obstacles in terms of data collection, standardization of results from multiple centers, bias, and commentary for intepretation. In this paper, the obstacles of meta-analysis are briefly reviewed, and potential problems with this statistical method are discussed.

Meta Analysis of Usability Experimental Research Using New Bi-Clustering Algorithm

  • Kim, Kyung-A;Hwang, Won-Il
    • 응용통계연구
    • /
    • 제21권6호
    • /
    • pp.1007-1014
    • /
    • 2008
  • Usability evaluation(UE) experiments are conducted to provide UE practitioners with guidelines for better outcomes. In UE research, significant quantities of empirical results have been accumulated in the past decades. While those results have been anticipated to integrate for producing generalized guidelines, traditional meta-analysis has limitations to combine UE empirical results that often show considerable heterogeneity. In this study, a new data mining method called weighted bi-clustering(WBC) was proposed to partition heterogeneous studies into homogeneous subsets. We applied the WBC to UE empirical results and identified two homogeneous subsets, each of which can be meta-analyzed. In addition, interactions between experimental conditions and UE methods were hypothesized based on the resulting partition and some interactions were confirmed via statistical tests.

인터넷상의 메타탐색엔진의 검색효율성 비교연구 (The study on the retrieval effectiveness of meta-search engine on the internet)

  • 김성희
    • 한국도서관정보학회지
    • /
    • 제27권
    • /
    • pp.457-483
    • /
    • 1997
  • This study was intended to compare the effectiveness of the Savvy search and Metacrawler in terms of the total number of relevant documents retrieved, precision, recall, and the number of deadlines. In addition, this study measured whether the Meta-search engine and general web search engines retrieved different web documents. As a result, Savvy search produced a higher precision and recall as compared with motacrawler search engine while the metacrawler had lower deadlines ration than savvy search, Also, Meta search engine was more effective than the general web search engine, The results show that the hybrid methodology of integrating a variety of web search engines can help solve retrieval effectiveness problems on the Internet.

  • PDF

이식성을 위한 메타데이터 기반의 CDSS 구축 (Implementation of Meta Data-based Clinical Decision Support System for the Portability)

  • 이상영;이윤현;이윤석
    • 디지털산업정보학회논문지
    • /
    • 제8권1호
    • /
    • pp.221-229
    • /
    • 2012
  • A model for expressing meta data syntax in the eXtensible Markup Language(XML) was developed to increase the portability of the Arden Syntax in medical treatment. In this model that is Arden syntax uses two syntax checking mechanisms, first an XML validation process, and second, a syntax check using an XSL style sheet. Two hundred seventy-seven examples of MLMs were transformed into MLMs in ArdenML and validated against the schema and style sheet. Both the original MLMs and reverse-parsed MLMs in ArdenML were checked using a Arden Syntax checker. The textual versions of MLMs were successfully transformed into XML documents using the model, and the reverse-parse yielded the original text version of MLMs.

산업체 수요중심 커리큘럼을 위한 메타모델 설계 기법 (Meta-Model Design Technique for Industrial Demand-Driven Curriculum)

  • 조은숙;박수희;장준오;노은하
    • 디지털산업정보학회논문지
    • /
    • 제7권4호
    • /
    • pp.169-181
    • /
    • 2011
  • The cooperation between universities and IT industry in producing IT manpower of quality is urgently called for to create the effective labor pool of supply and finally balance its supply and demand. Korean Government launched a program where industrial demand-driven curriculums are developed and applied to universities. This paper proposes a design technique of meta-modeling demand-driven curriculums and courses, based on the 3D software space and the software development process. This technique is proven to result in extensibility, flexibility and quality improvement in software design. Therefore, we expect that the proposed technique makes curriculums and courses possible to be continuously improved in many aspects.

A HGLM framework for Meta-Analysis of Clinical Trials with Binary Outcomes

  • Ha, Il-Do
    • Journal of the Korean Data and Information Science Society
    • /
    • 제19권4호
    • /
    • pp.1429-1440
    • /
    • 2008
  • In a meta-analysis combining the results from different clinical trials, it is important to consider the possible heterogeneity in outcomes between trials. Such variations can be regarded as random effects. Thus, random-effect models such as HGLMs (hierarchical generalized linear models) are very useful. In this paper, we propose a HGLM framework for analyzing the binominal response data which may have variations in the odds-ratios between clinical trials. We also present the prediction intervals for random effects which are in practice useful to investigate the heterogeneity of the trial effects. The proposed method is illustrated with a real-data set on 22 trials about respiratory tract infections. We further demonstrate that an appropriate HGLM can be confirmed via model-selection criteria.

  • PDF

입원환자를 대상으로한 근거기반 임상진료지침 추출에 관한 연구 (A Study for Evidence Based Clinical Pathway Extraction using Data of Inpatient)

  • 배인호;박한나;김용욱
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2013년도 춘계학술발표대회
    • /
    • pp.833-834
    • /
    • 2013
  • 진료데이터는 진료를 보면서 축척된 데이터로서 다양한 병명들에 대한 의사들의 진료행위를 추적해 볼 수 있는 유용한 정보가 될 수 있으며, 진료에 재활용함으로써 환자들에 대한 진료행위를 표준화하는데 사용될 수 있다. 본 연구에서는 다양한 상황에서 환자를 진료한 근거자료인 진료데이터를 이용하여 병원에서 활용 가능한 임상진료데이터를 추출하기 위한 방법에 대한 연구를 진행하였다.