• Title/Summary/Keyword: Data Heterogeneity

Search Result 614, Processing Time 0.026 seconds

A Study on the Method for Solving Data Heterogeneity in the Integrated Information System (통합 정보시스템에서의 데이터 이질성 해결 방안에 관한 연구)

  • Park, Seong-Jin;Park, Sung-Kong;Park, Hwa-Gyoo
    • Journal of Information Technology Services
    • /
    • v.7 no.4
    • /
    • pp.87-99
    • /
    • 2008
  • As the technologies for telecommunication have been evolving, more enhanced information services and integrated information systems have been introduced, which can manage a variety of information from the heterogeneous systems. The major obstacle for the integrated information systems is the integrating heterogeneous databases in the systems and the heterogeneity problems can be classified into the structural and data heterogeneities. However, the previous researches have mainly highlighted into the solving structural heterogeneity problems. This paper identifies the data heterogeneity problems for multi-database schema integrations and proposes a new solving method. We analyze the semantics equivalence in data values based on the functional dependency, primary and candidate keys, and present a procedural solution of data heterogeneity in the perspective of the concept of attribute equivalence, integration key and conceptual integration table.

Data Exchange between Cadastre and Physical Planning by Database Coupling

  • Kim, Kam-Rae;Choi, Won-Jun
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.25 no.1
    • /
    • pp.69-75
    • /
    • 2007
  • The information in physical planning field shows the socio-economic potentials of land resources while cadastral data does the physical and legal realities of the land. The two domains commonly deal with land information but have different views. Cadastre has to evolved to the multi-purpose ones which provide value-added information and support a wide spectrum of decision makers by mixing their own information with other spatial/non-spatial databases. In this context, the demands of data exchange between the two domains is growing up but this cannot be done without resolving the heterogeneity between the two information applications. Both of either discipline sees the reality within its own scope, which means each has a unique way to abstract real world phenomena to the database. The heterogeneity problem emerges when an GIS is autonomously and independently established. It causes considerable communication difficulties since heterogeneity of representations forms unique data semantics for each database. The semantic heterogeneity obviously creates an obstacle to data exchange but, at the same time, it can be a key to solve the problems too. Therefore, the study focuses on facilitating data sharing between the fields of cadastre and physical planning by resolving the semantic heterogeneity. The core job is developing a conversion mechanism of cadastral data into the information for the physical planning by DB coupling techniques.

A Message Conversion System based on MDR for Resolving Metadata Heterogeneity (메타데이타 이질성 해결을 위한 MDR 기반의 메시지 변환 시스템)

  • 김진관;김중일;정동원;백두권
    • Journal of KIISE:Databases
    • /
    • v.31 no.3
    • /
    • pp.232-242
    • /
    • 2004
  • Metadata is a general notion of data about data to improve data sharing and exchanging by definitely describing meaning and representation of data. However, metadata has been created in various ways and It caused another kind of heterogeneity problem named metadata heterogeneity problem. Recently, the research on metadata gateway approach that allows metadata heterogeneity is being more actively progressed. However, the existing commercialized systems that have been implemented with the metadata gateway approach are dependent on a metadata schema. In this paper, we propose a message conversion system which separates the mapping information from the mapping rules between heterogeneous metadata schemas. The proposed system dynamically manages standardized data elements by applying ISO/IEC l1179. Therefore, the proposed system provides the set of standard data elements to create consistently metadata of new databases and provides a fundamental resolution to the metadata heterogeneity problem.

A spatial heterogeneity mixed model with skew-elliptical distributions

  • Farzammehr, Mohadeseh Alsadat;McLachlan, Geoffrey J.
    • Communications for Statistical Applications and Methods
    • /
    • v.29 no.3
    • /
    • pp.373-391
    • /
    • 2022
  • The distribution of observations in most econometric studies with spatial heterogeneity is skewed. Usually, a single transformation of the data is used to approximate normality and to model the transformed data with a normal assumption. This assumption is however not always appropriate due to the fact that panel data often exhibit non-normal characteristics. In this work, the normality assumption is relaxed in spatial mixed models, allowing for spatial heterogeneity. An inference procedure based on Bayesian mixed modeling is carried out with a multivariate skew-elliptical distribution, which includes the skew-t, skew-normal, student-t, and normal distributions as special cases. The methodology is illustrated through a simulation study and according to the empirical literature, we fit our models to non-life insurance consumption observed between 1998 and 2002 across a spatial panel of 103 Italian provinces in order to determine its determinants. Analyzing the posterior distribution of some parameters and comparing various model comparison criteria indicate the proposed model to be superior to conventional ones.

Inference for heterogeneity of treatment eect in multi-center clinical trial

  • Ha, Il-Do
    • Journal of the Korean Data and Information Science Society
    • /
    • v.22 no.3
    • /
    • pp.605-612
    • /
    • 2011
  • In multi-center randomized clinical trial the treatment eect may be changed over centers. It is thus important to investigate the heterogeneity in treatment eect between centers. For this, uncorrelated random-eect models assuming independence between random-eect terms have been often used, which may be a strong assumption. In this paper we propose a correlated frailty modelling approach of investigating such heterogeneity using the hierarchical-likelihood method when the outcome is time-to-event. In particular, we show how to construct a proper prediction interval for frailty, which explores graphically the potential heterogeneity for a treatment-by-center interaction term. The proposed method is illustrated via numerical studies based on data from the design of a multi-center clinical trial.

Firm Heterogeneity and Location Choice: The Case of South Korean Manufacturing Multinationals

  • Han, Jae-Joon;Lee, Hongshik;Lee, Insu
    • East Asian Economic Review
    • /
    • v.16 no.4
    • /
    • pp.315-331
    • /
    • 2012
  • Previous studies of location choice have focused on country-level data more than firm-level data and been more concerned with host countries' distinctive features than with firm heterogeneity. Therefore, they do not answer the question of who will go where in terms of location choice. To analyze the role of firm heterogeneity in determining location choice, we develop a theoretical model and analyze data on 3,644 Korean manufacturing multinationals operating in 87 countries between 1982 and 2006. The results of our conditional logit analysis indicate that not only host country characteristics but also firm heterogeneous factors such as productivity, labor intensity, and size have considerable influence on the decision of where to locate FDI.

  • PDF

Identification of ERBB pathway-activated cells in triple-negative breast cancer

  • Cho, Soo Young
    • Genomics & Informatics
    • /
    • v.17 no.1
    • /
    • pp.3.1-3.4
    • /
    • 2019
  • Intratumor heterogeneity within a single tumor mass is one of the hallmarks of malignancy and has been reported in various tumor types. The molecular characterization of intratumor heterogeneity in breast cancer is a significant challenge for effective treatment. Using single-cell RNA sequencing (RNA-seq) data from a public resource, an ERBB pathway activated triple-negative cell population was identified. The differential expression of three subtyping marker genes (ERBB2, ESR1, and PGR) was not changed in the bulk RNA-seq data, but the single-cell transcriptomes showed intratumor heterogeneity. This result shows that ERBB signaling is activated using an indirect route and that the molecular subtype is changed on a single-cell level. Our data propose a different view on breast cancer subtypes, clarifying much confusion in this field and contributing to precision medicine.

The Design of Data Grid Wrapper for Integrated Retrieve based on XMDR (XMDR 기반의 통합 검색을 위한 데이터 그리드 Wrapper 설계)

  • Hwang, Chi-Gon;Jung, Kye-Dong;Choi, Young-Keun
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.12 no.5
    • /
    • pp.921-929
    • /
    • 2008
  • Recently, many researches have been conducted to solve data heterogeneity as a way for data integration. The elements of the system that we suggest are an XMDR wrapper and XMDR Repository. XMDR wrapper solves the heterogeneity of the existing system by creating the interface based on the standard information of XMDR, and performing the inter-conversion between global XMDR query and local query using mapping data on standard information and local schema. XMDR Repository are composed of XMDR which manages the mapping data on standard information and local schema, and of Proxy DB which saves the accomplished results. With XMDR wrapper and XMDR Repository, users can use the same interface, and they need not conduct repeated queries since XMDR wrapper not only solves the heterogeneity of the schema using the meta-semantic ontology of XMDR, but also considers the heterogeneity accompanying the meaning of the value through instance semantic ontology. Therefore, in this paper we suggest the grid wrapper for the solution of data heterogeneity and efficient data integration.

Beta-Meta: a meta-analysis application considering heterogeneity among genome-wide association studies

  • Gyungbu Kim;Yoonsuk Lee;Jeong Ho Park;Dongmin Kim;Wonseok Lee
    • Genomics & Informatics
    • /
    • v.20 no.4
    • /
    • pp.49.1-49.7
    • /
    • 2022
  • Many packages for a meta-analysis of genome-wide association studies (GWAS) have been developed to discover genetic variants. Although variations across studies must be considered, there are not many currently-accessible packages that estimate between-study heterogeneity. Thus, we propose a python based application called Beta-Meta which can easily process a meta-analysis by automatically selecting between a fixed effects and a random effects model based on heterogeneity. Beta-Meta implements flexible input data manipulation to allow multiple meta-analyses of different genotype-phenotype associations in a single process. It provides a step-by-step meta-analysis of GWAS for each association in the following order: heterogeneity test, two different calculations of an effect size and a p-value based on heterogeneity, and the Benjamini-Hochberg p-value adjustment. These methods enable users to validate the results of individual studies with greater statistical power and better estimation precision. We elaborate on these and illustrate them with examples from several studies of infertility-related disorders.

Genetic heterogeneity of liver cancer stem cells

  • Minjeong Kim;Kwang-Woo Jo;Hyojin Kim;Myoung-Eun Han;Sae-Ock Oh
    • Anatomy and Cell Biology
    • /
    • v.56 no.1
    • /
    • pp.94-108
    • /
    • 2023
  • Cancer cell heterogeneity is a serious problem in the control of tumor progression because it can cause chemoresistance and metastasis. Heterogeneity can be generated by various mechanisms, including genetic evolution of cancer cells, cancer stem cells (CSCs), and niche heterogeneity. Because the genetic heterogeneity of CSCs has been poorly characterized, the genetic mutation status of CSCs was examined using Exome-Seq and RNA-Seq data of liver cancer. Here we show that different surface markers for liver cancer stem cells (LCSCs) showed a unique propensity for genetic mutations. Cluster of differentiation 133 (CD133)-positive cells showed frequent mutations in the IRF2, BAP1, and ERBB3 genes. However, leucine-rich repeat-containing G protein-coupled receptor 5-positive cells showed frequent mutations in the CTNNB1, RELN, and ROBO1 genes. In addition, some genetic mutations were frequently observed irrespective of the surface markers for LCSCs. BAP1 mutations was frequently observed in CD133-, CD24-, CD13-, CD90-, epithelial cell adhesion molecule-, or keratin 19-positive LCSCs. ASXL2, ERBB3, IRF2, TLX3, CPS1, and NFATC2 mutations were observed in more than three types of LCSCs, suggesting that common mechanisms for the development of these LCSCs. The present study provides genetic heterogeneity depending on the surface markers for LCSCs. The genetic heterogeneity of LCSCs should be considered in the development of LCSC-targeting therapeutics.