• Title/Summary/Keyword: Data Lineage

Search Result 80, Processing Time 0.028 seconds

A Study on the Hierarchical Expression of Human Cell Lineage (인간 세포 Lineage 의 계층적 표현에 관한 연구)

  • Park, JaeSoon;Kwon, Seong Gyu;Oh, Ji Won;Lee, JongHyuk
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2020.11a
    • /
    • pp.663-664
    • /
    • 2020
  • 차세대 염기서열 분석 기술은 성능과 비용 면에서 매우 향상되어 한 개체 내 여러 세포의 유전자 분석이 가능한 수준이다. 한 개체 내 여러 조직 세포의 유전자는 모두 동일하지 않기 때문에 여러 조직 세포의 Lineage 를 계층적으로 표현하고 이를 조직 세포 간 변이 정도를 파악하는 데 활용한다면 암 돌연변이 발생 등을 미리 예측할 수 있다. 본 논문은 한 개체 내 여러 조직 간 변이를 관찰하기 위해 변이 검출 데이터를 계층적 군집 방법을 이용해 분석하고 이를 시각화 하는 방법을 제안한다. 실제의 8 개 조직 세포의 유전자를 분석하고 변이를 검출하여 Dendrogram 그래프로 시각화 하였다.

Acceleration of X-chromosome gene order evolution in the cattle lineage

  • Park, Woncheoul;Oh, Hee-Seok;Kim, Heebal
    • BMB Reports
    • /
    • v.46 no.6
    • /
    • pp.310-315
    • /
    • 2013
  • The gene order on the X chromosome of eutherians is generally highly conserved, although an increase in the rate of rearrangement has been reported in the rodent lineage. Conservation of the X chromosome is thought to be caused by selection related to maintenance of dosage compensation. However, we herein reveal that the cattle (Btau4.0) lineage has experienced a strong increase in the rate of X-chromosome rearrangement, much stronger than that previously reported for rodents. We also show that this increase is not matched by a similar increase on the autosomes and cannot be explained by assembly errors. Furthermore, we compared the difference in two cattle genome assemblies: Btau4.0 and Btau6.0 (Bos taurus UMD3.1). The results showed a discrepancy between Btau4.0 and Btau6.0 cattle assembly version data, and we believe that Btau6.0 cattle assembly version data are not more reliable than Btau4.0.

New distribution record of northern lineage plant of Stellaria filicaulis(Caryophyllaceae) from South Korea

  • Dong-Pil Jin;Chae Eun Lim;Sunhee Sim;Jin Dong Lee;Inbae Lee;Kwuidong Jung;Jung-Hyun Kim
    • Journal of Species Research
    • /
    • v.12 no.4
    • /
    • pp.299-306
    • /
    • 2023
  • A northern lineage plant, Stellaria filicaulis (Caryophyllaceae), was newly found in Yeoncheon-gun, Gyeonggi-do of South Korea. This species is distributed in China, Japan, Korea, Mongolia, and Russia. On the Korean Peninsula, St. filicaulis, however, has been known to grow in North Korea. Species identification was confirmed using morphological characteristics and DNA sequence data, while comparing with materials obtained from herbarium specimens. Stellaria filicaulis is distinguished from St. longifolia by having smooth surface of stem, petals about twice longer than sepals. On the neighbor-joining tree, St. filicaulis formed a clade, and the species is closely related to St. longifolia of the Parviflorae clade. Details of the morphological characters, the type specimens, voucher specimens data, and photographs of St. filicaulis in South Korea are presented. In addition, it is likely that a new habitat will be found by plant biodiversity field surveys through the middle part of the Korean Peninsula. Further research is needed to determine its population size, distribution, and threats, as well as identify appropriate locations for conservation collection of germplasm.

Molecular Phylogeny and Geography of Korean Medaka Fish (Oryzias latipes)

  • Kang, Tae-Wook;Lee, Eun-Hye;Kim, Moo-Sang;Paik, Sang-Gi;Kim, Sang-Soo;Kim, Chang-Bae
    • Molecules and Cells
    • /
    • v.20 no.1
    • /
    • pp.151-156
    • /
    • 2005
  • The phylogeny and geography of the medaka (Oryzias latipes) populations of Korea were investigated by analyzing sequence data for the mitochondrial control region. From the 41 haplotypes including 25 Korean haplotypes detected in 64 Korean specimens and data for the Japanese and Chinese populations, phylogenetic and nested clade analyses were executed to examine the phylogeny of haplogroups and the relation of the genetic architecture of the haplotypes to the historical geography of the Korean medaka fish. The analyses suggest that there are two very distinct lineages of Korean medaka, and that these result from reproductive isolation mechanisms due to geographic barriers. The southeastern lineage has experienced recent range expansion to the western region. The northwestern lineage, sister to Chinese populations, showed evidence of internal range expansion with shared haplotypes.

Members of Ectocarpus siliculosus F-box Family Are Subjected to Differential Selective Forces

  • Mahmood, Niaz;Moosa, Mahdi Muhammad;Matin, S. Abdul;Khan, Haseena
    • Interdisciplinary Bio Central
    • /
    • v.4 no.1
    • /
    • pp.1.1-1.7
    • /
    • 2012
  • Background: The F-box proteins represent one of the largest families of proteins in eukaryotes. Apart from being a component of the ubiquitin (Ub)/26 S proteasome pathways, their regulatory roles in other cellular and developmental pathways have also been reported. One interesting feature of the genes encoding the proteins of this particular family is their variable selection patterns across different lineages. This resulted in the presence of lineage specific F-box proteins across different species. Findings: In this study, 48 non-redundant F-box proteins in E. siliculosus have been identified by a homology based approach and classified into three classes based on their variable C-terminal domains. A greater number of the F-box proteins have domains similar to the ones identified in other species. On the other hand, when the proteins having unknown or no C-terminal domain (as predicted by InterProScan) were analyzed, it was found that some of them have the polyglutamine repeats. To gain evolutionary insights on the genes encoding the F-box proteins, their selection patterns were analyzed and a strong positive selection was observed which indicated the adaptation potential of the members of this family. Moreover, four lineage specific F-box genes were found in E. siliculosus with no identified homolog in any other species. Conclusions: This study describes a genome wide in silico analysis of the F-box proteins in E. siliculosus which sheds light on their evolutionary patterns. The results presented in this study provide a strong foundation to select candidate sequences for future functional analysis.

Evaluation of the Usability of Mobile RPG Game In-App Payment Service User : Focused on the Lineage M (모바일 RPG 게임의 인 앱 결제 서비스 이용자에 대한 사용성 평가 : '리니지M'을 중심으로)

  • Kim, Seung-Eon;Kim, Youngsik
    • Journal of Korea Game Society
    • /
    • v.18 no.3
    • /
    • pp.27-38
    • /
    • 2018
  • This paper is designed to propose the contents necessary for updating and developing future mobile RPG game developers through a usability evaluation of Users using In-App Payment Services at Lineage M. This paper conducted a usability assessment for quantitative assessment of users who use in-app payment services for mobile game 'Lineage M'. The tool for usability assessment was surveyed in a questionnaire designed based on 'The User Experience Honeycomb' defined by Peter Morville. These statistical results were then divided into lateral analyses. Through this, it is expected that game developers and planners will be able to find data to help develop new mobile RPG games.

Techniques to Guarantee Real-Time Fault Recovery in Spark Streaming Based Cloud System (Spark Streaming 기반 클라우드 시스템에서 실시간 고장 복구를 지원하기 위한 기법들)

  • Kim, Jungho;Park, Daedong;Kim, Sangwook;Moon, Yongshik;Hong, Seongsoo
    • Journal of KIISE
    • /
    • v.44 no.5
    • /
    • pp.460-468
    • /
    • 2017
  • In a real-time cloud environment, the data analysis framework plays a pivotal role. Spark Streaming meets most real-time requirements among existing frameworks. However, the framework does not meet the second scale real-time fault recovery requirement. Spark Streaming fault recovery time increases in proportion to the transformation history length called lineage. This is because it recovers the last state data based on the cumulative lineage recorded during normal operation. Therefore, fault recovery time is not bounded within a limited time. In addition, it is impossible to achieve a second-scale fault recovery time because it costs tens of seconds to read initial state data from fault-tolerant storage. In this paper, we propose two techniques to solve the problems mentioned above. We apply the proposed techniques to Spark Streaming 1.6.2. Experimental results show that the fault recovery time is bounded and the average fault recovery time is reduced by up to 41.57%.

DIRECT, MATERNAL AND CYTOPLASMIC GENETIC EFFECTS ON DAILY GAIN FROM BIRTH TO 45 DAYS OF BEEF CALVES

  • Shimada, K.;Willham, R.L.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.5 no.3
    • /
    • pp.567-570
    • /
    • 1992
  • Variance components were estimated for calf daily gain from birth to 45 days of age in small (S), medium (M) and large (L) lines of beef cattle. Analyses involved records collected on 682 (S), 510 (M) and 228 (L) calves in Iowa, USA from 1978 to 1986. Cytoplasmic lines were determined based on the foundation female in the maternal lineage of each animal. Data were analyzed separately by size line using a derivative-free restricted maximum likelihood procedure under an animal model including additive direct (a), additive maternal (m), cytoplasmic lineage effects and covariance (a, m). The heritabilities for direct and maternal, and the cytoplasmic effects, were 0.13, 0.35 and 0.00 for S, 0.14, 0.32 and 0.00 for M, and 0.05, 0.33 and 0.03 for L. Genetic correlations (a, m) for S, M and L were -0.33, -0.57 and -1.00, respectively. The maternal genetic effect was the most important for calf growth between birth and 45 dyas of age and cytoplasmic variances were not important in any line.

A Semiotics Framework for Analyzing Data Provenance Research

  • Ram, Sudha;Liu, Jun
    • Journal of Computing Science and Engineering
    • /
    • v.2 no.3
    • /
    • pp.221-248
    • /
    • 2008
  • Data provenance is the background knowledge that enables a piece of data to be interpreted and used correctly within context. The importance of tracking provenance is widely recognized, as witnessed by significant research in various areas including e-science, homeland security, and data warehousing and business intelligence. In order to further advance the research on data provenance, however, one must first understand the research that has been conducted to date and identify specific topics that merit further investigation. In this work, we develop a framework based on semiotics theory to assist in analyzing and comparing existing provenance research at the conceptual level. We provide a detailed review of data provenance research and compare and contrast the research based on d semiotics framework. We conclude with an identification of challenges that will drive future research in this field.

Contemporary review on the bifurcating autoregressive models : Overview and perspectives

  • Hwang, S.Y.
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.5
    • /
    • pp.1137-1149
    • /
    • 2014
  • Since the bifurcating autoregressive (BAR) model was developed by Cowan and Staudte (1986) to analyze cell lineage data, a lot of research has been directed to BAR and its generalizations. Based mainly on the author's works, this paper is concerned with a contemporary review on the BAR in terms of an overview and perspectives. Specifically, bifurcating structure is extended to multi-cast tree and to branching tree structure. The AR(1) time series model of Cowan and Staudte (1986) is generalized to tree structured random processes. Branching correlations between individuals sharing the same parent are introduced and discussed. Various methods for estimating parameters and related asymptotics are also reviewed. Consequently, the paper aims to give a contemporary overview on the BAR model, providing some perspectives to the future works in this area.