• Title/Summary/Keyword: Genome-wide

Search Result 695, Processing Time 0.031 seconds

A local search algorithm for predicting epistatic interactions of SNPs (복합 질환 관련 SNP 상호작용 예측을 위한 국소탐색 알고리즘)

  • Hong, Won-Pyo;Wee, Kyubum
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2010.11a
    • /
    • pp.1395-1398
    • /
    • 2010
  • 최근 GWAS(Genome-wide association study)로 인해 수십만 개의 SNP들이 사용 가능하게 되었다. 그러나 SNP 정보의 양이 방대하여 모든 SNP 조합을 검토하는 방식은 계산 비용이 클 뿐 아니라 오버피팅의 위험이 따른다. 본 논문에서는 필터링 기반 알고리즘인 SNPHarvester의 속도를 개선하고 평가함수를 상호정보량으로 대체하여 실험한다. 기존 SNPHarvester와 비교해 속도면에서 50%가 향상되었고 평가함수 면에서는 기존 SNPHarvester와 동일한 성능을 보였다.

Frequent Changes of 3' UTR Sequences in the Genes Expressed During Hematopoietic Differentiation Implicates the Importance of 3' UTR in Regulation of Gene Function (조혈세포의 분화과정에서 발현되는 유전자의 3‘ UTR 염기서열의 변화가 유전자 기능의 조절에 미치는 영향에 대한 연구)

  • Lee Sanggyu
    • YAKHAK HOEJI
    • /
    • v.49 no.3
    • /
    • pp.205-211
    • /
    • 2005
  • The 3' UTR (3' untranslated region) plays important roles in controlling gene expression through regulating 3' polyadenylation, mRNA export, subcellular localization, translational efficiency, and mRNA stability. Changes in the 3' UTR sequence in an expressed transcript can result in functional changes of the genes that are expressed in pathological conditions compared with those genes expressed in normal physiologic conditions. A genome-wide survey of 3' UTR variation was performed for the genes expressed during hematopoietic differentiation from CD34+ stem/progenitor cells to CD 15 + myeloid progenitor cells. Wide-spread differential usage of the 3' UTR was observed from the genes expressed during this cellular transition. This study implies that the 3' UTR can be a highly coordinated region for post-transcriptional regulation of the function of expressed genes.

A Genome-Wide Study of Moyamoya-Type Cerebrovascular Disease in the Korean Population

  • Joo, Sung-Pil;Kim, Tae-Sun;Lee, Il-Kwon;Kim, Joon-Tae;Park, Man-Seok;Cho, Ki-Hyun
    • Journal of Korean Neurosurgical Society
    • /
    • v.50 no.6
    • /
    • pp.486-491
    • /
    • 2011
  • Objective : Structural genetic variation, including copy-number variation (CNV), constitutes a substantial fraction of total genetic variability, and the importance of structural variants in modulating susceptibility is increasingly being recognized. CNV can change biological function and contribute to pathophysiological conditions of human disease. Its relationship with common, complex human disease in particular is not fully understood. Here, we searched the human genome to identify copy number variants that predispose to moya-moya type cerebrovascular disease. Methods : We retrospectively analyzed patients who had unilateral or bilateral steno-occlusive lesions at the cerebral artery from March, 2007, to September, 2009. For the 20 subjects, including patients with moyamoya type pathologies and three normal healthy controls, we divided the subjects into 4 groups : typical moyamoya (n=6), unilateral moyamoya (n=9), progression unilateral to typical moyamoya (n=2) and non-moyamoya (n=3). Fragmented DNA was hybridized on Human610Quad v1.0 DNA analysis BeadChips (Illumina). Data analysis was performed with GenomeStudio v2009.1, Genotyping 1.1.9, cnvPartition_v2.3.4 software. Overall call rates were more than 99.8%. Results : In total, 1258 CNVs were identified across the whole genome. The average number of CNV was 45.55 per subject (CNV region was 45.4). The gain/loss of CNV was 52/249, having 4.7 fold higher frequencies in loss calls. The total CNV size was 904,657,868, and average size was 993,038. The largest portion of CNVs (613 calls) were 1M-10M in length. Interestingly, significant association between unilateral moyamoya disease (MMD) and progression of unilateral to typical moyamoya was observed. Conclusion : Significant association between unilateral MMD and progression of unilateral to typical moyamoya was observed. The finding was confirmed again with clustering analysis. These data demonstrate that certain CNV associate with moyamoya-type cerebrovascular disease.

Whole-Genome Resequencing Analysis of Hanwoo and Yanbian Cattle to Identify Genome-Wide SNPs and Signatures of Selection

  • Choi, Jung-Woo;Choi, Bong-Hwan;Lee, Seung-Hwan;Lee, Seung-Soo;Kim, Hyeong-Cheol;Yu, Dayeong;Chung, Won-Hyong;Lee, Kyung-Tai;Chai, Han-Ha;Cho, Yong-Min;Lim, Dajeong
    • Molecules and Cells
    • /
    • v.38 no.5
    • /
    • pp.466-473
    • /
    • 2015
  • Over the last 30 years, Hanwoo has been selectively bred to improve economically important traits. Hanwoo is currently the representative Korean native beef cattle breed, and it is believed that it shared an ancestor with a Chinese breed, Yanbian cattle, until the last century. However, these two breeds have experienced different selection pressures during recent decades. Here, we whole-genome sequenced 10 animals each of Hanwoo and Yanbian cattle (20 total) using the Illumina HiSeq 2000 sequencer. A total of approximately 3.12 and 3.07 billion sequence reads were mapped to the bovine reference sequence assembly (UMD 3.1) at an average of approximately 10.71- and 10.53-fold coverage for Hanwoo and Yanbian cattle, respectively. A total of 17,936,399 single nucleotide polymorphisms (SNPs) were yielded, of which 22.3% were found to be novel. By annotating the SNPs, we further retrieved numerous nonsynonymous SNPs that may be associated with traits of interest in cattle. Furthermore, we performed whole-genome screening to detect signatures of selection throughout the genome. We located several promising selective sweeps that are potentially responsible for economically important traits in cattle; the PPP1R12A gene is an example of a gene that potentially affects intramuscular fat content. These discoveries provide valuable genomic information regarding potential genomic markers that could predict traits of interest for breeding programs of these cattle breeds.

Comparison of prediction accuracy for genomic estimated breeding value using the reference pig population of single-breed and admixed-breed

  • Lee, Soo Hyun;Seo, Dongwon;Lee, Doo Ho;Kang, Ji Min;Kim, Yeong Kuk;Lee, Kyung Tai;Kim, Tae Hun;Choi, Bong Hwan;Lee, Seung Hwan
    • Journal of Animal Science and Technology
    • /
    • v.62 no.4
    • /
    • pp.438-448
    • /
    • 2020
  • This study was performed to increase the accuracy of genomic estimated breeding value (GEBV) predictions for domestic pigs using single-breed and admixed reference populations (single-breed of Berkshire pigs [BS] with cross breed of Korean native pigs and Landrace pigs [CB]). The principal component analysis (PCA), linkage disequilibrium (LD), and genome-wide association study (GWAS) were performed to analyze the population structure prior to genomic prediction. Reference and test population data sets were randomly sampled 10 times each and precision accuracy was analyzed according to the size of the reference population (100, 200, 300, or 400 animals). For the BS population, prediction accuracy was higher for all economically important traits with larger reference population size. Prediction accuracy was ranged from -0.05 to 0.003, for all traits except carcass weight (CWT), when CB was used as the reference population and BS as the test. The accuracy of CB for backfat thickness (BF) and shear force (SF) using admixed population as reference increased with reference population size, while the results for CWT and muscle pH at 24 hours after slaughter (pH) were equivocal with respect to the relationship between accuracy and reference population size, although overall accuracy was similar to that using the BS as the reference.

Client-Server System Architecture for Inferring Large-Scale Genetic Interaction Networks (대규모 유전자 상호작용 네트워크 추론을 위한 클라이언트-서버 시스템 구조)

  • Kim, Yeong-Hun;Lee, Pil-Hyeon;Lee, Do-Heon
    • Bioinformatics and Biosystems
    • /
    • v.1 no.1
    • /
    • pp.38-45
    • /
    • 2006
  • We present a client-server system architecture for inferring genetic interaction networks based on Bayesian networks. It is typical to take tens of hours when genome-wide large-scale genetic interaction networks are inferred in the form of Bayesian networks. To deal with this situation, batch-style distributed system architectures are preferable to interactive standalone architectures. Thus, we have implemented a loosely coupled client-server system for network inference and user interface. The network inference consists of two stages. Firstly, the proposed method divides a whole gene set into overlapped modules, based on biological annotations and expression data together. Secondly, it infers Bayesian networks for each module, and integrates the learned subnetworks to a global network through common genes across the modules.

  • PDF

Elucidation of Copper and Asparagine Transport Systems in Saccharomyces cerevisiae KNU5377 Through Genome-Wide Transcriptional Analysis

  • KIM IL-SUP;YUN HAE SUN;SHIMISU HISAYO;KITAGAWA EMIKO;IWAHASHI HITOSHI;JIN INGNYOL
    • Journal of Microbiology and Biotechnology
    • /
    • v.15 no.6
    • /
    • pp.1240-1249
    • /
    • 2005
  • Saccharomyces cerevisiae KNU5377 has potential as an industrial strain that can ferment wasted paper for fuel ethanol at $40^{\circ}C$ [15, 16]. To understand the characteristics of the strain, genome-wide expression was performed using DNA microarray technology. We compared the homology of the DNA microarray between genomic DNAs of S. cerevisiae KNU5377 and a control strain, S. cerevisiae S288C. Approximately $97\%$ of the genes in S. cerevisiae KNU5377 were identified with those of the reference strain. YHR053c (CUP1), YLR155c (ASP3), and YDR038c (ENA5) showed lower homology than those of S. cerevisiae S288C. In particular, the differences in the regions of YHR053c and YLR155c were confirmed by Southern hybridization, but did not with that of the region of YDR038c. The expression level of mRNA in S. cerevisiae KNU5377 and S288C was also compared: the 550 ORFs of S. cerevisiae KNU5377 showed more than two-fold higher intensity than those of S. cerevisiae S288C. Among the 550 ORFs, 59 ORFs belonged to the groups of ribosomal proteins and mitochondrial ribosomal proteins, and 200 ORFs belonged to the group of cellular organization. DIP5 and GAP1 were the most highly expressed genes. These results suggest that upregulated DIP5 and GAP 1 might take the place of ASP3 and, additionally, the sensitivity against copper might be contributable to the lowest expression level of copper-binding metallothioneins encoded by CUP 1a (YHR053c) and CUP1b (YHR055c) in S. cerevisiae KNU5377.