• Title/Summary/Keyword: Whole genome

Search Result 582, Processing Time 0.025 seconds

High-Resolution Microarrays for Mapping Promoter Binding sites and Copy Number Variation in the Human Genome

  • Albert Thomas
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2006.02a
    • /
    • pp.125-126
    • /
    • 2006
  • NimbleGen has developed strategies to use its high-density oligonucleotide microarray platform (385,000 probes per array) to map both promoter binding sites and copy number variation at very high-resolution in the human genome. Here we describe a genome-wide map of active promoters determined by experimentally locating the sites of transcription imitation complex binding throughout the human genome using microarrays combined with chromatin immunoprecipitation. This map defines 10,567 active promoters corresponding to 6,763 known genes and at least 1,196 un-annotated transcriptional units. Microarray-based comparative genomic hybridisation (CGH) is animportant research tool for investigating chromosomal aberrations frequently associated with complex diseases such as cancer, neuropsychiatric disorders, and congenital developmental disorders. NimbleGen array CGH is an ultra-high resolution (0.5-50 Kb) oligo array platform that can be used to detect amplifications and deletions and map the associated breakpoints on the whole-genome level or with custom fine-tiling arrays. For whole-genome array CGH, probes are tiled through genic and intergenic regions with a median probe spacing of 6 Kb, which provides a comprehensive, unbiased analysis of the genome.

  • PDF

Five Computer Simulation Studies of Whole-Genome Fragment Assembly: The Case of Assembling Zymomonas mobilis ZM4 Sequences

  • Jung, Cholhee;Choi, Jin-Young;Park, Hyun Seck;Seo, Jeong-Sun
    • Genomics & Informatics
    • /
    • v.2 no.4
    • /
    • pp.184-190
    • /
    • 2004
  • An approach for genome analysis based on assembly of fragments of DNA from the whole genome can be applied to obtain the complete nucleotide sequence of the genome of Zymomonas mobilis. However, the problem of fragment assembly raise thorny computational issues. Computer simulation studies of sequence assembly usually show some abnormal assemblage of artificial sequences containing repetitive or duplicated regions, and suggest methods to correct those abnormalities. In this paper, we describe five simulation studies which had been performed previous to the actual genome assembly process of Zymomonas mobilis ZM4.

misMM: An Integrated Pipeline for Misassembly Detection Using Genotyping-by-Sequencing and Its Validation with BAC End Library Sequences and Gene Synteny

  • Ko, Young-Joon;Kim, Jung Sun;Kim, Sangsoo
    • Genomics & Informatics
    • /
    • v.15 no.4
    • /
    • pp.128-135
    • /
    • 2017
  • As next-generation sequencing technologies have advanced, enormous amounts of whole-genome sequence information in various species have been released. However, it is still difficult to assemble the whole genome precisely, due to inherent limitations of short-read sequencing technologies. In particular, the complexities of plants are incomparable to those of microorganisms or animals because of whole-genome duplications, repeat insertions, and Numt insertions, etc. In this study, we describe a new method for detecting misassembly sequence regions of Brassica rapa with genotyping-by-sequencing, followed by MadMapper clustering. The misassembly candidate regions were cross-checked with BAC clone paired-ends library sequences that have been mapped to the reference genome. The results were further verified with gene synteny relations between Brassica rapa and Arabidopsis thaliana. We conclude that this method will help detect misassembly regions and be applicable to incompletely assembled reference genomes from a variety of species.

Genomic Variations of Rice Regenerants from Tissue Culture Revealed by Whole Genome Re-Sequencing

  • Qin, Yang;Shin, Kong-Sik;Woo, Hee-Jong;Lim, Myung-Ho
    • Plant Breeding and Biotechnology
    • /
    • v.6 no.4
    • /
    • pp.426-433
    • /
    • 2018
  • Plant tissue culture is a technique that has invariably been used for various purposes such as obtaining transgenic plants for crop improvement or functional analysis of genes. However, this process can be associated with a variety of genetic and epigenetic instabilities in regenerated plants, termed as somaclonal variation. In this study, we investigated mutation spectrum, chromosomal distributions of nucleotide substitution types of single-nucleotide polymorphisms (SNPs) and insertions/deletions (InDels) by whole genome re-sequencing between Dongjin and Nipponbare along with regenerated plants of Dongjin from different induction periods. Results indicated that molecular spectrum of mutations in regenerated rice against Dongjin genome ranged from $9.14{\times}10^{-5}$ to $1.37{\times}10^{-4}$ during one- to three-month callus inductions, while natural mutation rate between Dongjin and Nipponbare genomes was $6.97{\times}10^{-4}$. Non-random chromosome distribution of SNP and InDel was observed in both regenerants and Dongjin genomes, with the highest densities on chromosome 11. The transition to transversion ratio was 2.25 in common SNPs of regenerants against Dongjin genome with the highest C/T transition frequency, which was similar to that of Dongjin against Nipponbare genome.

Genome analysis of Bacteroides sp. CACC 737 isolated from feline for its potential application

  • Kim, Jung-Ae;Jung, Min Young;Kim, Dae-Hyuk;Kim, Yangseon
    • Journal of Animal Science and Technology
    • /
    • v.62 no.6
    • /
    • pp.952-955
    • /
    • 2020
  • Bacteroides sp. CACC 737 was isolated from a feline, and its potential probiotic properties were characterized using functional genome analysis. Whole-genome sequencing was performed using the PacBio RSII and Illumina HiSeq platforms. The complete genome of strain CACC 737 contained 4.6 Mb, with a guanine (G) + cytosine (C) content of 45.8%, six cryptic plasmids, and extracellular polysaccharide gene as unique features. The strain was beneficial to animal health when consumed as feed, for example, for ameliorating immunological dysfunctions and metabolic disorders. The genome information adds to the comprehensive understanding of Bacteroides sp. and suggests potential animal-related industrial applications for this strain.

Whole Genome Sequence of Streptomyces sp. from Novel Marine Actinomycetes

  • Hyeon Kyeong Lee;Heung-Soon Park;Eung-Soo Kim;Si-Sun Choi
    • Microbiology and Biotechnology Letters
    • /
    • v.51 no.3
    • /
    • pp.325-327
    • /
    • 2023
  • This paper presents the complete genome sequence of a novel marine actinomycete, Streptomyces sp. MMBL 11-1. The genome of Streptomyces sp. MMBL 11-1 was obtained through next-generation sequencing using the PacBio Sequel system and Illumina platform provided by Macrogen, Korea. The assembled genome consists of five contigs, with a total length of 8,496,900 bp and a G+C content of 71.6%. The genome harbors multiple biosynthetic gene clusters (BGCs) associated with producing microbial natural products (MNPs). The comprehensive genomic information of this type of strain will serve as a valuable resource for identifying other marine actinomycetes strains.

Whole Genome Analysis of Human Papillomavirus Type 16 Multiple Infection in Cervical Cancer Patients

  • Chansaenroj, Jira;Theamboonlers, Apiradee;Junyangdikul, Pairoj;Swangvaree, Sukumarn;Karalak, Anant;Poovorawan, Yong
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.13 no.2
    • /
    • pp.599-606
    • /
    • 2012
  • The characterization of the whole genome of human papillomavirus type 16 (HPV16) from cervical cancer specimens with multiple infections in comparison with single infection samples as the oncogenic potential of the virus may differ. Cervical carcinoma specimens positive for HPV16 by PCR and INNO-LiPA were randomly selected for whole genome characterization. Two HPV16 single infection and six HPV16 multiple infection specimens were subjected to whole genome analysis by using conserved primers and subsequent sequencing. All HPV16 whole genomes from single infection samples clustered in the European (E) lineage while all multiple infection specimens belonged to the non-European lineage. The variations in nucleotide sequences in E6, E7, E2, L1 and Long control region (LCR) were evaluated. In the E6 region, amino acid changes at L83V were related to increased cancer progression. An amino acid variation N29S within the E7 oncoprotein significantly associated with severity of lesion was also discovered. In all three domains of the E2 gene non synonymous mutations were found. The L1 region showed various mutations which may be related to conformation changes of viral epitopes. Some transcription factor binding sites in the LCR region correlated to virulence were shown on GRE/1, TEF-1, YY14 and Oct-1. HPV16 European variant prone to single infection may harbor a major variation at L83V which significantly increases the risk for developing cervical carcinoma. HPV16 non-European variants prone to multiple infections may require many polymorphisms to enhance the risk of cervical cancer development.

Whole Genome Resequencing of Heugu (Korean Black Cattle) for the Genome-Wide SNP Discovery

  • Choi, Jung-Woo;Chung, Won-Hyong;Lee, Kyung-Tai;Choi, Jae-Won;Jung, Kyoung-Sub;Cho, Yongmin;Kim, Namshin;Kim, Tae-Hun
    • Food Science of Animal Resources
    • /
    • v.33 no.6
    • /
    • pp.715-722
    • /
    • 2013
  • Heugu (Korea Black Cattle) is one of the indigenous cattle breeds in Korea; however there has been severe lack of genomic studies on the breed. In this study, we report the first whole genome resequencing of Heugu at higher sequence coverage using Illumina HiSeq 2000 platform. More than 153.6 Giga base pairs sequence was obtained, of which 97% of the reads were mapped to the bovine reference sequence assembly (UMD 3.1). The number of non-redundantly mapped sequence reads corresponds to approximately 28.9-fold coverage across the genome. From these data, we identified a total of over six million single nucleotide polymorphisms (SNPs), of which 29.4% were found to be novel using the single nucleotide polymorphism database build 137. Extensive annotation was performed on all the detected SNPs, showing that most of SNPs were located in intergenic regions (70.7%), which is well corresponded with previous studies. Of the total SNPs, we identified substantial numbers of non-synonymous SNPs (13,979) in 5,999 genes, which could potentially affect meat quality traits in cattle. These results provide genome-wide SNPs that can serve as useful genetic tools and as candidates in searches for phenotype-altering DNA difference implicated with meat quality traits in cattle. The importance of this study can be further pronounced with the first whole genome sequencing of the valuable local genetic resource to be used in further genomic comparison studies with diverse cattle breeds.

DNA Fragment Assembly

  • 박근수
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2002.06a
    • /
    • pp.105-121
    • /
    • 2002
  • 최근 인간 지놈(genome)의 DNA가 밝혀져서 많은 관심을 받았는데, 이를 수행하는 방법을 소개한다. Human Genome Project에서 채택한 BAC-to-BAC 방식과 Celera 회사에서 채택한 whole genome shotgun 방식을 설명한다. 또한 두 방식에서 공히 fragment assembly 프로그램을 사용하는데, 이 프로그램의 개요를 설명한다.

  • PDF

Multi-omics techniques for the genetic and epigenetic analysis of rare diseases

  • Yeonsong Choi;David Whee-Young Choi;Semin Lee
    • Journal of Genetic Medicine
    • /
    • v.20 no.1
    • /
    • pp.1-5
    • /
    • 2023
  • Until now, rare disease studies have mainly been carried out by detecting simple variants such as single nucleotide substitutions and short insertions and deletions in protein-coding regions of disease-associated gene panels using diagnostic next-generation sequencing in association with patient phenotypes. However, several recent studies reported that the detection rate hardly exceeds 50% even when whole-exome sequencing is applied. Therefore, the necessity of introducing whole-genome sequencing is emerging to discover more diverse genomic variants and examine their association with rare diseases. When no diagnosis is provided by whole-genome sequencing, additional omics techniques such as RNA-seq also can be considered to further interrogate causal variants. This paper will introduce a description of these multi-omics techniques and their applications in rare disease studies.