• Title/Summary/Keyword: Shotgun sequencing

Search Result 25, Processing Time 0.033 seconds

Exploring the Microbial Community and Functional Characteristics of the Livestock Feces Using the Whole Metagenome Shotgun Sequencing

  • Hyeri Kim;Eun Sol Kim;Jin Ho Cho;Minho Song;Jae Hyoung Cho;Sheena Kim;Gi Beom Keum;Jinok Kwak;Hyunok Doo;Sriniwas Pandey;Seung-Hwan Park;Ju Huck Lee;Hyunjung Jung;Tai Young Hur;Jae-Kyung Kim;Kwang Kyo Oh;Hyeun Bum Kim;Ju-Hoon Lee
    • Journal of Microbiology and Biotechnology
    • /
    • v.33 no.1
    • /
    • pp.51-60
    • /
    • 2023
  • The foodborne illness is the important public health concerns, and the livestock feces are known to be one of the major reservoirs of foodborne pathogens. Also, it was reported that 45.5% of foodborne illness outbreaks have been associated with the animal products contaminated with the livestock feces. In addition, it has been known that the persistence of a pathogens depends on many potential virulent factors including the various virulent genes. Therefore, the first step to understanding the public health risk of livestock feces is to identify and describe microbial communities and potential virulent genes that contribute to bacterial pathogenicity. We used the whole metagenome shotgun sequencing to evaluate the prevalence of foodborne pathogens and to characterize the virulence associated genes in pig and chicken feces. Our data showed that the relative abundance of potential foodborne pathogens, such as Bacillus cereus was higher in chickens than pigs at the species level while the relative abundance of foodborne pathogens including Campylobacter coli was only detected in pigs. Also, the microbial functional characteristics of livestock feces revealed that the gene families related to "Biofilm formation and quorum sensing" were highly enriched in pigs than chicken. Moreover, the variety of gene families associated with "Resistance to antibiotics and toxic compounds" were detected in both animals. These results will help us to prepare the scientific action plans to improve awareness and understanding of the public health risks of livestock feces.

A New Approach to Fragment Assembly in DNA Sequencing

  • Pevzner, Pavel-A.;Tang, Haixu;Waterman, Micheal-S.
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2001.08a
    • /
    • pp.11-35
    • /
    • 2001
  • For the last twenty years fragment assembly in DNA sequencing followed the "overlap - layout - consensus"paradigm that is used in all currently available assembly tools. Although this approach proved to be useful in assembling clones, it faces difficulties in genomic shotgun assembly: the existing algorithms make assembly errors and are often unable to resolve repeats even in prokaryotic genomes. Biologists are well-aware of these errors and are forced to carry additional experiments to verify the assembled contigs. We abandon the classical “overlap - layout - consensus”approach in favor of a new Eulerian Superpath approach that, for the first time, resolves the problem of repeats in fragment assembly. Our main result is the reduction of the fragment assembly to a variation of the classical Eulerian path problem. This reduction opens new possibilities for repeat resolution and allows one to generate error-free solutions of the large-scale fragment assemble problems. The major improvement of EULER over other algorithms is that it resolves all repeats except long perfect repeats that are theoretically impossible to resolve without additional experiments.

  • PDF

The strategy and current status of Brassica rapa genome project (배추 유전체 염기서열 해독 전략과 현황)

  • Mun, Jeong-Hwan;Kwon, Soo-Jin;Park, Beom-Seok
    • Journal of Plant Biotechnology
    • /
    • v.37 no.2
    • /
    • pp.153-165
    • /
    • 2010
  • Brassica rapa is considered an ideal candidate to act as a reference species for Brassica genomic studies. Among the three basic Brassica species, B. rapa (AA genome) has the smallest genome (529 Mbp), compared to B. nigra (BB genome, 632 Mbp) and B. oleracea (CC genome, 696 Mbp). There is also a large collection of available cultivars of B. rapa, as well as a broad array of B. rapa genomic resources available. Under international consensus, various genomic studies on B. rapa have been conducted, including the construction of a physical map based on 22.5X genome coverage, end sequencing of 146,000 BACs, sequencing of >150,000 expressed sequence tags, and successful phase 2 shotgun sequencing of 589 euchromatic region-tiling BACs based on comparative positioning with the Arabidopsis genome. These sequenced BACs mapped onto the B. rapa genome provide beginning points for genome sequencing of each chromosome. Applying this strategy, all of the 10 chromosomes of B. rapa have been assigned to the sequencing centers in seven countries, Korea, UK, China, India, Canada, Australia, and Japan. The two longest chromosomes, A3 and A9, have been sequenced except for several gaps, by NAAS in Korea. Meanwhile a China group, including IVF and BGI, performed whole genome sequencing with Illumina system. These Sanger and NGS sequence data will be integrated to assemble a draft sequence of B. rapa. The imminent B. rapa genome sequence offers novel insights into the organization and evolution of the Brassica genome. In parallel, the transfer of knowledge from B. rapa to other Brassica crops would be expected.

Cloning and Overexpression of the Cdd Gene Encoding Cytidine Deaminase from Salmonella typhimurium

  • Lee, Sang-Mahn
    • Korean Journal of Environmental Biology
    • /
    • v.21 no.1
    • /
    • pp.56-59
    • /
    • 2003
  • The Salmonella typhimurium cdd gene encoding cytidine deaminase (cyti-dine/2'-deoxycytidine aminohydrolase; EC 3.5.4.5.) was isolated through shotgun clon-ing by complementation of the E. coli odd mutation. By subsequent deletion and sub-cloning from the original 3.7 Kb of EcoRI insert (pSAMI), the precise region of the cdd structural gene is located around the BglII site in the middle part of 1.7 Kb of NruI/PvuI segment. The 1.7 Kb containing odd gene wag subcloned to the pUC18 vector and the nucleotide sequence of the cdd gene was determined. When the putative ribosorne-binding site (Shine-Dalgarno sequence) and initiation codon were predicted to be GAGG at the position 459 and ATG at the position 470, respectively, there was an open reading frame of 885 nucleotides, encoding an 294 amino acid protein. The cdd gene expression in E. coli JF611/pSAMI was amplified about 50 fold compared to that of the wild type. The cdd gene expression was maintained in the stationary phase after rea-ching the peak in the late logarithmic phase.

Development and characterization of eleven microsatellite markers for a popular pet stag beetle, Dorcus hopei (Coleoptera, Lucanidae) using paired-end Illumina shotgun sequencing

  • Han, Taeman;Kim, Seung-Hyun;Park, In Gyun;Park, Haechul
    • International Journal of Industrial Entomology and Biomaterials
    • /
    • v.35 no.2
    • /
    • pp.97-99
    • /
    • 2017
  • Eleven polymorphic microsatellite loci were developed and characterized for Dorcus hopei in this study. The number of alleles varied from 2 to 21. The observed heterozygosity and expected heterozygosity ranged from 0.1058 to 0.9744 and 0.0997 to 0.8941, respectively. Two loci showed low polymorphism, while the rest were highly polymorphic. Six loci deviated from Hardy-Weinberg Equilibrium. The set of markers will provide effective tools for examining the population genetic structures and be helpful for managing wild population in D. hopei.

Survey of the Applications of NGS to Whole-Genome Sequencing and Expression Profiling

  • Lim, Jong-Sung;Choi, Beom-Soon;Lee, Jeong-Soo;Shin, Chan-Seok;Yang, Tae-Jin;Rhee, Jae-Sung;Lee, Jae-Seong;Choi, Ik-Young
    • Genomics & Informatics
    • /
    • v.10 no.1
    • /
    • pp.1-8
    • /
    • 2012
  • Recently, the technologies of DNA sequence variation and gene expression profiling have been used widely as approaches in the expertise of genome biology and genetics. The application to genome study has been particularly developed with the introduction of the nextgeneration DNA sequencer (NGS) Roche/454 and Illumina/ Solexa systems, along with bioinformation analysis technologies of whole-genome $de$ $novo$ assembly, expression profiling, DNA variation discovery, and genotyping. Both massive whole-genome shotgun paired-end sequencing and mate paired-end sequencing data are important steps for constructing $de$ $novo$ assembly of novel genome sequencing data. It is necessary to have DNA sequence information from a multiplatform NGS with at least $2{\times}$ and $30{\times}$ depth sequence of genome coverage using Roche/454 and Illumina/Solexa, respectively, for effective an way of de novo assembly. Massive shortlength reading data from the Illumina/Solexa system is enough to discover DNA variation, resulting in reducing the cost of DNA sequencing. Whole-genome expression profile data are useful to approach genome system biology with quantification of expressed RNAs from a wholegenome transcriptome, depending on the tissue samples. The hybrid mRNA sequences from Rohce/454 and Illumina/Solexa are more powerful to find novel genes through $de$ $novo$ assembly in any whole-genome sequenced species. The $20{\times}$ and $50{\times}$ coverage of the estimated transcriptome sequences using Roche/454 and Illumina/Solexa, respectively, is effective to create novel expressed reference sequences. However, only an average $30{\times}$ coverage of a transcriptome with short read sequences of Illumina/Solexa is enough to check expression quantification, compared to the reference expressed sequence tag sequence.

Metagenome Analysis of Protein Domain Collocation within Cellulase Genes of Goat Rumen Microbes

  • Lim, SooYeon;Seo, Jaehyun;Choi, Hyunbong;Yoon, Duhak;Nam, Jungrye;Kim, Heebal;Cho, Seoae;Chang, Jongsoo
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.26 no.8
    • /
    • pp.1144-1151
    • /
    • 2013
  • In this study, protein domains with cellulase activity in goat rumen microbes were investigated using metagenomic and bioinformatic analyses. After the complete genome of goat rumen microbes was obtained using a shotgun sequencing method, 217,892,109 pair reads were filtered, including only those with 70% identity, 100-bp matches, and thresholds below $E^{-10}$ using METAIDBA. These filtered contigs were assembled and annotated using blastN against the NCBI nucleotide database. As a result, a microbial community structure with 1431 species was analyzed, among which Prevotella ruminicola 23 bacteria and Butyrivibrio proteoclasticus B316 were the dominant groups. In parallel, 201 sequences related with cellulase activities (EC.3.2.1.4) were obtained through blast searches using the enzyme.dat file provided by the NCBI database. After translating the nucleotide sequence into a protein sequence using Interproscan, 28 protein domains with cellulase activity were identified using the HMMER package with threshold E values below $10^{-5}$. Cellulase activity protein domain profiling showed that the major protein domains such as lipase GDSL, cellulase, and Glyco hydro 10 were present in bacterial species with strong cellulase activities. Furthermore, correlation plots clearly displayed the strong positive correlation between some protein domain groups, which was indicative of microbial adaption in the goat rumen based on feeding habits. This is the first metagenomic analysis of cellulase activity protein domains using bioinformatics from the goat rumen.

Analysis of sequence alignment Tools on polymorphic genomes (다염기변이 유전체에 대한 서열 정렬 툴 분석)

  • Kim, Yoo-Sun;Kim, Jong-Hyun;Yeo, Yun-Ku;Kim, Woo-Cheol;Park, Sang-Hyun
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2008.06c
    • /
    • pp.217-221
    • /
    • 2008
  • 생명공학 기술의 발달로 지놈 프로젝트를 통해 인간 초파리 등 여러 종의 유전체 정보가 밝혀 졌다. 그러나 Post-Genome 연구에 있어서 매우 중요한 생물체인 멍게(Ciona intestinalis)와 성게(Strongylocentrotus purpuratus)의 유전체 서열은 현재 공개되어 있으나 염기서열의 연속성(continuity)에는 심각한 문제점이 존재하고 있다. 이들은 염기서열에 변이가 많은 다염기변이 유전체(polymorphic genomes)로 그 특성이 반영되지 않은 전통적인 Whole Genome Shotgun Sequencing(WGSS)방법을 사용였기 때문이다. 이와 같은 다염기변이 유전체 서열 분석은 시스템 생물학이나 비교 유전체학 등의 후발 연구에 기초가 되므로 매우 중요하다. 본 논문에서는 다염기변이 유전체에 대해 알아보고 서열 조립 알고리즘의 기본이 되는 서열 정렬 툴들 중 가장 많이 사용되는 FASTA, BLAST, BLAT에 대해 분석하여 봄으로써 다염기변이 유전체에 적합한 서열 조립 전략 수립을 위해 고려해야 하는 사항들을 논의해 본다.

  • PDF

Identification of Antibiotic Resistance Genes in Orofacial Abscesses Using a Metagenomics-based Approach: A Pilot Study

  • Yeeun Lee;Joo-Young Park;Youngnim Choi
    • Journal of Korean Dental Science
    • /
    • v.16 no.1
    • /
    • pp.35-46
    • /
    • 2023
  • Purpose: Culture-based methods for microbiological diagnosis and antibiotic susceptibility tests have limitations in the management of orofacial infections. We aimed to profile pus microbiota and identify antibiotic resistance genes (ARGs) using a culture-independent approach. Materials and Methods: Genomic DNA samples extracted from the pus specimens of two patients with orofacial abscesses were subjected to shotgun sequencing on the NovaSeq system. Taxonomic profiling and prediction of ARGs were performed directly from the metagenomic raw reads. Result: Taxonomic profiling revealed obligate anaerobic polymicrobial communities associated with infections of odontogenic origins: the microbial community of Patient 1 consisted of one predominant species (Prevotella oris 74.6%) with 27 minor species, while the sample from Patient 2 contained 3 abundant species (Porphyromonas endodontalis 33.0%; P. oris 31.6%; and Prevotella koreensis 13.4%) with five minor species. A total of 150 and 136 putative ARGs were predicted in the metagenome of each pus sample. The coverage of most predicted ARGs was less than 10%, and only the CfxA2 gene identified in Patient 1 was covered 100%. ARG analysis of the seven assembled genome/metagenome datasets of P. oris revealed that strain C735 carried the CfxA2 gene. Conclusion: A metagenomics-based approach is useful to profile predominantly anaerobic polymicrobial communities but needs further verification for reliable ARG detection.

Development of Microsatellite Markers using BAC clone Sequencing on Porcine Chromosome 6q28 - 6q32 (돼지 6번 염색체(6q28 - 6q32)의 BAC clone 염기서열 분석에 의한 Microsatellite Markers 개발)

  • Chang, K.W.;Lee, K.T.;Park, E.W.;Choi, B.H.;Kim, T.H.;Cheong, I.C.;Oh, S.J.
    • Journal of Animal Science and Technology
    • /
    • v.46 no.3
    • /
    • pp.301-306
    • /
    • 2004
  • This study was conducted to develop new markers at the region that was related to QTL affecting intramuscular fat and backfat thickness on chromosome 6q28 - 6q32 in pigs. Dozens of repeated sequences were founded using shotgun sequencing of several BAC clones corresponding to that region, of which five new microstellite markers that identified polymorphism were discovered. The mean number of alleles at each locus observed 2.13(KP0290F2), 4.63(KP0248Cll), 7.38(KP1231C91), 2.75(KPI23IC92) and 6.2S(KP1231C93) in 8 breeds(Landrace, Korean native pig, Duroc, Yorkshire, Berkshire, Wuzhishan pig, Xiang pig, Min pig). The average estimated heterozygosity values at each locus varied from 0.2100(KP0290F2) to 0.8304(KPI23IC91) in all populations. In other hand, the average allele of all loci WlL'I within range of 0.4517(Berkshire) and 0.6957 (Yorkshire). Of these markers, KP0248C11, KP1231C91 and KP1231C93 were identified to have optimal number of alleles, high heterozygosity values and low standard deviation values. Especially, KPI23IC91 and KPI231C93 might be considered as a useful marker for genetic mapping and diversity study.