• Title/Summary/Keyword: illumina sequencing

Search Result 153, Processing Time 0.027 seconds

Bioinformatic Suggestions on MiSeq-Based Microbial Community Analysis

  • Unno, Tatsuya
    • Journal of Microbiology and Biotechnology
    • /
    • v.25 no.6
    • /
    • pp.765-770
    • /
    • 2015
  • Recent sequencing technology development has revolutionized fields of microbial ecology. MiSeq-based microbial community analysis allows us to sequence more than a few hundred samples at a time, which is far more cost-effective than pyrosequencing. The approach, however, has not been preferably used owing to computational difficulties of processing huge amounts of data as well as known Illumina-derived artefact problems with amplicon sequencing. The choice of assembly software to take advantage of paired-end sequencing and methods to remove Illumina artefacts sequences are discussed. The protocol we suggest not only removed erroneous reads, but also dramatically reduced computational workload, which allows even a typical desktop computer to process a huge amount of sequence data generated with Illumina sequencers. We also developed a Web interface (http://biotech.jejunu.ac.kr/ ~abl/16s/) that allows users to conduct fastq-merging and mothur batch creation. The study presented here should provide technical advantages and supports in applying MiSeq-based microbial community analysis.

Comparison of the MGISEQ-2000 and Illumina HiSeq 4000 sequencing platforms for RNA sequencing

  • Jeon, Sol A;Park, Jong Lyul;Kim, Jong-Hwan;Kim, Jeong Hwan;Kim, Yong Sung;Kim, Jin Cheon;Kim, Seon-Young
    • Genomics & Informatics
    • /
    • v.17 no.3
    • /
    • pp.32.1-32.6
    • /
    • 2019
  • Currently, Illumina sequencers are the globally leading sequencing platform in the next-generation sequencing market. Recently, MGI Tech launched a series of new sequencers, including the MGISEQ-2000, which promise to deliver high-quality sequencing data faster and at lower prices than Illumina's sequencers. In this study, we compared the performance of two major sequencers (MGISEQ-2000 and HiSeq 4000) to test whether the MGISEQ-2000 sequencer delivers high-quality sequence data as suggested. We performed RNA sequencing of four human colon cancer samples with the two platforms, and compared the sequencing quality and expression values. The data produced from the MGISEQ-2000 and HiSeq 4000 showed high concordance, with Pearson correlation coefficients ranging from 0.98 to 0.99. Various quality control (QC) analyses showed that the MGISEQ-2000 data fulfilled the required QC measures. Our study suggests that the performance of the MGISEQ-2000 is comparable to that of the HiSeq 4000 and that the MGISEQ-2000 can be a useful platform for sequencing.

Survey of the Applications of NGS to Whole-Genome Sequencing and Expression Profiling

  • Lim, Jong-Sung;Choi, Beom-Soon;Lee, Jeong-Soo;Shin, Chan-Seok;Yang, Tae-Jin;Rhee, Jae-Sung;Lee, Jae-Seong;Choi, Ik-Young
    • Genomics & Informatics
    • /
    • v.10 no.1
    • /
    • pp.1-8
    • /
    • 2012
  • Recently, the technologies of DNA sequence variation and gene expression profiling have been used widely as approaches in the expertise of genome biology and genetics. The application to genome study has been particularly developed with the introduction of the nextgeneration DNA sequencer (NGS) Roche/454 and Illumina/ Solexa systems, along with bioinformation analysis technologies of whole-genome $de$ $novo$ assembly, expression profiling, DNA variation discovery, and genotyping. Both massive whole-genome shotgun paired-end sequencing and mate paired-end sequencing data are important steps for constructing $de$ $novo$ assembly of novel genome sequencing data. It is necessary to have DNA sequence information from a multiplatform NGS with at least $2{\times}$ and $30{\times}$ depth sequence of genome coverage using Roche/454 and Illumina/Solexa, respectively, for effective an way of de novo assembly. Massive shortlength reading data from the Illumina/Solexa system is enough to discover DNA variation, resulting in reducing the cost of DNA sequencing. Whole-genome expression profile data are useful to approach genome system biology with quantification of expressed RNAs from a wholegenome transcriptome, depending on the tissue samples. The hybrid mRNA sequences from Rohce/454 and Illumina/Solexa are more powerful to find novel genes through $de$ $novo$ assembly in any whole-genome sequenced species. The $20{\times}$ and $50{\times}$ coverage of the estimated transcriptome sequences using Roche/454 and Illumina/Solexa, respectively, is effective to create novel expressed reference sequences. However, only an average $30{\times}$ coverage of a transcriptome with short read sequences of Illumina/Solexa is enough to check expression quantification, compared to the reference expressed sequence tag sequence.

Whole genome sequencing based noninvasive prenatal test

  • Cho, Eun-Hae
    • Journal of Genetic Medicine
    • /
    • v.12 no.2
    • /
    • pp.61-65
    • /
    • 2015
  • Whole genome sequencing (WGS)-based noninvasive prenatal test (NIPT) is the first method applied in the clinical setting out of various NIPT techniques. Several companies, such as Sequenom, BGI, and Illumina offer WGS-based NIPT, each with different technical and bioinformatic approaches. Sequenom, BGI, and Illumina utilize z-, t-, and L-scores, as well as normalized chromosome values, respectively, for trisomy detection. Their outstanding performance has been demonstrated in clinical studies of more than 100,000 pregnancies. The sensitivity and specificity for detection of trisomies 13, 18, and 21 were above 98%, as reported by all three companies. Unlike other techniques, WGS-based NIPT can detect other trisomies as well as clinically significant segmental duplications/deletions within a chromosome, which could expand the scope of NIPT. Incorrect results could be due to low fetal fraction, fetoplacental mosaicism, confined placental mosaicism or maternal copy number variation (CNV). Among those, maternal CNV is a significant contributor of false positive results and therefore genome wide scanning plays an important role in preventing the occurrence of false positives. In this article, the bioinformatic techniques and clinical performance of three major companies are comprehensively reviewed.

Freeze-drying feces reduces illumina-derived artefacts on 16S rRNA-based microbial community analysis (Illumina를 이용한16S rRNA 기반 미생물생태분석에서 분변의 동결건조에 의한 인공적인 시퀀스 생성 감소효과)

  • Kim, Jungman;Unno, Tatsuya
    • Journal of Applied Biological Chemistry
    • /
    • v.59 no.4
    • /
    • pp.299-304
    • /
    • 2016
  • When used for amplicon sequencing, Illumina platforms produce more than hundreds of sequence artefacts, which affects operational taxonomic units based analyses such as differential abundance and network analyses. Nevertheless it has become a major tool for fecal microbial community analysis. In addition, results from sequence-based fecal microbial community analysis vary depending on conditions of samples (i.e., freshness, time of storage and quantity). We investigated if freeze-drying samples could improve quality of sequence data. Our results showed reduced number of possible artefacts while maintaining overall microbial community structure. Therefore, freeze-drying feces prior to DNA extraction is recommended for Illumina-based microbial community analysis.

Characterization of the Biodiversity of the Spoilage Microbiota in Chicken Meat Using Next Generation Sequencing and Culture Dependent Approach

  • Lee, Hee Soo;Kwon, Mirae;Heo, Sunhak;Kim, Min Gon;Kim, Geun-Bae
    • Food Science of Animal Resources
    • /
    • v.37 no.4
    • /
    • pp.535-541
    • /
    • 2017
  • This study investigated the psychrotrophic bacteria isolated from chicken meat to characterize their microbial composition during refrigerated storage. The bacterial community was identified by the Illumina MiSeq method based on bacterial DNA extracted from spoiled chicken meat. Molecular identification of the isolated psychrotrophic bacteria was carried out using 16S rDNA sequencing and their putrefactive potential was investigated by the growth at low temperature as well as their proteolytic activities in chicken meat. From the Illumina sequencing, a total of 187,671 reads were obtained from 12 chicken samples. Regardless of the type of chicken meat (i.e., whole meat and chicken breast) and storage temperatures ($4^{\circ}C$ and $10^{\circ}C$), Pseudomonas weihenstephanensis and Pseudomonas congelans were the most prominent bacterial species. Serratia spp. and Acinetobacter spp. were prominent in chicken breast and whole chicken meat, respectively. The 118 isolated strains of psychrotrophic bacteria comprised Pseudomonas spp. (58.48%), Serratia spp. (10.17%), and Morganella spp. (6.78%). All isolates grew well at $10^{\circ}C$ and they induced different proteolytic activities depending on the species and strains. Parallel analysis of the next generation sequencing and culture dependent approach provides in-depth information on the biodiversity of the spoilage microbiota in chicken meat. Further study is needed to develop better preservation methods against these spoilage bacteria.

Validation of fetus aneuploidy in 221 Korean clinical samples using noninvasive chromosome examination: Clinical laboratory improvement amendments-certified noninvasive prenatal test

  • Kim, Min-Jeong;Kwon, Chang Hyuk;Kim, Dong-In;Im, Hee Su;Park, Sungil;Kim, Ji Ho;Bae, Jin-Sik;Lee, Myunghee;Lee, Min Seob
    • Journal of Genetic Medicine
    • /
    • v.12 no.2
    • /
    • pp.79-84
    • /
    • 2015
  • Purpose: We developed and validated a fetal trisomy detection method for use as a noninvasive prenatal test (NIPT) including a Clinical Laboratory Improvement Amendments (CLIA)-certified bioinformatics pipeline on a cloud-based computing system using both Illumina and Life Technology sequencing platforms for 221 Korean clinical samples. We determined the necessary proportions of the fetal fraction in the cell-free DNA (cfDNA) sample for NIPT of trisomies 13, 18, and 21 through a limit of quantification (LOQ) test. Materials and Methods: Next-generation sequencing libraries from 221 clinical samples and three positive controls were generated using Illumina and Life Technology chemistries. Sequencing results were uploaded to a cloud and mapped on the human reference genome (GRCh37/hg19) using bioinformatics tools. Based on Z-scores calculated by normalization of the mapped read counts, final aneuploidy reports were automatically generated for fetal aneuploidy determination. Results: We identified in total 29 aneuploid samples, and additional analytical methods performed to confirm the results showed that one of these was a false-positive. The LOQ test showed that the proportion of fetal fraction in the cfDNA sample would affect the interpretation of the aneuploidy results. Conclusion: Noninvasive chromosome examination (NICE), a CLIA-certified NIPT with a cloud-based bioinformatics platform, showed unambiguous success in fetus aneuploidy detection.

Study on Microbial Community Succession and Protein Hydrolysis of Donkey Meat during Refrigerated Storage Based on Illumina NOVA Sequencing Technology

  • Wei, Zixiang;Chu, Ruidong;Li, Lanjie;Zhang, Jingjing;Zhang, Huachen;Pan, Xiaohong;Dong, Yifan;Liu, Guiqin
    • Food Science of Animal Resources
    • /
    • v.41 no.4
    • /
    • pp.701-714
    • /
    • 2021
  • In this study, the microbial community succession and the protein hydrolysis of donkey meat during refrigerated (4℃) storage were investigated. 16S rDNA sequencing method was used to analyze the bacteria community structure and succession in the level of genome. Meanwhile, the volatile base nitrogen (TVB-N) was measured to evaluate the degradation level of protein. After sorting out the sequencing results, 1,274,604 clean data were obtained, which were clustered into 2,064 into operational taxonomic units (OTUs), annotated to 32 phyla and 527 genus. With the prolonging of storage time, the composition of microorganism changed greatly. At the same time, the diversity and richness of microorganism decreased and then increased. During the whole storage period, Proteobacteria was the dominant phyla, and the Photobacterium, Pseudompnas, and Acinetobacter were the dominant genus. According to correlation analysis, it was found that the abundance of these dominant bacteria was significantly positively correlated with the variation of TVB-N. And Pseudomonas might play an important role in the production of TVB-N during refrigerated storage of donkey meat. The predicted metabolic pathways, based on PICRUSt analysis, indicated that amino metabolism in refrigerated donkey meat was the main metabolic pathways. This study provides insight into the process involved in refrigerated donkey meat spoilage, which provides a foundation for the development of antibacterial preservative for donkey meat.

Toward Complete Bacterial Genome Sequencing Through the Combined Use of Multiple Next-Generation Sequencing Platforms

  • Jeong, Haeyoung;Lee, Dae-Hee;Ryu, Choong-Min;Park, Seung-Hwan
    • Journal of Microbiology and Biotechnology
    • /
    • v.26 no.1
    • /
    • pp.207-212
    • /
    • 2016
  • PacBio's long-read sequencing technologies can be successfully used for a complete bacterial genome assembly using recently developed non-hybrid assemblers in the absence of second-generation, high-quality short reads. However, standardized procedures that take into account multiple pre-existing second-generation sequencing platforms are scarce. In addition to Illumina HiSeq and Ion Torrent PGM-based genome sequencing results derived from previous studies, we generated further sequencing data, including from the PacBio RS II platform, and applied various bioinformatics tools to obtain complete genome assemblies for five bacterial strains. Our approach revealed that the hierarchical genome assembly process (HGAP) non-hybrid assembler resulted in nearly complete assemblies at a moderate coverage of ~75x, but that different versions produced non-compatible results requiring post processing. The other two platforms further improved the PacBio assembly through scaffolding and a final error correction.

A Universal Analysis Pipeline for Hybrid Capture-Based Targeted Sequencing Data with Unique Molecular Indexes

  • Kim, Min-Jung;Kim, Si-Cho;Kim, Young-Joon
    • Genomics & Informatics
    • /
    • v.16 no.4
    • /
    • pp.29.1-29.5
    • /
    • 2018
  • Hybrid capture-based targeted sequencing is being used increasingly for genomic variant profiling in tumor patients. Unique molecular index (UMI) technology has recently been developed and helps to increase the accuracy of variant calling by minimizing polymerase chain reaction biases and sequencing errors. However, UMI-adopted targeted sequencing data analysis is slightly different from the methods for other types of omics data, and its pipeline for variant calling is still being optimized in various study groups for their own purposes. Due to this provincial usage of tools, our group built an analysis pipeline for global application to many studies of targeted sequencing generated with different methods. First, we generated hybrid capture-based data using genomic DNA extracted from tumor tissues of colorectal cancer patients. Sequencing libraries were prepared and pooled together, and an 8-plexed capture library was processed to the enrichment step before 150-bp paired-end sequencing with Illumina HiSeq series. For the analysis, we evaluated several published tools. We focused mainly on the compatibility of the input and output of each tool. Finally, our laboratory built an analysis pipeline specialized for UMI-adopted data. Through this pipeline, we were able to estimate even on-target rates and filtered consensus reads for more accurate variant calling. These results suggest the potential of our analysis pipeline in the precise examination of the quality and efficiency of conducted experiments.