• Title, Summary, Keyword: Transcriptome

Search Result 273, Processing Time 0.057 seconds

Analysis of Whole Transcriptome Sequencing Data: Workflow and Software

  • Yang, In Seok;Kim, Sangwoo
    • Genomics & Informatics
    • /
    • v.13 no.4
    • /
    • pp.119-125
    • /
    • 2015
  • RNA is a polymeric molecule implicated in various biological processes, such as the coding, decoding, regulation, and expression of genes. Numerous studies have examined RNA features using whole transcriptome sequencing (RNA-seq) approaches. RNA-seq is a powerful technique for characterizing and quantifying the transcriptome and accelerates the development of bioinformatics software. In this review, we introduce routine RNA-seq workflow together with related software, focusing particularly on transcriptome reconstruction and expression quantification.

De novo Genome Assembly and Single Nucleotide Variations for Soybean Mosaic Virus Using Soybean Seed Transcriptome Data

  • Jo, Yeonhwa;Choi, Hoseong;Bae, Miah;Kim, Sang-Min;Kim, Sun-Lim;Lee, Bong Choon;Cho, Won Kyong;Kim, Kook-Hyung
    • The Plant Pathology Journal
    • /
    • v.33 no.5
    • /
    • pp.478-487
    • /
    • 2017
  • Soybean is the most important legume crop in the world. Several diseases in soybean lead to serious yield losses in major soybean-producing countries. Moreover, soybean can be infected by diverse viruses. Recently, we carried out a large-scale screening to identify viruses infecting soybean using available soybean transcriptome data. Of the screened transcriptomes, a soybean transcriptome for soybean seed development analysis contains several virus-associated sequences. In this study, we identified five viruses, including soybean mosaic virus (SMV), infecting soybean by de novo transcriptome assembly followed by blast search. We assembled a nearly complete consensus genome sequence of SMV China using transcriptome data. Based on phylogenetic analysis, the consensus genome sequence of SMV China was closely related to SMV isolates from South Korea. We examined single nucleotide variations (SNVs) for SMVs in the soybean seed transcriptome revealing 780 SNVs, which were evenly distributed on the SMV genome. Four SNVs, C-U, U-C, A-G, and G-A, were frequently identified. This result demonstrated the quasispecies variation of the SMV genome. Taken together, this study carried out bioinformatics analyses to identify viruses using soybean transcriptome data. In addition, we demonstrated the application of soybean transcriptome data for virus genome assembly and SNV analysis.

Evaluation and interpretation of transcriptome data underlying heterogeneous chronic obstructive pulmonary disease

  • Ham, Seokjin;Oh, Yeon-Mok;Roh, Tae-Young
    • Genomics & Informatics
    • /
    • v.17 no.1
    • /
    • pp.2.1-2.12
    • /
    • 2019
  • Chronic obstructive pulmonary disease (COPD) is a type of progressive lung disease, featured by airflow obstruction. Recently, a comprehensive analysis of the transcriptome in lung tissue of COPD patients was performed, but the heterogeneity of the sample was not seriously considered in characterizing the mechanistic dysregulation of COPD. Here, we established a new transcriptome analysis pipeline using a deconvolution process to reduce the heterogeneity and clearly identified that these transcriptome data originated from the mild or moderate stage of COPD patients. Differentially expressed or co-expressed genes in the protein interaction subnetworks were linked with mitochondrial dysfunction and the immune response, as expected. Computational protein localization prediction revealed that 19 proteins showing changes in subcellular localization were mostly related to mitochondria, suggesting that mislocalization of mitochondria-targeting proteins plays an important role in COPD pathology. Our extensive evaluation of COPD transcriptome data could provide guidelines for analyzing heterogeneous gene expression profiles and classifying potential candidate genes that are responsible for the pathogenesis of COPD.

A Study on Transcriptome Analysis Using de novo RNA-sequencing to Compare Ginseng Roots Cultivated in Different Environments

  • Yang, Byung Wook
    • Proceedings of the Plant Resources Society of Korea Conference
    • /
    • /
    • pp.5-5
    • /
    • 2018
  • Ginseng (Panax ginseng C.A. Meyer), one of the most widely used medicinal plants in traditional oriental medicine, is used for the treatment of various diseases. It has been classified according to its cultivation environment, such as field cultivated ginseng (FCG) and mountain cultivated ginseng (MCG). However, little is known about differences in gene expression in ginseng roots between field cultivated and mountain cultivated ginseng. In order to investigate the whole transcriptome landscape of ginseng, we employed High-Throughput sequencing technologies using the Illumina HiSeqTM2500 system, and generated a large amount of sequenced transcriptome from ginseng roots. Approximately 77 million and 87 million high-quality reads were produced in the FCG and MCG roots transcriptome analyses, respectively, and we obtained 256,032 assembled unigenes with an average length of 1,171 bp by de novo assembly methods. Functional annotations of the unigenes were performed using sequence similarity comparisons against the following databases: the non-redundant nucleotide database, the InterPro domains database, the Gene Ontology Consortium database, and the Kyoto Encyclopedia of Genes and Genomes pathway database. A total of 4,207 unigenes were assigned to specific metabolic pathways, and all of the known enzymes involved in starch and sucrose metabolism pathways were also identified in the KEGG library. This study indicated that alpha-glucan phosphorylase 1, putative pectinesterase/pectinesterase inhibitor 17, beta-amylase, and alpha-glucan phosphorylase isozyme H might be important factors involved in starch and sucrose metabolism between FCG and MCG in different environments.

  • PDF

Pathway Retrieval for Transcriptome Analysis using Fuzzy Filtering Technique andWeb Service

  • Lee, Kyung-Mi;Lee, Keon-Myung
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.12 no.2
    • /
    • pp.167-172
    • /
    • 2012
  • In biology the advent of the high-throughput technology for sequencing, probing, or screening has produced huge volume of data which could not be manually handled. Biologists have resorted to software tools in order to effectively handle them. This paper introduces a bioinformatics tool to help biologists find potentially interesting pathway maps from a transcriptome data set in which the expression levels of genes are described for both case and control samples. The tool accepts a transcriptome data set, and then selects and categorizes some of genes into four classes using a fuzzy filtering technique where classes are defined by membership functions. It collects and edits the pathway maps related to those selected genes without analyst' intervention. It invokes a sequence of web service functions from KEGG, which an online pathway database system, in order to retrieve related information, locate pathway maps, and manipulate them. It maintains all retrieved pathway maps in a local database and presents them to the analysts with graphical user interface. The tool has been successfully used in identifying target genes for further analysis in transcriptome study of human cytomegalovirous. The tool is very helpful in that it can considerably save analysts' time and efforts by collecting and presenting the pathway maps that contain some interesting genes, once a transcriptome data set is just given.

Transcriptome analysis for the production of recombinant protein in Escherichia coli using DNA microarray

  • Heo, Won-Jae;Yun, Seong-Ho;Lee, Sang-Yeop
    • 한국생물공학회:학술대회논문집
    • /
    • /
    • pp.745-746
    • /
    • 2001
  • Transcriptome analysis was performed for the production of recombinant protein in E. coli using DNA microarray containing 2,850 genes including all functionally known and putative ones. Changes in transcriptome were analyzed qualitatively and quantitatively to provide their physiological and metabolic meanings.

  • PDF

Combined analysis of transcriptome and proteome for high cell density cultivation of Escherichia coli

  • Yun, Seong-Ho;Han, Mi-Jeong;Im, Geun-Bae;Lee, Sang-Yeop
    • 한국생물공학회:학술대회논문집
    • /
    • /
    • pp.845-848
    • /
    • 2001
  • For understanding physiology and metabolism under various culture conditions, combined analysis of transcriptome and proteome is attractable way. We have manufactured DNA microarray containing 2,850 genes including all functionally known and putative ones. In this study, we report analysis of transcriptome and proteome during the high cell density culture of E. coli by using DNA microarray and 2-DE. Fed-batch fermentation of E. coli was carried out by exponential feeding of nutrients until the maximum cell density reached 74 g dry cell weight/L (g DCW/L). Changes in transcriptome and proteome during the HCDC are analyzed qualitatively and quantitatively to provide their physiological and metabolic meanings.

  • PDF

Human Transcriptome and Chromatin Modifications: An ENCODE Perspective

  • Shen, Li;Choi, Inchan;Nestler, Eric J.;Won, Kyoung-Jae
    • Genomics & Informatics
    • /
    • v.11 no.2
    • /
    • pp.60-67
    • /
    • 2013
  • A decade-long project, led by several international research groups, called the Encyclopedia of DNA Elements (ENCODE), recently released an unprecedented amount of data. The ambitious project covers transcriptome, cistrome, epigenome, and interactome data from more than 1,600 sets of experiments in human. To make use of this valuable resource, it is important to understand the information it represents and the techniques that were used to generate these data. In this review, we introduce the data that ENCODE generated, summarize the observations from the data analysis, and revisit a computational approach that ENCODE used to predict gene expression, with a focus on the human transcriptome and its association with chromatin modifications.

Sequencing and Characterization of Divergent Marbling Levels in the Beef Cattle (Longissimus dorsi Muscle) Transcriptome

  • Chen, Dong;Li, Wufeng;Du, Min;Wu, Meng;Cao, Binghai
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.28 no.2
    • /
    • pp.158-165
    • /
    • 2015
  • Marbling is an important trait regarding the quality of beef. Analysis of beef cattle transcriptome and its expression profile data are essential to extend the genetic information resources and would support further studies on beef cattle. RNA sequencing was performed in beef cattle using the Illumina High-Seq2000 platform. Approximately 251.58 million clean reads were generated from a high marbling (H) group and low marbling (L) group. Approximately 80.12% of the 19,994 bovine genes (protein coding) were detected in all samples, and 749 genes exhibited differential expression between the H and L groups based on fold change (>1.5-fold, p<0.05). Multiple gene ontology terms and biological pathways were found significantly enriched among the differentially expressed genes. The transcriptome data will facilitate future functional studies on marbling formation in beef cattle and may be applied to improve breeding programs for cattle and closely related mammals.

Transcriptome profiling and comparative analysis of Panax ginseng adventitious roots

  • Jayakodi, Murukarthick;Lee, Sang-Choon;Park, Hyun-Seung;Jang, Woojong;Lee, Yun Sun;Choi, Beom-Soon;Nah, Gyoung Ju;Kim, Do-Soon;Natesan, Senthil;Sun, Chao;Yang, Tae-Jin
    • Journal of Ginseng Research
    • /
    • v.38 no.4
    • /
    • pp.278-288
    • /
    • 2014
  • Background: Panax ginseng Meyer is a traditional medicinal plant famous for its strong therapeutic effects and serves as an important herbal medicine. To understand and manipulate genes involved in secondary metabolic pathways including ginsenosides, transcriptome profiling of P. ginseng is essential. Methods: RNA-seq analysis of adventitious roots of two P. ginseng cultivars, Chunpoong (CP) and Cheongsun (CS), was performed using the Illumina HiSeq platform. After transcripts were assembled, expression profiling was performed. Results: Assemblies were generated from ~85 million and ~77 million high-quality reads from CP and CS cultivars, respectively. A total of 35,527 and 27,716 transcripts were obtained from the CP and CS assemblies, respectively. Annotation of the transcriptomes showed that approximately 90% of the transcripts had significant matches in public databases.We identified several candidate genes involved in ginsenoside biosynthesis. In addition, a large number of transcripts (17%) with different gene ontology designations were uniquely detected in adventitious roots compared to normal ginseng roots. Conclusion: This study will provide a comprehensive insight into the transcriptome of ginseng adventitious roots, and a way for successful transcriptome analysis and profiling of resource plants with less genomic information. The transcriptome profiling data generated in this study are available in our newly created adventitious root transcriptome database (http://im-crop.snu.ac.kr/transdb/index.php) for public use.