• Title/Summary/Keyword: Gene list analysis

Search Result 38, Processing Time 0.028 seconds

Meta-analysis of Gene Expression Data Identifies Causal Genes for Prostate Cancer

  • Wang, Xiang-Yang;Hao, Jian-Wei;Zhou, Rui-Jin;Zhang, Xiang-Sheng;Yan, Tian-Zhong;Ding, De-Gang;Shan, Lei
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.14 no.1
    • /
    • pp.457-461
    • /
    • 2013
  • Prostate cancer is a leading cause of death in male populations across the globe. With the advent of gene expression arrays, many microarray studies have been conducted in prostate cancer, but the results have varied across different studies. To better understand the genetic and biologic mechanisms of prostate cancer, we conducted a meta-analysis of two studies on prostate cancer. Eight key genes were identified to be differentially expressed with progression. After gene co-expression analysis based on data from the GEO database, we obtained a co-expressed gene list which included 725 genes. Gene Ontology analysis revealed that these genes are involved in actin filament-based processes, locomotion and cell morphogenesis. Further analysis of the gene list should provide important clues for developing new prognostic markers and therapeutic targets.

Applying a modified AUC to gene ranking

  • Yu, Wenbao;Chang, Yuan-Chin Ivan;Park, Eunsik
    • Communications for Statistical Applications and Methods
    • /
    • v.25 no.3
    • /
    • pp.307-319
    • /
    • 2018
  • High-throughput technologies enable the simultaneous evaluation of thousands of genes that could discriminate different subclasses of complex diseases. Ranking genes according to differential expression is an important screening step for follow-up analysis. Many statistical measures have been proposed for this purpose. A good ranked list should provide a stable rank (at least for top-ranked gene), and the top ranked genes should have a high power in differentiating different disease status. However, there is a lack of emphasis in the literature on ranking genes based on these two criteria simultaneously. To achieve the above two criteria simultaneously, we proposed to apply a previously reported metric, the modified area under the receiver operating characteristic cure, to gene ranking. The proposed ranking method is found to be promising in leading to a stable ranking list and good prediction performances of top ranked genes. The findings are illustrated through studies on both synthesized data and real microarray gene expression data. The proposed method is recommended for ranking genes or other biomarkers for high-dimensional omics studies.

CaGe: A Web-Based Cancer Gene Annotation System for Cancer Genomics

  • Park, Young-Kyu;Kang, Tae-Wook;Baek, Su-Jin;Kim, Kwon-Il;Kim, Seon-Young;Lee, Do-Heon;Kim, Yong-Sung
    • Genomics & Informatics
    • /
    • v.10 no.1
    • /
    • pp.33-39
    • /
    • 2012
  • High-throughput genomic technologies (HGTs), including next-generation DNA sequencing (NGS), microarray, and serial analysis of gene expression (SAGE), have become effective experimental tools for cancer genomics to identify cancer-associated somatic genomic alterations and genes. The main hurdle in cancer genomics is to identify the real causative mutations or genes out of many candidates from an HGT-based cancer genomic analysis. One useful approach is to refer to known cancer genes and associated information. The list of known cancer genes can be used to determine candidates of cancer driver mutations, while cancer gene-related information, including gene expression, protein-protein interaction, and pathways, can be useful for scoring novel candidates. Some cancer gene or mutation databases exist for this purpose, but few specialized tools exist for an automated analysis of a long gene list from an HGT-based cancer genomic analysis. This report presents a new web-accessible bioinformatic tool, called CaGe, a cancer genome annotation system for the assessment of candidates of cancer genes from HGT-based cancer genomics. The tool provides users with information on cancer-related genes, mutations, pathways, and associated annotations through annotation and browsing functions. With this tool, researchers can classify their candidate genes from cancer genome studies into either previously reported or novel categories of cancer genes and gain insight into underlying carcinogenic mechanisms through a pathway analysis. We show the usefulness of CaGe by assessing its performance in annotating somatic mutations from a published small cell lung cancer study.

BINGO: Biological Interpretation Through Statistically and Graph-theoretically Navigating Gene $Ontology^{TM}$

  • Lee, Sung-Geun;Yang, Jae-Seong;Chung, Il-Kyung;Kim, Yang-Seok
    • Molecular & Cellular Toxicology
    • /
    • v.1 no.4
    • /
    • pp.281-283
    • /
    • 2005
  • Extraction of biologically meaningful data and their validation are very important for toxicogenomics study because it deals with huge amount of heterogeneous data. BINGO is an annotation mining tool for biological interpretation of gene groups. Several statistical modeling approaches using Gene Ontology (GO) have been employed in many programs for that purpose. The statistical methodologies are useful in investigating the most significant GO attributes in a gene group, but the coherence of the resultant GO attributes over the entire group is rarely assessed. BINGO complements the statistical methods with graph-theoretic measures using the GO directed acyclic graph (DAG) structure. In addition, BINGO visualizes the consistency of a gene group more intuitively with a group-based GO subgraph. The input group can be any interesting list of genes or gene products regardless of its generation process if the group is built under a functional congruency hypothesis such as gene clusters from DNA microarray analysis.

The Implement of System on Microarry Classification Using Combination of Signigicant Gene Selection Method (정보력 있는 유전자 선택 방법 조합을 이용한 마이크로어레이 분류 시스템 구현)

  • Park, Su-Young;Jung, Chai-Yeoung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.12 no.2
    • /
    • pp.315-320
    • /
    • 2008
  • Nowadays, a lot of related data obtained from these research could be given a new present meaning to accomplish the original purpose of the whole research as a human genome project. In such a thread, construction of gene expression analysis system and a basis rank analysis system is being watched newly. Recently, being identified fact that particular sub-class of tumor be related with particular chromosome, microarray started to be used in diagnosis field by doing cancer classification and predication based on gene expression information. In this thesis, we used cDNA microarrays of 3840 genes obtained from neuronal differentiation experiment of cortical stem cells on white mouse with cancer, created system that can extract informative gene list through normalization separately and proposed combination method for selecting more significant genes. And possibility of proposed system and method is verified through experiment. That result is that PC-ED combination represent 98.74% accurate and 0.04% MSE, which show that it improve classification performance than case to experiment after generating gene list using single similarity scale.

GoBean: a Java GUI application for visual exploration of GO term enrichments

  • Lee, Sang-Hyuk;Cha, Ji-Young;Kim, Hyeon-Jin;Yu, Ung-Sik
    • BMB Reports
    • /
    • v.45 no.2
    • /
    • pp.120-125
    • /
    • 2012
  • We have developed a biologist-friendly, Java GUI application (GoBean) for GO term enrichment analysis. It was designed to be a comprehensive and flexible GUI tool for GO term enrichment analysis, combining the merits of other programs and incorporating extensive graphic exploration of enrichment results. An intuitive user interface with multiple panels allows for extensive visual scrutiny of analysis results. The program includes many essential and useful features, such as enrichment analysis algorithms, multiple test correction methods, and versatile filtering of enriched GO terms for more focused analyses. A unique graphic interface reflecting the GO tree structure was devised to facilitate comparisons of multiple GO analysis results, which can provide valuable insights for biological interpretation. Additional features to enhance user convenience include built in ID conversion, evidence code-based gene-GO association filtering, set operations of gene lists and enriched GO terms, and user -provided data files. It is available at http://neon.gachon.ac.kr/GoBean/.

Comparative Statistic Module (CSM) for Significant Gene Selection

  • Kim, Young-Jin;Kim, Hyo-Mi;Kim, Sang-Bae;Park, Chan;Kimm, Kuchan;Koh, InSong
    • Genomics & Informatics
    • /
    • v.2 no.4
    • /
    • pp.180-183
    • /
    • 2004
  • Comparative Statistic Module(CSM) provides more reliable list of significant genes to genomics researchers by offering the commonly selected genes and a method of choice by calculating the rank of each statistical test based on the average ranking of common genes across the five statistical methods, i.e. t-test, Kruskal-Wallis (Wilcoxon signed rank) test, SAM, two sample multiple test, and Empirical Bayesian test. This statistical analysis module is implemented in Perl, and R languages.

Identification of novel potential drugs and miRNAs biomarkers in lung cancer based on gene co-expression network analysis

  • Sara Hajipour;Sayed Mostafa Hosseini;Shiva Irani;Mahmood Tavallaie
    • Genomics & Informatics
    • /
    • v.21 no.3
    • /
    • pp.38.1-38.8
    • /
    • 2023
  • Non-small cell lung cancer (NSCLC) is an important cause of cancer-associated deaths worldwide. Therefore, the exact molecular mechanisms of NSCLC are unidentified. The present investigation aims to identify the miRNAs with predictive value in NSCLC. The two datasets were downloaded from the Gene Expression Omnibus (GEO) database. Differentially expressed miRNAs (DEmiRNA) and mRNAs (DEmRNA) were selected from the normalized data. Next, miRNA-mRNA interactions were determined. Then, co-expression network analysis was completed using the WGCNA package in R software. The co-expression network between DEmiRNAs and DEmRNAs was calculated to prioritize the miRNAs. Next, the enrichment analysis was performed for DEmiRNA and DEmRNA. Finally, the drug-gene interaction network was constructed by importing the gene list to dgidb database. A total of 3,033 differentially expressed genes and 58 DEmiRNA were recognized from two datasets. The co-expression network analysis was utilized to build a gene co- expression network. Next, four modules were selected based on the Zsummary score. In the next step, a bipartite miRNA-gene network was constructed and hub miRNAs (let-7a-2-3p, let-7d-5p, let-7b-5p, let-7a-5p, and let-7b-3p) were selected. Finally, a drug-gene network was constructed while SUNITINIB, MEDROXYPROGESTERONE ACETATE, DOFETILIDE, HALOPERIDOL, and CALCITRIOL drugs were recognized as a beneficial drug in NSCLC. The hub miRNAs and repurposed drugs may act a vital role in NSCLC progression and treatment, respectively; however, these results must validate in further clinical and experimental assessments.

Normative Issues in Next Generation Sequencing Gene Testing

  • Na-Kyoung Kim
    • Development and Reproduction
    • /
    • v.27 no.1
    • /
    • pp.47-56
    • /
    • 2023
  • Despite the commercialization of Next generation sequencing (NGS) gene testing, only a few studies have addressed the various ethical and legal problems associated with NGS testing in Korea Here, we reviewed the normative issues that emerged at each stage of the wet analysis and bioinformatics analysis of NGS gene testing. In particular, it was in mind to apply various international guidelines and the principles of bioethics to actual clinical practice. Considering the characteristics of NGS testing, wet analysis of additional testing can be justified if presumptive consent is recognized. Furthermore, the medical relationship between diseases needs to be established and it should be clear that the patient would have given consent if the patient had been aware of the correlation between genes. At the stage of bioinformatics analysis, the question of unsolicited findings arises. In case of unsolicited and relevant findings, according to American College of Medical Genetics and Genomics (ACMG), a recognized relationship between genes and diseases needs to be established. In case of unsolicited and not-relevant findings, it is almost impossible to determine whether knowing or not knowing the findings is more beneficial to the patient. However, it seems to be certain that the psychological harm an individual may suffer from such information is likely to be greater if the disease is severe and if there is no cure. The list of genes for which the ACMG guidelines impose reporting obligations is a good reference for judgment.

An integrated Bayesian network framework for reconstructing representative genetic regulatory networks.

  • Lee, Phil-Hyoun;Lee, Do-Heon;Lee, Kwang-Hyung
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2003.10a
    • /
    • pp.164-169
    • /
    • 2003
  • In this paper, we propose the integrated Bayesian network framework to reconstruct genetic regulatory networks from genome expression data. The proposed model overcomes the dimensionality problem of multivariate analysis by building coherent sub-networks from confined gene clusters and combining these networks via intermediary points. Gene Shaving algorithm is used to cluster genes that share a common function or co-regulation. Retrieved clusters incorporate prior biological knowledge such as Gene Ontology, pathway, and protein protein interaction information for extracting other related genes. With these extended gene list, system builds genetic sub-networks using Bayesian network with MDL score and Sparse Candidate algorithm. Identifying functional modules of genes is done by not only microarray data itself but also well-proved biological knowledge. This integrated approach can improve there liability of a network in that false relations due to the lack of data can be reduced. Another advantage is the decreased computational complexity by constrained gene sets. To evaluate the proposed system, S. Cerevisiae cell cycle data [1] is applied. The result analysis presents new hypotheses about novel genetic interactions as well as typical relationships known by previous researches [2].

  • PDF