• 제목/요약/키워드: Microarray data analysis

검색결과 323건 처리시간 0.023초

Exploration of Molecular Mechanisms of Diffuse Large B-cell Lymphoma Development Using a Microarray

  • Zhang, Zong-Xin;Shen, Cui-Fen;Zou, Wei-Hua;Shou, Li-Hong;Zhang, Hui-Ying;Jin, Wen-Jun
    • Asian Pacific Journal of Cancer Prevention
    • /
    • 제14권3호
    • /
    • pp.1731-1735
    • /
    • 2013
  • Objective: We aimed to identify key genes, pathways and function modules in the development of diffuse large B-cell lymphoma (DLBCL) with microarray data and interaction network analysis. Methods: Microarray data sets for 7 DLBCL samples and 7 normal controls was downloaded from the Gene Expression Omnibus (GEO) database and differentially expressed genes (DEGs) were identified with Student's t-test. KEGG functional enrichment analysis was performed to uncover their biological functions. Three global networks were established for immune system, signaling molecules and interactions and cancer genes. The DEGs were compared with the networks to observe their distributions and determine important key genes, pathways and modules. Results: A total of 945 DEGs were obtained, 272 up-regulated and 673 down-regulated. KEGG analysis revealed that two groups of pathways were significantly enriched: immune function and signaling molecules and interactions. Following interaction network analysis further confirmed the association of DEGs in immune system, signaling molecules and interactions and cancer genes. Conclusions: Our study could systemically characterize gene expression changes in DLBCL with microarray technology. A range of key genes, pathways and function modules were revealed. Utility in diagnosis and treatment may be expected with further focused research.

Statistical Analysis about Ability to Mouse Embryonic Stem Cell Differentiation using cDNA Microarray

  • Choi, Hang-Suk;Kim, Sung-Ju;Lee, Young-Jin;Cha, Kyung-Joon;Kim, Chul-Geun
    • Journal of the Korean Data and Information Science Society
    • /
    • 제16권4호
    • /
    • pp.951-958
    • /
    • 2005
  • As a foundation study of stem cell applied research, it is necessary to identify the large gene expression through cDNA microarray to understand principles of the level of molecular about cell function. In this paper, we investigated the gene expression through the K-means clustering method and path analysis with genes related to pluripoteny and differentiation in an mouse early stage embryonic development process and embryonic stem cell differentiation. We find a few biological phenomenon through this study. Also, we realize that this process provides functional relationship of unknown genes.

  • PDF

Finding associations between genes by time-series microarray sequential patterns analysis

  • Nam, Ho-Jung;Lee, Do-Heon
    • 한국생물정보학회:학술대회논문집
    • /
    • 한국생물정보시스템생물학회 2005년도 BIOINFO 2005
    • /
    • pp.161-164
    • /
    • 2005
  • Data mining techniques can be applied to identify patterns of interest in the gene expression data. One goal in mining gene expression data is to determine how the expression of any particular gene might affect the expression of other genes. To find relationships between different genes, association rules have been applied to gene expression data set [1]. A notable limitation of association rule mining method is that only the association in a single profile experiment can be detected. It cannot be used to find rules across different condition profiles or different time point profile experiments. However, with the appearance of time-series microarray data, it became possible to analyze the temporal relationship between genes. In this paper, we analyze the time-series microarray gene expression data to extract the sequential patterns which are similar to the association rules between genes among different time points in the yeast cell cycle. The sequential patterns found in our work can catch the associations between different genes which express or repress at diverse time points. We have applied sequential pattern mining method to time-series microarray gene expression data and discovered a number of sequential patterns from two groups of genes (test, control) and more sequential patterns have been discovered from test group (same CO term group) than from the control group (different GO term group). This result can be a support for the potential of sequential patterns which is capable of catching the biologically meaningful association between genes.

  • PDF

Monitoring of Gene Regulations Using Average Rank in DNA Microarray: Implementation of R

  • Park, Chang-Soon
    • Journal of the Korean Data and Information Science Society
    • /
    • 제18권4호
    • /
    • pp.1005-1021
    • /
    • 2007
  • Traditional procedures for DNA microarray data analysis are to preprocess and normalize the gene expression data, and then to analyze the normalized data using statistical tests. Drawbacks of the traditional methods are: genuine biological signal may be unwillingly eliminated together with artifacts, the limited number of arrays per gene make statistical tests difficult to use the normality assumption or nonparametric method, and genes are tested independently without consideration of interrelationships among genes. A novel method using average rank in each array is proposed to eliminate such drawbacks. This average rank method monitors differentially regulated genes among genetically different groups and the selected genes are somewhat different from those selected by traditional P-value method. Addition of genes selected by the average rank method to the traditional method will provide better understanding of genetic differences of groups.

  • PDF

프마이크로어레이 데이터의 유전자 집합 및 대사 경로 분석 (Gene Set and Pathway Analysis of Microarray Data)

  • 김선영
    • 유전체소식지
    • /
    • 제6권1호
    • /
    • pp.29-33
    • /
    • 2006
  • Gene set analysis is a new concept and method. to analyze and interpret microarray gene expression data and tries to extract biological meaning from gene expression data at gene set level rather than at gene level. Compared with methods which select a few tens or hundreds of genes before gene ontology and pathway analysis, gene set analysis identifies important gene ontology terms and pathways more consistently and performs well even in gene expression data sets with minimal or moderate gene expression changes. Moreover, gene set analysis is useful for comparing multiple gene expression data sets dealing with similar biological questions. This review briefly summarizes the rationale behind the gene set analysis and introduces several algorithms and tools now available for gene set analysis.

  • PDF

서로 다른 플랫폼의 마이크로어레이 연구 통합 분석 (Cross Platform Data Analysis in Microarray Experiment)

  • 이장미;이선호
    • 응용통계연구
    • /
    • 제26권2호
    • /
    • pp.307-319
    • /
    • 2013
  • 마이크로어레이 실험의 특성상 표본의 수가 많지 않는 단점을 보완하고 분석 결과를 일반화하기 위하여 공개 저장소에 축적된 자료 중에 연구 목적이 동일한 여러 연구들을 통합하여 분석하려는 시도가 활발하다. 그러나 실험에서 사용한 플랫폼이 서로 다른 경우에는 유전자 관찰값의 분포가 달라지기 때문에 통합이 어렵고 최상의 통합 방법이 제시되어 있지 않다. 본 논문에서는 순위 기반 중위수, 분위수 이산화와 표준화를 각각 이용하여 변환한 자료값을 직접 합치거나 메타분석을 하여 연구 결과를 합치는 방법을 알아 보았다. 또한 GEO에서 다운받은 실제 자료들을 이용하여 네 가지 방법의 장단점과 효과를 비교하였고 서로 다른 연구 자료를 통합하는 것의 영향을 알아보았다.

약동학적 파라미터를 이용한 시간경로 마이크로어레이 자료의 군집분석 (Clustering of Time-Course Microarray Data Using Pharmacokinetic Parameter)

  • 이효정;김별아;박미라
    • 응용통계연구
    • /
    • 제24권4호
    • /
    • pp.623-631
    • /
    • 2011
  • 시간경로 마이크로어레이 자료 분석의 주요 목적 중의 하나는 유전자들의 시간에 따른 발현수준의 변화를 고려함으로써 발현패턴에 기초한 유전자들의 그룹을 찾기 위한 것으로, 군집분석을 위한 다양한 알고리즘들이 제안되었다. 본 연구에서 시간경로 마이크로어레이 자료에 대한 군집분석을 위해 두 약물제제 간 생물학적 동등성을 평가하기 위한 약동학 시험에서 사용되는 약동학적 파라미터 값에 기초한 군집분석을 제안하였으며 이를 실제 데이터 및 모의실험 자료에 적용하여 유용성을 검토하였다.

Supervised Model for Identifying Differentially Expressed Genes in DNA Microarray Gene Expression Dataset Using Biological Pathway Information

  • Chung, Tae Su;Kim, Keewon;Kim, Ju Han
    • Genomics & Informatics
    • /
    • 제3권1호
    • /
    • pp.30-34
    • /
    • 2005
  • Microarray technology makes it possible to measure the expressions of tens of thousands of genes simultaneously under various experimental conditions. Identifying differentially expressed genes in each single experimental condition is one of the most common first steps in microarray gene expression data analysis. Reasonable choices of thresholds for determining differentially expressed genes are used for the next-stap-analysis with suitable statistical significances. We present a supervised model for identifying DEGs using pathway information based on the global connectivity structure. Pathway information can be regarded as a collection of biological knowledge, thus we are trying to determine the optimal threshold so that the consequential connectivity structure can be the most compatible with the existing pathway information. The significant feature of our model is that it uses established knowledge as a reference to determine the direction of analyzing microarray dataset. In the most of previous work, only intrinsic information in the miroarray is used for the identifying DEGs. We hope that our proposed method could contribute to construct biologically meaningful structure from microarray datasets.

Improved Statistical Testing of Two-class Microarrays with a Robust Statistical Approach

  • Oh, Hee-Seok;Jang, Dong-Ik;Oh, Seung-Yoon;Kim, Hee-Bal
    • Interdisciplinary Bio Central
    • /
    • 제2권2호
    • /
    • pp.4.1-4.6
    • /
    • 2010
  • The most common type of microarray experiment has a simple design using microarray data obtained from two different groups or conditions. A typical method to identify differentially expressed genes (DEGs) between two conditions is the conventional Student's t-test. The t-test is based on the simple estimation of the population variance for a gene using the sample variance of its expression levels. Although empirical Bayes approach improves on the t-statistic by not giving a high rank to genes only because they have a small sample variance, the basic assumption for this is same as the ordinary t-test which is the equality of variances across experimental groups. The t-test and empirical Bayes approach suffer from low statistical power because of the assumption of normal and unimodal distributions for the microarray data analysis. We propose a method to address these problems that is robust to outliers or skewed data, while maintaining the advantages of the classical t-test or modified t-statistics. The resulting data transformation to fit the normality assumption increases the statistical power for identifying DEGs using these statistics.

마이크로어레이 기반 miRNA 모듈 분석을 위한 하이퍼망 분류 기법 (Hypernetwork Classifiers for Microarray-Based miRNA Module Analysis)

  • 김선;김수진;장병탁
    • 한국정보과학회논문지:소프트웨어및응용
    • /
    • 제35권6호
    • /
    • pp.347-356
    • /
    • 2008
  • 마이크로어레이는 분자 생물학 실험에 있어 중요한 도구로 사용되고 있으며, 마이크로어레이 데이타 분석을 위한 다양한 계산학적 방법이 개발되어 왔다. 그러나, 기존 분석방법은 주어진 조건에 영향을 주는 개별 유전자를 추출하는 데 강한 방면, 유전자 간의 복합작용에 의한 영향을 분석하기 힘들다는 단점을 가지고 있다. 하이퍼망 모델은 생물학적인 네트워크 작용을 모방한 구조이며, 계산과정에서 요소간의 복합작용을 직접 고려하기 때문에 기존 방법에서 다루기 힘들었던 요소간 상호작용 분석이 가능하다는 장점을 가진다. 본 논문에서는 마이크로어레이 데이타를 기반으로 microRNA(miRNA) 프로파일 분석을 위한 하이퍼망 분류 기법을 소개한다. 하이퍼망 분류기는 miRNA 쌍을 기본 요소로 하여 진화 과정을 통해 miRNA 분류 데이타를 학습한다. 학습된 하이퍼망으로부터 유의하다.고 판단되는 miRNA 모듈을 쉽게 추출할 수 있으며, 사용자는 추출된 모듈의 유치미성을 직접 판단할 수 있다. 하이퍼망 분류기는 암 관련 miRNA 발현 데이타 분류 실험을 통해 91.46%의 정확도를 보임으로써 기존 기계학습 방법에 비해 뛰어난 성능을 보여주었으며, 하이퍼망 분석을 통해 생물학적으로 유의한 miRNA 모듈을 찾을 수 있음을 확인하였다.