Search | Korea Science

Gene Set and Pathway Analysis of Microarray Data (프마이크로어레이 데이터의 유전자 집합 및 대사 경로 분석)

Kim Seon-Young
- KOGO NEWS
- /
- v.6 no.1
- /
- pp.29-33
- /
- 2006
Gene set analysis is a new concept and method. to analyze and interpret microarray gene expression data and tries to extract biological meaning from gene expression data at gene set level rather than at gene level. Compared with methods which select a few tens or hundreds of genes before gene ontology and pathway analysis, gene set analysis identifies important gene ontology terms and pathways more consistently and performs well even in gene expression data sets with minimal or moderate gene expression changes. Moreover, gene set analysis is useful for comparing multiple gene expression data sets dealing with similar biological questions. This review briefly summarizes the rationale behind the gene set analysis and introduces several algorithms and tools now available for gene set analysis.
PDF

Statistical Methods for Gene Expression Data

Kim, Choongrak
- Communications for Statistical Applications and Methods
- /
- v.11 no.1
- /
- pp.59-77
- /
- 2004
Since the introduction of DNA microarray, a revolutionary high through-put biological technology, a lot of papers have been published to deal with the analyses of the gene expression data from the microarray. In this paper we review most papers relevant to the cDNA microarray data, classify them in statistical methods' point of view, and present some statistical methods deserving consideration and future study.
https://doi.org/10.5351/CKSS.2004.11.1.059 인용 PDF KSCI

Clustering Approaches to Identifying Gene Expression Patterns from DNA Microarray Data

Do, Jin Hwan;Choi, Dong-Kug
- Molecules and Cells
- /
- v.25 no.2
- /
- pp.279-288
- /
- 2008
The analysis of microarray data is essential for large amounts of gene expression data. In this review we focus on clustering techniques. The biological rationale for this approach is the fact that many co-expressed genes are co-regulated, and identifying co-expressed genes could aid in functional annotation of novel genes, de novo identification of transcription factor binding sites and elucidation of complex biological pathways. Co-expressed genes are usually identified in microarray experiments by clustering techniques. There are many such methods, and the results obtained even for the same datasets may vary considerably depending on the algorithms and metrics for dissimilarity measures used, as well as on user-selectable parameters such as desired number of clusters and initial values. Therefore, biologists who want to interpret microarray data should be aware of the weakness and strengths of the clustering methods used. In this review, we survey the basic principles of clustering of DNA microarray data from crisp clustering algorithms such as hierarchical clustering, K-means and self-organizing maps, to complex clustering algorithms like fuzzy clustering.
KSCI

Gene Expression study of human chromosomal aneuploid

Lee Su-Man
- Proceedings of the Korean Society for Bioinformatics Conference
- /
- 2006.02a
- /
- pp.98-107
- /
- 2006
Chromosomal copy number changes (aneuploidies) are common in human populations. The extra chromosome can affect gene expression by whole-genome level. By gene expression microarray analysis, we want to find aberrant gene expression due to aneuploidies in Klinefelter (+X) and Down syndrome (+21). We have analyzed the inactivation status of X-linked genes in Klinefelter Syndrome (KS) by using X-linked cDNA microarray and cSNP analysis. We analyzed the expression of 190 X-linked genes by cDNA microarray from the lymphocytes of five KS patients and five females (XX) with normal males (XY) controls. cDNA microarray experiments and cSNP analysis showed the differentially expressed genes were similar between KS and XX cases. To analyze the differential gene expressions in Down Syndrome (DS), Amniotic Fluid (AF)cells were collected from 12 pregnancies at $16{\sim}18$ weeks of gestation in DS (n=6) and normal (n=6) subjects. We also analysis AF cells for a DNA microarray system and compared the chip data with two dimensional protein gel analysis of amniotic fluid. Our data may provide the basis for a more systematic identification of biological markers of fetal DS, thus leading to an improved understanding of pathogenesis for fetal DS.
PDF

Basic Concept of Gene Microarray (Gene Microarray의 기본개념)

Hwang, Seung Yong
- Korean Journal of Biological Psychiatry
- /
- v.8 no.2
- /
- pp.203-207
- /
- 2001
The genome sequencing project has generated and will continue to generate enormous amounts of sequence data including 5 eukaryotic and about 60 prokaryotic genomes. Given this ever-increasing amounts of sequence information, new strategies are necessary to efficiently pursue the next phase of the genome project-the elucidation of gene expression patterns and gene product function on a whole genome scale. In order to assign functional information to the genome sequence, DNA chip(or gene microarray) technology was developed to efficiently identify the differential expression pattern of independent biological samples. DNA chip provides a new tool for genome expression analysis that may revolutionize many aspects of biotechnology including new drug discovery and disease diagnostics.
PDF

Consensus Clustering for Time Course Gene Expression Microarray Data

Kim, Seo-Young;Bae, Jong-Sung
- Communications for Statistical Applications and Methods
- /
- v.12 no.2
- /
- pp.335-348
- /
- 2005
The rapid development of microarray technologies enabled the monitoring of expression levels of thousands of genes simultaneously. Recently, the time course gene expression data are often measured to study dynamic biological systems and gene regulatory networks. For the data, biologists are attempting to group genes based on the temporal pattern of their expression levels. We apply the consensus clustering algorithm to a time course gene expression data in order to infer statistically meaningful information from the measurements. We evaluate each of consensus clustering and existing clustering methods with various validation measures. In this paper, we consider hierarchical clustering and Diana of existing methods, and consensus clustering with hierarchical clustering, Diana and mixed hierachical and Diana methods and evaluate their performances on a real micro array data set and two simulated data sets.
https://doi.org/10.5351/CKSS.2005.12.2.335 인용 PDF KSCI

Statistical Analysis of Gene Expression Data

박태성
- Proceedings of the Korean Society for Bioinformatics Conference
- /
- 2001.10a
- /
- pp.97-115
- /
- 2001
cDNA microarray technology allows the monitoring of expression levels for thousands of genes simultaneously. Many statistical analysis tools become widely applicable to the analysis of cDNA microarray data. In this talk, we consider a two-way ANOVA model to differentiate genes that have high variability and ones that do not. Using this model, we detect genes that have different gene expression profiles among experimental groups. The two-way ANOVA model is illustrated using cDNA microarrays of 3,800 genes obtained in an experiment to search for changes in gene expression profiles during neuronal differentiation of cortical stem cells.
PDF

Poor Correlation Between the New Statistical and the Old Empirical Algorithms for DNA Microarray Analysis

Kim, Ju Han;Kuo, Winston P.;Kong, Sek-Won;Ohno-Machado, Lucila;Kohane, Isaac S.
- Genomics & Informatics
- /
- v.1 no.2
- /
- pp.87-93
- /
- 2003
DNA microarray is currently the most prominent tool for investigating large-scale gene expression data. Different algorithms for measuring gene expression levels from scanned images of microarray experiments may significantly impact the following steps of functional genomic analyses. $Affymetrix^{(R)}$ recently introduced high-density microarrays and new statistical algorithms in Microarray Suit (MAS) version 5.0$^{(R)}$. Very high correlations (0.92 - 0.97) between the new algorithms and the old algorithms (MAS 4.0) across several species and conditions were reported. We found that the column-wise array correlations had a tendency to be much higher than the row-wise gene correlations, which may be much more meaningful in the following higher-order data analyses including clustering and pattern analyses. In this paper, not only the detailed comparison of the two sets of algorithms is illustrated, but the impact of the introducing new algorithms on the further clustering analysis of microarray data and of possible pitfalls in mixing the old and the new algorithms were also described.
PDF KSCI

Screening and Clustering for Time-course Yeast Microarray Gene Expression Data using Gaussian Process Regression (효모 마이크로어레이 유전자 발현데이터에 대한 가우시안 과정 회귀를 이용한 유전자 선별 및 군집화)

Kim, Jaehee;Kim, Taehoun
- The Korean Journal of Applied Statistics
- /
- v.26 no.3
- /
- pp.389-399
- /
- 2013
This article introduces Gaussian process regression and shows its application with time-course microarray gene expression data. Gene screening for yeast cell cycle microarray expression data is accomplished with a ratio of log marginal likelihood that uses Gaussian process regression with a squared exponential covariance kernel function. Gaussian process regression fitting with each gene is done and shown with the nine top ranking genes. With the screened data the Gaussian model-based clustering is done and its silhouette values are calculated for cluster validity.
https://doi.org/10.5351/KJAS.2013.26.3.389 인용 PDF KSCI

Standard-based Integration of Heterogeneous Large-scale DNA Microarray Data for Improving Reusability

Jung, Yong;Seo, Hwa-Jeong;Park, Yu-Rang;Kim, Ji-Hun;Bien, Sang Jay;Kim, Ju-Han
- Genomics & Informatics
- /
- v.9 no.1
- /
- pp.19-27
- /
- 2011
Gene Expression Omnibus (GEO) has kept the largest amount of gene-expression microarray data that have grown exponentially. Microarray data in GEO have been generated in many different formats and often lack standardized annotation and documentation. It is hard to know if preprocessing has been applied to a dataset or not and in what way. Standard-based integration of heterogeneous data formats and metadata is necessary for comprehensive data query, analysis and mining. We attempted to integrate the heterogeneous microarray data in GEO based on Minimum Information About a Microarray Experiment (MIAME) standard. We unified the data fields of GEO Data table and mapped the attributes of GEO metadata into MIAME elements. We also discriminated non-preprocessed raw datasets from others and processed ones by using a two-step classification method. Most of the procedures were developed as semi-automated algorithms with some degree of text mining techniques. We localized 2,967 Platforms, 4,867 Series and 103,590 Samples with covering 279 organisms, integrated them into a standard-based relational schema and developed a comprehensive query interface to extract. Our tool, GEOQuest is available at http://www.snubi.org/software/GEOQuest/.
https://doi.org/10.5808/GI.2011.9.1.019 인용 PDF KSCI

Search Result 315, Processing Time 0.02 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)