Quality Control Usage in High-Density Microarrays Reveals Differential Gene Expression Profiles in Ovarian Cancer

  • Villegas-Ruiz, Vanessa (Experimental Oncology Laboratory, Research Department, National Institute of Pediatrics, Institute of Ophthalmology, "Conde de Valenciana") ;
  • Moreno, Jose (Research Direction, Juarez Hospital of Mexico) ;
  • Jacome-Lopez, Karina (Experimental Oncology Laboratory, Research Department, National Institute of Pediatrics, Institute of Ophthalmology, "Conde de Valenciana") ;
  • Zentella-Dehesa, Alejandro (Medicine Genomic and Environmental Toxicology Department, Biomedical Research Institute, UNAM) ;
  • Juarez-Mendez, Sergio (Experimental Oncology Laboratory, Research Department, National Institute of Pediatrics, Institute of Ophthalmology, "Conde de Valenciana")
  • Published : 2016.05.01


There are several existing reports of microarray chip use for assessment of altered gene expression in different diseases. In fact, there have been over 1.5 million assays of this kind performed over the last twenty years, which have influenced clinical and translational research studies. The most commonly used DNA microarray platforms are Affymetrix GeneChip and Quality Control Software along with their GeneChip Probe Arrays. These chips are created using several quality controls to confirm the success of each assay, but their actual impact on gene expression profiles had not been previously analyzed until the appearance of several bioinformatics tools for this purpose. We here performed a data mining analysis, in this case specifically focused on ovarian cancer, as well as healthy ovarian tissue and ovarian cell lines, in order to confirm quality control results and associated variation in gene expression profiles. The microarray data used in our research were downloaded from ArrayExpress and Gene Expression Omnibus (GEO) and analyzed with Expression Console Software using RMA, MAS5 and Plier algorithms. The gene expression profiles were obtained using Partek Genomics Suite v6.6 and data were visualized using principal component analysis, heat map, and Venn diagrams. Microarray quality control analysis showed that roughly 40% of the microarray files were false negative, demonstrating over- and under-estimation of expressed genes. Additionally, we confirmed the results performing second analysis using independent samples. About 70% of the significant expressed genes were correlated in both analyses. These results demonstrate the importance of appropriate microarray processing to obtain a reliable gene expression profile.


High density microarrays;gene expression profiles;quality control;ovarian cancer


Supported by : National Institute of Pediatrics


  1. Bicaku E, Xiong Y, Marchion DC, et al (2012). In vitro analysis of ovarian cancer response to cisplatin, carboplatin, and paclitaxel identifies common pathways that are also associated with overall patient survival. Br J Cancer, 106, 1967-75.
  2. Brazma A, Parkinson H, Sarkans U, et al (2003). ArrayExpress--a public repository for microarray gene expression data at the EBI. Nucleic Acids Res, 31, 68-71.
  3. Burgoon LD, Eckel-Passow JE, Gennings C, et al (2005). Protocols for the assurance of microarray data quality and process control. Nucleic Acids Res, 33, e172.
  4. Cai SY, Yang T, Chen Y, et al (2015). Gene expression profiling of ovarian carcinomas and prognostic analysis of outcome. J Ovarian Res, 8, 50.
  5. Chang C, Wang J, Zhao C, et al (2011). Maximizing biomarker discovery by minimizing gene signatures. BMC Genomics, 12 Suppl 5, S6.
  6. Dinalankara W, Bravo HC (2015). Gene expression signatures based on variability can robustly predict tumor progression and prognosis. Cancer Inform, 14, 71-81.
  7. Edgar R, Domrachev M, Lash AE (2002). Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res, 30, 207-10.
  8. Hanahan D, Weinberg RA (2000). The hallmarks of cancer. Cell, 100, 57-70.
  9. Hanahan D, Weinberg RA (2011). Hallmarks of cancer: the next generation. Cell, 144, 646-74.
  10. Heyn H, Esteller M (2012). DNA methylation profiling in the clinic: applications and challenges. Nat Rev Genet, 13, 679-92.
  11. Horikawa N, Baba T, Matsumura N, et al (2015). Genomic profile predicts the efficacy of neoadjuvant chemotherapy for cervical cancer patients. BMC Cancer, 15, 739.
  12. Irizarry RA, Bolstad BM, Collin F, et al (2003). Summaries of Affymetrix GeneChip probe level data. Nucleic Acids Res, 31, e15.
  13. Jemal A, Siegel R, Ward E, et al (2008). Cancer statistics, 2008. CA Cancer J Clin, 58, 71-96.
  14. Juarez-Mendez S, Zentella-Dehesa A, Villegas-Ruiz V, et al (2013). Splice variants of zinc finger protein 695 mRNA associated to ovarian cancer. J Ovarian Res, 6, 61.
  15. Knudsen S, Hother C, Gronbaek K, et al (2015). Development and blind clinical validation of a microRNA based predictor of response to treatment with R-CHO(E)P in DLBCL. PLoS One, 10, e0115538.
  16. Kolesnikov N, Hastings E, Keays M, et al (2015). ArrayExpress update--simplifying data submissions. Nucleic Acids Res, 43, D1113-6.
  17. Li MH, Fu SB, Xiao HS (2015). Genome-wide analysis of microRNA and mRNA expression signatures in cancer. Acta Pharmacol Sin, 36, 1200-11.
  18. Ma XJ, Dahiya S, Richardson E, et al (2009). Gene expression profiling of the tumor microenvironment during breast cancer progression. Breast Cancer Res, 11, R7.
  19. McCall MN, Murakami PN, Lukk M, et al (2011). Assessing affymetrix GeneChip microarray quality. BMC Bioinformatics, 12, 137.
  20. Panteris E, Swift S, Payne A, et al (2007). Mining pathway signatures from microarray data and relevant biological knowledge. J Biomed Inform, 40, 698-706.
  21. Parkinson H, Sarkans U, Shojatalab M, et al (2005). ArrayExpress--a public repository for microarray gene expression data at the EBI. Nucleic Acids Res, 33, D553-5.
  22. Pepper SD, Saunders EK, Edwards LE, et al (2007). The utility of MAS5 expression summary and detection call algorithms. BMC Bioinformatics, 8, 273.
  23. Placa JR, Bueno Rde B, Pinheiro DG, et al (2015). Gene expression analysis of laryngeal squamous cell carcinoma. Genom Data, 5, 9-12.
  24. Quackenbush J (2001). Computational analysis of microarray data. Nat Rev Genet, 2, 418-27.
  25. Rocca-Serra P, Brazma A, Parkinson H, et al (2003). ArrayExpress: a public database of gene expression data at EBI. C R Biol, 326, 1075-8.
  26. Rong G, Kang H, Wang Y, et al (2013). Candidate markers that associate with chemotherapy resistance in breast cancer through the study on Taxotere-induced damage to tumor microenvironment and gene expression profiling of carcinoma-associated fibroblasts (CAFs). PLoS One, 8, e70960.
  27. Seo J, Gordish-Dressman H, Hoffman EP (2006). An interactive power analysis tool for microarray hypothesis testing and generation. Bioinformatics, 22, 808-14.
  28. Sonachalam M, Shen J, Huang H, et al (2012). Systems biology approach to identify gene network signatures for colorectal cancer. Front Genet, 3, 80.
  29. Sun QL, Zhao CP, Wang TY, et al (2015). Expression profile analysis of long non-coding RNA associated with vincristine resistance in colon cancer cells by next-generation sequencing. Gene, 572, 79-86.
  30. Zakharkin SO, Kim K, Mehta T, et al (2005). Sources of variation in Affymetrix microarray experiments. BMC Bioinformatics, 6, 214.