Computational Analysis of Neighboring Genes on Arabidopsis thaliana Chromosomes 4 and 5: Their Genomic Association as Functional Subunits

  • Goh, Sung-Ho (Laboratory of Plant Genomics Center, Korea Research Institute of Bioscience and Biotechnology) ;
  • Kim, Tae-Hyung (Laboratory of Plant Genomics Center, Korea Research Institute of Bioscience and Biotechnology) ;
  • Kim, Jee-Hyub (Laboratory of Plant Genomics Center, Korea Research Institute of Bioscience and Biotechnology) ;
  • Nam, DouGu (Laboratory of Plant Genomics Center, Korea Research Institute of Bioscience and Biotechnology) ;
  • Choi, Doil (Laboratory of Plant Genomics Center, Korea Research Institute of Bioscience and Biotechnology) ;
  • Hur, Cheol-Goo (Laboratory of Plant Genomics Center, Korea Research Institute of Bioscience and Biotechnology)
  • Published : 2003.09.01

Abstract

The genes related to specific events or pathways in bacteria are frequently localized proximate to the genome of their neighbors, as with the structures known as operon, but eukaryotic genes seem to be independent of their neighbors, and are dispersed randomly throughout genomes. Although cases are rare, the findings from structures similar to prokaryotic operons in the nematode genome, and the clustering of housekeeping genes on human genome, lead us to assess the genomic association of genes as functional subunits. We evaluated the genomic association of neighboring genes on chromosomes 4 and 5 of Arabidopsis thaliana with and without respectively consideration of the scaffold/matrix­attached regions (S/MAR) loci. The observed number of functionally identical bigrams and trig rams were significantly higher than expected, and these results were verified statistically by calculating p-values for weighted random distributions. The observed frequency of functionally identical big rams and trig rams were much higher in chromosome 4 than in chromosome 5, but the frequencies with, and without, consideration of the S/MAR in each chromosome were similar. In this study, a genomic association among functionally related neighboring genes in Arabidopsis thaliana was suggested.

Keywords

References

  1. Blumental, T., Evans, D., Link, C.D., Guffanti, A., Lawson, D., Theirry-Mieg, J., Chiu, W.L, Duke, K., Kiraly, M. and Kim, S. (2002) A global analysis of Caenorhabditis elegans operons. Nature, 417, 851 - 854 https://doi.org/10.1038/nature00831
  2. Bode, J., Kohwi, Y., Dickinson, L., Joh, T., Klehr, D., Mielke, C. and Kohwi-Shigematsu, T. (1992) Biological significance of unwinding capability of nuclear matrix-associating DNAs. Science, 255, 195-197 https://doi.org/10.1126/science.1553545
  3. Cohen, B.A., Mitra, R.D., Hughes, J.D. and Church, G.M. (2000) A computational analysis of whole-genome expression data reveals chromosomal domains of gene expression. Nature Genet., 26, 183-186 https://doi.org/10.1038/79896
  4. Dandekar, T., Snel, B., Huynen, M. and Bork, P. (1998) Conservation of gene order: a fingerprint of proteins that physically interact. Trends Biochem. Sci., 23, 324-328 https://doi.org/10.1016/S0968-0004(98)01274-2
  5. Enright, A.J., lIiopoulos, I., Kyrpides, N.C. and Ouzounis, C.A. (1999) Protein interaction maps for complete genomes based on gene fusion events. Nature, 402, 86-90 https://doi.org/10.1038/47056
  6. Frisch, M., Frech, K., Klingenhoff, A, Cartharius, K., Liebich, I. and Werner, T. (2001) In silico prediction of scaffold/matrix attachment regions in large genomic sequences. Genome Res., 12, 349-354
  7. Ge, H., Liu, Z., Church, G.M. and Vidal, M. (2001) Correlation between transcriptome and interactome mapping data from Saccharomyces cerevisiae. Nature Genet., 29, 482-486 https://doi.org/10.1038/ng776
  8. Huynen, M., Snel, B., LatheIII, W. and Bork, P. (2000) Predicting Protein Function by Genomic Context: Quantitative Evaluation and Qualitative Inferences. Genome Res., 10, 1204-1210 https://doi.org/10.1101/gr.10.8.1204
  9. Lercher, M.J., Urrutia, A.O. and Hurst, L.D. (2002) Clustering of housekeeping-genes provides a unified model of gene order in the human genome. Nature Genet., 31, 180-183 https://doi.org/10.1038/ng887
  10. Liebich, I., Bode, J., Frisch, M. and Wingender, E. (2002) S/MARt DB: a database on scaffold/matrix attached regions. Nucleic Acids Res., 20, 372-274
  11. Liebich, I., Bode, J., Reuter, I. and Wingender, E. (2002) Evaluation of sequence motifs found in scaffold/matrix attached regions (S/MARs). Nucleic Acids Res., 30, 3433-3442 https://doi.org/10.1093/nar/gkf446
  12. Marcotte, E.M., Pellegrini, M., Ng, H.L., Rice, D.W., Yeates, T.O. and Eisenberg, D. (1999) Detecting Protein Function and Protein-Protein Interactions from Genome Sequences. Science, 285, 751-753 https://doi.org/10.1126/science.285.5428.751
  13. Mayer, k. et aI., The European Union Arabidopsis Genome Sequencing Consortium & The Coldspring Harbor, Washington University in St Louis and PE Biosystems Arabidopsis Sequencing Consortium. (1999) Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana. Nature, 402, 769-777 https://doi.org/10.1038/47134
  14. Overbeek, R., Fonstein, M., D'Souza, M., Pusch, G.D. and Maltsev, N. (1999) The use of gene clusters to infer functional coupling. Proc. Natl. Acad. Sci. USA, 96, 2896-2901 https://doi.org/10.1073/pnas.96.6.2896
  15. Pellegrini, M., Marcotte, E.M., Thompson, M.J., Eisenberg, D. and Yeates, T.O. (1999) Assigning protein functions by comparative genome analysis: Protein phylogenetic profiles. Proc. Natl. Acad. Sci. USA, 96, 4285-4288 https://doi.org/10.1073/pnas.96.8.4285
  16. Singh, G.B., Kramer, J.A. and Krawetz, S.A. (1997) Mathematical model to predict regions of chromatin attachment to the nuclear matrix. Nucleic Acids Res., 25, 1419-1425 https://doi.org/10.1093/nar/25.7.1419
  17. Snel, B., Bork, P. and Huynen, M.A. (2002) The identification of functional modules from the genomic association of genes. Proc. Natl. Acad. Sci. USA, 99, 5890-5895 https://doi.org/10.1073/pnas.092632599
  18. Stein, G.S. (1998) Interrelationships of nuclear architecture with gene expression: Functional encounters on a long and winding road. J. Cell. Biochem., 70, 157-158 https://doi.org/10.1002/(SICI)1097-4644(19980801)70:2<157::AID-JCB1>3.0.CO;2-N
  19. The Kazusa DNA Research Institute, The Cold Spring Harbor and Washington University in St Louis Sequencing Consortium and The European Union Arabidopsis Genome Sequencing Consortium. (2000) Sequence and analysis of chromosome 5 of the plant Arabidopsis thaliana. Nature, 408, 823-826 https://doi.org/10.1038/35048507
  20. Thompson, H.G.R., Harris, J.W., Wold, B.J., Quake, S.R. and Brody, J.P. (2002) Identification and confirmation of a module of coexpressed genes. Genome Res., 12, 1517-1522 https://doi.org/10.1101/gr.418402
  21. Van Drunen, C.M., Oosterling, R.W., Keultjes, G.M., Weisbeek, P.J, Van Driel, R. and Smeekens, S.C.M. (1997) Analysis of the chromatin domain organization around the platocyanin gene reveals an MAR-specific sequence element in Arabidopsis thaliana. Nucleic Acids Res., 25, 3904-3911 https://doi.org/10.1093/nar/25.19.3904
  22. Van Drunen, C.M., Sewalt, R.G.A.B., Oosterling, R.W., Weisbeek, P.J, Keultjes, G.M., Smeekens, S.C.M. and Van Driel, R. (1999) A bipartite sequence element associated with matrix/scaffold attachment regions. Nucleic Acids Res., 27, 2924-2930 https://doi.org/10.1093/nar/27.14.2924
  23. Von Mering, C.and Bork, P.(2002) Genome organization: Teamed up for transcription. Nature, 417, 797-798 https://doi.org/10.1038/417797a
  24. Yanai, I., Derti, A. and DeLisi, C. (2001) Genes linked by fusion events are generally of the same functional category: A systematic analysis of 30 microbial genomes. Proc. Natl. Acad. Sci. USA, 98, 7940-7945 https://doi.org/10.1073/pnas.141236298