New Approach to Predict microRNA Gene by using data Compression technique

  • Kim, Dae-Won (Department of Functional Genomics, University of Science and Technology, Genome Research Center, Korea Research Institute of Bioscience & Biotechnology) ;
  • Yang, Joshua SungWoo (Department of Functional Genomics, University of Science and Technology, National Genome Information Center, Korean Research Institutes of Bioscience & Biotechnology) ;
  • Kim, Pan-Jun (Department of Physics, Korea Advanced Institute of Science and Technology) ;
  • Chu, In-Sun (Department of Functional Genomics, University of Science and Technology, National Genome Information Center, Korean Research Institutes of Bioscience & Biotechnology) ;
  • Jeong, Ha-Woong (Department of Physics, Korea Advanced Institute of Science and Technology) ;
  • Park, Hong-Seog (Department of Functional Genomics, University of Science and Technology, Genome Research Center, Korea Research Institute of Bioscience & Biotechnology)
  • Published : 2005.09.22

Abstract

Over the past few years, the complex and subtle roles of microRNA (miRNA) in gene regulation have been increasingly appreciated. Computational approaches have played one of important roles in identifying miRNAs from plant and animals, as well as in predicting their putative gene target. We present a new approach of comprehensive analysis of the evolutionarily conserved element scores and applied data compression technique to detect putative miRNA genes. We used the evolutionarily conserved elements [19] (see more detail on method and material) to calculate for base-by-base along the candidate pre-miRNA gene region by detecting common conserved pattern from target sequence. We applied the data compression technique [20] to detect unknown miRNA genes. This zipping method devises, without loss of generality with respect to the nature of the character strings, a method to measure the similarity between the strings under consideration [20]. Our experience to using our new computational method for detecting miRNA gene identification (or miRNA gene prediction) has been stratified and we were able to find 28 putative miRNA genes.

Keywords