• Title/Summary/Keyword: EST clustering

Search Result 17, Processing Time 0.022 seconds

Structural analysis of expressed sequence tags inimmature seed of Oryza sativa L. (벼 미숙종자의 발현유전자 구조특성분석)

  • Yoon, Ung-Han;Lee, Gang-Seob;Lee, Jung-Sook;Hahn, Jang-Ho;Kim, Chang-Kug;Kikuch, Shoshi;Satoh, Kouji;Kim, Jin-A;Lee, Jeong-Hwa;Lee, Tae-Ho;Kim, Yong-Hwan
    • Journal of Plant Biotechnology
    • /
    • v.36 no.2
    • /
    • pp.130-136
    • /
    • 2009
  • Rice (Oryza sativa) is the most important staple crop in Korea. With its small genome size of 389Mb, rice is a model plant for genome research. We analyzed expressed sequence tag (EST) clones from immature seeds of rice (cv. Ilpum) at 20 days after heading. The 25,668 EST clones were clustered by using SeqMan program and 7,509 clones were selected as unique clones. We compared the 7,509 unique genes with KOME database including the 32,127 FL-cDNA in rice. Finally, 4,990 clones were homologous and 2,519 clones non-homologous to FL-cDNA clones. In addition, we mapped the 7,509 cDNA clones by using TIGR rice pseudomolecule version 5. Ultimately, 7,347 clones were matched to be significant clones related to the TIGR rice pseudomolecules, but 162 clones were unmapped. For the clustering of orthologous group genes, we further analyzed the 7,509 EST clones from immature seeds using NCBI clusters of orthologous groups database. Among the clones, 4,968 clones were categorized into information storage and processing, cellular processes and signaling, metabolism and poorly characterized genes, proportioning 799 (14.89%), 1,536 (28.3%), 1,148 (21.2%) and 1,936 (35.7%) clones to the previous four categories, respectively.

Analysis of Expressed Sequence Tags from the Wood-Decaying Fungus Fomitopsis palustris and Identification of Potential Genes Involved in the Decay Process

  • Karim, Nurul;Shibuya, Hajime;Kikuchi, Taisei
    • Journal of Microbiology and Biotechnology
    • /
    • v.21 no.4
    • /
    • pp.347-358
    • /
    • 2011
  • Fomitopsis palustris, a brown-rot basidiomycete, causes the most destructive type of decay in wooden structures. In spite of its great economic importance, very little information is available at the molecular level regarding its complex decay process. To address this, we generated over 3,000 expressed sequence tags (ESTs) from a cDNA library constructed from F. palustris. Clustering of 3,095 high-quality ESTs resulted in a set of 1,403 putative unigenes comprising 485 contigs and 918 singlets. Homology searches based on BlastX analysis revealed that 78% of the F. palustris unigenes had a significant match to proteins deposited in the nonredundant databases. A subset of F. palustris unigenes showed similarity to the carbohydrateactive enzymes (CAZymes), including a range of glycosyl hydrolase (GH) family proteins. Some of these CAZyme-encoded genes were previously undescribed for F. palustris but predicted to have potential roles in biodegradation of wood. Among them, we identified and characterized a gene (FpCel45A) encoding the GH family 45 endoglucanase. Moreover, we also provided functional classification of 473 (34%) of F. palustris unigenes using the Gene Ontology hierarchy. The annotated EST data sets and related analysis may be useful in providing an initial insight into the genetic background of F. palustris.

Construction of web-based Database for Haliotis SNP (웹기반 전복류 (Haliotis) SNP 데이터베이스 구축)

  • Jeong, Ji-Eun;Lee, Jae-Bong;Kang, Se-Won;Baek, Moon-Ki;Han, Yeon-Soo;Choi, Tae-Jin;Kang, Jung-Ha;Lee, Yong-Seok
    • The Korean Journal of Malacology
    • /
    • v.26 no.2
    • /
    • pp.185-188
    • /
    • 2010
  • The Web-based the genus Haliotis SNP database was constructed on the basis of Intel Server Platform ZSS130 dual Xeon 3.2 GHz cpu and Linux-based (Cent OS) operating system. Haliotis related sequences (2,830 nucleotide sequences, 9,102 EST sequences) were downloaded through NCBI taxonomy browser. In order to eliminate vector sequences, we conducted vector masking step using cross match software with vector sequence database. In addition, poly-A tails were removed using Trimmest software from EMBOSS package. The processed sequences were clustered and assembled by TGICL package (TIGR tools) equipped with CAP3 software. A web-based interface (Haliotis SNP Database, http://www.haliotis.or.kr) was developed to enable optimal use of the clustered assemblies. The Clustering Res. menu shows the contig sequences from the clustering, the alignment results and sequences from each cluster. And also we can compare any sequences with Haliotis related sequences in BLAST menu. The search menu is equipped with its own search engine so that it is possible to search all of the information in the database using the name of a gene, accession number and/or species name. Taken together, the Web-based SNP database for Haliotis will be valuable to develop SNPs of Haliotis in the future.

Analysis of Expressed Sequence Tags from the Red Alga Griffithsia okiensis

  • Lee, Hyoung-Seok;Lee, Hong-Kum;An, Gyn-Heung;Lee, Yoo-Kyung
    • Journal of Microbiology
    • /
    • v.45 no.6
    • /
    • pp.541-546
    • /
    • 2007
  • Red algae are distributed globally, and the group contains several commercially important species. Griffithsia okiensis is one of the most extensively studied red algal species. In this study, we conducted expressed sequence tag (ESTs) analysis and synonymous codon usage analysis using cultured G. okiensis samples. A total of 1,104 cDNA clones were sequenced using a cDNA library made from samples collected from Dolsan Island, on the southern coast of Korea. The clustering analysis of these sequences allowed for the identification of 1,048 unigene clusters consisting of 36 consensus and 1,012 singleton sequences. BLASTX searches generated 532 significant hits (E-value <$10^{-4}$) and via further Gene Ontology analysis, we constructed a functional classification of 434 unigenes. Our codon usage analysis showed that unigene clusters with more than three ESTs had higher GC contents (76.5%) at the third position of the codons than the singletons. Also, the majority of the optimal codons of G. okiensis and Chondrus crispus belonging to Bangiophycidae were G-ending, whereas those of Porphyra yezoensis belonging to Florideophycidae were G-ending. An orthologous gene search for the P. yezoensis EST database resulted in the identification of 39 unigenes commonly expressed in two rhodophytes, which have putative functions for structural proteins, protein degradation, signal transduction, stress response, and physiological processes. Although experiments have been conducted on a limited scale, this study provides a material basis for the development of microarrays useful for gene expression studies, as well as useful information for the comparative genomic analysis of red algae.

Platform of Hot Pepper Defense Genomics: Isolation of Pathogen Responsive Genes in Hot Pepper (Capsicum annuum L.) Non-Host Resistance Against Soybean Pustule Pathogen (Xanthomonas axonopodis pv. glycines)

  • Lee, Sang-Hyeob;Park, Do-Il
    • The Plant Pathology Journal
    • /
    • v.20 no.1
    • /
    • pp.46-51
    • /
    • 2004
  • Host resistance is usually parasite-specific and is restricted to a particular pathogen races, and commonly is expressed against specific pathogen genotypes. In contrast, resistance shown by an entire plant species to a species of pathogen is known as non-host resistance. Therefore, non-host resistance is the more common and broad form of disease resistance exhibited by plants. As a first step to understand the mechanism of non-host plant defense, expressed sequence tags (EST) were generated from a hot pepper leaf cDNA library constructed from combined leaves collected at different time points after inoculation with non-host soybean pustule pathogen (Xanthomonas axonopodis pv. Glycines; Xag). To increase gene diversity, ESTs were also generated from cDNA libraries constructed from anthers and flower buds. Among a total of 10,061 ESTs, 8,525 were of sufficient quality to analyze further. Clustering analysis revealed that 55 % of all ESTs (4685) occurred only once. BLASTX analysis revealed that 74% of the ESTs had significant sequence similarity to known proteins present in the NCBI nr database. In addition, 1,265 ESTs were tentatively identified as being full-length cDNAs. Functional classification of the ESTs derived from pathogen-infected pepper leaves revealed that about 25% were disease- or defense-related genes. Furthermore, 323 (7%) ESTs were tentatively identified as being unique to hot pepper. This study represents the first analysis of sequence data from the hot pepper plant species. Although we focused on genes related to the plant defense response, our data will be useful for future comparative studies.

Analysis of germinating seed stage expressed sequence tags in Oryza sativa L. (벼 발아종자 발현유전자의 발현특성분석)

  • Yoon, Ung-Han;Lee, Gang-Seob;Kim, Chang-Kug;Lee, Jung-Sook;Hahn, Jang-Ho;Yun, Doh-Won;Ji, Hyeon-So;Lee, Tae-Ho;Lee, Jeong-Hwa;Park, Sung-Han;Kim, Gun-Wook;Seo, Mi-Suk;Kim, Yong-Hwan
    • Journal of Plant Biotechnology
    • /
    • v.36 no.3
    • /
    • pp.281-288
    • /
    • 2009
  • Seed germination is the important stage to express many genes for regulation of energy metabolism, starch degradation and cell division from seed dormancy state. For the functional analysis of seed germination mechanisms, we were analyzed the rice cDNA clones (Oryzasativa cultivar Ilpum) obtained from seed imbibition during 48 hours. Total number of 18,101 Expressed Sequence Tags (ESTs) were clustered using SeqMan program. Among them, 8,836 clones were identified as unique clones. We identified the chitinase gene specifically expressed in seed germination and amylase gene involved to starch degradation from the full length cDNA analysis, and several genes were registered to NCBI GeneBank. To analyzed the commonly expressed genes between inmature seed and germinated seed, 25,66 inmature ESTs and 18,101 germinated ESTs were clustered using SeqMan program and identified 2,514 clones as commonly expressed unigene. Among them, alpha-glubulin and alcohol dehydrogenase I were supposed to LEA genes only expressed in the immature and germinated seed stages. For the clustering of orthologous group genes, we further analyzed the 8,836 EST clones from germinating seeds using NCBI clusters of orthologous groups database. Among the clones, 5,076 clones were categorized into information storage and processing, cellular processes and signaling, metabolism and poorly characterized genes, proportioning 783 (14.29%), 1,484 (27%), 1,363 (24.8%) and 1,869 (34%) clones to the previous four categories, respectively.

Molecular cloning and expression pattern of Metallothionein Gene from the left-handed shell, Physa acuta (왼돌아물달팽이 (Physa acuta) 의 Metallothionein 유전자 클로닝 및 발현양상)

  • Jo, Yong-Hun;Baek, Moon-Ki;Kang, Se-Won;Lee, Jae-Bong;Byun, In-Seon;Choi, Sang-Haeng;Chae, Sung-Hwa;Kang, Jung-Ha;Han, Yeon-Soo;Park, Hong-Seog;Lee, Yong-Seok
    • The Korean Journal of Malacology
    • /
    • v.25 no.3
    • /
    • pp.223-230
    • /
    • 2009
  • Metallothioneins (MTs) play a key role in metallic homeostasis and detoxification in most living organisms. In an attempt to study the biological functions and significance of MT in a snail, we cloned and partially characterized the MT gene from the left-handed snail, Physa acutawhich has been regarded as a potential biomonitering species for fresh water. The complete cDNA sequence of PaMT cDNA was identified from the expressed sequence tag (EST) sequencing project of Physa acuta. The coding region of 180 bp gives 60 amino acid residues including the initiation methionine and termination codon. Clustering and phylogenic analysis of PaMT with other MT amino acid sequences show that it has some identities to Helix pomatia (60%), Arianta arbustorum (58%), Perna viridis (49%), Mytilus edulis (49%), Bathymodiolus azoricus (49%), Bathymodiolus azoricus (48%) and Bathymodiolus sp. FD-2002 (48%). Time dependent induction for PaMT from P. acuta exposed with cadmium (50 ppb) indicated that PaMT was induced at 4-8 hr after exposure. It remains to further develop PaMTas a potential biomarker for water contamination in fresh water.

  • PDF