• Title/Summary/Keyword: representing sequence

Search Result 178, Processing Time 0.03 seconds

A Survey of the Brassica rapa Genome by BAC-End Sequence Analysis and Comparison with Arabidopsis thaliana

  • Hong, Chang Pyo;Plaha, Prikshit;Koo, Dal-Hoe;Yang, Tae-Jin;Choi, Su Ryun;Lee, Young Ki;Uhm, Taesik;Bang, Jae-Wook;Edwards, David;Bancroft, Ian;Park, Beom-Seok;Lee, Jungho;Lim, Yong Pyo
    • Molecules and Cells
    • /
    • v.22 no.3
    • /
    • pp.300-307
    • /
    • 2006
  • Brassica rapa ssp. pekinensis (Chinese cabbage) is an economically important crop and a model plant for studies on polyploidization and phenotypic evolution. To gain an insight into the structure of the B. rapa genome we analyzed 12,017 BAC-end sequences for the presence of transposable elements (TEs), SSRs, centromeric satellite repeats and genes, and similarity to the closely related genome of Arabidopsis thaliana. TEs were estimated to occupy 14% of the genome, with 12.3% of the genome represented by retrotransposons. It was estimated that the B. rapa genome contains 43,000 genes, 1.6 times greater than the genome of A. thaliana. A number of centromeric satellite sequences, representing variations of a 176-bp consensus sequence, were identified. This sequence has undergone rapid evolution within the B. rapa genome and has diverged among the related species of Brassicaceae. A study of SSRs demonstrated a non-random distribution with a greater abundance within predicted intergenic regions. Our results provide an initial characterization of the genome of B. rapa and provide the basis for detailed analysis through whole-genome sequencing.

Genetic diversity and phylogenetic analysis of genus Paeonia based on nuclear ribosomal DNA ITS sequence

  • Sun, Yan-Lin;Hong, Soon-Kwan
    • Journal of Plant Biotechnology
    • /
    • v.38 no.3
    • /
    • pp.234-240
    • /
    • 2011
  • The genus Paeonia belongs to the family Paeoniaceae having significant medicinal and ornamental importance. The present investigation was undertaken with an aim to understand phylogenetic relationships of three Paeonia species (P. lactiflora, P. obovata, and P. suffruticosa) that are widely distributed in China, Korea, and Japan, using nuclear ribosomal DNA (nrDNA) internal transcribed spacer (ITS) sequence and to compare the phylogeny results with investigations reported earlier using existed sequences of the same species. The size variation obtained among sequenced nrDNA ITS region was narrow and ranged from 722 to 726 bp. The highest interspecific genetic distance (GD) was found between P. lactiflora and P. suffruticosa or P. obovata. The phylogram obtained using our nrDNA ITS sequences showed non-congruence with previous hypothesis of the phylogeny between section Paeonia and section Moutan of genus Paeonia. This result was supported by the phylogenetic relations showed in the phylogram constructed with existed sequences in NCBI. The present study suggested that P. obovata belonging to section Paeonia was phylogenetically closer to P. suffruticosa representing section Moutan of genus Paeonia than P. lactiflora belonging to section Paeonia. The main reason of the paraphyly of section Paeonia is thought to be nucleotide additivity directly caused by origin hybridization. This study provides more sequence sources of genus Paeonia, and will help for further studies in intraspecies population, and their phylogentic analysis and molecular evolution.

An Efficient Shape-Feature Computing Method from Boundary Sequences of Arbitrary Shapes (임의 형상의 윤곽선 시퀀스 정보로부터 형상 특징의 효율적인 연산 방법)

  • 김성옥;김동규;김민환
    • Journal of Korea Multimedia Society
    • /
    • v.5 no.3
    • /
    • pp.255-262
    • /
    • 2002
  • A boundary sequence can be a good representation of arbitrary shapes, because it can represent them simply and precisely. However, boundary sequences have not been used as a representation of arbitrary shapes, because the pixel-based shape-features such as area, centroid, orientation, projection and so forth, could not be computed directly from them. In this paper, we show that the shape-features can be easily computed from the boundary sequences by introducing the cross-sections that are defined as vertical (or horizontal) line segments in a shape. A cross-section generation method is proposed, which generates cross-sections of the shape efficiently by tracing the boundary sequence of the shape once. Furthermore, a boundary sequence extraction method is also proposed, which generates a boundary sequence for each shape in a binary image automatically The proposed methods work well even if a shape has holes. Eventually, we show that a boundary sequence can be used effectively for representing arbitrary shapes.

  • PDF

Sequence Diversity in MIC6 Gene among Toxoplasma gondii Isolates from Different Hosts and Geographical Locations

  • Li, Zhong-Yuan;Song, Hui-Qun;Chen, Jia;Zhu, Xing-Quan
    • Parasites, Hosts and Diseases
    • /
    • v.53 no.3
    • /
    • pp.341-344
    • /
    • 2015
  • Toxoplasma gondii is an opportunistic protozoan parasite that can infect almost all warm-blooded animals including humans with a worldwide distribution. Micronemes play an important role in invasion process of T. gondii, associated with the attachment, motility, and host cell recognition. In this research, sequence diversity in microneme protein 6 (MIC6) gene among 16 T. gondii isolates from different hosts and geographical regions and 1 reference strain was examined. The results showed that the sequence of all the examined T. gondii strains was 1,050 bp in length, and their A + T content was between 45.7% and 46.1%. Sequence analysis presented 33 nucleotide mutation positions (0-1.1%), resulting in 23 amino acid substitutions (0-2.3%) aligned with T. gondii RH strain. Moreover, T. gondii strains representing the 3 classical genotypes (Type I, II, and III) were separated into different clusters based on the locus of MIC6 using phylogenetic analyses by Bayesian inference (BI), maximum parsimony (MP), and maximum likelihood (ML), but T. gondii strains belonging to ToxoDB #9 were separated into different clusters. Our results suggested that MIC6 gene is not a suitable marker for T. gondii population genetic studies.

Presence of Diverse Sugarcane Bacilliform Viruses Infecting Sugarcane in China Revealed by Pairwise Sequence Comparisons and Phylogenetic Analysis

  • Ahmad, Kashif;Sun, Sheng-Ren;Chen, Jun-Lu;Huang, Mei-Ting;Fu, Hua-Ying;Gao, San-Ji
    • The Plant Pathology Journal
    • /
    • v.35 no.1
    • /
    • pp.41-50
    • /
    • 2019
  • Sugarcane bacilliform viruses (SCBV), which belong to the genus Badnavirus, family Caulimoviridae, are an important DNA virus complex that infects sugarcane. To explore the genetic diversity of the sugarcane-infecting badnavirus complex in China, we tested 392 sugarcane leaf samples collected from Fujian, Yunnan, and Hainan provinces for the occurrence of SCBV by polymerase chain reaction (PCR) assays using published primers SCBV-F and SCBV-R that target the reverse transcriptase/ribonuclease H (RT/RNase H) regions of the viral genome. A total of 111 PCR-amplified fragments (726 bp) from 63 SCBV-positive samples were cloned and sequenced. A neighbor-joining phylogenetic tree was constructed based on the SCBV sequences from this study and 34 published sequences representing 18 different phylogroups or genotypes (SCBV-A to -R). All SCBV-tested isolates could be classified into 20 SCBV phylogenetic groups from SCBV-A to -T. Of nine SCBV phylogroups reported in this study, two novel phylogroups, SCBV-S and SCBV-T, that share 90.0-93.2% sequence identity and show 0.07-0.11 genetic distance with each other in the RT/RNase H region, are proposed. SCBV-S had 57.6-92.2% sequence identity and 0.09-0.66 genetic distance, while SCBV-T had 58.4-90.0% sequence identity and 0.11-0.63 genetic distance compared with the published SCBV phylogroups. Additionally, two other Badnavirus species, Sugarcane bacilliform MO virus (SCBMOV) and Sugarcane bacilliform IM virus (SCBIMV), which originally clustered in phylogenetic groups SCBV-E and SCBV-F, respectively, are first reported in China. Our findings will help to understand the level of genetic heterogeneity present in the complex of Badnavirus species that infect sugarcane.

Application of Sequence Diagrams to the Reverse Engineering Process of the ESf-ccs

  • Hasan, Md. Mehedi;Elakrat, Mohamed;Mayaka, Joyce;Jung, Jae Cheon
    • Journal of the Korean Society of Systems Engineering
    • /
    • v.15 no.1
    • /
    • pp.1-8
    • /
    • 2019
  • Reverse engineering involves examining a system or component so as to comprehend its structure, functionality, and operation. Creation of a system model in reverse engineering can serve several purposes: test generation, change impact analysis, and the creation of a new or modified system. When attempting to reverse engineering a system, often the most readily accessible information is the system description, which does not readily lend itself to use in Model Based System Engineering (MBSE). Therefore, it is necessary to be able to transform this description into a diagram, which clearly depicts the behavior of the system as well as the interaction between components. This study demonstrates how sequence diagrams can be extracted from the systems description. Using MBSE software, the sequence diagrams for the Engineered Safety Features Component Control System (ESF-CCS) of the Nuclear Power Plant are created. Sequence diagrams are chosen because they are a means of representing the systems behavior and the interaction between components. In addition, from these diagrams, the system's functional requirements can be elicited. These diagrams then serve as the baseline of the reverse engineering process and multiple system views are subsequently be created from them, thus speeding up the development process. In addition, the use of MBSE ensures that any additional information obtained from auxiliary sources can then be input into the system model, ensuring data consistency.

Molecular systematics of Poaceae based on eight chloroplast markers, emphasizing the phylogenetic positions of Korean taxa

  • LEE, Jung-Hoon;KIM, Ki-Joong;KIM, Bo-Yun;KIM, Young-Dong
    • Korean Journal of Plant Taxonomy
    • /
    • v.52 no.3
    • /
    • pp.127-143
    • /
    • 2022
  • This study was conducted to clarify the phylogenetic position and relationships of Korean Poaceae taxa. A total of 438 taxa including 155 accessions of Korean Poaceae (representing 92% and 72% of Korean Poaceous genera and species, respectively) were employed for phylogeny reconstruction. Sequence data of eight chloroplast DNA markers were used for molecular phylogenetic analyses. The resulted phylogeny was mostly concordant with previous phylogenetic hypotheses, especially in terms of subfamilial and tribal relationships. Several taxa-specific indels were detected in the molecular phylogeny, including a 45 bp deletion in rps3 (PACMAD [Panicoideae, Arundinoideae, Chloridoideae, Micrairoideae, Aristidoideae, Danthonioideae] clade), a 15 bp deletion in ndhF (Oryzeae + Phyllorachideae), a 6 bp deletion in trnLF (Poeae s.l.), and two (17 bp and 378 bp) deletions in atpF-H (Pooideae). The Korean Poaceae members were classified into 23 tribes, representing eight subfamilies. The subfamilial and tribal classifications of the Korean taxa were generally congruent with a recently published system, whereas some subtribes and genera were found to be non-monophyletic. The taxa included in the PACMAD clade (especially Andropogoneae) showed very weak and uncertain phylogenetic relationships, presumably to be due to evolutionary radiation and polyploidization. The reconstructed phylogeny can be utilized to update the taxonomic positions of the newly examined grass accessions.

Zero-suppressed ternary decision diagram algorithm for solving noncoherent fault trees in probabilistic safety assessment of nuclear power plants

  • Woo Sik Jung
    • Nuclear Engineering and Technology
    • /
    • v.56 no.6
    • /
    • pp.2092-2098
    • /
    • 2024
  • Probabilistic safety assessment (PSA) plays a critical role in ensuring the safe operation of nuclear power plants. In PSA, event trees are developed to identify accident sequences that could lead to core damage. These event trees are then transformed into a core-damage fault tree, wherein the accident sequences are represented by usual and complemented logic gates representing failed and successful operations of safety systems, respectively. The core damage frequency (CDF) is estimated by calculating the minimal cut sets (MCSs) of the core-damage fault tree. Delete-term approximation (DTA) is commonly employed to approximately solve MCSs representing accident sequence logics from noncoherent core-damage fault trees. However, DTA can lead to an overestimation of CDF, particularly when fault trees contain many nonrare events. To address this issue, the present study introduces a new zero-suppressed ternary decision diagram (ZTDD) algorithm that averts the CDF overestimation caused by DTA. This ZTDD algorithm can optionally calculate MCSs with DTA or prime implicants (PIs) without any approximation from the core-damage fault tree. By calculating PIs, accurate CDF can be calculated. The present study provides a comprehensive explanation of the ZTDD structure, formula of the ZTDD algorithm, ZTDD minimization, probability calculation from ZTDD, strength of the ZTDD algorithm, and ZTDD application results. Results reveal that the ZTDD algorithm is a powerful tool that can quickly and accurately calculate CDF and drastically improve the safety of nuclear power plants.

Construction of Large Library of Protein Fragments Using Inter Alpha-carbon Distance and Binet-Cauchy Distance (내부 알파탄소간 거리와 비네-코시 거리를 사용한 대규모 단백질 조각 라이브러리 구성)

  • Chi, Sang-mun
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.19 no.12
    • /
    • pp.3011-3016
    • /
    • 2015
  • Representing protein three-dimensional structure by concatenating a sequence of protein fragments gives an efficient application in analysis, modeling, search, and prediction of protein structures. This paper investigated the effective combination of distance measures, which can exploit large protein structure database, in order to construct a protein fragment library representing native protein structures accurately. Clustering method was used to construct a protein fragment library. Initial clustering stage used inter alpha-carbon distance having low time complexity, and cluster extension stage used the combination of inter alpha-carbon distance, Binet-Cauchy distance, and root mean square deviation. Protein fragment library was constructed by leveraging large protein structure database using the proposed combination of distance measures. This library gives low root mean square deviation in the experiments representing protein structures with protein fragments.

Genomic Distribution of Simple Sequence Repeats in Brassica rapa

  • Hong, Chang Pyo;Piao, Zhong Yun;Kang, Tae Wook;Batley, Jacqueline;Yang, Tae-Jin;Hur, Yoon-Kang;Bhak, Jong;Park, Beom-Seok;Edwards, David;Lim, Yong Pyo
    • Molecules and Cells
    • /
    • v.23 no.3
    • /
    • pp.349-356
    • /
    • 2007
  • Simple Sequence Repeats (SSRs) represent short tandem duplications found within all eukaryotic organisms. To examine the distribution of SSRs in the genome of Brassica rapa ssp. pekinensis, SSRs from different genomic regions representing 17.7 Mb of genomic sequence were surveyed. SSRs appear more abundant in non-coding regions (86.6%) than in coding regions (13.4%). Comparison of SSR densities in different genomic regions demonstrated that SSR density was greatest within the 5'-flanking regions of the predicted genes. The proportion of different repeat motifs varied between genomic regions, with trinucleotide SSRs more prevalent in predicted coding regions, reflecting the codon structure in these regions. SSRs were also preferentially associated with gene-rich regions, with peri-centromeric heterochromatin SSRs mostly associated with retrotransposons. These results indicate that the distribution of SSRs in the genome is non-random. Comparison of SSR abundance between B. rapa and the closely related species Arabidopsis thaliana suggests a greater abundance of SSRs in B. rapa, which may be due to the proposed genome triplication. Our results provide a comprehensive view of SSR genomic distribution and evolution in Brassica for comparison with the sequenced genomes of A. thaliana and Oryza sativa.