• Title/Summary/Keyword: representing sequence

Search Result 178, Processing Time 0.031 seconds

Two Enteropathogenic Escherichia coli Strains Representing Novel Serotypes and Investigation of Their Roles in Adhesion

  • Wang, Jing;Jiao, HongBo;Zhang, XinFeng;Zhang, YuanQing;Sun, Na;Yang, Ying;Wei, Yi;Hu, Bin;Guo, Xi
    • Journal of Microbiology and Biotechnology
    • /
    • v.31 no.9
    • /
    • pp.1191-1199
    • /
    • 2021
  • Enteropathogenic Escherichia coli (EPEC), which belongs to the attaching and effacing diarrheagenic E. coli strains, is a major causative agent of life-threatening diarrhea in infants in developing countries. Most EPEC isolates correspond to certain O serotypes; however, many strains are non-typeable. Two EPEC strains, EPEC001 and EPEC080, which could not be serotyped during routine detection, were isolated. In this study, we conducted an in-depth characterization of their putative O-antigen gene clusters (O-AGCs) and also performed constructed mutagenesis of the O-AGCs for functional analysis of O-antigen (OAg) synthesis. Sequence analysis revealed that the occurrence of O-AGCs in EPEC001 and E. coli O132 may be mediated by recombination between them, and EPEC080 and E. coli O2/O50 might acquire each O-AGC from uncommon ancestors. We also indicated that OAg-knockout bacteria were highly adhesive in vitro, except for the EPEC001 wzy derivative, whose adherent capability was less than that of its wild-type strain, providing direct evidence that OAg plays a key role in EPEC pathogenesis. Together, we identified two EPEC O serotypes in silico and experimentally, and we also studied the adherent capabilities of their OAgs, which highlighted the fundamental and pathogenic role of OAg in EPEC.

Sequence characterization and polymorphism of melanocortin 1 receptor gene in some goat breeds with different coat color of Mongolia

  • Ganbold, Onolragchaa;Manjula, Prabuddha;Lee, Seung-Hwan;Paek, Woon Kee;Seo, Dongwon;Munkhbayar, Munkhbaatar;Lee, Jun Heon
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.32 no.7
    • /
    • pp.939-948
    • /
    • 2019
  • Objective: Extension and Agouti loci play a key role for proportions of eumelanin and pheomelanin in determining coat color in several species, including goat. Mongolian goats exhibit diverse types of coat color phenotypes. In this study, investigation of the melanocortin 1 receptor (MC1R) coding region in different coat colors in Mongolian goats was performed to ascertain the presence of the extension allele. Methods: A total of 105 goat samples representing three goat breeds were collected for this study from middle Mongolia. A 938 base pair (bp) long coding region of the MC1R gene was sequenced for three different breeds with different coat colors (Gobi Gurwan Saikhan: complete black, Zalaa Jinstiin Tsagaan: complete white, Mongolian native goat: admixture of different of coat colors). The genotypes of these goats were obtained from analyzing and comparing the sequencing results. Results: A total of seven haplotypes defined by five substitution were identified. The five single nucleotide polymorphisms included two synonymous mutations (c.183C>T and c.489G>A) and three missense (non-synonymous) mutations (c.676A>G, c.748T>G, and c.770T>A). Comparison of genotypes frequencies of two common missense mutions using chi-sqaure ($x^2$) test revealed significant differences between coat color groups (p<0.001). A logistic regression analysis additionally suggested highly significant association between genotypes and variation of black versus white uniform combination. Alternatively, most investigated goats (60.4%) belonged to H2 (TGAGT) haplotype. Conclusion: According to the findings obtained in this study on the investigated coat colors, mutations in MC1R gene may have the crucial role for determining eumelanin and pheomelanin phenotypes. Due to the complication of coat color phenotype, more detailed investigation needed.

Correlation-based and feature-driven mutation signature analyses to identify genetic features associated with DNA mutagenic processes in cancer genomes

  • Jeong, Hye Young;Yoo, Jinseon;Kim, Hyunwoo;Kim, Tae-Min
    • Genomics & Informatics
    • /
    • v.19 no.4
    • /
    • pp.40.1-40.11
    • /
    • 2021
  • Mutation signatures represent unique sequence footprints of somatic mutations resulting from specific DNA mutagenic and repair processes. However, their causal associations and the potential utility for genome research remain largely unknown. In this study, we performed PanCancer-scale correlative analyses to identify the genomic features associated with tumor mutation burdens (TMB) and individual mutation signatures. We observed that TMB was correlated with tumor purity, ploidy, and the level of aneuploidy, as well as with the expression of cell proliferation-related genes representing genomic covariates in evaluating TMB. Correlative analyses of mutation signature levels with genes belonging to specific DNA damage-repair processes revealed that deficiencies of NHEJ1 and ALKBH3 may contribute to mutations in the settings of APOBEC cytidine deaminase activation and DNA mismatch repair deficiency, respectively. We further employed a strategy to identify feature-driven, de novo mutation signatures and demonstrated that mutation signatures can be reconstructed using known causal features. Using the strategy, we further identified tumor hypoxia-related mutation signatures similar to the APOBEC-related mutation signatures, suggesting that APOBEC activity mediates hypoxia-related mutational consequences in cancer genomes. Our study advances the mechanistic insights into the TMB and signature-based DNA mutagenic and repair processes in cancer genomes. We also propose that feature-driven mutation signature analysis can further extend the categories of cancer-relevant mutation signatures and their causal relationships.

Analysis of Amyloid Beta 1-16 (Aβ16) Monomer and Dimer Using Electrospray Ionization Mass Spectrometry with Collision-Induced Dissociation

  • Kim, Kyoung Min;Kim, Ho-Tae
    • Mass Spectrometry Letters
    • /
    • v.13 no.4
    • /
    • pp.177-183
    • /
    • 2022
  • The monomer and dimer structures of the amyloid fragment Aβ(1-16) sequence formed in H2O were investigated using electrospray ionization mass spectrometry (MS) and tandem MS (MS/MS). Aβ16 monomers and dimers were indicated by signals representing multiple proton adduct forms, [monomer+zH]n+ (=Mz+, z = charge state) and [dimer+zH]z+ (=Dz+), in the MS spectrum. Fragment ions of monomers and dimers were observed using collision-induced dissociation MS/MS. Peptide bond dissociation was mostly observed in the D1-D7 and V11-K16 regions of the MS/MS spectra for the monomer (or dimer), regardless of the monomer (or dimer) charge state. Both covalent and non-covalent bond dissociation processes were indicated by the MS/MS results for the dimers. During the non-covalent bond dissociation process, the D3+ dimer complex was separated into two components: the M1+ and M2+ subunits. During the covalent bond dissociation of the D3+ dimer complex, the b and y fragment ions attached to the monomer, (M+b10-15)z+ and (M+y9-15)z+, were thought to originate from the dissociation of the M2+ monomer component of the (M1++M2+) complex. Two different D3+ complex geometries exist; two distinguished interaction geometries resulting from interactions between the M1+ monomer and two different regions of M2+ (the N-terminus and C-terminus) are proposed. Intricate fragmentation patterns were observed in the MS/MS spectrum of the D5+ complex. The complicated nature of the MS/MS spectrum is attributable to the coexistence of two D5+ configurations, (M1++M4+) and (M2+M3+), in the Aβ16 solution.

Probabilistic Distribution and Variability of Geotechnical Properties with Randomness Characteristic (무작위성을 보이는 지반정수의 확률분포 및 변동성)

  • Kim, Dong-Hee;Lee, Ju-Hyoung;Lee, Woo-Jin
    • Journal of the Korean Geotechnical Society
    • /
    • v.25 no.11
    • /
    • pp.87-103
    • /
    • 2009
  • To determine the reliable probabilistic distribution model of geotechnical properties, outlier and randomness test for analysis data, parameter estimation of probabilistic distribution model, and goodness-of-fit test for model parameter and probabilistic distribution model have to be performed in sequence. In this paper, the probabilistic distribution model's geotechnical properties of Songdo area in Incheon are estimated by the above proposed procedure. Also, the coefficient of variation (COV) representing the variability of geotechnical properties is determined for several geotechnical properties. Reliable probabilistic distribution model and COV of geotechnical properties can be used for probability-based design procedure and reasonable choice of design value in deterministic design method.

Composition and functional diversity of bacterial communities during swine carcass decomposition

  • Michelle Miguel;Seon-Ho Kim;Sang-Suk Lee;Yong-Il Cho
    • Animal Bioscience
    • /
    • v.36 no.9
    • /
    • pp.1453-1464
    • /
    • 2023
  • Objective: This study investigated the changes in bacterial communities within decomposing swine microcosms, comparing soil with or without intact microbial communities, and under aerobic and anaerobic conditions. Methods: The experimental microcosms consisted of four conditions: UA, unsterilized soil-aerobic condition; SA, sterilized soil-aerobic condition; UAn, unsterilized soil-anaerobic condition; and San, sterilized soil-anaerobic condition. The microcosms were prepared by mixing 112.5 g of soil and 37.5 g of ground carcass, which were then placed in sterile containers. The carcass-soil mixture was sampled at day 0, 5, 10, 30, and 60 of decomposition, and the bacterial communities that formed during carcass decomposition were assessed using Illumina MiSeq sequencing of the 16S rRNA gene. Results: A total of 1,687 amplicon sequence variants representing 22 phyla and 805 genera were identified in the microcosms. The Chao1 and Shannon diversity indices varied in between microcosms at each period (p<0.05). Metagenomic analysis showed variation in the taxa composition across the burial microcosms during decomposition, with Firmicutes being the dominant phylum, followed by Proteobacteria. At the genus level, Bacillus and Clostridium were the main genera within Firmicutes. Functional prediction revealed that the most abundant Kyoto encyclopedia of genes and genomes metabolic functions were carbohydrate and amino acid metabolisms. Conclusion: This study demonstrated a higher bacteria diversity in UA and UAn microcosms than in SA and SAn microcosms. In addition, the taxonomic composition of the microbial community also exhibited changes, highlighting the impact of soil sterilization and oxygen on carcass decomposition. Furthermore, this study provided insights into the microbial communities associated with decomposing swine carcasses in microcosm.

A Deep Learning Based Approach to Recognizing Accompanying Status of Smartphone Users Using Multimodal Data (스마트폰 다종 데이터를 활용한 딥러닝 기반의 사용자 동행 상태 인식)

  • Kim, Kilho;Choi, Sangwoo;Chae, Moon-jung;Park, Heewoong;Lee, Jaehong;Park, Jonghun
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.1
    • /
    • pp.163-177
    • /
    • 2019
  • As smartphones are getting widely used, human activity recognition (HAR) tasks for recognizing personal activities of smartphone users with multimodal data have been actively studied recently. The research area is expanding from the recognition of the simple body movement of an individual user to the recognition of low-level behavior and high-level behavior. However, HAR tasks for recognizing interaction behavior with other people, such as whether the user is accompanying or communicating with someone else, have gotten less attention so far. And previous research for recognizing interaction behavior has usually depended on audio, Bluetooth, and Wi-Fi sensors, which are vulnerable to privacy issues and require much time to collect enough data. Whereas physical sensors including accelerometer, magnetic field and gyroscope sensors are less vulnerable to privacy issues and can collect a large amount of data within a short time. In this paper, a method for detecting accompanying status based on deep learning model by only using multimodal physical sensor data, such as an accelerometer, magnetic field and gyroscope, was proposed. The accompanying status was defined as a redefinition of a part of the user interaction behavior, including whether the user is accompanying with an acquaintance at a close distance and the user is actively communicating with the acquaintance. A framework based on convolutional neural networks (CNN) and long short-term memory (LSTM) recurrent networks for classifying accompanying and conversation was proposed. First, a data preprocessing method which consists of time synchronization of multimodal data from different physical sensors, data normalization and sequence data generation was introduced. We applied the nearest interpolation to synchronize the time of collected data from different sensors. Normalization was performed for each x, y, z axis value of the sensor data, and the sequence data was generated according to the sliding window method. Then, the sequence data became the input for CNN, where feature maps representing local dependencies of the original sequence are extracted. The CNN consisted of 3 convolutional layers and did not have a pooling layer to maintain the temporal information of the sequence data. Next, LSTM recurrent networks received the feature maps, learned long-term dependencies from them and extracted features. The LSTM recurrent networks consisted of two layers, each with 128 cells. Finally, the extracted features were used for classification by softmax classifier. The loss function of the model was cross entropy function and the weights of the model were randomly initialized on a normal distribution with an average of 0 and a standard deviation of 0.1. The model was trained using adaptive moment estimation (ADAM) optimization algorithm and the mini batch size was set to 128. We applied dropout to input values of the LSTM recurrent networks to prevent overfitting. The initial learning rate was set to 0.001, and it decreased exponentially by 0.99 at the end of each epoch training. An Android smartphone application was developed and released to collect data. We collected smartphone data for a total of 18 subjects. Using the data, the model classified accompanying and conversation by 98.74% and 98.83% accuracy each. Both the F1 score and accuracy of the model were higher than the F1 score and accuracy of the majority vote classifier, support vector machine, and deep recurrent neural network. In the future research, we will focus on more rigorous multimodal sensor data synchronization methods that minimize the time stamp differences. In addition, we will further study transfer learning method that enables transfer of trained models tailored to the training data to the evaluation data that follows a different distribution. It is expected that a model capable of exhibiting robust recognition performance against changes in data that is not considered in the model learning stage will be obtained.

Inter Simple Sequence Repeats (ISSR) Marker Analysis of Genetic Diversity in Korean Phasianus colchicus karpowi and Genetic Relationships Among Subspecies of Phasianus spp. (Inter Simple Sequence Repeats (ISSR) 표지자를 이용한 한국꿩의 유전적 다양성 및 아종간의 유연관계 분석)

  • Yoon, Seong-Il
    • Korean Journal of Environmental Biology
    • /
    • v.26 no.2
    • /
    • pp.66-75
    • /
    • 2008
  • The level of genetic diversity and genetic relationships among Korean ring-necked pheasant (Phasianus colchicus karpowi) habitat and subspecies have been investigated based on Inter Simple Sequence Repeat (ISSR) markers. Wild and domesticated Korean ring-necked pheasant, hybrids between domesticated Korean ring-necked and foreign subspecies, and four foreign subspecies; Chinese ring-necked (P. c. torquatus), Melanistic mutant (P. c. mut. tenebrosus), XL White (P. c. mut) and Southern green (P. c. versicolor) were used for comparison. On the basis of the results of AMOV A, 94.08% of genetic diversity in Korean ring-necked was allocated among individuals within habitat differences. Estimate of $\Phi$st, which represents the degree of genetic differentiation among habitats was 5.9%. Based on the dendrogram reconstructed by UPGMA, Yangpyung habitat of the eight habitats turned out to be distinct from others habitat. Interestingly, domesticated Korean ring-necked and hybrid mixture showed closer genetic relationship with four foreign subspecies than Korean ring-necked. As a consequence of AMOVA, 96.63% of genetic diversity in four foreign subspecies was allocated among individuals within subspecies. Estimate of $\Phi$st representing the degree of genetic differentiation among subspecies was 3.4%, which was lower than that among habitats of Korean ring-necked. The lower level of genetic difference among four foreign subspecies showed that these subspecies were genetically closer even though they were morphologically classified into four different subspecies. When seven habitats of Korean ring-necked pheasant and four foreign subspecies were divided into Korean and Foreign Pheasant Groups, respectively, more than 17% of genetic diversity was allocated between groups (about 4% among habitats/subspecies within groups). This observation implied that Korean ring-necked pheasant is genetically quite different from four foreign subspecies. On the basis of cluster analysis, three foreign subspecies (Chinese ring-necked pheasant, Melanistic mutant pheasant, and XL White pheasant) formed a distinct group with domesticated Korean ring-necked pheasant and hybrid mixture at 98% confidence interval.

Runoff assessment using radar rainfall and precipitation runoff modeling system model (레이더 강수량과 PRMS 모형을 이용한 유출량 평가)

  • Kim, Tae-Jeong;Kim, Sung-Hoon;Lee, Sung-Ho;Kim, Chang-Sung;Kwon, Hyun-Han
    • Journal of Korea Water Resources Association
    • /
    • v.53 no.7
    • /
    • pp.493-505
    • /
    • 2020
  • The rainfall-runoff model has been generally adopted to obtain a consistent runoff sequence with the use of the long-term ground-gauged based precipitation data. The Thiessen polygon is a commonly applied approach for estimating the mean areal rainfall from the ground-gauged precipitation by assigning weight based on the relative areas delineated by a polygon. However, spatial bias is likely to increase due to a sparse network of the rain gauge. This study aims to generate continuous runoff sequences with the mean areal rainfall obtained from radar rainfall estimates through a PRMS rainfall-runoff model. Here, the systematic error of radar rainfall is corrected by applying the G/R Ratio. The results showed that the estimated runoff using the corrected radar rainfall estimates are largely similar and comparable to that of the Thiessen. More importantly, one can expect that the mean areal rainfall obtained from the radar rainfall estimates are more desirable than that of the ground in terms of representing rainfall patterns in space, which in turn leads to significant improvement in the estimation of runoff.

Context-Weighted Metrics for Example Matching (문맥가중치가 반영된 문장 유사 척도)

  • Kim, Dong-Joo;Kim, Han-Woo
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.43 no.6 s.312
    • /
    • pp.43-51
    • /
    • 2006
  • This paper proposes a metrics for example matching under the example-based machine translation for English-Korean machine translation. Our metrics served as similarity measure is based on edit-distance algorithm, and it is employed to retrieve the most similar example sentences to a given query. Basically it makes use of simple information such as lemma and part-of-speech information of typographically mismatched words. Edit-distance algorithm cannot fully reflect the context of matched word units. In other words, only if matched word units are ordered, it is considered that the contribution of full matching context to similarity is identical to that of partial matching context for the sequence of words in which mismatching word units are intervened. To overcome this drawback, we propose the context-weighting scheme that uses the contiguity information of matched word units to catch the full context. To change the edit-distance metrics representing dissimilarity to similarity metrics, to apply this context-weighted metrics to the example matching problem and also to rank by similarity, we normalize it. In addition, we generalize previous methods using some linguistic information to one representative system. In order to verify the correctness of the proposed context-weighted metrics, we carry out the experiment to compare it with generalized previous methods.