• Title/Summary/Keyword: sequence modeling

Search Result 398, Processing Time 0.026 seconds

Homology Modeling and Characterization of Oligoalginate Lyase from the Alginolytic Marine Bacterium Sphingomonas sp. Strain MJ-3 (알긴산을 분해하는 해양미생물인 Sphingomonas sp. MJ-3 균주의 올리고알긴산 분해효소의 상동성 모델링 및 특성연구)

  • Kim, Hee Sook
    • Journal of Life Science
    • /
    • v.25 no.2
    • /
    • pp.121-129
    • /
    • 2015
  • Alginates are found in marine brown seaweeds and in extracellular biofilms secreted by some bacteria. Previously, we reported an oligoalginate lyase from Sphingomonas sp. MJ-3 (MJ3-Oal) that had an exolytic activity and protein sequence homology with endolytic polymannuronate (polyM) lyase in the N-terminal region. In this study, the MJ3-Oal was tested for both exolytic and endolytic activity by homology modeling using the crystal structure of Alg17c from Saccharophagus degradans 2-40T. The tyrosine residue at the $426^{th}$ position, which possibly formed a hydrogen bond with the substrate, was mutated to phenylalanine. The FPLC profiles showed that MJ3-Oal degraded alginate quickly to monomers as a final product through the oligmers, whereas the Tyr426Phe mutant showed only exolytic alginate lyase activity. $^1H$-NMR spectra also showed that MJ3-Oal degraded the endoglycosidic bond of polyM and polyMG (polymannuronate-guluronate) blocks. These results indicate that oligoalginate lyase from Sphingomonas sp. MJ-3 probably catalyzes the degradation of both exo- and endo-glycosidic bonds of alginate.

Molecular Cloning of cDNA Encoding a Putative Eugenol Synthase in Tomato (Solanum lycopersicum 'Micro-Tom') and Prediction of 3D Structure and Physiochemical Properties (토마토 'Micro-Tom' 과실의 eugenol synthase 유전자 클로닝, 단백질의 3차 구조 및 생리화학적 특성 예측)

  • Kang, Seung-Won;Seo, Sang-Gyu;Lee, Tai-Ho;Lee, Gung-Pyo
    • Journal of agriculture & life science
    • /
    • v.46 no.4
    • /
    • pp.9-20
    • /
    • 2012
  • Eugenol is a volatile compound synthesized by eugenol synthase in various plants and belongs to phenylpropene compounds. However, characteristics of eugenol synthase in tomato has not been known. Therefore, we cloned a full length cDNA of a putative eugenol synthase from tomato 'Micro-Tom' using rapid amplification of cDNA ends (RACE) technique and named a clone SlEGS. Open reading frame of SlEGS was 921bp long and its deduced amino acid sequence was 307bp. The BLAST analysis indicated that SlEGS shared high similarity with PhEGS1 (67.1%) and CbEGS2 (69.4%). Amino acid composition of SlEGS was determined by CLC genomics workbench tool and 3D structure of SlEGS was constructed by homology modeling using Swiss-PDB viewer and validated using PROCHECK and ProSA-web tool. In addition, the physiochemical properties of SlEGS was evaluated using ExPASy's ProtParam tool. Molecular weight was 33.93kDa and isoelectric point was 5.85 showing acidic nature. Other properties such as extinction coefficient, instability index, aliphatic index, and grand average hydropathy was also analyzed.

A Topic Modeling-based Recommender System Considering Changes in User Preferences (고객 선호 변화를 고려한 토픽 모델링 기반 추천 시스템)

  • Kang, So Young;Kim, Jae Kyeong;Choi, Il Young;Kang, Chang Dong
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.2
    • /
    • pp.43-56
    • /
    • 2020
  • Recommender systems help users make the best choice among various options. Especially, recommender systems play important roles in internet sites as digital information is generated innumerable every second. Many studies on recommender systems have focused on an accurate recommendation. However, there are some problems to overcome in order for the recommendation system to be commercially successful. First, there is a lack of transparency in the recommender system. That is, users cannot know why products are recommended. Second, the recommender system cannot immediately reflect changes in user preferences. That is, although the preference of the user's product changes over time, the recommender system must rebuild the model to reflect the user's preference. Therefore, in this study, we proposed a recommendation methodology using topic modeling and sequential association rule mining to solve these problems from review data. Product reviews provide useful information for recommendations because product reviews include not only rating of the product but also various contents such as user experiences and emotional state. So, reviews imply user preference for the product. So, topic modeling is useful for explaining why items are recommended to users. In addition, sequential association rule mining is useful for identifying changes in user preferences. The proposed methodology is largely divided into two phases. The first phase is to create user profile based on topic modeling. After extracting topics from user reviews on products, user profile on topics is created. The second phase is to recommend products using sequential rules that appear in buying behaviors of users as time passes. The buying behaviors are derived from a change in the topic of each user. A collaborative filtering-based recommendation system was developed as a benchmark system, and we compared the performance of the proposed methodology with that of the collaborative filtering-based recommendation system using Amazon's review dataset. As evaluation metrics, accuracy, recall, precision, and F1 were used. For topic modeling, collapsed Gibbs sampling was conducted. And we extracted 15 topics. Looking at the main topics, topic 1, top 3, topic 4, topic 7, topic 9, topic 13, topic 14 are related to "comedy shows", "high-teen drama series", "crime investigation drama", "horror theme", "British drama", "medical drama", "science fiction drama", respectively. As a result of comparative analysis, the proposed methodology outperformed the collaborative filtering-based recommendation system. From the results, we found that the time just prior to the recommendation was very important for inferring changes in user preference. Therefore, the proposed methodology not only can secure the transparency of the recommender system but also can reflect the user's preferences that change over time. However, the proposed methodology has some limitations. The proposed methodology cannot recommend product elaborately if the number of products included in the topic is large. In addition, the number of sequential patterns is small because the number of topics is too small. Therefore, future research needs to consider these limitations.

Curvature stroke modeling for the recognition of on-line cursive korean characters (온라인 흘림체 한글 인식을 위한 곡률획 모델링 기법)

  • 전병환;김무영;김창수;박강령;김재희
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.33B no.11
    • /
    • pp.140-149
    • /
    • 1996
  • Cursive characters are written on an economical principle to reduce the motion of a pen in the limit of distinction between characters. That is, the pen is not lifted up to move for writing a next stroke, the pen is not moved at all, or connected two strokes chance their shapes to a similar and simple shape which is easy to be written. For these reasons, strokes and korean alphabets are not only easy to be changed, but also difficult to be splitted. In this paper, we propose a curvature stroke modeling method for splitting and matching by using a structural primitive. A curvature stroke is defined as a substroke which does not change its curvanture. Input strokes handwritten in a cursive style are splitted into a sequence of curvature strokes by segmenting the points which change the direction of rotation, which occur a sudden change of direction, and which occur an excessive rotation Each reference of korean alphabets is handwritten in a printed style and is saved as a sequence of curvature strikes which is generated by splitting process. And merging process is used to generate various sequences of curvature strikes for matching. Here, it is also considered that imaginary strokes can be written or omitted. By using a curvature stroke as a unit of recognition, redundant splitting points in input characters are effectively reduced and exact matching is possible by generating a reference curvature stroke, which consists of the parts of adjacent two korean alphasbets, even when the connecting points between korean alphabets are not splitted. The results showed 83.6% as recognition rate of the first candidate and 0.99sec./character (CPU clock:66MHz) as processing time.

  • PDF

Utilizing cell-free DNA to validate targeted disruption of MYO7A in rhesus macaque pre-implantation embryos

  • Junghyun Ryu;Fernanda C. Burch;Emily Mishler;Martha Neuringer;Jon D. Hennebold;Carol Hanna
    • Journal of Animal Reproduction and Biotechnology
    • /
    • v.37 no.4
    • /
    • pp.292-297
    • /
    • 2022
  • Direct injection of CRISPR/Cas9 into zygotes enables the production of genetically modified nonhuman primates (NHPs) essential for modeling specific human diseases, such as Usher syndrome, and for developing novel therapeutic strategies. Usher syndrome is a rare genetic disease that causes loss of hearing, retinal degeneration, and problems with balance, and is attributed to a mutation in MYO7A, a gene that encodes an uncommon myosin motor protein expressed in the inner ear and retinal photoreceptors. To produce an Usher syndrome type 1B (USH1B) rhesus macaque model, we disrupted the MYO7A gene in developing zygotes. Identification of appropriately edited MYO7A embryos for knockout embryo transfer requires sequence analysis of material recovered from a trophectoderm (TE) cell biopsy. However, the TE biopsy procedure is labor intensive and could adversely impact embryo development. Recent studies have reported using cell-free DNA (cfDNA) from embryo culture media to detect aneuploid embryos in human in vitro fertilization (IVF) clinics. The cfDNA is released from the embryo during cell division or cell death, suggesting that cfDNA may be a viable resource for sequence analysis. Moreover, cfDNA collection is not invasive to the embryo and does not require special tools or expertise. We hypothesized that selection of appropriate edited embryos could be performed by analyzing cfDNA for MYO7A editing in embryo culture medium, and that this method would be advantageous for the subsequent generation of genetically modified NHPs. The purpose of this experiment is to determine whether cfDNA can be used to identify the target gene mutation of CRISPR/Cas9 injected embryos. In this study, we were able to obtain and utilize cfDNA to confirm the mutagenesis of MYO7A, but the method will require further optimization to obtain better accuracy before it can replace the TE biopsy approach.

The Study on Efficiency Analysis of 3D Animation Production Process Using Unreal Live Link for Autodesk Maya (언리얼 라이브 링크를 이용한 3D애니메이션 제작 공정의 효율성 분석 연구)

  • Chongsan Kwon;Si-min Kim
    • Journal of Industrial Convergence
    • /
    • v.21 no.9
    • /
    • pp.11-21
    • /
    • 2023
  • There have been many studies to improve the efficiency of the CG production process, but it was not easy to overcome the problem that it was difficult to check the result in the middle of work and it took a lot of time for rendering. However, as the possibility of using Unreal Live Link, which can check the result in real-time, is increasing, expectations for improving the efficiency of the production process are rising. This study analyzed the efficiency of the 3D animation production process using Unreal Live Link. To this end, modeling, rigging, animation, and layout work were done in Maya, and the final output image sequence was rendered in Unreal Engine through Unreal Live Link. And the difference between this production process and the existing production process in which the final output image sequence is rendered in the 3D software itself was compared and analyzed. As a result of the analysis, unlike the traditional 3D animation production process, it was possible to check the final work result in real-time by proceeding with the work through a high-quality viewport screen, and it was found that the efficiency of work was maximized by deriving the final result through real-time screen capture. Recently, the use of game engines in the 3D animation and film industry is gradually increasing, and the efficiency of work is expected to be maximized if Unreal Live Link is used.

Numerical Modeling of Flow Characteristics within the Hyporheic Zones in a Pool-riffle Sequences (여울-소 구조에서 지표수-지하수 혼합대의 흐름 특성 분석에 관한 수치모의 연구)

  • Lee, Du-Han;Kim, Young-Joo;Lee, Sam-Hee
    • Journal of Wetlands Research
    • /
    • v.14 no.1
    • /
    • pp.75-87
    • /
    • 2012
  • Hyporheic zone is a region beneath and alongside a stream, river, or lake bed, where there is mixing of shallow groundwater and surfacewater. Hyporheic exchange controls a variety of physical, biogeochemical and thermal processes, and provides unique ecotones in a aquatic ecosystem. Field and experimental observations, and modeling studies indicate that hyporheic exchange is mainly in response to pressure gradients driven by the geomorphological features of stream beds. In the reach scale of a stream, pool-riffle structures dominate the exchange patterns. Flow over a pool-riffle sequence develops recirculation zones and stagnation points, and this flow structures make irregular pressure gradient which is driving force of the hyporheic exchange. In this study, 3 D hydro-dynamic model solves the Reynolds-averaged Navier-Stokes equations for the surface water and Darcy's Law and the continuity equation for ground water. The two sets of equations are coupled via the pressure distribution along the interface. Simulation results show that recirculation zones and stagnation points in the pool-riffle structures dominantly control the upwelling and downwelling patterns. With decrease of recirculation zones, length of donwelling zone formed in front of riffles is reduced and position of maximum downwelling point moves downward. The numerical simulation could successfully predict the behavior of hyporheic exchange and contribute the field study, river management and restoration.

Implementation of a Kinematic Network-Based Single-Frequency GPS Measurement Model and Its Simulation Tests for Precise Positioning and Attitude Determination of Surveying Vessel (동적네트워크 기반 단일주파수 GPS 관측데이터 모델링을 통한 측량선의 정밀측위 및 자세각결정 알고리즘 구현과 수치실험에 의한 성능분석)

  • Hungkyu, Lee;Siwan, Lyu
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.33 no.2
    • /
    • pp.131-142
    • /
    • 2015
  • In order to support the development of a cost-effective river bathymetric system, this research has focused on modeling GPS observables, which are obtained by array of five single-frequency receivers (i.e., two references and three rovers) to estimate the high accurate kinematic position, and the surveying vessel altitude. Also, by applying all GPS measurements as multiple-baselines with constraining rover baselines, we derived the socalled ‘kinematic network model.’ From the model, the integer-constrained least-squares (LS) for position estimation and the implicit LS for attitude determination were implemented, while a series of simulation tests with respect to the baseline lengths around 2km performed to demonstrate its accuracy analysis. The on-the-fly (OTF) ambiguity resolution tests revealed that ninety-nine percents of time-to-fix-first ambiguity (TTFF) can be decided in less than two seconds, when the positioning accuracy of ambiguity-fixed solutions was assessed as the greater than or equal to one and two centimeters in horizontal and vertical, respectively. Comparing to the GPS-derived attitudes, the achievable accuracy gradually descended in sequence of yaw, pitch and roll due to the antenna geometric configuration. Furthermore, the RMSE values for the baseline lengths of three to six meters were within ±1′for yaw, and less than ±10′and ±20′for pitch and roll, respectively, but those of between six to fifteen meters were less than ±1′for yaw, ±5′for pitch, and ±10′for roll.

Principal Discriminant Variate (PDV) Method for Classification of Multicollinear Data: Application to Diagnosis of Mastitic Cows Using Near-Infrared Spectra of Plasma Samples

  • Jiang, Jian-Hui;Tsenkova, Roumiana;Yu, Ru-Qin;Ozaki, Yukihiro
    • Proceedings of the Korean Society of Near Infrared Spectroscopy Conference
    • /
    • 2001.06a
    • /
    • pp.1244-1244
    • /
    • 2001
  • In linear discriminant analysis there are two important properties concerning the effectiveness of discriminant function modeling. The first is the separability of the discriminant function for different classes. The separability reaches its optimum by maximizing the ratio of between-class to within-class variance. The second is the stability of the discriminant function against noises present in the measurement variables. One can optimize the stability by exploring the discriminant variates in a principal variation subspace, i. e., the directions that account for a majority of the total variation of the data. An unstable discriminant function will exhibit inflated variance in the prediction of future unclassified objects, exposed to a significantly increased risk of erroneous prediction. Therefore, an ideal discriminant function should not only separate different classes with a minimum misclassification rate for the training set, but also possess a good stability such that the prediction variance for unclassified objects can be as small as possible. In other words, an optimal classifier should find a balance between the separability and the stability. This is of special significance for multivariate spectroscopy-based classification where multicollinearity always leads to discriminant directions located in low-spread subspaces. A new regularized discriminant analysis technique, the principal discriminant variate (PDV) method, has been developed for handling effectively multicollinear data commonly encountered in multivariate spectroscopy-based classification. The motivation behind this method is to seek a sequence of discriminant directions that not only optimize the separability between different classes, but also account for a maximized variation present in the data. Three different formulations for the PDV methods are suggested, and an effective computing procedure is proposed for a PDV method. Near-infrared (NIR) spectra of blood plasma samples from mastitic and healthy cows have been used to evaluate the behavior of the PDV method in comparison with principal component analysis (PCA), discriminant partial least squares (DPLS), soft independent modeling of class analogies (SIMCA) and Fisher linear discriminant analysis (FLDA). Results obtained demonstrate that the PDV method exhibits improved stability in prediction without significant loss of separability. The NIR spectra of blood plasma samples from mastitic and healthy cows are clearly discriminated between by the PDV method. Moreover, the proposed method provides superior performance to PCA, DPLS, SIMCA and FLDA, indicating that PDV is a promising tool in discriminant analysis of spectra-characterized samples with only small compositional difference, thereby providing a useful means for spectroscopy-based clinic applications.

  • PDF

PRINCIPAL DISCRIMINANT VARIATE (PDV) METHOD FOR CLASSIFICATION OF MULTICOLLINEAR DATA WITH APPLICATION TO NEAR-INFRARED SPECTRA OF COW PLASMA SAMPLES

  • Jiang, Jian-Hui;Yuqing Wu;Yu, Ru-Qin;Yukihiro Ozaki
    • Proceedings of the Korean Society of Near Infrared Spectroscopy Conference
    • /
    • 2001.06a
    • /
    • pp.1042-1042
    • /
    • 2001
  • In linear discriminant analysis there are two important properties concerning the effectiveness of discriminant function modeling. The first is the separability of the discriminant function for different classes. The separability reaches its optimum by maximizing the ratio of between-class to within-class variance. The second is the stability of the discriminant function against noises present in the measurement variables. One can optimize the stability by exploring the discriminant variates in a principal variation subspace, i. e., the directions that account for a majority of the total variation of the data. An unstable discriminant function will exhibit inflated variance in the prediction of future unclassified objects, exposed to a significantly increased risk of erroneous prediction. Therefore, an ideal discriminant function should not only separate different classes with a minimum misclassification rate for the training set, but also possess a good stability such that the prediction variance for unclassified objects can be as small as possible. In other words, an optimal classifier should find a balance between the separability and the stability. This is of special significance for multivariate spectroscopy-based classification where multicollinearity always leads to discriminant directions located in low-spread subspaces. A new regularized discriminant analysis technique, the principal discriminant variate (PDV) method, has been developed for handling effectively multicollinear data commonly encountered in multivariate spectroscopy-based classification. The motivation behind this method is to seek a sequence of discriminant directions that not only optimize the separability between different classes, but also account for a maximized variation present in the data. Three different formulations for the PDV methods are suggested, and an effective computing procedure is proposed for a PDV method. Near-infrared (NIR) spectra of blood plasma samples from daily monitoring of two Japanese cows have been used to evaluate the behavior of the PDV method in comparison with principal component analysis (PCA), discriminant partial least squares (DPLS), soft independent modeling of class analogies (SIMCA) and Fisher linear discriminant analysis (FLDA). Results obtained demonstrate that the PDV method exhibits improved stability in prediction without significant loss of separability. The NIR spectra of blood plasma samples from two cows are clearly discriminated between by the PDV method. Moreover, the proposed method provides superior performance to PCA, DPLS, SIMCA md FLDA, indicating that PDV is a promising tool in discriminant analysis of spectra-characterized samples with only small compositional difference.

  • PDF