• Title/Summary/Keyword: score sequence

Search Result 177, Processing Time 0.027 seconds

An end-to-end synthesis method for Korean text-to-speech systems (한국어 text-to-speech(TTS) 시스템을 위한 엔드투엔드 합성 방식 연구)

  • Choi, Yeunju;Jung, Youngmoon;Kim, Younggwan;Suh, Youngjoo;Kim, Hoirin
    • Phonetics and Speech Sciences
    • /
    • v.10 no.1
    • /
    • pp.39-48
    • /
    • 2018
  • A typical statistical parametric speech synthesis (text-to-speech, TTS) system consists of separate modules, such as a text analysis module, an acoustic modeling module, and a speech synthesis module. This causes two problems: 1) expert knowledge of each module is required, and 2) errors generated in each module accumulate passing through each module. An end-to-end TTS system could avoid such problems by synthesizing voice signals directly from an input string. In this study, we implemented an end-to-end Korean TTS system using Google's Tacotron, which is an end-to-end TTS system based on a sequence-to-sequence model with attention mechanism. We used 4392 utterances spoken by a Korean female speaker, an amount that corresponds to 37% of the dataset Google used for training Tacotron. Our system obtained mean opinion score (MOS) 2.98 and degradation mean opinion score (DMOS) 3.25. We will discuss the factors which affected training of the system. Experiments demonstrate that the post-processing network needs to be designed considering output language and input characters and that according to the amount of training data, the maximum value of n for n-grams modeled by the encoder should be small enough.

Clinical validation of the 3-dimensional double-echo steady-state with water excitation sequence of MR neurography for preoperative facial and lingual nerve identification

  • Kwon, Dohyun;Lee, Chena;Chae, YeonSu;Kwon, Ik Jae;Kim, Soung Min;Lee, Jong-Ho
    • Imaging Science in Dentistry
    • /
    • v.52 no.3
    • /
    • pp.259-266
    • /
    • 2022
  • Purpose: This study aimed to evaluate the clinical usefulness of magnetic resonance (MR) neurography using the 3-dimensional double-echo steady-state with water excitation (3D-DESS-WE) sequence for the preoperative delineation of the facial and lingual nerves. Materials and Methods: Patients underwent MR neurography for a tumor in the parotid gland area or lingual neuropathy from January 2020 to December 2021 were reviewed. Preoperative MR neurography using the 3D-DESS-WE sequence was evaluated. The visibility of the facial nerve and lingual nerve was scored on a 5-point scale, with poor visibility as 1 point and excellent as 5 points. The facial nerve course relative to the tumor was identified as superficial, deep, or encased. This was compared to the actual nerve course identified during surgery. The operative findings in lingual nerve surgery were also described. Results: Ten patients with parotid tumors and 3 patients with lingual neuropathy were included. Among 10 parotid tumor patients, 8 were diagnosed with benign tumors and 2 with malignant tumors. The median facial nerve visibility score was 4.5 points. The distribution of scores was as follows: 5 points in 5 cases, 4 points in 1 case, 3 points in 2 cases, and 2 points in 2 cases. The lingual nerve continuity score in the affected area was lower than in the unaffected area in all 3 patients. The average visibility score of the lingual nerve was 2.67 on the affected side and 4 on the unaffected side. Conclusion: This study confirmed that the preoperative localization of the facial and lingual nerves using MR neurography with the 3D-DESS-WE sequence was feasible and contributed to surgical planning for the parotid area and lingual nerve.

A DNA Sequence Alignment Algorithm Using Quality Information and a Fuzzy Inference Method (품질 정보와 퍼지 추론 기법을 이용한 DNA 염기 서열 배치 알고리즘)

  • Kim, Kwang-Baek
    • Journal of Intelligence and Information Systems
    • /
    • v.13 no.2
    • /
    • pp.55-68
    • /
    • 2007
  • DNA sequence alignment algorithms in computational molecular biology have been improved by diverse methods. In this paper, we proposed a DNA sequence alignment algorithm utilizing quality information and a fuzzy inference method utilizing characteristics of DNA sequence fragments and a fuzzy logic system in order to improve conventional DNA sequence alignment methods using DNA sequence quality information. In conventional algorithms, DNA sequence alignment scores were calculated by the global sequence alignment algorithm proposed by Needleman-Wunsch applying quality information of each DNA fragment. However, there may be errors in the process for calculating DNA sequence alignment scores in case of low quality of DNA fragment tips, because overall DNA sequence quality information are used. In the proposed method, exact DNA sequence alignment can be achieved in spite of low quality of DNA fragment tips by improvement of conventional algorithms using quality information. And also, mapping score parameters used to calculate DNA sequence alignment scores, are dynamically adjusted by the fuzzy logic system utilizing lengths of DNA fragments and frequencies of low quality DNA bases in the fragments. From the experiments by applying real genome data of NCBI (National Center for Biotechnology Information), we could see that the proposed method was more efficient than conventional algorithms using quality information in DNA sequence alignment.

  • PDF

Determination of Design Parameters for Automobile Parts Recycling (자동차 부품의 재활용을 위한 설계시의 주요인자 결정)

  • 목학수;문광섭;박홍석;성재현;최흥원
    • Journal of the Korean Society for Precision Engineering
    • /
    • v.20 no.1
    • /
    • pp.159-171
    • /
    • 2003
  • In this paper, same parts of a domestic automobiles and foreign automobiles are disassembled fur the evaluation of disassemblability, especially door trim and bumper. Influencing factors of disassembly are determined by the classification of bottleneck process in disassembly process. On the bases of disassembly sequence and structure of parts and subassembly, disassemblability is classified into aye categories. The influencing factors, which are related with the five categories are determined. By these relations, the checklist for disassembly evaluation is draw up and score tables of checked factors are established. For the establishing the disassembly score tables, the weighting values of each five categories are calculated by the disassembly test of automobiles and then, the weighting values of each influencing factors of five categories are calculated by the method of AHP (Analytic Hierarchy Process). And the last, the weighting values are modified and recalculated from the disassembly test. Using these weighting values, the score of influencing factors are determined and then, the score tables are established based on the score of influencing factors.

A Reranking Model for Korean Morphological Analysis Based on Sequence-to-Sequence Model (Sequence-to-Sequence 모델 기반으로 한 한국어 형태소 분석의 재순위화 모델)

  • Choi, Yong-Seok;Lee, Kong Joo
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.7 no.4
    • /
    • pp.121-128
    • /
    • 2018
  • A Korean morphological analyzer adopts sequence-to-sequence (seq2seq) model, which can generate an output sequence of different length from an input. In general, a seq2seq based Korean morphological analyzer takes a syllable-unit based sequence as an input, and output a syllable-unit based sequence. Syllable-based morphological analysis has the advantage that unknown words can be easily handled, but has the disadvantages that morpheme-based information is ignored. In this paper, we propose a reranking model as a post-processor of seq2seq model that can improve the accuracy of morphological analysis. The seq2seq based morphological analyzer can generate K results by using a beam-search method. The reranking model exploits morpheme-unit embedding information as well as n-gram of morphemes in order to reorder K results. The experimental results show that the reranking model can improve 1.17% F1 score comparing with the original seq2seq model.

Genetic assessment of BoLA-DRB3 polymorphisms by comparing Bangladesh, Ethiopian, and Korean cattle

  • Mandefro, Ayele;Sisay, Tesfaye;Edea, Zewdu;Uzzaman, Md. Rasel;Kim, Kwan-Suk;Dadi, Hailu
    • Journal of Animal Science and Technology
    • /
    • v.63 no.2
    • /
    • pp.248-261
    • /
    • 2021
  • Attributable to their major function in pathogen recognition, the use of bovine leukocyte antigens (BoLA) as disease markers in immunological traits in cattle is well established. However, limited report exists on polymorphism of the BoLA gene in zebu cattle breeds by high resolution typing methods. Thus, we used a polymerase chain reaction sequence-based typing (PCR-SBT) method to sequence exon 2 of the BoLA class II DRB3 gene from 100 animals (Boran, n = 13; Sheko, n = 20; Fogera, n = 16; Horro, n = 19), Hanwoo cattle (n = 18) and Bangladesh Red Chittagong zebu (n = 14). Out of the 59 detected alleles, 43 were already deposited under the Immuno Polymorphism Database for major histocompatibility complex (IPD-MHC) while 16 were unique to this study. Assessment of the level of genetic variability at the population and sequence levels with genetic distance in the breeds considered in this study showed that Zebu breeds had a gene diversity score greater than 0.752, nucleotide diversity score greater than 0.152, and mean number of pairwise differences higher than 14, being very comparable to those investigated for other cattle breeds. Regarding neutrality tests analyzed, we investigated that all the breeds except Hanwoo had an excess number of alleles and could be expected from a recent population expansion or genetic hitchhiking. Howbeit, the observed heterozygosity was not significantly (p < 0.05) higher than the expected heterozygosity. The Hardy Weinberg equilibrium (HWE) analysis revealed non-significant excess of heterozygote animals, indicative of plausible over-dominant selection. The pairwise FST values suggested a low genetic variation among all the breeds (FST = 0.056; p < 0.05), besides the rooting from the evolutionary or domestication history of the cattle. No detached clade was observed in the evolutionary divergence study of the BoLA-DRB3 gene, inferred from the phylogenetic tree based on the maximum likelihood model. The investigation herein indicated the clear differences in BoLA-DRB3 gene variability between African and Asian cattle breeds.

A Study on Satisfaction of Clinical Practice of Dental Technology Students - Focused on Daegu region - (치기공과학생의 임상실습만족도에 대한 조사 연구 -대구지역을 중심으로-)

  • Lee, Hwa-Sik;Bae, Bong-Jin;Park, Myoung-Ho
    • Journal of Technologic Dentistry
    • /
    • v.31 no.4
    • /
    • pp.45-52
    • /
    • 2009
  • This study is analyzed to conduct better on-site practices with recognizing importance of the clinical practice of Dept. of dental technology and use it as a basic material in the clinical practice. Target people who are students studying dental technology in D college in Daegu were questioned by survey. Study results below 1. Average score of the survey about satisfaction of the operating method of clinical practice shows 3.26. In detail elements, 'credit assignment(10 credits)' is 3.65 as the highest score, 'execution period(vacation)' is 3.50, 'choice of the clinical practice organization' is 3.25, 'measures after practice' is 2.98 and 'pre-education' is 2.98 as the lowest score. 2. Through the real clinical practice, 'experience of new equipments and technology' is 3.64 as the highest score, 'choice of lecturer' is 3.61, 'guidance way' is 3.49, 'contents properness' is 3.44, 'environment of practice organization' is 3.36, 'evaluation way' is 3.35 and 'practical use of the evaluation material' is 3.18 as the lowest score. 3. The average score of survey about satisfaction after clinical practice of the participated students is 3.46 that is higher than both 'satisfaction about operating method(3.26)' about clinical practice of college and 'satisfaction about organization(3.44)' about environment of dental craft organizations and labs, guidance way of lecturer and evaluation. 4. In the improvement of distribution of the clinical practice evaluation, in the 'practice organization: college' viewpoint, '7:3' is 35.77% as the highest response, '6:4' is 25.20%, '8:2' is 22.76% and '4:6' is 16.26 in regular sequence. 5. In site evaluation reflection of clinical practice, 50% reflection is 32.93% as the highest percentage, 60% reflection is 26.83%, 20% reflection is 20.73% and 80% reflection is 6.10% in regular sequence. In attendance score, it shows percentage of reflecting 50% and 40% is 26.98%, students wanting to reflect 30% is 25.40%, reflecting 10% is 20.63% and no reflecting is 0%. In result of the analyzed data, clinical practice has to be studied more in considering that clinical practice is important point in education of Dept. of Dental Technology and also problems in college and on-site practice need improvements.

  • PDF

SCORE SEQUENCES IN ORIENTED GRAPHS

  • Pirzada, S.;Naikoo, T.A.;Shah, N.A.
    • Journal of applied mathematics & informatics
    • /
    • v.23 no.1_2
    • /
    • pp.257-268
    • /
    • 2007
  • An oriented graph is a digraph with no symmetric pairs of directed arcs and without loops. The score of a vertex $v_i$ in an oriented graph D is $a_{v_i}\;(or\;simply\;a_i)=n-1+d_{v_i}^+-d_{v_i}^-,\;where\; d_{v_i}^+\;and\;d_{v_i}^-$ are the outdegree and indegree, respectively, of $v_i$ and n is the number of vertices in D. In this paper, we give a new proof of Avery's theorem and obtain some stronger inequalities for scores in oriented graphs. We also characterize strongly transitive oriented graphs.

Global Sequence Homology Detection Using Word Conservation Probability

  • Yang, Jae-Seong;Kim, Dae-Kyum;Kim, Jin-Ho;Kim, Sang-Uk
    • Interdisciplinary Bio Central
    • /
    • v.3 no.4
    • /
    • pp.14.1-14.9
    • /
    • 2011
  • Protein homology detection is an important issue in comparative genomics. Because of the exponential growth of sequence databases, fast and efficient homology detection tools are urgently needed. Currently, for homology detection, sequence comparison methods using local alignment such as BLAST are generally used as they give a reasonable measure for sequence similarity. However, these methods have drawbacks in offering overall sequence similarity, especially in dealing with eukaryotic genomes that often contain many insertions and duplications on sequences. Also these methods do not provide the explicit models for speciation, thus it is difficult to interpret their similarity measure into homology detection. Here, we present a novel method based on Word Conservation Score (WCS) to address the current limitations of homology detection. Instead of counting each amino acid, we adopted the concept of 'Word' to compare sequences. WCS measures overall sequence similarity by comparing word contents, which is much faster than BLAST comparisons. Furthermore, evolutionary distance between homologous sequences could be measured by WCS. Therefore, we expect that sequence comparison with WCS is useful for the multiple-species-comparisons of large genomes. In the performance comparisons on protein structural classifications, our method showed a considerable improvement over BLAST. Our method found bigger micro-syntenic blocks which consist of orthologs with conserved gene order. By testing on various datasets, we showed that WCS gives faster and better overall similarity measure compared to BLAST.

Reinterpretation of the protein identification process for proteomics data

  • Kwon, Kyung-Hoon;Lee, Sang-Kwang;Cho, Kun;Park, Gun-Wook;Kang, Byeong-Soo;Park, Young-Mok
    • Interdisciplinary Bio Central
    • /
    • v.1 no.3
    • /
    • pp.9.1-9.6
    • /
    • 2009
  • Introduction: In the mass spectrometry-based proteomics, biological samples are analyzed to identify proteins by mass spectrometer and database search. Database search is the process to select the best matches to the experimental mass spectra among the amino acid sequence database and we identify the protein as the matched sequence. The match score is defined to find the matches from the database and declare the highest scored hit as the most probable protein. According to the score definition, search result varies. In this study, the difference among search results of different search engines or different databases was investigated, in order to suggest a better way to identify more proteins with higher reliability. Materials and Methods: The protein extract of human mesenchymal stem cell was separated by several bands by one-dimensional electrophorysis. One-dimensional gel was excised one by one, digested by trypsin and analyzed by a mass spectrometer, FT LTQ. The tandem mass (MS/MS) spectra of peptide ions were applied to the database search of X!Tandem, Mascot and Sequest search engines with IPI human database and SwissProt database. The search result was filtered by several threshold probability values of the Trans-Proteomic Pipeline (TPP) of the Institute for Systems Biology. The analysis of the output which was generated from TPP was performed. Results and Discussion: For each MS/MS spectrum, the peptide sequences which were identified from different conditions such as search engines, threshold probability, and sequence database were compared. The main difference of peptide identification at high threshold probability was caused by not the difference of sequence database but the difference of the score. As the threshold probability decreases, the missed peptides appeared. Conversely, in the extremely high threshold level, we missed many true assignments. Conclusion and Prospects: The different identification result of the search engines was mainly caused by the different scoring algorithms. Usually in proteomics high-scored peptides are selected and low-scored peptides are discarded. Many of them are true negatives. By integrating the search results from different parameter and different search engines, the protein identification process can be improved.