• Title/Summary/Keyword: Sentence analysis

Search Result 493, Processing Time 0.031 seconds

Korean Sentence Symbol Preprocess System for the Improvement of Speech Synthesis Quality (음성 합성 시스템의 품질 향상을 위한 한국어 문장 기호 전처리 시스템)

  • Lee, Ho-Joon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.20 no.2
    • /
    • pp.149-156
    • /
    • 2015
  • In this paper, we propose a Korean sentence symbol preprocessor for a SSML (speech synthesis markup language) supported speech synthesis system in order to improve the quality of the synthesized result. After the analysis of Korean Wikipedia documents, we propose 8 categories for the meaning of sentence symbols and 11 regular expression for the classification of each category. After the development of a Korean sentence symbol preprocess system we archived 56% of precision and 71.45% of recall ratio for 63,000 sentences.

An acoustical analysis of synchronous English speech using automatic intonation contour extraction (영어 동시발화의 자동 억양궤적 추출을 통한 음향 분석)

  • Yi, So Pae
    • Phonetics and Speech Sciences
    • /
    • v.7 no.1
    • /
    • pp.97-105
    • /
    • 2015
  • This research mainly focuses on intonational characteristics of synchronous English speech. Intonation contours were extracted from 1,848 utterances produced in two different speaking modes (solo vs. synchronous) by 28 (12 women and 16 men) native speakers of English. Synchronous speech is found to be slower than solo speech. Women are found to speak slower than men. The effect size of speech rate caused by different speaking modes is greater than gender differences. However, there is no interaction between the two factors (speaking modes vs. gender differences) in terms of speech rate. Analysis of pitch point features has it that synchronous speech has smaller Pt (pitch point movement time), Pr (pitch point pitch range), Ps (pitch point slope) and Pd (pitch point distance) than solo speech. There is no interaction between the two factors (speaking modes vs. gender differences) in terms of pitch point features. Analysis of sentence level features reveals that synchronous speech has smaller Sr (sentence level pitch range), Ss (sentence slope), MaxNr (normalized maximum pitch) and MinNr (normalized minimum pitch) but greater Min (minimum pitch) and Sd (sentence duration) than solo speech. It is also shown that the higher the Mid (median pitch), the MaxNr and the MinNr in solo speaking mode, the more they are reduced in synchronous speaking mode. Max, Min and Mid show greater speaker discriminability than other features.

Comparative Study of Tokenizer Based on Learning for Sentiment Analysis (고객 감성 분석을 위한 학습 기반 토크나이저 비교 연구)

  • Kim, Wonjoon
    • Journal of Korean Society for Quality Management
    • /
    • v.48 no.3
    • /
    • pp.421-431
    • /
    • 2020
  • Purpose: The purpose of this study is to compare and analyze the tokenizer in natural language processing for customer satisfaction in sentiment analysis. Methods: In this study, a supervised learning-based tokenizer Mecab-Ko and an unsupervised learning-based tokenizer SentencePiece were used for comparison. Three algorithms: Naïve Bayes, k-Nearest Neighbor, and Decision Tree were selected to compare the performance of each tokenizer. For performance comparison, three metrics: accuracy, precision, and recall were used in the study. Results: The results of this study are as follows; Through performance evaluation and verification, it was confirmed that SentencePiece shows better classification performance than Mecab-Ko. In order to confirm the robustness of the derived results, independent t-tests were conducted on the evaluation results for the two types of the tokenizer. As a result of the study, it was confirmed that the classification performance of the SentencePiece tokenizer was high in the k-Nearest Neighbor and Decision Tree algorithms. In addition, the Decision Tree showed slightly higher accuracy among the three classification algorithms. Conclusion: The SentencePiece tokenizer can be used to classify and interpret customer sentiment based on online reviews in Korean more accurately. In addition, it seems that it is possible to give a specific meaning to a short word or a jargon, which is often used by users when evaluating products but is not defined in advance.

Research on the Syntactic-Semantic Analysis System on Compound Sentence for Descriptive-type Grading (서술형 문항 채점을 위한 복합문 구문의미분석 시스템에 대한 연구)

  • Kang, WonSeog
    • The Journal of Korean Association of Computer Education
    • /
    • v.21 no.6
    • /
    • pp.105-115
    • /
    • 2018
  • The descriptive-type question is appropriate for deep thinking ability evaluation, but it is not easy to grade. Since, even though same grading criterion, the graders produce different scores, we need the objective evaluation system. However, the system needs the Korean analysis. As the descriptive-type answering is described with the compound sentence, the system has to analyze the compound sentence. This paper develops the Korean syntactic-semantic analysis system for compound sentence and evaluates performance of the system. This system selects the modifiee of the word phrase using syntactic-semantic constraint and semantic dictionary. The 93% accurate rate shows that the system is effective. This system will be utilized in descriptive-type grading and Korean processing.

영한자동번역에서의 두단계 영어 전산문법

  • 최승권
    • Language and Information
    • /
    • v.4 no.1
    • /
    • pp.97-109
    • /
    • 2000
  • Application systems of natural language processing such as machine translation system must deal with actual texts including the full range of linguistic phenomena. But it seems to be impossible that the existing grammar covers completely such actual texts because they include disruptive factors such as long sentences, unexpected sentence patterns and erroneous input to obstruct well-formed analysis of a sentence. In order to solve analysis failure due to the disruptive factors or incorrect selection of correct parse tree among forest parse trees, this paper proposes two-level computational grammar which consists of a constraint-based grammar and an error-tolerant grammar. The constraint-based computational grammar is the grammar that gives us the well-formed analysis of English texts. The error-tolerant computational grammar is the grammar that reconstructs a comprehensible whole sentence structure with partially successful parse trees within failed parsing results.

  • PDF

Anchoring Effect of the Prosecutor's Demand on Sentence: Evidence from Korean Sexual Crime Cases

  • KIM, JUNGWOOK;CHAE, SUBOK
    • KDI Journal of Economic Policy
    • /
    • v.39 no.3
    • /
    • pp.1-18
    • /
    • 2017
  • The anchoring effect can be found when a decision shows cognitive prejudice towards the initial information given. Several studies have argued that such an effect is present even for judges in the courtroom. This paper seeks to find a relationship between judges' decisions on penalty sentences and the sentences recommended by prosecutors. In this study, 2,773 actual court cases are considered in the analysis, and quantile regression is used to show that the sentencing decisions judges make are anchored by the recommendations of prosecutors. However, this reliance on recommendations differs according to the seriousness of the crime committed. Specifically, at the lowest penalty levels, a one-month increase in the prosecutors' sentencing recommendation results in a 0.25-month increase in the judges' sentence, while at the highest sentence level, the judges' sentences increase by 0.78 months under an identical condition. The results of this research indicate the need to create more objective and clear sentencing guidelines in the future in an effort to mitigate the psychological pressure experienced by judges with regard to serious offences or heinous crimes.

Voice Characteristics of Spastic Dysarthria (경직형 마비성 구어장애자의 장애 정도별 음성 특성)

  • Park, Hee-Jung;Shin, Hey-Jung;Jeong, Ok-Ran;Seok, Dong-Il
    • Speech Sciences
    • /
    • v.11 no.4
    • /
    • pp.185-195
    • /
    • 2004
  • Spastic dysarthria is a motor speech disorder produced by a damage to the direct (pyramidal) and indirect (extrapyramidal) pathways of the central nervous system. This study attempted to determine the influences of the severity of spastic dysarthria on the stability of voice. A sustained /a/, /a/ produced at the beginning of a sentence and at the end of the sentence were analyzed. The Praat 4.0.51 was used for analysis. The results showed a significant difference between the severe spastic dysarthric group and normal group matched by age and sex at the beginning and the end of the sentence. In addition, the severe group showed a significant difference between the first /a/ and the second /a/ at the end of sentence.

  • PDF

E-book to sign-language translation program based on morpheme analysis (형태소 분석 기반 전자책 수화 번역 프로그램)

  • Han, Sol-Ee;Kim, Se-A;Hwang, Gyung-Ho
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.21 no.2
    • /
    • pp.461-467
    • /
    • 2017
  • As the number of smart devices increases, e-book contents and services are proliferating. However, the text based e-book is difficult for a hearing-impairment person to understand. In this paper, we developed an android based application in which we can choose an e-book text file and each sentence is translated to sign-language elements which are shown in videos that are retrieved from the sign-language contents server. We used the korean sentence to sign-language translation algorithm based on the morpheme analysis. The proposed translation algorithm consists of 3 stages. Firstly, some elements in a sentence are removed for typical sign-language usages. Secondly, the tense of the sentence and the expression alteration are applied. Finally, the honorific forms are considered and word positions in the sentence are revised. We also proposed a new method to evaluate the performance of the translation algorithm and demonstrated the superiority of the algorithm through the translation results of 100 reference sentences.

An implementation of parser for special syntax processing in Korea (한국어 특수구문 처리를 위한 파서의 구현)

  • Kim, Jae-Mun;Lee, Sang-Kuk;Lee, Sang-Jo
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.31B no.11
    • /
    • pp.124-135
    • /
    • 1994
  • In this paper, we propose a Korean syntax analysis system for special syntax processing. HPSG, which processes syntatic and semantic analysis unificationally, is chosen for grammar description. Head-driven unidirectional active chart parser, which is efficient in Korean processing, is used for parsing mechanism. The parser of this paper can analyze not only general sentence structure which consists of complement-head, adjunct-head and head-head structure bur also special syntax which consists of auxiliay verb sentence, causative sentence, passive sentence and so on.

  • PDF

A Structure of Passive Constructions in Korean and their meaning 'Potential' (한국어 피동문의 구조와 가능(potential)의 의미 해석 -대조적 관점에서-)

  • Mok, Jung-Soo;Kim, Yeong-Jung
    • Lingua Humanitatis
    • /
    • v.8
    • /
    • pp.369-387
    • /
    • 2006
  • Which syntactic function should we assign to the 'ga-type' constituent which occurs in the morphological passive constructions in Korean, [N0-neun N1-i Vpass-ending]? This problem is very important in two respects. First, a small change of status of the particle 'i/ga' can exert an overall influence on the Korean grammar. Second, the particle '-i/ga' cannot guarantee that 'ga-type' constituents are subject of the sentence, so that the concept of syntactic category should be distinguished from that of syntactic function. This paper claims that the analysis of sentence has long been focused on the structure of proposition, namely the argument structure and that the direction of analysis should be turned to the 'person structure' which can be revealed on the pragmatic level. On the basis of this, this paper suggests that the specific type of the morphological passive constructions in Korean, [N0-neun N1-i Vpass-ending] should be analysed in line with the psych-verb constructions and that the modal meaning 'potential' of the passive constructions is correlated with sentence pattern and 'person structure'.

  • PDF