• Title/Summary/Keyword: speech features

Search Result 648, Processing Time 0.025 seconds

Bi-directional LSTM-CNN-CRF for Korean Named Entity Recognition System with Feature Augmentation (자질 보강과 양방향 LSTM-CNN-CRF 기반의 한국어 개체명 인식 모델)

  • Lee, DongYub;Yu, Wonhee;Lim, HeuiSeok
    • Journal of the Korea Convergence Society
    • /
    • v.8 no.12
    • /
    • pp.55-62
    • /
    • 2017
  • The Named Entity Recognition system is a system that recognizes words or phrases with object names such as personal name (PS), place name (LC), and group name (OG) in the document as corresponding object names. Traditional approaches to named entity recognition include statistical-based models that learn models based on hand-crafted features. Recently, it has been proposed to construct the qualities expressing the sentence using models such as deep-learning based Recurrent Neural Networks (RNN) and long-short term memory (LSTM) to solve the problem of sequence labeling. In this research, to improve the performance of the Korean named entity recognition system, we used a hand-crafted feature, part-of-speech tagging information, and pre-built lexicon information to augment features for representing sentence. Experimental results show that the proposed method improves the performance of Korean named entity recognition system. The results of this study are presented through github for future collaborative research with researchers studying Korean Natural Language Processing (NLP) and named entity recognition system.

School Phonetics and How to Teach Prosody of English in Japan

  • Tsuzuki, Masaki
    • Proceedings of the KSPS conference
    • /
    • 1997.07a
    • /
    • pp.11-25
    • /
    • 1997
  • This presentation will focus on building basic English Prosodic Skills which are very useful and helpful for Japanese learners of English. The focus first will be on recognizing the seven basic nuclear tones, analysing intonation structures, distinguishing intonation patterns and then on the way of improving speaking ability using sufficient verbal contents of intonation (mini-dialogue). My presentation deals mainly with some difficulties which Japanese learners of English have in the field of RP intonation, It is chiefly concerned with identifying, describing and analysing tone-group sequences. It sometimes happens that Japanese learners of English can pronounce isolated bounds correctly and read phonetic symbols sufficiently, bet have difficult problems in carrying out accurate prosodic features. The use of wrong intonation is sometimes the cause of misunderstanding of speaker's attitude, connotation and shades of meaning, etc.. However accurately students can pronounce the nuclear tone or tone-group of English, they have to learn how to connect tone-groups properly for suitable sequences in respect to meaning or implication. We are faced with the complicated theory of RF intonation on the one hand and difficult realization of it on the other. Japanese learners of English have special difficulties in employing "rising tune" and "falling + rising tune". If students are taught pitch movements by indicating dots graphically between two horizontal lines, they can easily understand the whole shape of pitch movements. In this presentation, I illuminate several tone-group sequences which are very useful for Japanese learning English intonation. Among them, four similar Pitch Patterns, such as, (1) (equation omitted)- type, (2) (equation omitted) - type, (3) (equation omitted) - type and (4) (Rising Head) (equation omitted)- type are clarified and other important tone-group sequences aye also highlighted from the point of view of teaching English as a foreign language. The intonation theory, tone marks and technical terms are, in all essentials, those of Intonation of Colloquial English by O'Connor, J. D. and Arnold, G. F., Longman, 2nd ed., 1982. The changes of tone are shown graphically between two horizontal lines representing the ordinary high and low zones of the utterance. A.C.Gimson (1981:314) : The intonation of English has been studied in greater detail and for longer than that of any other language. No definitive analysis, classifying the features of RP intonation, has yet appeared (though that presented by O'Connor and Arnold (1973) provides the most comprehensive and useful account from the foreign learner's point of view).

  • PDF

The semantic structure of the Russian humor in the works of Michael Zadornov (자도르노프 작품 속에 나라난 러시아 유머의 의미군조)

  • 안병팔
    • Lingua Humanitatis
    • /
    • v.6
    • /
    • pp.321-357
    • /
    • 2004
  • In this article the structure of modern Russian humor is analyzed on the basis of some theories: bi-sociation theory (Koestler 1964), semantic script theory of verbal humor, using the concept of semantic presupposition, pragmatic felicity condition (Searle 1969; Levinson 1983) and grammatical rules (Chomsky 1965). Up to now the listed former theories were not examined and less analyzed by the semantic structure in the study of the structure of Russian humor(HcaeBa 1969; 3 $a_{OPHOB}$ 1991; 1992). Kreps (1981), who analyzed the works of Zoschenko, presented 21 types of humor, using the term 'humoreme'(Kpenc 1981, 36-37). These types are the list of the available means of humor that work not in the base of semantic criteria, but in the base of means of literary rhetoric. Kreps presented types of humor means, such as contradiction, antonymic substitution, macaronic speech and correlation of humoremes in the various types of humor. Apart from Kreps, Manakov (MaHaKOB 1986, 61-79) also studied these problems. He also set the system of the basic types of humor. Manakov introduced the linguistic means of humor of some Russian writers: Gogol, Tchechov. The means that Manakov showed with detailed examples, are trope, epithet, comic comparison, comic metaphor, comic periphrasis, euphemism, pun, zeugma, comic toponym, comic onomatopoeia, mania of foreign vocabulary, folk etymology, dialect etc. But these studies don't explain why these means make the works humorous. An, B.p tried to answer this question (안병팔 1997 a; b). An B.p. explains contexts of humor through the Release theory, the Superiority theory and the Incongruity theory. An, B.p. explained the process of deviation from the grammatical norms through morpho-syntactic and lexical means. But in these studies the humor was not analyzed by the semantic criteria. In order to linguistically evaluate various means of humor formation, it is necessary to elicit its deep structure, which makes it possible to research the formation and interpretation of humor. For this purpose this article, being based on the Incongruity theory, defined the structure of humor as negation of presupposition. Of course the former traditional studies also well shared the concept of 'contradiction' and 'contrast' of humor structure, but they didn't explain the structure by semantic differential features. This study, analyzing the works of' Zadornov, M., tried to note that through the negation of semantic presupposition the structure of contradiction is formed with semantic differential features on the semantic, syntactic or lexical dimensions.

  • PDF

A Study on its Formation of the Ulsan Dutbeki Dance: Focusing on Local Features in the Ulsan District. (향토성에 의한 울산덧배기춤의 형상화에 관한 연구)

  • Choi, Heung-Kee
    • (The) Research of the performance art and culture
    • /
    • no.41
    • /
    • pp.187-218
    • /
    • 2020
  • Ulsan Dutbeki is a local dance handed down by the Ulsan people through custom. This study was discussed on the locality of Ulsan Dutbeki. The method of this study is as follows. First of all, the perception of Dutbeki from the perspective of Ulsan's local characteristic. First, Ulsan Dutbeki is based on the local characteristic of the southeastern coastal area of the Korean peninsula. Second, Dutbeki features local characteristics of Ulsan as a military cultural area. Third, in Dutbeki, there is a local culture of Ulsan which was originated from the village Dongjeol and outdoor performances. Next, the researcher perceived Ulsan Dutbeki which had been handed down through custom and approached its shape. The origins of the shape are, firstly, the speech tone and gestures of Ulsan people. Secondly, folk plays related to worshiping martial arts and military training. Thirdly, the characteristics of the Dutbeki dance in coastal areas of Gyeongsangdo. Fourth, local custom displayed at the village festival of Ulsan. Ulsan is a region of Gyeongsang culture area and has similarity with other localities. However, this study limited its comparisons with regard to Dutbeki that were originated from the local characteristics of other regions. The results of this study recognized Ulsan Dutbeki as a local dance in Ulsan area. In other words, this study perceived Dutbeki, which had been an entertaining component of traditional lifestyle, as an intangible cultural heritage and studied the form in every conceivable way from an artistic point of view.

A Study on the Acoustic Characteristics of the Pansori by Voice Signals Analysis (음성신호 분석에 의한 판소리의 음성학적 특징 연구)

  • Kim, HyunSook
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.14 no.7
    • /
    • pp.3218-3222
    • /
    • 2013
  • Pansori is our traditional vocal sound, originality and excellence in the art of conversation, gesture general became a globally recognized world intangible heritage. Especially, Pansori as shrews and humorous representation of audience participation with a high degree of artistic value and enjoy the arts throughout all layers to be responsible for the social integration of functions is evaluated. Therefore, in this paper, Pansori five yard target speech signal analysis techniques applied to analyze the Pansori acoustic features of a representation of a society and era correlation extraction studies were performed. Pansori on the five yard spectrogram, pitch, stability and strength analysis for this experiment. Pansori through experimental results Comical story while keeping the audience focused and interested to better reflect the characteristics of energy for the wave of voice and vocal cord tremor change the width of a large, stable and voice with a loud voice, that expresses were analyzed.

SOUND SIMILARITY JUDGMENTS AND PHONOLOGICAL UNITS

  • Yoon, Yeo-Bom
    • Proceedings of the KSPS conference
    • /
    • 1997.07a
    • /
    • pp.142-143
    • /
    • 1997
  • The purpose of this paper is to assess the psychological status of the phoneme, syllable, and various postulated subsyllabic units in Korean by applying the Sound Similarity Judgment (SSJ) task, to compare the results with those in English, and to discuss the advantage and disadvantage of the SSJ task as a tool for linguistic research. In Experiment 1, 30 subjects listened to pairs of 56 eve words which were systematically varied from 'totally different' (e.g., pan-met) to 'identical' (e.g., pan-pan). Subjects were then asked to rate sound similarity of each pair on a 10-point scale. Not very surprisingly, there was a strong correlation between the number of phonemic segments matched and the similarity score provided by the subjects. This result was in accord with the previous results from English (e.g., Vitz & Winkler, 1973; Derwing & Nearey, 1986) and supported the assumption that the phoneme is the basic phonological unit in Korean and English. However, there were sharply contrasting results between the two languages. When the pairs shared two phonemes (e.g., pan-pat; pan-pen; pan-man), the pairs sharing the fIrst two phonemes were judged significantly more similar than the other two types of pairs. Quite to the contrary, in the comparable English experiments, the pairs sharing the last two phonemes were judged significantly more similar than the other two types of pairs. Experiment 2 was designed to conflrm the results of Experiment 1 by controlling the 'degree' of similarity between phonemes. For example, the pair pan-pam can be judged more similar than the pair pan-nan, although both pairs share the same number of phonemes. This could be interpreted either as confirming the result of Experiment 1 or as the fact that /n/ is more similar to /m/ than /p/ is to /n/ in terms of shared number of distinctive features. The results of Experiment 2 supported the former interpretation. Thus, the results of both experiments clearly showed that, although the 'number' of matched phonemes is the important predictor in judging sound similarity of monosyllabic pairs of both languages, the 'position' of the matched phonemes exerts a different influence in judging sound similarity in the two languages. This contrasting set of results may provide interesting implications for the internal structure of the syllable in the two languages.

  • PDF

Noise Robust Text-Independent Speaker Identification for Ubiquitous Robot Companion (지능형 서비스 로봇을 위한 잡음에 강인한 문맥독립 화자식별 시스템)

  • Kim, Sung-Tak;Ji, Mi-Kyoung;Kim, Hoi-Rin;Kim, Hye-Jin;Yoon, Ho-Sub
    • 한국HCI학회:학술대회논문집
    • /
    • 2008.02a
    • /
    • pp.190-194
    • /
    • 2008
  • This paper presents a speaker identification technique which is one of the basic techniques of the ubiquitous robot companion. Though the conventional mel-frequency cepstral coefficients guarantee high performance of speaker identification in clean condition, the performance is degraded dramatically in noise condition. To overcome this problem, we employed the relative autocorrelation sequence mel-frequency cepstral coefficient which is one of the noise robust features. However, there are two problems in relative autocorrelation sequence mel-frequency cepstral coefficient: 1) the limited information problem. 2) the residual noise problem. In this paper, to deal with these drawbacks, we propose a multi-streaming method for the limited information problem and a hybrid method for the residual noise problem. To evaluate proposed methods, noisy speech is used in which air conditioner noise, classic music, and vacuum noise are artificially added. Through experiments, proposed methods provide better performance of speaker identification than the conventional methods.

  • PDF

Analysis of the University Library's Space Program and Design Characteristics with the Concept of 'Cultural Commons' - Focused on the Tama Art University Library - (문화공유지(Cultural Commons) 개념에 의한 대학도서관의 공간프로그램과 디자인방법의 특성 - 타마미술대학 도서관을 중심으로 -)

  • Pyun, Young-Hee;Park, Chan-Il
    • Korean Institute of Interior Design Journal
    • /
    • v.24 no.3
    • /
    • pp.48-58
    • /
    • 2015
  • This study is to conclude a direction for Information Commons, which supports the university library in a new role. The study explains perspectives on the changing role of the university library by examining the approaches, histories, and theories practiced by various researchers on Information Commons. The study aims to discover ways of improving the library space that are dedicated to technology using Information Commons, it also examines ways of creating a unified "library space" that will support learning and access to knowledge and information. The features of Cultural Commons include making improvements to technology-centered space, and providing support to research, freedom of speech, creative approach, public freedom and collaboration, and interaction. The functions of Cultural Commons within the university library are listed: First, it supports programs that will transform the library into a social hub within the university. The space specifically blurs the boundary between the library building and its surroundings, and unifies these spaces to enhance its catalytic role in aiding social interactions and human-centered approach. Second, it supports active participation through cultural programs and provides a fluid and interactive space with virtual resources. Third, it enhances user experience to supports behaviors and activities that involve fixtures and equipment in the space to promote learning. The study notes that, with the emergence of these characteristics, the university library is changing by implementing Cultural Commons for on-campus social space and new learning. Accordingly, this implementation is expected to enhance active acceptance of the library space in the future.

An Improved Homonym Disambiguation Model based on Bayes Theory (Bayes 정리에 기반한 개선된 동형이의어 분별 모텔)

  • 김창환;이왕우
    • Journal of the Korea Computer Industry Society
    • /
    • v.2 no.12
    • /
    • pp.1581-1590
    • /
    • 2001
  • This paper asserted more developmental model of WSD(word sense disambiguation) than J. Hur(2000)'s WSD model. This model suggested an improved statistical homonym disambiguation Model based on Bayes Theory. This paper using semantic information(co-occurrence data) obtained from definitions of part of speech(POS) tagged UMRD-S(Ulsan university Machine Readable Dictionary(Semantic Tagged)). we extracted semantic features in the context as nouns, predicates and adverbs from the definitions in the korean dictionary. In this research, we make an experiment with the accuracy of WSD system about major nine homonym nouns and new seven homonym predicates supplementary. The inner experimental result showed average accuracy of 98.32% with regard to the most Nine homonym nouns and 99.53% for the Seven homonym predicates. An Addition, we save test on Korean Information Base and ETRI's POS tagged corpus. This external experimental result showed average accuracy of 84.42% with regard to the most Nine nouns over unsupervised learning sentences from Korean Information Base and ETRI Corpus, 70.81 % accuracy rate for the Seven predicates from Sejong Project phrase part tagging corpus (3.5 million phrases) too.

  • PDF

Implementation of Iconic Language for the Language Support System of the Language Disorders (언어 장애인의 언어보조 시스템을 위한 아이콘 언어의 구현)

  • Choo Kyo-Nam;Woo Yo-Seob;Min Hong-Ki
    • The KIPS Transactions:PartB
    • /
    • v.13B no.4 s.107
    • /
    • pp.479-488
    • /
    • 2006
  • The iconic language interlace is designed to provide more convenient environments for communication to the target system than the keyboard-based interface. For this work, tendencies and features of vocabulary are analyzed in conversation corpora constructed from the corresponding domains with high degree of utilization, and the meaning and vocabulary system of iconic language are constructed through application of natural language processing methodologies such as morphological, syntactic and semantic analyses. The part of speech and grammatical rules of iconic language are defined in order to make the situation corresponding the icon to the vocabulary and meaning of the Korean language and to communicate through icon sequence. For linguistic ambiguity resolution which may occur in the iconic language and for effective semantic processing, semantic data focused on situation of the iconic language are constructed from the general purpose Korean semantic dictionary and subcategorization dictionary. Based on them, the Korean language generation from the iconic interface in semantic domain is suggested.