• Title/Summary/Keyword: 발화길이

Search Result 70, Processing Time 0.023 seconds

Semi-automatic Expansion for a Chatting Corpus Based on Similarity Measure Using Utterance Embedding by CNN (합성곱 신경망에 의한 발화 임베딩을 사용한 유사도 측정 기반의 채팅 말뭉치 반자동 확장 방법)

  • An, Jaehyun;Ko, Youngjoong
    • Annual Conference on Human and Language Technology
    • /
    • 2018.10a
    • /
    • pp.95-100
    • /
    • 2018
  • 채팅 시스템을 잘 만들기 위해서는 양질, 대량의 채팅 말뭉치가 굉장히 중요하지만 구축 시 많은 비용이 발생한다는 어려움이 있었다. 따라서 본 논문에서는 영화 자막, 극대본과 같이 대량의 발화 데이터를 이용하여 채팅 말뭉치를 반자동으로 확장하는 방법을 제안한다. 채팅 말뭉치 확장을 위해 미리 구축된 채팅 말뭉치와 유사도 기법을 이용하여 채팅 유사도를 구하고, 채팅 유사도가 실험을 통해 얻은 임계값보다 크다면 올바른 채팅쌍이라고 판단하였다. 그리고 길이가 매우 짧은 채팅성 발화의 채팅 유사도를 효과적으로 계산하기 위해 본 논문에서 제안하는 것은 형태소 단위 임베딩 벡터와 합성곱 신경망 모델을 이용하여 발화 단위 표상을 생성하는 것이다. 실험 결과 기본 발화 단위 표상 생성 방법인 TF를 이용하는 것보다 정확률, 재현율, F1에서 각각 5.16%p, 6.09%p, 5.73%p 상승하여 61.28%, 53.19%, 56.94%의 성능을 가지는 채팅 말뭉치 반자동 구축 모델을 생성할 수 있었다.

  • PDF

Performance of Korean spontaneous speech recognizers based on an extended phone set derived from acoustic data (음향 데이터로부터 얻은 확장된 음소 단위를 이용한 한국어 자유발화 음성인식기의 성능)

  • Bang, Jeong-Uk;Kim, Sang-Hun;Kwon, Oh-Wook
    • Phonetics and Speech Sciences
    • /
    • v.11 no.3
    • /
    • pp.39-47
    • /
    • 2019
  • We propose a method to improve the performance of spontaneous speech recognizers by extending their phone set using speech data. In the proposed method, we first extract variable-length phoneme-level segments from broadcast speech signals, and convert them to fixed-length latent vectors using an long short-term memory (LSTM) classifier. We then cluster acoustically similar latent vectors and build a new phone set by choosing the number of clusters with the lowest Davies-Bouldin index. We also update the lexicon of the speech recognizer by choosing the pronunciation sequence of each word with the highest conditional probability. In order to analyze the acoustic characteristics of the new phone set, we visualize its spectral patterns and segment duration. Through speech recognition experiments using a larger training data set than our own previous work, we confirm that the new phone set yields better performance than the conventional phoneme-based and grapheme-based units in both spontaneous speech recognition and read speech recognition.

Change in lip movement during speech by aging: Based on a double vowel (노화에 따른 발화 시 입술움직임의 변화: 이중모음을 중심으로)

  • Park, Hee-June
    • Phonetics and Speech Sciences
    • /
    • v.13 no.1
    • /
    • pp.73-79
    • /
    • 2021
  • This study investigated the change in lip movement during speech according to aging. For the study, 15 elderly women with an average of 69 years and 15 young women with an average of 22 years were selected. To measure the movement of the lips, the ratio between the minimum point and the maximum point of movement when pronouncing a double vowel was analyzed in pixel units using image analysis software. For clinical utility, the software was produced by applying an automated algorithm and compared with the results of handwork. This study found that the range of the width and length of lips in double vowel tasks was smaller for the elderly than that of the young. A strong positive correlation was found between manual and automated methods, indicating that both methods are useful for extracting lip contours. Based on the above results, it was found that the range of the lips decreased when ignited as aging progressed. Therefore, monitoring the condition of lip performance by simply measuring the movement of lips before aging progresses, and performing exercises to maintain lip range, will prevent pronunciation problems caused by aging.

Mg-Al합금의 조성비율에 따른 발화온도특성

  • Han, U-Seop;Lee, Geun-Won
    • Proceedings of the Korea Institute of Fire Science and Engineering Conference
    • /
    • 2013.04a
    • /
    • pp.77-77
    • /
    • 2013
  • 최근의 산업활동에서는 신규 원료 개발과 생산 효율성을 높이기 위하여 분체 공정이 증가하고 있는데, 미세 분진의 취급으로 분진운의 형성과 착화가 용이해지므로 분진폭발이나 화재 위험성이 증가하고 있다. 분진을 안전하게 사용하고 저장, 취급하기 위해서는 착화 전의 위험성 지표로서 최저발화온도(MIT ; Minimum Ignition Temperature)를 사전에 파악해 두는 것이 중요하다. 분진농도의 발화온도는 장치 내의 발화위험성이나 분진 취급 공정의 사고예방대책 관리를 위한 실용적 관점에서 중요하게 활용되는 폭발특성값이다. 또한 분진의 발화온도는 분진농도에 의존하며 농도변화에 따른 가장 낮은 온도를 MIT라고 한다. 본 연구에서는 화재폭발사고 빈도가 줄지 않고 있는 Mg 및 Mg-Al합금(60:40 wt%, 50:50 wt%, 40:60 wt%)을 대상으로 조성비율에 따른 최저발화온도를 실험적으로 조사하였다. Mg 및 Mg-Al(60:40 wt%), Mg-Al(50:50 wt%), Mg-Al(40:60 wt%) 시료의 평균입경은 142, 160, 151, $152{\mu}m$이다. MIT실험장치는 IEC 61241-2-1(Methods for Determining the Minimum Ignition Temperatures of Dust, 1994)에 준거하여 제작하여 사용하였다. 실험장치는 가열로, 분진운 시료홀더, 온도조절장치, 압축공기 제어장치 등으로 구성되어 있다. 구체적인 실험방법은 시험분진를 분진홀더에 장착하고 0.5 bar의 압축공기를 0.3 sec 동안 사용하여 일정 온도로 가열된 로의 내부로 분진운을 부유시킬 때에 분진운이 발화하여 가열로 하단부의 개방구에까지 화염이 전파하는지를 디지털비데오카메라로 기록, 평가하여 발화 유무를 판정하였다. Mg합금에 대한 MIT를 측정한 결과 $740^{\circ}C$가 얻어졌으며, Mg-Al(60:40 wt%)의 MIT는 $820^{\circ}C$로 조사되었다. 그러나 Mg-Al(50:50 wt%) 및 Mg-Al(40:60 wt%)에 대해서는 최대 가열로의 설정온도를 $890^{\circ}C$까지로 하여 농도를 변화시키면서 조사하였으나 발화가 일어나지 않았다. 문헌에 따르면 Mg입자 표면의 산화피막은 다공성으로 일정 온도에서 산화반응이 시간에 따라 직선적으로 증가하는데 반하여, Al의 산화피막은 보호 작용을 하여 일정 온도에서 산화반응속도가 표면과 내부의 농도 기울기에 의한 확산속도에 의존한다고 보고하고 있다. 본 연구결과를 토대로 Mg-Al합금의 발화특성을 고찰해 보면, Mg-Al합금에서 자기 전파성이 작은 Al성분의 증가는 착화지연이 증가하여 연소성이 감소하여 최저발화온도의 증가로 이어지는 것으로 추정되었다. 또한 발화온도는 주어진 조건의 온도장에서 분진이 존재하는 시간 길이에 따라 변화하므로, 발화온도를 실험적으로 측정하는 경우에는 측정장치나 방법에 따라 달라지므로 사업장의 현장에 발화온도를 적용하는 경우에는 장치 내의 분진의 존재시간을 고려할 필요가 있다.

  • PDF

The influence of utterance length on speech rate in spontaneous speech (자연발화 음성 코퍼스에서 발화 속도에 대한 발화 길이의 영향)

  • Kim, Jungsun
    • Phonetics and Speech Sciences
    • /
    • v.9 no.1
    • /
    • pp.9-17
    • /
    • 2017
  • The current study examined speech rate and its variance in spontaneous Seoul Korean speech. The current study focused on factors affecting the variance of speech rate such as utterance length, individual speakers, and gender. The results revealed that, first, utterance length has a significant influence on speech rate. Longer utterances were spoken at a faster rate. Second, regarding the effect of utterance length, individual speakers differed significantly in their speaking rate. The variation between speakers and within speakers tended to increase as utterance length increases. Third, there were speakers' gender differences, indicating that males produced considerably faster speaking rate than females. Additionally, the current study implied that non-linguistic factors in spontaneous speech can affect the variance of speakers' speaking rate.

A Comparative Study on the Speech Rate of Advanced Korean(L2) Learners and Korean Native Speakers in Conversational Speech (자유 대화에서의 한국어 원어민 화자와 한국어 고급 학습자들의 발화 속도 비교)

  • Hong, Minkyoung
    • Journal of Korean language education
    • /
    • v.29 no.3
    • /
    • pp.345-363
    • /
    • 2018
  • The purpose of this study is to compare the speech rate of advanced Korean(L2) learners and Korean native speakers in spontaneous utterances. Specifically, the current study investigated the difference of the two groups' speech pattern according to utterance length. Eight advanced Korean(L2) learners and eight Korean native speakers participated in this study. The data were collected by recording their conversation and physical measurements (speaking rate, articulatory rates, pause and several types of speech disfluency) were taken on extracted 120 utterances from 12 out of the 16 participants. The findings show that advanced Korean learners' speech pattern is similar to that of Koreans in the short-length utterance. However, in the long-length utterance, two groups show different speech patterns; while the articulatory rate of Korean native speakers increased in the long-length utterance, that of Korean learners decreased. This suggests that the frequency of speech disfluency factors might affect this result.

Topic and Topic Change Detection in Instance Messaging (인스턴트 메시징에서의 대화 주제 및 주제 전환 탐지)

  • Choi, Yoon-Jung;Shin, Wook-Hyun;Jeong, Yoon-Jae;Myaeng, Sung-Hyon;Han, Kyoung-Soo
    • Journal of the Korea Society of Computer and Information
    • /
    • v.13 no.7
    • /
    • pp.59-66
    • /
    • 2008
  • This paper describes a novel method for identifying the main topic and detecting topic changes in a text-based dialogue as in Instant Messaging (IM). Compared to other forms of text, dialogues are uniquely characterized with the short length of text with small number of words, two or more participants, and existence of a history that affects the current utterance. Noting the characteristics, our method detects the main topic of a dialogue by considering the keywords not only the utterances of the user but also the dialogue system's responses. Dialogue histories are also considered in the detection process to increase accuracy. For topic change detection, the similarity between the former utterance's topic and the current utterance's topic is calculated. If the similarity is smaller than a certain threshold, our system judges that the topic has been changed from the current utterance. We obtained 88.2% and 87.4% accuracy in topic detection and topic change detection, respectively.

  • PDF

A Study on the Spontaneous Ignition Characteristics of Wood Pellets related to Change in Flow Rate (공기유량의 변화에 대한 우드펠릿의 자연발화 특성에 관한 연구)

  • Kim, Hyeong-Seok;Choi, Yu-Jung;Choi, Jae-Wook
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.20 no.4
    • /
    • pp.590-596
    • /
    • 2019
  • Uses of fossil fuels like coal and oil increases with industrial development, and problems like abnormal climate come up as greenhouse gas increases. Accordingly, studies are actively conducted on eco-friendly renewable energy as a replacement for the main resources, and especially, wood pellets with high thermal efficiency are in the limelight as an alternative fuel in thermal power stations and gas boilers. However, despite a constant increase in their usage, few studies are conducted on their risks like fire and spontaneous combustion. Thus, this study found the auto-ignition temperature and critical ignition temperature of wood pellets with a change in flow rate in a thermostatic bath, using a sample vessel with 20 cm in length, 20 cm in height and 14 cm in thickness to predict their ignition characteristics. Consequently, at the flow rate of 0 NL/min, as the core temperature of the sample increased to higher than the ambient temperature, they ignited at $153^{\circ}C$, when the critical ignition temperature was $152.5^{\circ}C$. At the flow rates of 0.5 NL/min and 1.0 NL/min, it was $149.5^{\circ}C$, and at the flow rate of 1.5 NL/min, it was $147.5^{\circ}C$. Consequently, at the same storage, the more the flow rate, the lower the critical ignition temperature became.

Forest Fire Direction and Spread Characteristics by Field Investigations (사례 조사를 통한 산불 방향 및 확산 특성)

  • Lee, Byung-Do;Koo, Kyo-Sang;Lee, Myung-Bo
    • Fire Science and Engineering
    • /
    • v.23 no.5
    • /
    • pp.96-102
    • /
    • 2009
  • Forest fire ignition and spread characteristics are needed as basic data in fire management. Slope aspect of ignition point, spread direction, and wind direction at that time were analyzed and regression equations were proposed for predicting burned area, fire perimeter, head spread rate, and flank spread rate using combustion time using 101 forest fires broken out between 2007 and 2009 spring. 57% forest fires of investigated numbers were ignited in south, southwest, and southeast aspects and 68% of forest fires were spreaded to east, southeast, and northeast influenced by westerly wind. About 11.8ha forest was burned and 0.5km fire perimeter increase was predicted per hour. Head and flank spread rate were calculated 0.13km and 0.05km, respectively.

A Research on the Interlanguage of Chinese Speaking Korean Language Learners: Focusing on MLU and Characteristics Found in Vocabulary Usage (중국인 한국어 학습자의 중간언어 연구 - 평균발화길이(MLU)와 어휘적 특성을 중심으로)

  • Kim, Seon-Jung;Kim, Mok-Ah
    • Cross-Cultural Studies
    • /
    • v.22
    • /
    • pp.303-327
    • /
    • 2011
  • This study aims to uncover the learner's language proficiency shown in the writing data of Chinese elementary/intermediate level learners. Language proficiency of the learners acquired by error analysis provides only partial information, and thus this study analyses the interlanguage of Korean learners in terms of 'Mean Length of Utterance, MLU' to discover the overall aspect of learner's language proficiency more symmetrically. The analysis of vocabulary area is to be enforced after generally studying the learner's language development aspect in accordance with MLU-m(orpheme) and MLU-(w)ord found in compositions by Chinese speaking Korean language learners. In terms of MLU, it has been slightly increased as the level of proficiency between elementary level and intermediate level learners; however, the morpheme seemed to be difficult to use, since the difference between Chinese learners and Korean university students has been notably shown. Vocabulary diversity, using aspect for each word class, and using aspect of the predicate are studied for vocabulary area; more various and numerous vocabulary tend to be used as the level of proficiency increases. In terms of predicate use, Chinese learners use less numerous vocabulary types.