• Title/Summary/Keyword: 자유발화

Search Result 34, Processing Time 0.03 seconds

Common Speech Database Collection (공통음성 DB 구축)

  • Kim Sanghum;Oh Seungshin;Jung Ho-Young;Jeong Hyung-Bae;Kim Jeong-Se
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • spring
    • /
    • pp.21-24
    • /
    • 2002
  • 본 논문은 ETRI 음성정보연구센터에서 추진하고 있는 공통음성 DB 구축에 관하여 기술한다. 총 3 년(2001 11-2004. 10) 동안 음성인식, 음성합성, 화자인식 등 다양한 용도의 음성 DB 를 수집할 예정이며, 1년차인 2002 년에는 총 14 종의 음성 DB 를 수집할 계획이다. 공통 음성 DB 는 다양한 통신망(마이크, 헤드셋, VoIP, 유무선 전화망), 지역, 성별, 발성환경(사무실, 지하철, 도로 등)을 고려하여 설계하였으며, 발성대상은 숫자, 단어, 문장이고, 발성방법은 자유발화, 대화체, 낭독체 등 다양한 스타일의 음성 DB 로 구성되어 있다. 이에 본 논문에서는 총 14 종에 해당하는 공통음성 DB 의 구축내역과 구축방안 및 DB 구축 일정에 관해 기술하고자 한다.

  • PDF

Meaning and Intonation of Endings with Polysemous Modality: Through the Analysis of the Spontaneous Speech (인식·행위 양태 다의성 어미의 의미와 억양 -구어 자유발화 분석을 통하여-)

  • Jo, Min-ha
    • Korean Linguistics
    • /
    • v.77
    • /
    • pp.331-357
    • /
    • 2017
  • The purpose of this paper is to identify the workings of intonation realized in the endings through the spoken language. To achieve this objective, this paper has analyzed 300 minutes of spontaneous speech by women from Seoul and discussed the meanings of modality and their relationship with intonation. Intonation functions significantly in polysemous modal endings in epistemic and act modality. Epistemic modality is usually expressed through indirect and soft intonations such as L:, M: and LH, whereas act modality is expressed through direct and strong intonations such as H, HL and LHL. Intonation appears to be related to the Certainty degree of information, rather than classification of modality, Lengthening relate to indirectness, H with uncertainty, L with statements or affirmation, and HL and LHL relates to assertive attitude. This paper is significant as it has overcome the abstractness of existing modality studies and has engaged in objective and comprehensive analysis with actual spontaneous speech data.

Study on Method Constructing Dialog Act Tagged Corpus for Dialog System in Car (차량용 대화 시스템을 위한 Dialog Act 태깅 코퍼스 구축 방법 연구)

  • Choi, Sung-Kwon;Kwon, Oh-Woog;Kim, Young-Gil
    • Annual Conference on Human and Language Technology
    • /
    • 2012.10a
    • /
    • pp.181-184
    • /
    • 2012
  • 본 논문에서는 한국전자통신연구원 언어처리연구팀에서 개발하고 있는 차량용 대화 시스템을 위한 Dialog Act 태깅 코퍼스 구축 방법에 대해 기술하는 것을 목표로 한다. 차량용 태깅 코퍼스 구축 방법은 크게 차량용 대화 코퍼스 수집과 수집된 대화 코퍼스에 Dialog Act를 반자동으로 태깅하는 방법으로 나눌 수 있다. 차량용 대화 코퍼스 수집은 1) 대화플랜 맵 구축, 2) 표준대화 구축, 3) 자유대화 구축, 4) 사용자 발화에 패러프래징 발화 구축의 순으로 구축되었다. Dialog Act 태깅은 수집된 대화코퍼스로부터 슬롯 후보를 추출하여 슬롯 체계를 구축한 후 반자동 슬롯 태깅을 실시하고, 슬롯 태깅 결과와 Dialog Act Type을 조합하여 Dialog Act 태깅 코퍼스를 구축하였다. 이렇게 구축된 Dialog Act 태깅 코퍼스는 차량 공조시스템(에어컨, 히터 등) 및 차량 응급 조치 정보 서비스와 같은 차량용 대화 시스템에 적용 중에 있다.

  • PDF

Developing an Instrument for Analysing Students' Behavioral Engagement in School Science Classroom (과학수업에서 나타나는 학생들의 행동적 참여 분석을 위한 영상 분석 도구의 개발)

  • Choi, Joonyoung;Na, Jiyeon;Song, Jinwoong
    • Journal of The Korean Association For Science Education
    • /
    • v.35 no.2
    • /
    • pp.247-258
    • /
    • 2015
  • Students are engaged in classroom learning, and classroom learning occurs not only through conversation but also through nonverbal behavior. In science classrooms especially, there are meaningful nonverbal behaviors such as practical activities like observation and measurement. But these behaviors have not been properly investigated by existing instruments that try to measure students' engagement. This study aims to develop a new instrument for analyzing students' behavioral engagement especially in science classrooms. The method of developing the instrument was structured along three steps. First, student behaviors have been classified into fourteen categories through literature review and a series of observation of elementary science classroom. Second, based on these, a framework for analyzing student behavioral engagement has been developed. With the framework, every student moment could be labeled as Participatory Speech or Participatory Silence or Non-Participatory Speech or Non-Participatory Silence. Third, an instrument to which the framework is applied has been developed by using Microsoft Excel. As a trial, two fourth-grade students in elementary science class were analyzed with this instrument. The results of the trial analysis shows that the longest period of a science lesson was occupied by Participatory Silence (63% and 72%). Among the participatory silence, 'listening' was the most common (51% and 42% of the trial lesson) and 'observing' which is a specific behavior to science was the fourth position (17% and 17% of the trial lesson). It is expected that the developed instrument could be used in improving our understanding of the patterns of student engagement in science classrooms.

A Study of Intonation Curve Slopes in Korean Spontaneous Speech (자유 발화 자료에서 나타나는 한국어 억양 곡선의 기울기 특성에 대한 연구)

  • Oh, Jeahyuk
    • Phonetics and Speech Sciences
    • /
    • v.6 no.1
    • /
    • pp.21-30
    • /
    • 2014
  • This study aims to discuss pitch slope on Korean intonation curve in spontaneous speech data. For this study, 656 utterances were taken in the spoken corpus and used 'close-copy stylization'. And then the physical feature of pitch movements was extracted for the study. The pitch slope was calculated on the basis of time and pitch range in each utterance. As a result, the average and distribution of pitch slope is similar between men and women in the range of the pitch movement except for essential differences. The slope of pitch movement confirms that there are no differences between men and women. Pitch slope on a scale of -10 to 10 is 90% of the entire pitch slope; pitch slope that moves by time scale without curve is 33.1%; pitch slope that moves half of the pitch bandwidth during the average time for pitch movement is 23.4%; pitch slope that moves 100% of pitch bandwidth during a half of the average time for pitch movement is 10.4%. Those results imply the possibility of standardization methods of Korean intonation by pitch slope.

Phoneme Frequency of 3 to 8-year-old Korean Children (3세${\sim}$8세 아동의 자유 발화 분석을 바탕으로 한 한국어 말소리의 빈도 관련 정보)

  • Sin, Ji-Yeong
    • Proceedings of the KSPS conference
    • /
    • 2005.04a
    • /
    • pp.15-19
    • /
    • 2005
  • The aim of this study is to provide some information on frequencies of occurrence for units of Korean phonemes and syllables analysing spontaneous speech spoken by 3 to 8-year-old Korean children. 49 Korean Children(7${\sim}$10 children for each age) were employed as subjects for this study. Speech data were recorded and phonemically transcribed. 120 utterances for each child were selected for analysis except one child whose data were only 91 utterances. The data size of the present study were 5,971 utterances, 5,1554 syllables, and 105491 phonemes. Among 19 consonants, /n/ showed highest frequency rate of these four conson ants were over 50% for all age groups. Among 18 vowels, /a/ was the most frequent one and /i/ and / ${\wedge}$ were the second and third respectively. The frequency rate of these four consonants were over 50% for all age groups. Frequently occurring syllable types were a part of grammatical word in most cases. Only 5${\sim}$6% of syllable types covered 50% of speech.

  • PDF

Study on Participants' Perceptions of Sharing Economy Policies: A Text Ming Approach to Online Community Posts (공유경제 참여자의 비즈니스 등록정책에 대한 인식과 심적기재: 온라인 발화에 대한 텍스트마이닝)

  • Park, Soo Kyung
    • Journal of Digital Convergence
    • /
    • v.20 no.2
    • /
    • pp.47-56
    • /
    • 2022
  • With the advent of online platforms, individuals have been able to trade small resources, such as a room, in the market. However, as there is no clear regulation on these economic activities, various side effects have emerged. Accordingly, the government reestablished related policies to resolve the unintended consequences of these economic activities. However, the policy has not been implemented yet, and many participants do not comply with the policy. Therefore, this study intends to examine their perceptions in detail. For this purpose, a text mining technique was applied. Posts and comments from major online communities were collected. By applying the topic modeling technique, 5 topics were derived. Compliance with the government's policy is a voluntary decision. Therefore, it is necessary to carry out an in-depth understanding of the policy target. Therefore, based on this study, it is expected that in the future, methods to induce them to conform to policy can be discussed in detail.

A Chatter Bot for a Task-Oriented Dialogue System (목적지향 대화 시스템을 위한 챗봇 연구)

  • Huang, Jin-Xia;Kwon, Oh-Woog;Lee, Kyung-Soon;Kim, Young-Kil
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.6 no.11
    • /
    • pp.499-506
    • /
    • 2017
  • Chatter bots are normally used in task-oriented dialogue systems to support free conversations. However, there is not much research on how chatter bots as auxiliary system should be different from independent ones. In this paper, we have developed a chatter bot for a dialogue-based computer assisted language learning (DB-CALL) system. We compared the chatter bot in two different cases: as an independent bot, and as an auxiliary system. The results showed that, the chatter bot as an auxiliary system showed much lower satisfaction than the independent one. A discussion is held about the difference between an auxiliary chatter bot and an independent bot. In addition, we evaluated a search-based chatter bot and a deep learning based chatter bot. The advantages and disadvantages of both methods are discussed.

Intonation Patterns of Korean Spontaneous Speech (한국어 자유 발화 음성의 억양 패턴)

  • Kim, Sun-Hee
    • Phonetics and Speech Sciences
    • /
    • v.1 no.4
    • /
    • pp.85-94
    • /
    • 2009
  • This paper investigates the intonation patterns of Korean spontaneous speech through an analysis of four dialogues in the domain of travel planning. The speech corpus, which is a subset of spontaneous speech database recorded and distributed by ETRI, is labeled in APs and IPs based on K-ToBI system using Momel, an intonation stylization algorithm. It was found that unlike in English, a significant number of APs and IPs include hesitation lengthening, which is known to be a disfluency phenomenon due to speech planning. This paper also claims that the hesitation lengthening is different from the IP-final lengthening and that it should be categorized as a new category, as it greatly affects the intonation patterns of the language. Except for the fact that 19.09% of APs show hesitation lengthening, the spontaneous speech shows the same AP patterns as in read speech with higher frequency of falling patterns such as LHL in comparison with read speech which show more LH and LHLH patterns. The IP boundary tones of spontaneous speech, showing the same five patterns such as L%, HL%, LHL%, H%, LH% as in read speech, show higher frequency of rising patterns (H% and LH%) and contour tones (HL%, LH%, LHL%) while read speech on the contrary shows higher frequency of falling patterns and simple tones at the end of IPs.

  • PDF

Usability Testing for a Mobile Augmentative Alternative Communication(AAC) Software and Users' Preference for the Size of Mobile Devices (모바일 보완대체의사소통(AAC) 소프트웨어의 사용성 평가 및 모바일 기기의 크기에 대한 선호도 조사)

  • Lee, H-Y.;Hong, K-H.
    • Journal of rehabilitation welfare engineering & assistive technology
    • /
    • v.6 no.1
    • /
    • pp.37-43
    • /
    • 2012
  • We conducted a user-centered usability testing on the Android-based Mobile Augmentative Alternative Communication(AAC) Software. In this paper, we examined functionality, satisfaction, and ease of information searching for a specific function using a task scenario, and we investigated appropriateness of development purposes, contents, instructional strategies, usability, functions of management mode, and user interface of the mobile AAC to the communication needs of children who are nonverbal. We also examined user requirements, preference, satisfaction, and other personal opinions for the mobile AAC using an open feedback. In addition, we investigated users' preference for the size of mobile devices using 4.3", 5.0", and 7.0" mobile devices.

  • PDF