• 제목/요약/키워드: speech corpora

검색결과 39건 처리시간 0.02초

SiTEC의 공동 이용을 위한 음성 코퍼스 구축 현황 및 계획 (Current States and Future Plans at SiTEC for Speech Corpora for Common Use)

  • 김봉완;최대림;김영일;이광현;이용주
    • 대한음성학회지:말소리
    • /
    • 제46호
    • /
    • pp.175-185
    • /
    • 2003
  • To support speech information technology industry it is vital to create and distribute standardized speech corpora to be used for the development of products and technologies. In this article we introduce speech corpora created by Speech Information Technology & Industry Promotion Center(SiTEC) during its 1st and 2nd fiscal years (2001/5/1-2003/4/30) and plans for those corpora which is being created currently or will be created in near future. We introduce the corpus for car application to expand speech information technology to the field of traditional industry, the corpora for foreign languages to support exportation, the corpus for basic research for the sake of application in the industry, the corpora for common use, and others.

  • PDF

다양한 음성코퍼스의 통합관리시스템의 설계 및 구현에 관한 검토 (An Investigation for Design and Implementation of an Integrated Data Management System of Various Speech Corpora)

  • 황경훈;정창원;김영일;김봉완;이용주
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2003년도 10월 학술대회지
    • /
    • pp.69-72
    • /
    • 2003
  • In this paper, we investigate various factors that are relevant to design and implementation of an integrated management system for various speech corpora. The purpose of this paper is to manage an integrated management system for various kinds of speech corpora necessary for speech research and speech corpora consrtructed in different data formats. In addition, ways are considered to allow users to search with effect for speech corpora that meet various conditions which they want, and to allow them to add with ease corpora that are constructed newly. In order to achieve this goal, we design a global schema for an integrated management of new additional information without changing old speech corpora, and construct a web-based integrated management system based on the scheme that can be accessed without any temporal and spatial restrictions. And we show the steps by which these can be implemented, and describe related future study topics, examining the system.

  • PDF

다양한 음성코퍼스의 통합 관리시스템 구축 (Construction of Integration Management System of Various Speech Corpora)

  • 유경택;정창원;김도관;이용주
    • 한국컴퓨터정보학회논문지
    • /
    • 제11권1호
    • /
    • pp.259-271
    • /
    • 2006
  • 본 논문에서는 다양한 음성코퍼스의 통합 관리 시스템을 설계하고 구현하기 위한 여러 고려 사항들을 검토 하고자 한다. 본 논문의 목적은 음성 연구에 필요한 다양한 음성 데이터베이스의 종류 그리고 서로 다른 데이터 형태로 구축된 음성코퍼스를 통합적으로 관리하는데 있다. 또한, 부가적으로 사용자가 요청하는 다양한 조건에 맞는 음성 데이터들을 효과적으로 검색 가능하고 새로 구성된 음성코퍼스를 손쉽게 추가 할 수 있도록 고려하였다. 이를 위해 기존의 구축된 음성코퍼스의 수정 없이 새로운 정보를 통합 관리하기 위한 전역 스키마(global schema)를 설계하고, 이를 기반으로 시 공간의 제약 없이 액세스 할 수 있는 웹 기반의 통합 관리 시스템을 구축하였다. 끝으로 서비스에 포함된 수행 결과인 웹기반 인터페이스를 기술하고, 통합 관리 시스템을 구현하기 위해 인덱스 뷰를 사용한 효과성을 보인다.

  • PDF

대화형 코퍼스의 설계 및 구조적 문서화에 관한 연구 (A Study in Design and Construction of Structured Documents for Dialogue Corpus)

  • 강창규;남명우;양옥렬
    • 한국콘텐츠학회논문지
    • /
    • 제4권4호
    • /
    • pp.1-10
    • /
    • 2004
  • 음성인식의 연구 대상은 낭독음성에서 대화음성으로 발전해가고 있다. 이를 위해서는 대량의 대화코퍼스가 필요하다. 그러나 아직 충분한 양의 대화코퍼스가 구축되어 있지 못하며 코퍼스의 주석 정보 또한 복잡하고 다양하게 표현하고 있어 효율적인 활용이 어렵다. 따라서 본 논문에서는 TEI를 기반으로 하여 대화 영역을 텔레뱅킹으로 설정하고 대화코퍼스를 구축하여 구축된 대화코퍼스의 주석 정보를 XML(extensible Markup Language)로 표준화할 수 있도록 DTD (Document Type Definition) 정의하고 저장 시스템을 설계하였다.

  • PDF

SiTEC의 STiLL관련 음성 코퍼스의 구축 현황 (Creation of Speech Corpora for STiLL at SiTEC)

  • 김영일;김봉완;최대림;이광현;정은순;이용주
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2005년도 추계 학술대회 발표논문집
    • /
    • pp.13-16
    • /
    • 2005
  • As language learning that utilizes speech and information processing technology is getting popular. Speech Information Technology & Promotion Center(SiTEC) has created and is distributing speech corpora for STiLL in order to support basic research and development of products. We will introduce the corpus for Korean and those for English which we have created and are distributing.

  • PDF

자동차 환경에서의 노이즈 DB 및 한국어 음성 DB 구축 (Creation and Assessment of Korean Speech and Noise DB in Car Environments)

  • 이광현;김봉완;이용주
    • 대한음성학회지:말소리
    • /
    • 제48호
    • /
    • pp.141-153
    • /
    • 2003
  • Researches into robust recognition in noise environments, especially in car environments, are being carried out actively in speech community. In this paper we will report on three types of corpora that SiTEC (Speech Information TEchnology & industry promotion Center) has created for research into speech recognition in car noise environments. The first is the recordings of 900 Korean native speakers, distributed according to gender, age, and region, who uttered application words in car environments. The second is the collections of mixed noise in 3 car types by model while setting up various noise patterns which can be obtained with the car engine on or off, at different driving speed, and in different road conditions with windows open or closed. The third is the recordings of simulated speech by HATS (Head and Torso Simulator) in car environments with the internal and external noise factors added. These three types of recordings were all made through synchronized 8 channel microphones that are fixed in a car. The creation and applications of these corpora will be reported on in detail.

  • PDF

Vocabulary Analyzer Based on CEFR-J Wordlist for Self-Reflection (VACSR) Version 2

  • Yukiko Ohashi;Noriaki Katagiri;Takao Oshikiri
    • 아시아태평양코퍼스연구
    • /
    • 제4권2호
    • /
    • pp.75-87
    • /
    • 2023
  • This paper presents a revised version of the vocabulary analyzer for self-reflection (VACSR), called VACSR v.2.0. The initial version of the VACSR automatically analyzes the occurrences and the level of vocabulary items in the transcribed texts, indicating the frequency, the unused vocabulary items, and those not belonging to either scale. However, it overlooked words with multiple parts of speech due to their identical headword representations. It also needed to provide more explanatory result tables from different corpora. VACSR v.2.0 overcomes the limitations of its predecessor. First, unlike VACSR v.1, VACSR v.2.0 distinguishes words that are different parts of speech by syntactic parsing using Stanza, an open-source Python library. It enables the categorization of the same lexical items with multiple parts of speech. Second, VACSR v.2.0 overcomes the limited clarity of VACSR v.1 by providing precise result output tables. The updated software compares the occurrence of vocabulary items included in classroom corpora for each level of the Common European Framework of Reference-Japan (CEFR-J) wordlist. A pilot study utilizing VACSR v.2.0 showed that, after converting two English classes taught by a preservice English teacher into corpora, the headwords used mostly corresponded to CEFR-J level A1. In practice, VACSR v.2.0 will promote users' reflection on their vocabulary usage and can be applied to teacher training.

Momel을 이용한 한국어의 억양 연구 (A Study on Korean Intonation Using Momel)

  • 김선희;유현지;홍혜진;이호영
    • 대한음성학회지:말소리
    • /
    • 제63호
    • /
    • pp.85-100
    • /
    • 2007
  • This paper aims to propose how to extract intonation patterns using Momel, a pitch stylization algorithm, and to present results of analyzing speech corpora in comparison with those in earlier researches. Two speech corpora are used: one is the sound files obtained from the K-ToBI web site, and the other consists of 80 passages pronounced by 4 speakers (2 male and 2 female). The results show that Momel provides significant pitch targets which can be labeled as H and L tones within prosodic units such as Accentual Phrase (AP) and Intonation Phrase (IP). The resulting AP patterns and IP boundary tone patterns correspond to those in earlier researches. Thus, this study will contribute to the study of intonation as well as to the development of automatic intonation labeling systems.

  • PDF

Enhancement of a language model using two separate corpora of distinct characteristics

  • 조세형;정태선
    • 한국지능시스템학회논문지
    • /
    • 제14권3호
    • /
    • pp.357-362
    • /
    • 2004
  • 언어 모델은 음성 인식이나 필기체 문자 인식 등에서 다음 단어를 예측함으로써 인식률을 높이게 된다. 그러나 언어 모델은 그 도메인에 따라 모두 다르며 충분한 분량의 말뭉치를 수집하는 것이 거의 불가능하다. 본 논문에서는 N그램 방식의 언어모델을 구축함에 있어서 크기가 제한적인 말뭉치의 한계를 극복하기 위하여 두개의 말뭉치, 즉 소규모의 구어체 말뭉치와 대규모의 문어체 말뭉치의 통계를 이용하는 방법을 제시한다. 이 이론을 검증하기 위하여 수십만 단어 규모의 방송용 말뭉치에 수백만 이상의 신문 말뭉치를 결합하여 방송 스크립트에 대한 퍼플렉시티를 30% 향상시킨 결과를 획득하였다.