• Title/Summary/Keyword: 음성 발달

Search Result 297, Processing Time 0.025 seconds

Real-time Implementation of Fast LMS and MDF Algorithms using dSPACE board (dSPACE 보드를 이용한 고속 LMS와 MDF 알고리즘의 실시간 구현)

  • 조우근;정원용
    • Proceedings of the Korea Institute of Convergence Signal Processing
    • /
    • 2000.08a
    • /
    • pp.149-152
    • /
    • 2000
  • 통신기술의 발달과 정보화 사회로 빠르게 변화되면서, 유선ㆍ무선, 핸즈프리, 원거리 화상회의 등의 다양한 방식의 통신이 이루어지고 있다. 음성통신의 어려운 문제 중에 하나는 주위의 소음이다. 소음은 상황에 따라서 다양하고 복잡하여 그 특성을 분석하기가 어렵다. 소음의 특성과 반향 등을 분석하기 위해서는 수 천 개의 적응필터 탭이 필요하게 된다. 따라서 실시간 소음제거를 위해서는 계산량이 많아 어려움이 따르므로 계산량 감소를 위해 FFT연산에 근거한 주파수 영역의 FDAF 적응필터를 이용하게 되었다. 하지만 계산량은 상당히 감소되었지만, 적응필터의 차수가 증가하면서 시간지연과 하드웨어적으로 복잡하게 되어 블록의 차수를 줄일 수 있는 MDF를 비교 검토하였다.

  • PDF

A Design and Implementation of Improving Children's Memory Application Based on Kinect Sensor (Kinect Sensor 기반의 아동 기억력 향상 애플리케이션 설계 및 구현)

  • Won Joo Lee;Gyeong Min Kim;Gi Jae Sin;Su Ji Kim;Seo Yeong Lee
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2023.01a
    • /
    • pp.53-54
    • /
    • 2023
  • 본 논문에서는 키넥트 센서 기반의 아동 기억력 향상 애플리케이션을 설계하고 구현한다. 이 애플리케이션은 유아층의 기억력을 향상시키고 팔 동작으로 소근육 발달에 도움을 주는 카드 짝 맞추기 게임의 기능을 구현한다. 카드 짝 맞추기 게임은 키넥트 센서에서 인식한 사용자의 스켈레톤, 뎁스스트림, 조인트, 음성 정보를 활용하여 플레이어의 오른손을 인식하여 카드를 뒤집고 짝이 맞는 경우는 그대로 두고 짝이 맞지 않는 경우에는 다시 뒤집는다. 사용자는 카드의 위치와 그림을 기억하며 16장의 카드를 모두 맞출때까지 계속 진행한다. 이 게임은 유아들이 재미있게 게임을 즐기면서 기억력을 향상시킬 수 있다.

  • PDF

An Integrated E-model Implementation for Speech Quality Measurement in VoIP and VoLTE (VoIP와 VoLTE 음성 품질 측정을 위한 통합 E-model 구현)

  • Kim, Bog-Soon;Baek, Kwang-Hyun;Cho, Gi-Hwan
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.50 no.7
    • /
    • pp.10-18
    • /
    • 2013
  • With advancing of mobile communication services and commercializing of VoLTE (Voice of LTE), it is getting to pay attention on QoS of VoLTE. This paper proposes an integrated E-model in which some factors influenced to service quality of VoIP and VoLTE based voice communication system are considered in calculating the voice quality of Wideband Codec. The model aims to calculate R value which reflects the situations of access network, network characteristics, terminals' usage and mobility. We mainly deal with the integrated E-model's structure, related algorithms and optimal parameters for VoLTE. Some experiments show that the voice quality difference between VoIP and VoiceChecker, and VoLTE and POLQA, is below 10%. With the proposed model, we can calculate the voice quality by making use of the factors directly affected to service quality and the environment of VoLTE terminal and network. As a result, we can estimate the service quality in advance, without measuring it in real wireless environment.

Jitter and Shimmer of the Deaf Voice (농자 음성의 주파수 변동율 및 진폭 변동율)

  • Ok-ran Jeong
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.7 no.1
    • /
    • pp.39-42
    • /
    • 1996
  • The present study analyed jitter and shimmer of the deaf in 4 different voicing conditions. Thirty-two male subjects and 27 female subjects participated in the study on a voluntary basis. The age ranged from 6 to 18 for male and 8 to 21 for female subjects. The subjects were either congenitally or prelingually deaf The four different voicing conditions included /a/ prolongation, counting, reading, and conversation. The experiment utilized CSL Visi-Pitch Model 6095(Kay Elemetrics Corp.) to sample and analyze the data. Both jitter and shimmer means were higher than the threshold values(normative data) reported. In addition, this investigation performed two separate 2-factor ANOVAs in order to determine if jitter and shimmer change as a function of gender and voicing condition. The results showed the following. First of all there was the gender effect on shimmer but not on jitter, in that male subjects 'shimmer was higher than females'. secondly, there was the voicing condition effect both on jitter and shimmer. /a/ prolongation and reading produced lower jitter than counting and conversation. /a/ prolongation produced lower shimmer than the remaining conditions. Finally, no interaction between gender and voicing condition existed.

  • PDF

Design of an Visitor Identification system for the Front Door of an Apartment using Deep learning (딥러닝 기반 이용한 공동주택현관문의 출입자 식별 시스템 설계)

  • Lee, Min-Hye;Mun, Hyung-Jin
    • Journal of the Korea Convergence Society
    • /
    • v.13 no.4
    • /
    • pp.45-51
    • /
    • 2022
  • Fear of contact exists due to the prevention of the spread of infectious diseases such as COVID-19. When using the common entrance door of an apartment, access is possible only if the resident enters a password or obtains the resident's permission. There is the inconvenience of having to manually enter the number and password for the common entrance door to enter. Also, contactless entry is required due to COVID-19. Due to the development of ICT, users can be easily identified through the development of face recognition and voice recognition technology. The proposed method detects a visitor's face through a CCTV or camera attached to the common entrance door, recognizes the face, and identifies it as a registered resident. Then, based on the registered information of the resident, it is possible to operate without contact by interworking with the elevator on the server. In particular, if face recognition fails with a hat or mask, the visitor is identified by voice or additional authentication of the visitor is performed based on the voice message. It is possible to block the spread of contagiousness without leaving any contactless function and fingerprint information when entering and exiting the front door of an apartment house, and without the inconvenience of access.

Diadochokinetic Skills in Typically developing Children Aged 4-6 Years : Pilot Study (학령전기 정상발달 아동의 자모음 교대운동특성 : 예비연구)

  • Jeong, Han-Jin;Lee, Ok-Bun;Sehr, Kyeung-Hee
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.12 no.7
    • /
    • pp.3149-3155
    • /
    • 2011
  • The purpose of this study was to know the characteristics of DDK performance between CV(e.g. 'pa') and VV(e.g., 'ai') syllables in typically developing children aged 4 to 6 years old. 12 TD children performed DDK with CV structure(/pʰə/, /tʰə/, /kʰə/, /pʰətʰə/, /tʰəkʰə?/, /pʰətʰəkʰə/) and with VV structure(/ai/, /ɔi/, /ɑɔi/). Spoken syllables were counted in one second, and all spoken DDK were measured by PC-quirer. The results showed that all spoken DDK became faster as the age of children were increased. This trend was also appeared in both CV and VV syllables repetition. In addition, there was no differences in DDK rate with CV and VV syllables. The frequency of articulatory error during DDK performance was very high in the age of 3, and there was no pattern in the frequency of articulatory error according to the developmental age.

Cross-sectional perception studies of children's monosyllabic word by naive listeners (일반 청자의 아동 발화 단음절에 대한 교차 지각 분석)

  • Ha, Seunghee;So, Jungmin;Yoon, Tae-Jin
    • Phonetics and Speech Sciences
    • /
    • v.14 no.1
    • /
    • pp.21-28
    • /
    • 2022
  • Previous studies have provided important findings on children's speech production development. They have revealed that essentially all aspects of children's speech shift toward adult-like characteristics over time. Nevertheless, few studies have examined the perceptual aspects of children's speech tokens, as perceived by naive adult listeners. To fill the gap between children's production and adults' perception, we conducted cross-sectional perceptual studies of monosyllabic words produced by children aged two to six years. Monosyllabic words in the consonant-vowel-consonant form were extracted from children's speech samples and presented aurally to five listener groups (20 listeners in total). Generally, the agreement rate between children's production of target words and adult listeners' responses increases with age. The perceptual responses to tokens produced by two-year old children induced the largest discrepancies and the responses to words produced by six years olds agreed the most. Further analyses were conducted to identify the sources of disagreement, including the types of segments and syllable structure. This study makes an important contribution to our understanding of the development and perception of children's speech across age groups.

유아의 언어치료와 청각의 중요성

  • 김양희
    • Proceedings of the KSPS conference
    • /
    • 1996.02a
    • /
    • pp.1-2
    • /
    • 1996
  • 청각의 중요성은 새잠스럽게 말활 필요도 없고 농아가 말못하는 것은 누구나 다 알 수 있는 사실입니다. 그러나 음을 하나하나 습득하고 단어를 하나하나 반복하면서 언어습득을 시작하는 유년기에 있어서 청각은 더욱 독특한 역활을 합니다. 조국에 돌아와서 일하기 시작한지 일년이 조금 넘었으나 최초부터 우리 연구소에 찾아오는 어린이가 구주제국보다 훨씬 연소하고 또 수가 훨씬 많은 데 놀랐습니다. 그 중 대다수가 조음장애라든가 언어지연입니다. 더욱 놀라운 점은 이러한 장애가 정상지능의 어린이에게 많은 것입니다. 일반상식으로 어휘력과 발표력이 부족한 어린이들은 정신지체아와 혼동하게 됩니다. 연구소에 진단받으러 오는 어린이들을 체계적으로 청각 검사를 한 결과를 슬라이드를 통해서 말씀드리겠습나다. 검사받은 어린이 중 50-60%가 청각에 이상이 있는 것이 발견되었습니다. 동반한 어머니들은 너무나 놀라서 "우리 아이는 검사를 받았어요! 모두 정상이라고 그랬어요" 이 엄마 말씀도 정당하고 전검사도 정당활 것입니다. 그러나 이러한 어린이들의 문제는 특수합니다. 즉 경도난청에다 또 일시적 난청이기 때문에 명시에 생활하는 데는 큰 지장이 없고 때에 따라서는 청각이 거의 완전히 회복되고 또 몇 주후에 감기가 들거나 하면 다시 난청이 되는 것입니다. 이러한 난청문제가 일년에 3-4번씩 반복되어 어린이가 만 3-4세가 되면 약 1년간 청확한 음을 청취못한 셈이 됩니다. 조석에 기온차가 대단한 계절, 난방의 발달로 인하여 실내는 영상 24도이고 문 한겹만 열고 나가면 영하 10도 그 차이는 34도, 거리로 나가면 일산화탄소를 뿜고 쾌주 하는 차량, 버스나 트럭에셔 나오는 연기는 키가 작은 어린이 코속으로 직통하고 에어컨 시스댐으로 난방.냉방하는 지하상가, 백화점, 지하철 기타 대건물에는 바이러스 만연의 적절한 곳이 됩니다. 생리적 저항력이 없는 어린이들은 이러한 공해와 생활조건의 제일희생자가 되는 것입니다. 엄마들이 "얘는 감기, 비염, 편도선을 달고 삽니다...." "얘는 코감기, 목감기 번갈아 가면서 하도 앓고 있어서 양약율 중지하고 현재 한약을 먹고 있습니다." 이러한 역경은 극복할 수 있는가\ulcorner 질병의 메카니즘은 어떻게 작용되는가\ulcorner 등등을 육미회 센타에서 체험한 사례를 가지고 말씀드리고자 합니다.

  • PDF

A Computer Access System for the Physically Disabled Using Eye-Tracking and Speech Recognition (아이트래킹 및 음성인식 기술을 활용한 지체장애인 컴퓨터 접근 시스템)

  • Kwak, Seongeun;Kim, Isaac;Sim, Debora;Lee, Seung Hwan;Hwang, Sung Soo
    • Journal of the HCI Society of Korea
    • /
    • v.12 no.4
    • /
    • pp.5-15
    • /
    • 2017
  • Alternative computer access devices are one of the ways for the physically disabled to meet their desire to participate in social activities. Most of these devices provide access to computers by using their feet or heads. However, it is not easy to control the mouse by using their feet, head, etc. with physical disabilities. In this paper, we propose a computer access system for the physically disabled. The proposed system can move the mouse only by the user's gaze using the eye-tracking technology. The mouse can be clicked through the external button which is relatively easy to press, and the character can be inputted easily and quickly through the voice recognition. It also provides detailed functions such as mouse right-click, double-click, drag function, on-screen keyboard function, internet function, scroll function, etc.

On a Speech Coding Algorithm for Low Cost Implementation of Voice Telegram System (보이스 전보 시스템 구현을 위한 저가형 음성파형 부호화 알고리즘)

  • 나덕수;민소연;배명진
    • The Journal of the Acoustical Society of Korea
    • /
    • v.19 no.2
    • /
    • pp.101-105
    • /
    • 2000
  • A telegram has been used to transmit the emergency news or celebration message. So, it has been very important media in our life. Although the telegram processing is more and more convenient, on the other hand, the telegram service contains only text message. The voice telegram is that delivering user's voice with text message. So, the voice telegram can be delivered sender's emotions and feelings. However, since voice information contains lots of data, large memory size and high cost processor are needed to deliver itself. In this paper, we proposed a new speech waveform coding method that has low complexity and low cost implementation for the voice telegram system. First, we fixed one basic speech waveform per pitch period and measured the waveform similarity between basic and neighbor speech waveform. Second, if the similarity satisfied threshold values, we compress the neighbor speech waveform with pitch and magnitude value per pitch period and if not, we save speech waveform. When the compression is about 45%, we obtained about 4 point in MOS.

  • PDF