• Title/Summary/Keyword: Voice Production

Search Result 141, Processing Time 0.021 seconds

Smart Mirror of Personal Environment using Voice Recognition (음성인식을 이용한 개인환경의 스마트 미러)

  • Yeo, Un-Chan;Park, Sin-Hoo;Moon, Jin-Wan;An, Seong-Won;Han, Yeong-Oh
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.14 no.1
    • /
    • pp.199-204
    • /
    • 2019
  • This paper introduces smart mirror that provides the contents needed for an individual's daily life. When a command that is designated as voice recognition is entered, Smart Mirror is produced that outputs desired contents from a display. The contents of the current smart mirror include time, weather, subway information, schedule and photography. Smart mirror sold for commercial private households is difficult to distribute due to high prices, but the smart mirror production presented in this paper can lower the manufacturing cost and can be more easily used by voice recognition.

The comparative Study of the Acoustic Representation between Pansori singer's and Spasmodic dysphonia patient's Voice (병적인 소리 떨림증과 소리꾼 떨림증의 음향학적인 비교연구)

  • Hong, K.H.;Kim, H.G.;Lee, J.K.;Choi, J.S.
    • Proceedings of the KSPS conference
    • /
    • 2007.05a
    • /
    • pp.143-145
    • /
    • 2007
  • Muscle groups that are located in and around the vocal tract can produce audible changes in frequency and/or intensity of the voice. Vocal vibrato is a characteristic feature in the singing of performers trained in the western classical tradition and vibrato is generally considered to result from modulation in frequency amplitude and timbre. Vocal tremor is also characterized by periodic fluctuations in the voice frequency or intensity and vocal tremor is symptom of a neurological disease as Spasmodic dysphonia , Parkinson's disease. Vocal vibrato and Vocal tremor may have many of the same origins and mechanisms in the voice production systems. The purpose of this study is to find acostic character of Korean traditional song Pansori singer's vibrato and Spasmodic dysphonia patient's vocal tremor. twelve Pansori singers and seven Spasmodic dysponia patients participated to this study. Power spectrum and Real time Spectrogram are used to analyze the acoustic characteristics of Pansori singing and Spasmodic dysphonia patient's voice The results are as follows; First, vowel formant differences between Pansori singing and Spasmodic dysphonia patient's voice are higher F1, F3. Second, The vibrato rate show differences between Pansori singing and Spasmodic dysphonia patients;$4^{\sim}6/sec$ and $5{\sim}6/sec$ Vibrato rate of pitch is 5.7 Hz ${\sim}$ 42.4 Hz for Pansori singing , 3.8 Hz ${\sim}$ 27.9 Hz for Spasmodic dysphonia patients ;Vibrato rate of intensity range is 0.07 dB ${\sim}$ 8.26 dB for Pansori singing and 0.07 dB ${\sim}$ 4.81 dB for Spasmodic dysphonia patients

  • PDF

The Analysis of Voice after Vertical Partial Laryngectomy with Mucosal Flap and Fat Graft Reconstruction (수직후두부분절제술 및 점막 피판과 지방 이식을 통한 성대 재건술 후의 음성분석)

  • Chu, Hyung-Ro;Choi, In-Ja;Kim, Jin-Hwan;Ahn, Hwoe-Young;Rho, Young-Soo
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.18 no.2
    • /
    • pp.134-137
    • /
    • 2007
  • Background and Objectives: The goals of laryngeal reconstruction have been prevention of aspiration, production of a functional voice, and maintenance of an adequate airway for decannulation. It is generally believed that the reconstruction of the glottic region after vertical partial laryngectomy (VPL) can improve laryngeal function. The objective of this study is to evaluate of voice function after VPL with mucosal flap and fat graft reconstruction. Materials and Methods: From 1994 to 2006, 13 patients, who had been treated with VPL with mucosal flap and fat graft reconstruction. The voice characteristics, acoustic, aerodynamic parameter were measured in 13 patients after vertical partial laryngectomy with mucosal flap and fat graft reconstruction. Acoustic analysis was carried out using Computerized Speech Lab (CSL) and aerodynamic analysis were carried out using Aerophon II,3 months and 12 months after surgery. Results: The GRBAS scale, jitter, shimmer, NHR were improved as time goes on after surgery. But, maximum phonation time was shortened after surgery and there is no significant differences between before and after surgery in mean flow rate. Conclusion: The voice function of the mucosal flap and fat graft reconstruction after VPL were satisfactory. This can be an excellent reconstruction method after vertical partial laryngectomy.

  • PDF

Domestic Development of Vibrational Film Forming Machine and Die and Mold in the High Speed Production(I) - Single production forming machine - (고속 생산형 필름 진동판 성형기 및 금형 국산화 개발(I) - 단수 생산 진동판 성형기 -)

  • Kim, Jung-Hyun
    • Journal of the Korean Society of Manufacturing Process Engineers
    • /
    • v.11 no.6
    • /
    • pp.9-15
    • /
    • 2012
  • Vibrational film has been more employed in ear-phones or small type of speakers along with a wide use of portable multi-media equipments such as MP3 and MP4. However, the current hand work production process of diaphragms is inefficient. In this study, a die-and-mold and a single production forming machine are developed, and they result in a multi-production forming machine. The multi-production forming machine consists primarily of a film feeding unit and an unwinding unit. A vacuum suction device provides the film feeding unit, while the unwinding unit is obtained using an appropriate damper. The advantage of the developed single production forming machine is shown according to a proper voice test.

Analysis of Singing Technique of Mongolian Traditional Singing Called Khoomei (몽골 전통 발성 흐미의 발성 방법 분석에 대한 사례연구)

  • Nam, Do-Hyun;Paik, Jae-Yeon;Hwang, Yoen-Shin;Choi, Hong-Shik
    • Speech Sciences
    • /
    • v.15 no.3
    • /
    • pp.145-156
    • /
    • 2008
  • The goal of this study was to investigate acoustic and physiologic characteristics of two phonation types of 'Khoomei' which is a traditional singing style of people who live around the Altai mountains or Mongolia region. It can be produced two pitches simultaneously - high melody pitch can be perceived along with a low drone pitch. Sygyt and kargyraa styles are the most popular and identifiable styles and they can be recognized as the different sounds depending on the method of voice production. Two trained Mongolians participated and have used at least 5 - 6 years. The characteristics of this voice production were measured by using flexible fiberscope, Stroboscopy, Lx Speech studio, Spead, and Doctor Speech. In Sygyt style, very high vocal fold closure (71.50%) with both true and false vocal folds contact and strong breathing support was observed. They also showed that tongue height and harmonics were increased (around 10dB) with resonance cavity movement. In contrast, it was found that Kargyraa sound had very low pitch with relaxed stomach, less laryngeal tension and lower vocal fold contact (69.50%) than hard Sygyt style sound without raising the tongue during phonation. 'Khoomei' phonation can be made by strong contact of both true and false vocal folds and by increasing the harmonics as well.

  • PDF

A Case of Thyroid Cartilage Fracture with Vocal Cord Paralysis (갑상연골 골절로 인한 성대마비의 치험례)

  • 조진규;차창일;안회영;조중생;홍남표
    • Proceedings of the KOR-BRONCHOESO Conference
    • /
    • 1983.05a
    • /
    • pp.14.2-14
    • /
    • 1983
  • Complications and sequelae of the laryngeal trauma are respiratory difficulties, edema or swelling, cellulitis or abscess, fistula, perichondrium and chondritis, chronic laryngeal stenosis, vocal cord paralysis, decannulation difficulty, and impaired voice production etc. Generally, the treatment of laryngeal injuries consists of initial tracheostomy for adequate airway and later surgical intervention for its complications and sequelae. Recently, authors experienced a case of closed laryngeal injury with thyroid cartilage fracture, left vocal cord paralysis, swallowing difficulty and right clavicular fracture owing to automobile accident. With reconstructive surgery for thyroid cartilage fracture, we established an adequate airway, improved swallowing function and better voice production.

  • PDF

The effects of length of residence (LOR) on voice onset time (VOT)

  • Kim, Mi-Ryoung
    • Phonetics and Speech Sciences
    • /
    • v.12 no.4
    • /
    • pp.9-17
    • /
    • 2020
  • Changes in the first language (L1) sound system as a result of acquiring a second language (L2) (i.e., phonetic drift) have received considerable attention from a variety of speakers, settings, and environments. Less attention has been given to phonetic drift in adult speakers' L2 learning as their length of residence in America (LOR) increases. This study examines the effects of LOR on voice onset time (VOT) in L1 Korean stops. Three different groups of Korean adult learners of L2 English were compared to assess how malleable their L1 representations are in terms of LOR and whether there is any relationship between L1 change and L2 acquisition. The results showed that the effect of LOR was linguistically unimportant in the production of Korean stops. However, VOT merger as evidence of sound change in Korean stops were robust in the speech production of most of the female speakers across the groups. The results suggest that L2 English may not be the primary cause of L1 sound change. For generalizability, further study is necessary to see whether other acoustic cues show a similar pattern.

L1-L2 Transfer in VOT and f0 Production by Korean English Learners: L1 Sound Change and L2 Stop Production

  • Kim, Mi-Ryoung
    • Phonetics and Speech Sciences
    • /
    • v.4 no.3
    • /
    • pp.31-41
    • /
    • 2012
  • Recent studies have shown that the stop system of Korean is undergoing a sound change in terms of the two acoustic parameters, voice onset time (VOT) and fundamental frequency (f0). Because of a VOT merger of a consonantal opposition and onset-f0 interaction, the relative importance of the two parameters has been changing in Korean where f0 is a primary cue and VOT is a secondary cue in distinguishing lax from aspirated stops in speech production as well as perception. In English, however, VOT is a primary cue and f0 is a secondary cue in contrasting voiced and voiceless stops. This study examines how Korean English learners use the two acoustic parameters of L1 in producing L2 English stops and whether the sound change of acoustic parameters in L1 affects L2 speech production. The data were collected from six adult Korean English learners. Results show that Korean English learners use not only VOT but also f0 to contrast L2 voiced and voiceless stops. However, unlike VOT variations among speakers, the magnitude effect of onset consonants on f0 in L2 English was steady and robust, indicating that f0 also plays an important role in contrasting the [voice] contrast in L2 English. The results suggest that the important role of f0 in contrasting lax and aspirated stops in L1 Korean is transferred to the contrast of voiced and voiceless stops in L2 English. The results imply that, for Korean English learners, f0 rather than VOT will play an important perceptual cue in contrasting voiced and voiceless stops in L2 English.

Plan for the Development of a Standardized Dummy for Persons in Need of Rescue in a Confined Space (밀폐공간 구조 요구자를 위한 더미 표준화 개발 방안)

  • Choi, Seo-Yeon;Rie, Dong-Ho;Kim, Hyung-Jun
    • Journal of the Korea Safety Management & Science
    • /
    • v.18 no.4
    • /
    • pp.99-105
    • /
    • 2016
  • This study was conducted to develop a dummy in an environment similar to the human body, to prepare a standard for evaluation and to present the process of the production in order to evaluate the performance of the robot that can detect the persons needing rescue in a confined space, who are difficult for fire-fighting officials to rescue in case of fire and disaster. As a result, a standard for evaluation was developed and standardized into four parts 'Normal,' 'Risk Stage 1,' 'Risk Stage 2' and 'Risk Stage 3'based on the number of breath cycles, carbon dioxide concentration, core temperature and criteria for hearing to recognize the voice. In addition, in order to produce a dummy, fever, breathing capacity and voice output function were compared and analyzed. This study has significance that it built up basic data of the method of producing the actual dummy, by presenting characteristics and controlling methods using the waterproof insulation heating coil for the function, solenoid valve for the consecutive output of breathing capacity and USB program sound board for voice output.

A perception-based analysis of voice onset time (VOT) dissimilation in Korean

  • Hijo Kang;Mira Oh
    • Phonetics and Speech Sciences
    • /
    • v.16 no.1
    • /
    • pp.25-31
    • /
    • 2024
  • This study examines the perceptual motivation behind dissimilation. Consistent with previous arguments suggesting that dissimilation originates from perception rather than production (Coetzee, 2005; Kiparsky, 2003; Scheer, 2013), we hypothesized that an oral stop with short of voice onset time (VOT) would be recognized as non-aspirated more often when it is followed by an aspirated stop with a long VOT. This hypothesis was tested through a perception experiment in which 32 Korean listeners made judgments on the first consonant of C1VC2V words manipulated with C1 VOT and C2 types. The results revealed that aspirated-based C1 was recognized as aspirated or tense depending on the duration of VOT, while lenis-based C1 was consistently recognized as lenis. The dissimilatory effect of aspirated C2 was confirmed as anticipated, and furthermore, tense C2 increased the ratio of tense responses more than aspirated C2. These results provide evidence of a perceptual bias against recurrent aspirated stops, which may play a role in activating a dissimilatory rule or constraint in a language. The assimilatory effect of tense C2 is in consistent with findings indicating that word-initial tensification is facilitated by the following tense stop in Korean (Kang & Oh, 2016; H. Kim, 2016).