• Title/Summary/Keyword: Voice function

Search Result 434, Processing Time 0.03 seconds

Chaucer's Storytelling: The Clerk's Tale in Terms of Bakhtin's Concept (초서의 이야기하기 -바흐친의 개념을 통해 본 「서생의 이야기」)

  • Lee, Dongchoon
    • Journal of English Language & Literature
    • /
    • v.53 no.2
    • /
    • pp.281-306
    • /
    • 2007
  • M. M. Bakhtin's dialogic concept of multi-voiced discourse allows us to open up the text of The Clerk's Tale and to account for its radical heterogeneity. Once we recognize the multi-voiced character of The Clerk's Tale, then what was heretofore regarded as discontinuous or ignored can be seen as the clash of several different world-views. Such a conceptual framework gives an added depth and scope to such thematic subjects as sovereignty, the status of women, and rhetorical style. There are three different and antagonistic voices involved in the tale's narration. These voices project different viewpoints or world-views, and they consequently engage each other in a polemic debate. Their relationship with each other is discontinuous and dialectical rather than continuous and harmonious. The first voice is the Petrarchan voice of moral allegory, which is the voice of tradition, authority, and high seriousness. This voice of moral allegory regards the story of Griselda as an exemplum of spiritual constancy and virtuous suffering. The second voice is the Clerkly voice of pathos based on human experience and feeling. This voice is defined by the Clerk's asides and apostrophes interspersed in the narrative proper, which function to engage the Petrarchan voice in a polemical debate. The third voice is the voice of parody, nominally identified with Chaucer the poet, which is located in the second ending, including Envoy. Whereas the other two voices are earnest and serious, the voice of parody is irrelevant, playful and antagonistic to both the Petrarchan voice of moral allegory and the Clerkly voice of secular humility.

Bilingual Voice Conversion Using Frequency Warping on Formant Space (포만트 공간에서의 주파수 변환을 이용한 이중 언어 음성 변환 연구)

  • Chae, Yi-Geun;Yun, Young-Sun;Jung, Jin Man;Eun, Seongbae
    • Phonetics and Speech Sciences
    • /
    • v.6 no.4
    • /
    • pp.133-139
    • /
    • 2014
  • This paper describes several approaches to transform a speaker's individuality to another's individuality using frequency warping between bilingual formant frequencies on different language environments. The proposed methods are simple and intuitive voice conversion algorithms that do not use training data between different languages. The approaches find the warping function from source speaker's frequency to target speaker's frequency on formant space. The formant space comprises four representative monophthongs for each language. The warping functions can be represented by piecewise linear equations, inverse matrix. The used features are pure frequency components including magnitudes, phases, and line spectral frequencies (LSF). The experiments show that the LSF-based voice conversion methods give better performance than other methods.

Voice conversion using low dimensional vector mapping (낮은 차원의 벡터 변환을 통한 음성 변환)

  • Lee, Kee-Seung;Doh, Won;Youn, Dae-Hee
    • Journal of the Korean Institute of Telematics and Electronics S
    • /
    • v.35S no.4
    • /
    • pp.118-127
    • /
    • 1998
  • In this paper, we propose a voice personality transformation method which makes one person's voice sound like another person's voice. In order to transform the voice personality, vocal tract transfer function is used as a transformation parameter. Comparing with previous methods, the proposed method can obtain high-quality transformed speech with low computational complexity. Conversion between the vocal tract transfer functions is implemented by a linear mapping based on soft clustering. In this process, mean LPC cepstrum coefficients and mean removed LPC cepstrum modeled by the low dimensional vector are used as transformation parameters. To evaluate the performance of the proposed method, mapping rules are generated from 61 Korean words uttered by two male and one female speakers. These rules are then applied to 9 sentences uttered by the same persons, and objective evaluation and subjective listening tests for the transformed speech are performed.

  • PDF

GMM based Nonlinear Transformation Methods for Voice Conversion

  • Vu, Hoang-Gia;Bae, Jae-Hyun;Oh, Yung-Hwan
    • Proceedings of the KSPS conference
    • /
    • 2005.11a
    • /
    • pp.67-70
    • /
    • 2005
  • Voice conversion (VC) is a technique for modifying the speech signal of a source speaker so that it sounds as if it is spoken by a target speaker. Most previous VC approaches used a linear transformation function based on GMM to convert the source spectral envelope to the target spectral envelope. In this paper, we propose several nonlinear GMM-based transformation functions in an attempt to deal with the over-smoothing effect of linear transformation. In order to obtain high-quality modifications of speech signals our VC system is implemented using the Harmonic plus Noise Model (HNM)analysis/synthesis framework. Experimental results are reported on the English corpus, MOCHA-TlMlT.

  • PDF

Control of a welfare liferobot guided by voice commands

  • Han, Seong-Ho;Yoshihiro, Takita
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2001.10a
    • /
    • pp.47.3-47
    • /
    • 2001
  • This paper describes the control of a health care robot (called Welfare Liferobot) with voice commands. The welfare liferobot is an intelligent autonomous mobile robot with its own control system on-board and the set of sensors to perceive an environment. It is a natural way to control the welfare liferobot by use of voice command for the usage of keyboard and mouse may present a difficult problem to the elderly and the handicapped. Voice input as the main control modality can offer many advantages. A set of oral commands is included, and each command has its associated function. These control words (commands) have to be chosen by user. Each time a voice command is recognized by the robot, it executes the pre-assigned action ...

  • PDF

The Acoustic Severity Index in the Pathologic Voice (음성장애에 대한 음향학적 중등도 지표)

  • Hong, Ki-Hwan;Kim, Hyun-Ki;Yang, Yoon-Soo
    • Speech Sciences
    • /
    • v.10 no.4
    • /
    • pp.201-219
    • /
    • 2003
  • Background: The perceptual assessment is generally performed by the voice specialist. The objective evaluation is performed in a voice laboratory. Research in voice laboratories has generated a variety of different objective tests and parameters. The perceptual evaluation is one of the most controversial topics in voice research. Review of literature reveals a wide variety of rating scales and reliability data fluctuating from study to study. Unfortunately, there is no widely accepted valid method for classifying voice disorders and assessing outcome after voice treatment. Objectives: The goals of this research were to identify important objective acoustic parameters of vocal quality, and to establish an objective and quantitative correlate of the perceived vocal quality. Materials and Methods : We evaluated the voice analyzed data from 122 dysphonic patients and 20 normal volunteers. A computerized speech lab. 4300B(CSL) was used to carry out the analysis of each voice sample. Results: Three dysphonia severity indices(DSI) were created using discriminant analysis. DSI is based on the weighted combination of the following selected set of acoustic parameters: absolute jitter(Jita in us), smoothed pitch period perturbation (sPPQ in %), amplitude perturbation quotient(APQ in %), soft phonation index(SPI), average fundamental frequency(Fo in Hz), lowest fundamental frequency(Flo in Hz), and smoothed amplitude perturbation quotient(sAPQ in %). The DSI, being the discriminating rule calculated by the logistic regression, consists of three equation based on statistically significant acoustic parameters. Three DSI were created to reflects best the degree of hoarseness as expressed by G from the GRBAS scale. The more positive this DSI is for a patient, the worse the vocal quality. The more it is negative, the better it is. The effect of sex is included implicitly in the DSI-1 and DSI-2, so that a separate DSI-1 and DSI-2 for males and females need not be used. The DSI is objective because no perceptual input is required for its calculation. Conculsion : This research demonstrates that the voice function values calculated from three different multivariate objective dysphonia severity indices are significantly associated with subjective voice assessments. These multivariate objective dysphonia severity indices may be appropriate for use in clinical trials and outcomes research on treatment effectiveness for voice disorders.

  • PDF

Intelligent Steering Control System Based on Voice Instructions

  • Seo, Ki-Yeol;Oh, Se-Woong;Suh, Sang-Hyun;Park, Gyei-Kark
    • International Journal of Control, Automation, and Systems
    • /
    • v.5 no.5
    • /
    • pp.539-546
    • /
    • 2007
  • The important field of research in ship operation is related to the high efficiency of transportation, the convenience of maneuvering ships and the safety of navigation. For these purposes, many intelligent technologies for ship automation have been required and studied. In this paper, we propose an intelligent voice instruction-based learning (VIBL) method and discuss the building of a ship's steering control system based on this method. The VIBL system concretely consists of two functions: a text conversion function where an instructor's inputted voice is recognized and converted to text, and a linguistic instruction based learning function where the text instruction is understood through a searching process of given meaning elements. As a study method, the fuzzy theory is adopted to build maneuvering models of steersmen and then the existing LIBL is improved and combined with the voice recognition technology to propose the VIBL. The ship steering control system combined with VIBL is tested in a ship maneuvering simulator and its validity is shown.

Vibration Analysis of Micro Speaker Diaphragm (마이크로 스피커 다이어프램의 진동해석)

  • Hong, D.K.;Woo, B.C.;Ahn, C.W.;Han, G.J.
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2005.05a
    • /
    • pp.551-554
    • /
    • 2005
  • This study uses a characteristic function to explain correlations between the objective function and design variables. Analysis of means and table of orthogonal array were carried out. The change of shape of diaphragm, thickness of diaphragm and voice coil weight based on the table of orthogonal array is made. Therefore this study carried to decide shape of diaphragm, voice coil weight and thickness of diaphragm for minimizing 1st natural frequency and maximizing 2nd natural frequency of diaphragm using design of experiments and characteristic function with constraints. we showed improved design factors that minimized 1st natural frequency and maximized 2nd natural frequency of diaphragm.

  • PDF

Voice Handicap Index and Voice-Related Quality of Life in Idiopathic Parkinson's Disease (파킨슨병 환자의 음성장애지수 및 음성관련 삶의 질 연구)

  • Yu, Gyung;Jang, Insoo;Kim, Lakhyung
    • Journal of Oriental Neuropsychiatry
    • /
    • v.24 no.2
    • /
    • pp.155-162
    • /
    • 2013
  • Objectives : The purpose of this study is to evaluate the voice handicaps of the idiopathic Parkinson's Diseases (PD) and their voice-related quality of life. Methods : Voice handicap index-10 (VHI-10) and Voice related Quality of Life were completed by 17 idiopathic PD patients, and Unified Parkinson's Disease Rating Scale (UPDRS) part I, II, III were assessed. The relations between VHI-10, VRQOL and UPDRS scores were analysed. Results : VHI-10 score of PD patients was $14.35{\pm}8.07$ and VRQOL total score of PD patients was $59.12{\pm}20.25$, social-emotional $59.93{\pm}20.50$, physical function $58.58{\pm}21.77$. There were significant relations between VHI-10, VRQOL score and UPDRS II (activities of daily living). Conclusions : These results suggest that voice impairments affect the daily living of PD patients and their quality of lives.

The Effect of An Increase of Closed Quotient on Improvement of Voice Quality after Type I Thyroplasty in Patients with Unilateral Vocal Cord Paralysis (일측 성대마비 환자에서 성대내전술 후 성대접촉율의 증가가 음질 개선에 미치는 영향)

  • Kim, Han-Su;Choi, Seung-Hee;Lim, Jae-Yol;Choi, Hong-Shik
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.15 no.1
    • /
    • pp.16-20
    • /
    • 2004
  • Purpose : To assess perceptual, acoustic and aerodynamic measure of voice quality in patients with unilateral vocal cord paralysis before and after type I thyroplasty. Methods : The clinical records of patients operated type I thyroplasty in the Departement of otorhinoalryngolgy, Yongdong Severance hospital from November 2001 to November 2003 were reviewed. All patients uderwent a vocal function evaluation including perceptual, acoustic and aerodynamic measures of voice preoperative and on $60^{th}$ postoperative day. The perceptual and acoustic measures were obtained from recording of patients' reading a 'Sanchak' passage. The perceptual evaluation was performed by 2 speech pathologist using a 4-point rating scale. Acoustic parameters(voice range profile low(RAL), voice range profile high(RAH), average fundamental frequency(AFX), closed quotient, harmonic to noise ratio, jitter and shimmer) were investigated by Lx speech studio. Mean flow rate(MFR), subglottic pressure(Psub) and intensity were measured using the Phonatory function analyzer. The maximum phonation time was also measured. The data were statistically analyzed. A paired t-test (p<0.1) was used to compare preoperative and postoperative results. And multiple regression test was used to find which parameter was most correlated to improvement of postoperative voice quality. Results : Among aerodynamic parameters, Psub $(88.11mmH_2O{\rightarrow}58.7mmH_2O)$, MPT(7.87sec${\rightarrow}$12.53sec), MFR (359.8ml/sec${\rightarrow}$161.06ml/sec) were statistically improved. AFx(205.5Hz${\rightarrow}$163.27Hz), AQx(23.9%${\rightarrow}$48.3%), RAL, RAH. Jotter and shimmer were improved. In multiple regression test, AFx and AQx was noted as the two meost correlated parameters to improvement of postoperative breathiness. But general grade of voice quality was more correlated to Psub and shimmer. Conclusion : Vocal fold medialization procedures effectively reduce glottic gap. Increasing of contact area of both vocal folds induced improvement in aerodynamic parameters and leaded stabilizing of vocal fold vibration. That effect results in improvement in acoustic parameters (shimmer, jitter, signal-to-noise ratio, voice range profile) and voice quality.

  • PDF