• Title/Summary/Keyword: speech task

Search Result 316, Processing Time 0.019 seconds

Performance in a phonological deletion awareness task according to age and gender : Development of a phonological awareness screening test for preschool children (연령과 성에 따른 음운인식 탈락과제 수행력 : 학령전기 아동을 위한 음운인식 선별검사 개발)

  • Kim, Soo Jin;Oh, Gyung Ah;Seo, Eun Young;Ko, Yoo Kyeong
    • Phonetics and Speech Sciences
    • /
    • v.10 no.2
    • /
    • pp.61-68
    • /
    • 2018
  • Phonological awareness, or consciousness of speech sounds and operational skill with them, develops in the order word > syllable > phoneme, over the ages of four to seven. Among the various types of phonological awareness tasks, the deletion task has a higher level of difficulty because it requires operation and deletion of sounds within words. This task also has a high correlation with reading proficiency. This study utilized a deletion task with 20 questions to see how operational development depended on age and gender. The deletion task, with 20 questions, was tested on four- to six-year old children developing normally (N = 90). The results showed that phonological awareness performance improved with age. This age effect was not accompanied by a gender effect; age and gender interacted. The study confirmed the development of phonological awareness in four- to six-year-old children who were developing normally. The deletion task can be used to effectively detect the risk of difficulties with phonological awareness in preschoolers with speech, language, and reading problems.

A study on the speech feature extraction based on the hearing model (청각 모델에 기초한 음성 특징 추출에 관한 연구)

  • 김바울;윤석현;홍광석;박병철
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.33B no.4
    • /
    • pp.131-140
    • /
    • 1996
  • In this paper, we propose the method that extracts the speech feature using the hearing model through signal precessing techniques. The proposed method includes following procedure ; normalization of the short-time speech block by its maximum value, multi-resolution analysis using the discrete wavelet transformation and re-synthesize using thediscrete inverse wavelet transformation, differentiation after analysis and synthesis, full wave rectification and integration. In order to verify the performance of the proposed speech feature in the speech recognition task, korean digita recognition experiments were carried out using both the dTW and the VQ-HMM. The results showed that, in case of using dTW, the recognition rates were 99.79% and 90.33% for speaker-dependent and speaker-independent task respectively and, in case of using VQ-HMM, the rate were 96.5% and 81.5% respectively. And it indicates that the proposed speech feature has the potentials to use as a simple and efficient feature for recognition task.

  • PDF

Correlation between Physical Fatigue and Speech Signals (육체피로와 음성신호와의 상관관계)

  • Kim, Taehun;Kwon, Chulhong
    • Phonetics and Speech Sciences
    • /
    • v.7 no.1
    • /
    • pp.11-17
    • /
    • 2015
  • This paper deals with the correlation between physical fatigue and speech signals. A treadmill task to increase fatigue and a set of subjective questionnaire for rating tiredness were designed. The results from the questionnaire and the collected bio-signals showed that the designed task imposes physical fatigue. The t-test for two-related-samples between the speech signals and fatigue showed that the parameters statistically significant to fatigue are fundamental frequency, first and second formant frequencies, long term average spectral slope, smoothed pitch perturbation quotient, relative average perturbation, pitch perturbation quotient, cepstral peak prominence, and harmonics to noise ratio. According to the experimental results, it is shown that mouth is opened small and voice is changed to be breathy as the physical fatigue accumulates.

Disfluencies and Speech Rates of Standard Korean Speakers in Story-telling and Reading Contexts

  • Shim, Hong-Im;Chon, Hee-Cheong;Ko, Do-Heung
    • Speech Sciences
    • /
    • v.12 no.1
    • /
    • pp.45-51
    • /
    • 2005
  • The purpose of this study is to compare disfluencies and speech rates (overall speech rate and articulation rate) of normal adult speakers who use the standard Korean according to dissimilar speech tasks (story-telling and text-reading). Participants were 100 Korean adult speakers. The results are summarized as follows: First, the most frequent type of disfluency in the story-telling task was 'interjection', whereas that in the text-reading task was 'revision'. Second, the overall speech rates (syllables per second and syllables per minute) showed significant differences depending on the speech tasks. Third, the articulation rates (syllables per second and syllables per minute) showed significant differences depending on the speech tasks.

  • PDF

The Locus of the Word Frequency Effect in Speech Production: Evidence from the Picture-word Interference Task (말소리 산출에서 단어빈도효과의 위치 : 그림-단어간섭과제에서 나온 증거)

  • Koo, Min-Mo;Nam, Ki-Chun
    • MALSORI
    • /
    • no.62
    • /
    • pp.51-68
    • /
    • 2007
  • Two experiments were conducted to determine the exact locus of the frequency effect in speech production. Experiment 1 addressed the question as to whether the word frequency effect arise from the stage of lemma selection. A picture-word interference task was performed to test the significance of interactions between the effects of target frequency, distractor frequency and semantic relatedness. There was a significant interaction between the distractor frequency and the semantic relatedness and between the target and the distractor frequency. Experiment 2 examined whether the word frequency effect is attributed to the lexeme level which represent phonological information of words. A methodological logic applied to Experiment 2 was the same as that of Experiment 1. There was no significant interaction between the distractor frequency and the phonological relatedness. These results demonstrate that word frequency has influence on the processes involved in selecting a correct lemma corresponding to an activated lexical concept in speech production.

  • PDF

Transformer-based transfer learning and multi-task learning for improving the performance of speech emotion recognition (음성감정인식 성능 향상을 위한 트랜스포머 기반 전이학습 및 다중작업학습)

  • Park, Sunchan;Kim, Hyung Soon
    • The Journal of the Acoustical Society of Korea
    • /
    • v.40 no.5
    • /
    • pp.515-522
    • /
    • 2021
  • It is hard to prepare sufficient training data for speech emotion recognition due to the difficulty of emotion labeling. In this paper, we apply transfer learning with large-scale training data for speech recognition on a transformer-based model to improve the performance of speech emotion recognition. In addition, we propose a method to utilize context information without decoding by multi-task learning with speech recognition. According to the speech emotion recognition experiments using the IEMOCAP dataset, our model achieves a weighted accuracy of 70.6 % and an unweighted accuracy of 71.6 %, which shows that the proposed method is effective in improving the performance of speech emotion recognition.

Speech Task Force and Quality of Life after Surgery in Children with Cleft Lip and Palate: Limitation of Professionals

  • Benjamas Prathanee;Panida Thanawirattananit;Phrutthinun Surit;Ratchanee Mitkitti;Kalyanee Makarabhirom
    • Archives of Plastic Surgery
    • /
    • v.51 no.3
    • /
    • pp.275-283
    • /
    • 2024
  • Background Shortage of speech and language therapists results in lack of speech services. The aims of this study were to find the effectiveness of a combination speech therapy model at Level IV: General speech and language pathologist (GSLP) and Level V: Specific speech and language pathologist (SSLP) in reduction of the number of articulation errors and promotion the quality of life (QoL) for children with cleft palate with or without cleft lip (CP ± L). Methods Fifteen children with CP ± L, aged 4 years 1 month to 10 years 9 months (median = 76 months; minimum:maximum = 49:129 months) were enrolled in this study. Pre- and post-assessment included oral peripheral examination; articulation tests via Articulation Screening Test, Thai Universal Parameters of Speech Outcomes for People with Cleft Palate, Hearing Evaluation, The World Health Organization Quality of Life Brief_Thai (WHOQOL-BRIEF-THAI) version questionnaire for QoL were performed. Speech therapy included a 3-day intensive speech camp by SSLP, five 30-minute speech therapy sessions by a GSLP, and five 1-day follow-up speech camps by SSLP that provided four 45-minute speech therapy sessions for each child. Results Post-articulation revealed statistically significant reduction of the numbers of articulation errors at word, sentence, and screening levels (median difference [MD] = 3, 95% confidence interval [CI] = 2-5; MD = 6, 95% CI = 4.5-8; MD = 2.25, 95% CI = 1.5-3, respectively) and improvement of QoL. Conclusion A speech task force consisting of a combination of Level IV: GSLP and Level V: SSLP could significantly reduce the number of articulation errors and promote QoL.

Development and Evaluation of an English Speaking Task Using Smartphone and Text-to-Speech (스마트폰과 음성합성을 활용한 영어 말하기 과제의 개발과 평가)

  • Moon, Dosik
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.16 no.5
    • /
    • pp.13-20
    • /
    • 2016
  • This study explores the effects of an video-recording English speaking task model on learners. The learning model, a form of mobile learning, was developed to facilitate the learners' output practice applying advantages of a smartphone and Text-to Speech. The survey results shows the positive effects of the speaking task on the domain of pronunciation, speaking, listening, writing in terms of students' confidence, as well as general English ability. The study further examines the possibilities and limitations of the speaking task in assisting Korean learners improve their speaking ability, who do not have sufficient exposure to English input or output practice due to the situational limitations where English is learned as a foreign language.

Usability Improvement for the Speech Interface of Mobile Phones While Driving (운전 상황에서 휴대폰 음성인터페이스의 사용성 향상에 관한 연구)

  • Kang, Yun-Hwan;Jeong, Seong-Wook;Jung, Ga-Hun;Choi, Jae-Ho;Jung, Eui-S.
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.35 no.1
    • /
    • pp.109-118
    • /
    • 2009
  • While driving, the manual use of a mobile phone is heavily restricted due to the interference with the primary driving task. An alternative would be the use of speech interface. The current study aims to provide a guideline to implementation of a speech interface to the mobile phone. To do so, an expert evaluation was made and it revealed that a speech interface requires less workload, less performance degradation of the driving task than that of the keypad interface. To make speech interfaces more usable, new improvements are suggested. Subjective workload can be reduced and user satisfaction can be improved without degrading the primary task performance, for instance, by letting the user interrupt the speech of the phone, eliminating the repetitive words, letting the user know clearly what makes an error, providing a way to go back to the previous state, reducing the usage of keypad buttons and reducing the amount of the information on the screen.

A Basic Study on the Development of a Grading Scale of Discourse Competence in Korean Speaking Assessment -Focusing on the Scale of 'REFUSAL' Task (한국어 말하기 평가에서 '담화 능력' 등급 기술을 위한 기초 연구 -'부탁'에 대한 '거절하기' 과제를 중심으로-)

  • Lee, Haeyong;Lee, Hyang
    • Journal of Korean language education
    • /
    • v.29 no.3
    • /
    • pp.255-292
    • /
    • 2018
  • Most grading scales of Korean language proficiency tests are based on existing grading scales that are not empirically verified. The purpose of this study is to develop an empirically verified scale descriptor. The 'Performance data-driven approach' that is suggested by Fulcher (1987) was used to develop the detailed description of characteristics for each level of performance. This study is focused on the functional phase of speech samples analysis (coding data) to create explanatory categories of discourse skills into which individual observations of speech phenomena can be scored. The speech samples that were collected through this study demonstrated stages of speech that can be a foundation of a grading scale. The data used in the study was collected from 23 native speakers of Korean. Speech samples were recorded from simulated speaking tests using the 'REFUSAL' task, and transcribed for analysis. The transcript was analyzed using discourse analysis. The result showed that the 'REFUSAL' task needs to go through four functional phases in actual communication. Furthermore, this study found specific and detailed explanatory categories of discourse competence based on the actual native speaker's speech data. Such findings are expected to contribute to the development of more valid and reliable speaking assessment.