• Title/Summary/Keyword: Spectrogram Analysis

Search Result 91, Processing Time 0.024 seconds

Acoustic Characteristics of 'Short Rushes of Speech' using Alternate Motion Rates in Patients with Parkinson's Disease (파킨슨병 환자의 교대운동속도 과제에서 관찰된 '말 뭉침'의 음향학적 특성)

  • Kim, Sun Woo;Yoon, Ji Hye;Lee, Seung Jin
    • Phonetics and Speech Sciences
    • /
    • v.7 no.2
    • /
    • pp.55-62
    • /
    • 2015
  • It is widely accepted that Parkinson's disease(PD) is the most common cause of hypokinetic dysarthria, and its characteristics of 'short rushes of speech' have become more evident along with the severity of motor disorders. Speech alternate motion rates (AMRs) are particularly useful for observing not only rate abnormalities but also deviant speech. However, relatively little is known about the characteristics of 'short rushes of speech' in terms of AMRs of PD except for the perceptual characteristics. The purpose of this study was to examine which acoustic features of 'short rushes of speech' in terms of AMRs are a robust indicator of Parkinsonian speech. Numbers of syllabic repetitions (/pə/, /tə/, /kə/) in AMR tasks were analyzed through acoustic methods observing a spectrogram of the Computerized Speech Lab in 9 patients with PD. Acoustically, we found three characteristics of 'short rushes of speech': 1) Vocalized consonants without closure duration(VC) 76.3%; 2) No consonant segmentation(NC) 18.6%; 3) No vowel formant frequency(NV) 5.1%. Based on these results, 'short rushes of speech' may affect the failure to reach and maintain the phonatory targets. In order to best achieve the therapeutic goals, and to make the treatment most efficacious, it is important to incorporate training methods which are based on both phonation and articulation.

Experimental Phonetic Study of Kyungsang and Cholla Dialect Using Power Spectrum and Laryngeal Fiberscope (파워스펙트럼 및 후두내시경을 이용한 방언 음성(方言 音聲)의 실험적 연구(實驗的 硏究): 경상방언 및 전라방언을 중심으로)

  • Kim, Hyun-Gi;Lee, Eung-Young;Hong, Ki-Hwan
    • Speech Sciences
    • /
    • v.9 no.2
    • /
    • pp.25-47
    • /
    • 2002
  • Human language activity in the information society has been developing the communication system between humans and machines. The aim of this study was to analyze dialectal speech in Korea. One hundred Kyungsang and one hundred Cholla informants participated in this study. A CSL and Flexible laryngeal fiberscope were used for analysis of the acoustic and glottal gestures of all the vowels and consonants. Test words were made on the picture cards and letter cards which contained each vowel and each consonant, respectively. The dialogue between the examiner and the informants was recorded in a question and answer manner. The acoustic results of two dialects were as follows: Kyungsang and Cholla informants showed neutralization between /e/ and /$\varepsilon$. However, the apertures of Kyungsang vowels /i, w, u, o/ were higher than those of Cholla vowels. The /wi/ and /$\varepsilon$/ of Kyungsang Diphthong vowels were shown as simple vowels /i/ and /$\varepsilon$/ in Cholla dialect. The VOT of Cholla dilaect was longer than that of Kyungsang dialect. The fricative frequence of Kyurlgsang dialect was about 1000Hz higher than that of Cholla dialect. The glottal widths on fiberscopic images showed that the consonant durations of Kyungsang and Cholla dialects were correlated all together with the acoustic duration on the spectrogram.

  • PDF

Human Laughter Generation using Hybrid Generative Models

  • Mansouri, Nadia;Lachiri, Zied
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.15 no.5
    • /
    • pp.1590-1609
    • /
    • 2021
  • Laughter is one of the most important nonverbal sound that human generates. It is a means for expressing his emotions. The acoustic and contextual features of this specific sound are different from those of speech and many difficulties arise during their modeling process. During this work, we propose an audio laughter generation system based on unsupervised generative models: the autoencoder (AE) and its variants. This procedure is the association of three main sub-process, (1) the analysis which consist of extracting the log magnitude spectrogram from the laughter database, (2) the generative models training, (3) the synthesis stage which incorporate the involvement of an intermediate mechanism: the vocoder. To improve the synthesis quality, we suggest two hybrid models (LSTM-VAE, GRU-VAE and CNN-VAE) that combine the representation learning capacity of variational autoencoder (VAE) with the temporal modelling ability of a long short-term memory RNN (LSTM) and the CNN ability to learn invariant features. To figure out the performance of our proposed audio laughter generation process, objective evaluation (RMSE) and a perceptual audio quality test (listening test) were conducted. According to these evaluation metrics, we can show that the GRU-VAE outperforms the other VAE models.

Acoustic Emission and Burr Comparison of Circular Sawing and Milling in Fiber Reinforced Plastic Cutting (원형 톱과 엔드밀의 복합재료 절단 음향과 버 비교연구)

  • Joo, Chang-Min;Baek, Jong-Hyun;Kim, Su-Jin;Lee, Gun-Myung
    • Journal of the Korean Society of Manufacturing Process Engineers
    • /
    • v.21 no.7
    • /
    • pp.98-104
    • /
    • 2022
  • Circular sawing and milling are general machining processes used for routing fiber-reinforced plastics (FRP). In this study, the productivity and cutting quality of a circular saw and flat endmill were compared. As a result, the productivity of the circular saw was approximately ten times higher than that of the endmill for the same tool life, and the burr size of the circular saw was 14 times smaller than that of the flat-end mill. The spectrogram analysis of the cutting sound also showed that the acoustic emission of the circular saw was more uniform than that of the flat end mill. Circular sawing is thus a more suitable process for the straight cutting of pultrusion FRP than a flat endmill.

Data Analysis of Inertial Sensors for Train Positioning Detection System (열차위치검지 시스템을 위한 관성센서 데이터 분석 연구)

  • Kim, Seong Jin;Park, Sungsoo;Lee, Jae-Ho;Kang, Donghoon
    • Journal of the Korean Society for Nondestructive Testing
    • /
    • v.35 no.1
    • /
    • pp.18-24
    • /
    • 2015
  • Train positioning detection information is fundamental for high-speed railroad inspection, making it possible to simultaneously determine the status and evaluate the integrity of railroad equipment. This paper presents the results of measurements and an analysis of an inertial measurement unit (IMU) used as a positioning detection sensors. Acceleration and angular rate measurements from the IMU were analyzed in the amplitude and frequency domains, with a discussion on vibration and train motions. Using these results and GPS information, the positioning detection of a Korean tilting train express was performed from Naju station to Illo station on the Honam-line. The results of a synchronized analysis of sensor measurements and train motion can help in the design of a train location detection system and improve the positioning detection performance.

An Analysis of Timbre Comparison between Jeongak Daegeum and Sanjo Daegeum (정악대금과 산조대금의 음색 특징 분석)

  • Sung, Ki-Young
    • Journal of Korea Entertainment Industry Association
    • /
    • v.14 no.3
    • /
    • pp.229-236
    • /
    • 2020
  • In this paper, the tone of Daegeum, one of the most representative wind instruments of our country, was analyzed. Daegeum is widely used as Jeongak Daegeum and Sanjo Daegeum, which are played in royal and wind music, and Sanjo Daegeum is mainly played in Sanjo, Sinawi and folk music. The reason why the two pieces of music are being played in different music genres is due to the improvement of the length of the pipe and the location of the finger holes, allowing the Sanjo Daegeum to perform faster than Jeongak Daegeum, apply various techniques, and make the choice of musical instruments harmonized with music by making the difference in tone. For timber analysis of Jeongak Daegeum and Sanjo Daegeum, the composition of the overtones was visually verified through Spectrogram and Spectrum Analizer, in which the results of recordings were recorded by playing octave low, flat, and octave high positions with the same power. From this, Jeongak Daegeum, which is rich in low-pitched sound, harmonizes with solemn music such as royal music, and Sanjo Daegeum, which has a relatively clear high-pitched sound, is well suited to bright music such as solo music.

Sound Enhancement of low Sample rate Audio Using LMS in DWT Domain (DWT영역에서 LMS를 이용한 저 샘플링 비율 오디오 신호의 음질 향상)

  • 백수진;윤원중;박규식
    • The Journal of the Acoustical Society of Korea
    • /
    • v.23 no.1
    • /
    • pp.54-60
    • /
    • 2004
  • In order to mitigate the problems in storage space and network bandwidth for the full CD quality audio, current digital audio is always restricted by sampling rate and bandwidth. This restriction normally results in low sample rate audio or calls for the data compression scheme such as MP3. However, they can only reproduce a lower frequency range than a regular CD quality because of the Nyquist sampling theory. Consequently they lose rich spatial information embedded in high frequency. The propose of this paper is to propose efficient high frequency enhancement of low sample rate audio using n adaptive filtering and DWT analysis and synthesis. The proposed algorithm uses the LMS adaptive algorithm to estimate the missing high frequency contents in DWT domain and it then reconstructs the spectrally enhanced audio by using the DWT synthesis procedure. Several experiments with real speech and audio are performed and compared with other algorithm. From the experimental results of spectrogram and sonic test, we confirm that the proposed algorithm outperforms the other algorithm and reasonably works well for the most of audio cases.

Acoustic Characteristics of Korean Spoken by the Women Immigrants from Japan and Philippine (여성 결혼이민자들의 한국어 조음에 나타나는 음향음성학 특성 연구 - 일본과 필리핀 출신 여성 결혼이민자들을 대상으로)

  • Jo, Seon-Hui;Kim, Hyun-Gi;Kim, Sun-Jun
    • Speech Sciences
    • /
    • v.15 no.3
    • /
    • pp.203-217
    • /
    • 2008
  • The number of Asian women immigrants in Korea is getting bigger and it's important to note that their communication problem in Korean causes not only the difficulty of adapting to Korean society but their children's speech-language disorder. To date there is little research on their acoustics characters and articulatory errors. Therefore, this study focuses on acoustic characters and articulatory error patterns of the women immigrants from Japan and Philippine based on the theory of "contrastive analysis". The subjects were 16 Japanese women immigrants(age: 42.5$\pm$4.4) and 14 Philippine women immigrants(age: 31.64$\pm$6.7) and control group consisted of 10 Korean women(age: 28.3$\pm$1.2). Speech and hearing of all subjects and control group were within normal limits. Speech samples were analyzed in a computer using CSL and data analysis was done on FFT widow for F1, F2, F3 of vowels and on wideband spectrogram for VOT of plosives and africatives. The results of this study were like this; For Japanese women immigrants, they had different articulatory patterns of /e/, /a/, /u/, /o/, /$\varepsilon$/, /m/ from those of Koreans and showed articulatory errors on the fortis and aspirated sounds. The reason is Japanese has only two distinctive characters for plosives and affricates; voicing and voiceless. The Philippine women immigrants also showed the same error patterns as the Japanese women immigrants. Especially the errors on aspirated sounds were prominent because their mother tongue has no distinctive characters about aspirated sounds. For vowels, they showed errors of /a/, /o/, /c/.

  • PDF

An Interdisciplinary Study of A Leaders' Voice Characteristics: Acoustical Analysis and Members' Cognition

  • Hahm, SangWoo;Park, Hyungwoo
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.14 no.12
    • /
    • pp.4849-4865
    • /
    • 2020
  • The traditional roles of leaders are to influence members and motivate them to achieve shared goals in organizations. However, leaders such as top managers and chief executive officers, in practice, do not always directly meet or influence other company members. In fact, they tend to have the greatest impact on their members through formal speeches, company procedures, and the like. As such, official speech is directly related to the motivation of company employees. In an official speech, not only the contents of the speech, but also the voice characteristics of the speaker have an important influence on listeners, as the different vocal characteristics of a person can have different effects on the listener. Therefore, according to the voice characteristics of a leader, the cognition of the members may change, and, the degree to which the members are influenced and motivated will be different. This study identifies how members may perceive a speech differently according to the different voice characteristics of leaders in formal speeches. Further, different perceptions about voices will influence members' cognition of the leader, for example, in how trustworthy they appear. The study analyzed recorded speeches of leaders, and extracted features of their speaking style through digital speech signal analysis. Then, parameters were extracted and analyzed by the time domain, frequency domain, and spectrogram domain methods. We also analyzed the parameters for use in Natural Language Processing. We investigated which leader's voice characteristics had more influence on members or were more effective on them. A person's voice characteristics can be changed. Therefore, leaders who seek to influence members in formal speeches should have effective voice characteristics to motivate followers.

A Study on the Effects of Speech Training for Adults Focusing on the Analysis of Voices Before and After Speech Training (성인 스피치교육 전후 효과에 관한 목소리변화스펙트로그램 비교 연구)

  • Chung, Eun-Ee;Lee, Sang-Ho
    • Journal of Digital Contents Society
    • /
    • v.18 no.6
    • /
    • pp.1049-1056
    • /
    • 2017
  • This study focused on the changes in the voices in determining the effects of speech training. This study aimed to make more visible and scientific evaluation of the changes in the voices among the substantial effects obtained from speech training. As a result, some objective differences from before the speech training could be found in the voice of every learner. Each learner showed gradual technical improvement in a variety of vocal elements, including resonance and timbre, accuracy of pronunciation, pause; that is, the voice became more powerful, more accurate pronounced, more pausing and more stable than before the speech training. This study determined if speech training could change a voice and the results are expected to help speech learners participate actively in speech training and see their speech ability improved.