• Title/Summary/Keyword: speeches

Search Result 93, Processing Time 0.022 seconds

An Interdisciplinary Study of A Leaders' Voice Characteristics: Acoustical Analysis and Members' Cognition

  • Hahm, SangWoo;Park, Hyungwoo
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.14 no.12
    • /
    • pp.4849-4865
    • /
    • 2020
  • The traditional roles of leaders are to influence members and motivate them to achieve shared goals in organizations. However, leaders such as top managers and chief executive officers, in practice, do not always directly meet or influence other company members. In fact, they tend to have the greatest impact on their members through formal speeches, company procedures, and the like. As such, official speech is directly related to the motivation of company employees. In an official speech, not only the contents of the speech, but also the voice characteristics of the speaker have an important influence on listeners, as the different vocal characteristics of a person can have different effects on the listener. Therefore, according to the voice characteristics of a leader, the cognition of the members may change, and, the degree to which the members are influenced and motivated will be different. This study identifies how members may perceive a speech differently according to the different voice characteristics of leaders in formal speeches. Further, different perceptions about voices will influence members' cognition of the leader, for example, in how trustworthy they appear. The study analyzed recorded speeches of leaders, and extracted features of their speaking style through digital speech signal analysis. Then, parameters were extracted and analyzed by the time domain, frequency domain, and spectrogram domain methods. We also analyzed the parameters for use in Natural Language Processing. We investigated which leader's voice characteristics had more influence on members or were more effective on them. A person's voice characteristics can be changed. Therefore, leaders who seek to influence members in formal speeches should have effective voice characteristics to motivate followers.

Improved Orthogonal Projection Method for Cancelling Acoustic Echo Signals (음향반향신호의 제거를 위한 개선된 직교투사법)

  • Yun Hyun-min
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.9 no.4
    • /
    • pp.703-711
    • /
    • 2005
  • This paper proposes the improved orthogonal projection method as a new technique advancing the performance of the echo cancellation for speeches in the acoustic echo canceller. Comparing with the used NLMS adaptive algorithm, it shows that this method improves the performance of the echo cancellation for signals with the large auto-correlation. In order to testify performances of the orthogonal projection method whom this paper proposes, we have coded a simulation program and executed computer simulations. We observed convergence curves by using two adaptive algorithm for noises and speeches. From simulation results for two input signals, the proposed method shows the high ERLE and the fast convergence and the stable operation in case of using speeches as well as noises.

A Study on Acoustic Masking Effect by Frame-Based Formant Enhancement (프레임 기반의 포먼트 강조에 의한 음향 마스킹 현상 발생에 대한 연구)

  • Jeon, Yu-Yong;Kim, Kyu-Sung;Lee, Sang-Min
    • Journal of Biomedical Engineering Research
    • /
    • v.30 no.6
    • /
    • pp.529-534
    • /
    • 2009
  • One of the characteristics of the hearing impaired is that their frequency selectivity is poorer than that of the normal hearing. To compensate this, formant enhancement algorithms and spectral contrast enhancement algorithms have been developed. However in some cases, these algorithms fail to improve the frequency selectivity of the hearing impaired. One of the reasons is the acoustic masking among enhanced formants. In this study, we tried to enhance the formants based on the individual masking characteristic of each subject. The masking characteristic used in this study was minimum level difference (MLD) between the first formant to the second formant while acoustic masking was occurred. If the level difference between the two formants in each frame is larger than the MLD, the gain of the first formant was decreased to reduce the acoustic masking that occurred among formants. As a result of the speech discrimination test, using formant enhanced speeches, speech discrimination score (SDS) of the speeches having differently enhanced formants was significantly superior to SDS of the speeches having equally enhanced formants. It means that suppression of the acoustic masking among formants improve frequency selectivity of the hearing impaired.

Podcasting and Politics in Singapore: An Experimental Study of Medium Effects

  • Skoric, Marko M.;Sim, Clarice;Juan, Han Teck;Fang, Pam
    • Journal of Contemporary Eastern Asia
    • /
    • v.8 no.2
    • /
    • pp.27-43
    • /
    • 2009
  • A ban on political podcasting during the General Elections 2006 in Singapore was justified by the Singaporean government on the grounds that the new medium had a greater power to influence voters than traditional modes of political discourse. A between-subjects controlled experiment was conducted to test whether podcasts of political speeches had a greater power to influence voters' evaluations of political candidates and likelihood of voting for them than online text-based transcripts of the same speeches. The study also examined whether mere exposure to political speeches online, irrespective of the modality, had an effect on voters' more general political preferences, i.e. the likelihood of support and voting for the opposition. The findings suggest that political podcasts were no more persuasive than text-based websites and that the effects on political preferences, if any, were likely due the exposure to political content online, not because of the nature of the medium. The implications of the findings are discussed.

Fast offline transformer-based end-to-end automatic speech recognition for real-world applications

  • Oh, Yoo Rhee;Park, Kiyoung;Park, Jeon Gue
    • ETRI Journal
    • /
    • v.44 no.3
    • /
    • pp.476-490
    • /
    • 2022
  • With the recent advances in technology, automatic speech recognition (ASR) has been widely used in real-world applications. The efficiency of converting large amounts of speech into text accurately with limited resources has become more vital than ever. In this study, we propose a method to rapidly recognize a large speech database via a transformer-based end-to-end model. Transformers have improved the state-of-the-art performance in many fields. However, they are not easy to use for long sequences. In this study, various techniques to accelerate the recognition of real-world speeches are proposed and tested, including decoding via multiple-utterance-batched beam search, detecting end of speech based on a connectionist temporal classification (CTC), restricting the CTC-prefix score, and splitting long speeches into short segments. Experiments are conducted with the Librispeech dataset and the real-world Korean ASR tasks to verify the proposed methods. From the experiments, the proposed system can convert 8 h of speeches spoken at real-world meetings into text in less than 3 min with a 10.73% character error rate, which is 27.1% relatively lower than that of conventional systems.

Improved Orthogonal Projection Method for Implementing Acoustic Echo Canceller (음향반향제거기의 구현을 위한 개선된 직교투사법)

  • Lee Haeng-Woo
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.43 no.2 s.308
    • /
    • pp.73-81
    • /
    • 2006
  • This paper proposes the improved orthogonal projection method as a new technique advancing the performance of the acoustic echo canceller. Comparing with the widely used NLMS adaptive algorithm which is simple and stable, it shows that this method has the improvement of the convergence speed for signals with the large auto-correlation, and has small computational quantities. In order to testify performances of the orthogonal projection method whom this paper proposes, we have coded a simulation program md executed computer simulations. We observed convergence curves by using two adaptive algorithm for noises and speeches. From simulation results for two input signals, the proposed method shows the high ERLE and the fast convergence and the stable operation in case of using speeches as well as noises.

Grammatical Properties of Kes Constructions in a Speech Corpus (연설문 말뭉치에서 나타나는 '것' 구문의 문법적 특징)

  • Kim, Jong-Bok;Lee, Seung-Han;Kim, Kyung-Min
    • Korean Journal of Cognitive Science
    • /
    • v.19 no.3
    • /
    • pp.257-281
    • /
    • 2008
  • The expression 'kes' is one of the most widely used ones in the language whose uses are highly dependent upon the context. These highly-context dependent uses make it hard to determine its grammatical properties. As a way of examining the properties in a rather controlled context, this paper collects a series of speeches made by government officials and examines the grammatical properties of the expression in the corpus. In particular, the paper, based on the 539 instances of 'kes' uses extracted from the corpus, focuses on the 7 types of 'kes' constructions most widely used in the collected speech corpus.

  • PDF

Efficient Part-of-Speech Set for Knowledge-based Word Sense Disambiguation of Korean Nouns (한국어 명사의 지식기반 의미중의성 해소를 위한 효과적인 품사집합)

  • Kwak, Chul-Heon;Seo, Young-Hoon;Lee, Chung-Hee
    • The Journal of the Korea Contents Association
    • /
    • v.16 no.4
    • /
    • pp.418-425
    • /
    • 2016
  • This paper presents the part-of-speech set which is highly efficient at knowledge-based word sense disambiguation for Korean nouns. 174,000 sentences extracted for test set from Sejong semantic tagged corpus whose sense is based on Standard korean dictionary. We disambiguate selected nouns in test set using glosses and examples in Standard Korean dictionary. 15 part-of-speeches which give the best performance for all test set and 17 part-of-speeches which give the best performance for accuracy average of selected nouns are selected. We obtain 12% more performance by those part-of-speech sets than by full 45 part-of-speech set.

On the Control of Energy Flow between the Connection Parts of Syllables for the Korean Multi-Syllabic Speech Synthesis in the Time Domain Using Mono-syllables as a Synthesis Unit (단음절 합성단위음을 사용한 시간영역에서의 한국어 다음절어 규칙합성을 위한 음절간 접속구간에서의 에너지 흐름 제어에 관한 연구)

  • 강찬희;김윤석
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.24 no.9B
    • /
    • pp.1767-1774
    • /
    • 1999
  • This paper is to synthesize Korean multi-syllabic speeches in the time domain using mono-syllables as a synthesis unit. Specially it is to control the shape forms of speech energy flows between the connection parts of syllables in the case of concatenation mono-syllables. For this it is controlled with the prosody parameters1) extracted from speech waveforms in the time domains and presented the experimental results controlled the energy flows by using the induced concatenation rules from the korean syllable shapeforms in connetion parts of syllables. In the results of experiments, it is removed the incontinuities of energy follows in the connection parts produced by concatenating the mono-syllables in the time domain and also improved the qualities and naturalites of synthesized speeches.

  • PDF