• Title/Summary/Keyword: 발화속도

Search Result 127, Processing Time 0.033 seconds

Visual Voice Activity Detection and Adaptive Threshold Estimation for Speech Recognition (음성인식기 성능 향상을 위한 영상기반 음성구간 검출 및 적응적 문턱값 추정)

  • Song, Taeyup;Lee, Kyungsun;Kim, Sung Soo;Lee, Jae-Won;Ko, Hanseok
    • The Journal of the Acoustical Society of Korea
    • /
    • v.34 no.4
    • /
    • pp.321-327
    • /
    • 2015
  • In this paper, we propose an algorithm for achieving robust Visual Voice Activity Detection (VVAD) for enhanced speech recognition. In conventional VVAD algorithms, the motion of lip region is found by applying an optical flow or Chaos inspired measures for detecting visual speech frames. The optical flow-based VVAD is difficult to be adopted to driving scenarios due to its computational complexity. While invariant to illumination changes, Chaos theory based VVAD method is sensitive to motion translations caused by driver's head movements. The proposed Local Variance Histogram (LVH) is robust to the pixel intensity changes from both illumination change and translation change. Hence, for improved performance in environmental changes, we adopt the novel threshold estimation using total variance change. In the experimental results, the proposed VVAD algorithm achieves robustness in various driving situations.

The Relationship between Flash Point and Fire Properties of Flammable Liquids (가연성 액체의 인화점과 화재특성치와의 관계)

  • Song, Young-Ho;Ha, Dong-Myeong
    • Journal of the Korean Institute of Gas
    • /
    • v.11 no.2 s.35
    • /
    • pp.10-14
    • /
    • 2007
  • Flash point is one of the major physical properties used to evaluate fire hazards of the combustible liquids. Properties showing relative fire hazards of the combustible liquids are heat release rate(HRR), peak heat release rate(PHRR), time to ignition(TTI), mass loss rate, and yield of $CO/CO_2$. The relationships between flash points and fire properties of the combustible liquids were examined in this study. For this study, mass loss rate and time to ignition were measured to calculate fire properties of the combustible liquids. The results showed that good correlations could be found between flash point and time to ignition, time to peak heat release rate, and the propensity to flashover. From a presented results, the parameters can be used to evaluate relative hazards of the combustible liquids on fire.

  • PDF

The Behavior Characteristics of the 2005 Yangyang Forest Fire (2005년 강원도 양양산불 행동 특성)

  • Lee Byung-Doo;Lee Si-Young;Chung Joo-Sang
    • Fire Science and Engineering
    • /
    • v.19 no.4 s.60
    • /
    • pp.1-6
    • /
    • 2005
  • To control forest fire effectively, it is necessary to understand forest fire behavior and relevance to forest fire environmental factors. In this paper, the behavior characteristics of the 2005 Yangyang forest fire were analyzed into the spread patterns and severity grades. The spread processes of the forest fire could be divided into two steps. At the first step, the fire ran fast to the east due to the strong west wind and then spreaded out in irregular direction. The maximum spread rate of the fire was 1.21km/hr and the mean was 0.65 km/hr. The result of the fire severity classification indicated that about $80\%$(1,110ha) of the whole study site was extremely burned and the remaining $15\%(211 ha)\;and\;5\%(61 ha)$ were damaged slightly and moderately respectively.

A Method of Automated Quality Evaluation for Voice-Based Consultation (음성 기반 상담의 품질 평가를 위한 자동화 기법)

  • Lee, Keonsoo;Kim, Jung-Yeon
    • Journal of Internet Computing and Services
    • /
    • v.22 no.2
    • /
    • pp.69-75
    • /
    • 2021
  • In a contact-free society, online services are becoming more important than classic offline services. At the same time, the role of a contact center, which executes customer relation management (CRM), is increasingly essential. For supporting the CRM tasks and their effectiveness, techniques of process automation need to be applied. Quality assurance (QA) is one of the time and resource consuming, and typical processes that are suitable for automation. In this paper, a method of automatic quality evaluation for voice based consultations is proposed. Firstly, the speech in consultations is transformed into a text by speech recognition. Then quantitative evaluation based on the QA metrics, including checking the elements in opening and closing mention, the existence of asking the mandatory information, the attitude of listening and speaking, is executed. 92.7% of the automated evaluations are the same to the result done by human experts. It was found that the non matching cases of the automated evaluations were mainly caused from the mistranslated Speech-to-Text (STT) result. With the confidence of STT result, this proposed method can be employed for enhancing the efficiency of QA process in contact centers.

Deep-Learning Based Real-time Fire Detection Using Object Tracking Algorithm

  • Park, Jonghyuk;Park, Dohyun;Hyun, Donghwan;Na, Youmin;Lee, Soo-Hong
    • Journal of the Korea Society of Computer and Information
    • /
    • v.27 no.1
    • /
    • pp.1-8
    • /
    • 2022
  • In this paper, we propose a fire detection system based on CCTV images using an object tracking technology with YOLOv4 model capable of real-time object detection and a DeepSORT algorithm. The fire detection model was learned from 10800 pieces of learning data and verified through 1,000 separate test sets. Subsequently, the fire detection rate in a single image and fire detection maintenance performance in the image were increased by tracking the detected fire area through the DeepSORT algorithm. It is verified that a fire detection rate for one frame in video data or single image could be detected in real time within 0.1 second. In this paper, our AI fire detection system is more stable and faster than the existing fire accident detection system.

Robust Real-time Pose Estimation to Dynamic Environments for Modeling Mirror Neuron System (거울 신경 체계 모델링을 위한 동적 환경에 강인한 실시간 자세추정)

  • Jun-Ho Choi;Seung-Min Park
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.19 no.3
    • /
    • pp.583-588
    • /
    • 2024
  • With the emergence of Brain-Computer Interface (BCI) technology, analyzing mirror neurons has become more feasible. However, evaluating the accuracy of BCI systems that rely on human thoughts poses challenges due to their qualitative nature. To harness the potential of BCI, we propose a new approach to measure accuracy based on the characteristics of mirror neurons in the human brain that are influenced by speech speed, depending on the ultimate goal of movement. In Chapter 2 of this paper, we introduce mirror neurons and provide an explanation of human posture estimation for mirror neurons. In Chapter 3, we present a powerful pose estimation method suitable for real-time dynamic environments using the technique of human posture estimation. Furthermore, we propose a method to analyze the accuracy of BCI using this robotic environment.

Proper frequency band as EMG fatigue indices of biceps femoris muscles during treadmill walking (드레트밀 보행시 대퇴이두근의 EMG 근피로지수로서 적당한 주파수 대역)

  • Jongchil Won;Kiyoung Lee
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.17 no.3
    • /
    • pp.141-145
    • /
    • 2024
  • Because of muscle fatigue, motor unit recruitment and firing rates decrease and EMG power spectrum shifts toward lower frequencies as spectral compression which represented by a falling shift in the median frequency. However, changes of this frequency shows relatively less than those of the magnitudes of the low frequency band. This paper aims to examine the moderate ranges of the frequency bands in the existed ones as spectral fatigue indices of biceps femoris muscle. Twelve subjects participate in this experiment, and EMG signals are measured from these muscles during treadmill walking on the speed of 4.5 km/h. ANOVA analysis is used to compare changes of the low and high frequency band with reference to those of median frequency. Experimental results demonstrate that the low frequency band 25-82 Hz and the high frequency band 142-300 Hz could be appropriate for spectral fatigue indices of biceps femoris muscles.

Numerical Simulations of Dynamic Response of Cased Reactive System Subject to Bullet Impact (총탄 충격이 가해진 반응 시스템의 파괴 거동에 관한 수치적 연구)

  • Kim, Bohoon;Kim, Minsung;Doh, Youngdae;Kim, Changkee;Yoo, Jichang;Yoh, Jai-Ick
    • Transactions of the Korean Society of Mechanical Engineers B
    • /
    • v.38 no.6
    • /
    • pp.525-538
    • /
    • 2014
  • Safety of reactive systems is one of the most important research areas in the field of weapon development. A NoGo response or at least a low-order explosion should be ensured to prevent unexpected accidents when the reactive system is impacted by high-velocity projectile. We investigated the shock-induced detonation of cased reactive systems subject to a normal projectile impact to the cylindrical surface based on two-dimensional hydrodynamic simulations using the I&G chemical rate law. Two types of energetic materials, namely LX-17 and AP-based solid propellant, were considered to compare the dynamic responses of the reactive system when subjected to the threshold impact velocity. It was found that shock-to-detonation transition phenomena occurred in the cased LX-17, whereas no full reaction occurred in the propellant.

Change in acoustic characteristics of voice quality and speech fluency with aging (노화에 따른 음질과 구어 유창성의 음향학적 특성 변화)

  • Hee-June Park;Jin Park
    • Phonetics and Speech Sciences
    • /
    • v.15 no.4
    • /
    • pp.45-51
    • /
    • 2023
  • Voice issues such as voice weakness that arise with age can have social and emotional impacts, potentially leading to feelings of isolation and depression. This study aimed to investigate the changes in acoustic characteristics resulting from aging, focusing on voice quality and spoken fluency. To this end, tasks involving sustained vowel phonation and paragraph reading were recorded for 20 elderly and 20 young participants. Voice-quality-related variables, including F0, jitter, shimmer, and Cepstral Peak Prominence (CPP) values, were analyzed along with speech-fluency-related variables, such as average syllable duration (ASD), articulation rate (AR), and speech rate (SR). The results showed that in voice quality-related measurements, F0 was higher for the elderly and voice quality was diminished, as indicated by increased jitter, shimmer, and lower CPP levels. Speech fluency analysis also demonstrated that the elderly spoke more slowly, as indicated by all ASD, AR, and SR measurements. Correlation analysis between voice quality and speech fluency showed a significant relationship between shimmer and CPP values and between ASD and SR values. This suggests that changes in spoken fluency can be identified early by measuring the variations in voice quality. This study further highlights the reciprocal relationship between voice quality and spoken fluency, emphasizing that deterioration in one can affect the other.

Exploring Small Group Argumentation and Epistemological Framing of Gifted Science Students as Revealed by the Analysis of Their Responses to Anomalous Data (변칙 사례에 대한 과학 영재 학생들의 반응에서 드러난 인식론적 프레이밍과 소집단 논변활동 탐색)

  • Lee, Eun Ju;Yun, Sun Mi;Kim, Heui-Baik
    • Journal of The Korean Association For Science Education
    • /
    • v.35 no.3
    • /
    • pp.419-429
    • /
    • 2015
  • In this study, we explored students' epistemological framing during scientific argumentation and how interactions among group members influenced group argumentation. Twenty-one gifted science students divided into groups of three or four participated in this study. Students' discussions related to data interpretation concerning the rate of photosynthesis were analyzed. Students' activities were videotaped in groups so the discourse could be transcribed and students' behavioral cues analyzed. Students' epistemological framing has been identified through analysis of their speech and behavioral responses to the anomalous data from the inquiry process. Subsequently, their sources of warrant and group argumentation levels were explored. We found out that group members framed the inquiry in two ways: "understanding phenomena" and "classroom game." Group members whose framing was "understanding phenomena" required other members to justify the anomalous data by examining its validity and reliability, which conclusively demonstrated a high level of argumentation. On the other hand, when group members used "classroom game" to frame their argumentation, they did not recognize the necessity of explaining the anomalous data; rather, these students used simple empirical justification to explain the data, reflecting a low level of argumentation. When students using different epistemological framing disagreed over interpretations of anomalous data throughout the discussion, clashes ensued that resulted in emotional conflict and a lack of discussion. Students' framing shifts were observed during the discussion on which group leaders seemed to have a huge influence. This study lays the foundation for future work on establishing productive framing to prompt scientific argumentation in science classrooms.