• Title/Summary/Keyword: Audio Analysis

Search Result 536, Processing Time 0.027 seconds

A Study on Audio-visual Stimulation Based Unconstrained Stress Analysis using Chair-type BCG Measurement System (의자형 심탄도 측정시스템을 이용한 시청각 자극 기반의 무구속 스트레스 분석 연구)

  • Kim, Byeong-Ju;Noh, Yun-hong;Jeong, Do-Un
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2014.04a
    • /
    • pp.1012-1013
    • /
    • 2014
  • 본 논문에서는 일상생활 중 지속적으로 심장 상태를 모니터링 할 수 있는 무구속 의자형 심탄도 측정시스템을 개발하였다. 또한 구현된 시스템에서 측정된 생체신호를 이용하여 주관적인 감정자극의 스트레스를 분석하기 위한 연구를 수행하였다. 수준을 분석하고자 하였다. 실험은 시스템에 착석하여 실시간으로 시청각 자극 실험을 수행하였고, 심박수와 심박변이도의 시간영역 및 주파수영역 파라미터를 확인하였다. 확인된 심박변이도의 파라미터는 시청각 도중 기술한 인간의 감정들을 체계화하여 2차원 공간에 여러 감정들의 관계를 나타낸 제임스 러셀(J. Russell)의 감정모델을 주관적인 감정 자극에 의한 스트레스 지표 나타내어 비교 분석하였다. 실험결과는 RMSSD, LF/HF 파라미터가 스트레스 수준 분류에 사용될 수 있는 잠재력을 가지고 있음을 증명한다.

A Study on the Analysis of the Audio DAC Performance (음성 DAC 의 성능 분석에 대한 고찰)

  • Sung, Kyunghun;Park, Seungsang;Nam, Wongtae;Go, Junghwan
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2018.05a
    • /
    • pp.484-485
    • /
    • 2018
  • DAC 는 디지털-아날로그 변환 회로는 디지털 전기 신호를 아날로그 전기 신호로 변환하는 전자 회로이다. 특히 최근 음성 신호는 그 효율성 및 경제성 때문에 디지털 데이터 형태로 저장/전송되고 있어 DAC 는 음성 관련 사업에서 필수적으로 쓰이고 있다. 본 논문은 음성 신호의 디지털-아날로그 변환 시 DAC 의 성능에 대한 분석 및 시험 결과를 소개한다.

The Performance Analysis of Optical CDMA based Acoustic Sensor System using Optical Fiber Sensors (광 CDMA 기반광섬유 센서를 이용한 음파탐지 시스템의 특성 분석)

  • Park, Sang-Jo;Kim, Bong-Kyu
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.12 no.5
    • /
    • pp.956-962
    • /
    • 2008
  • We propose the optical CDMA based audio acoustic sensor using optical fiber sensors which can be used in the bottom of the sea. In the proposed network, we analyzed the performance of noise power. Numerical simulations confirm that the performance can be improved by increasing the measuring time of optical sensors compared with using conventional WDMA method.

A Study on the Lexical Diversity of Korean-Chinese Bilingual Children (한국어·중국어 이중 언어 사용 아동의 어휘 다양성)

  • Choi, Jiyoung
    • Journal of Korean language education
    • /
    • v.28 no.4
    • /
    • pp.245-271
    • /
    • 2017
  • This study aimed at investigating the lexical diversity in the "Frog Story" narratives of Korean-Chinese bilingual children. Six bilingual speakers of Korean children- four boys and two girls- were audio recorded as they produced narratives based on pictures from the Mercer Mayer book "Frog, where are you?" The order of narration was counterbalanced. The vocabularies from narratives were analyzed by type, token, TTR (type-token Ratio) and D value using the CLAN (Computerized Language Analysis) program. The findings showed that the pattern of lexical diversity in Korean is similar with the Chinese, but the TTR and D value of Chinese still remain low in comparison with those of Korean. In addition, Korean language seems to have significant influence on Chinese in the language usage pattern and vice versa.

Analysis on the Possibility of Electronic Surveillance Society in the Intelligence Information age

  • Chung, Choong-Sik
    • Journal of Platform Technology
    • /
    • v.6 no.4
    • /
    • pp.11-17
    • /
    • 2018
  • In the smart intelligence information society, there is a possibility that the social dysfunction such as the personal information protection issue and the risk to the electronic surveillance society may be highlighted. In this paper, we refer to various categories and classify electronic surveillance into audio surveillance, visual surveillance, location surveillance, biometric information surveillance, and data surveillance. In order to respond to new electronic surveillance in the intelligent information society, it requires a change of perception that is different from that of the past. This starts with the importance of digital privacy and results in the right to self-determination of personal information. Therefore, in order to preemptively respond to the dysfunctions that may arise in the intelligent information society, it is necessary to further raise the awareness of the civil society to protect information human rights.

Development of Automative Loudness Control Technique based on Audio Contents Analysis using Deep Learning (딥러닝을 이용한 오디오 콘텐츠 분석 기반의 자동 음량 제어 기술 개발)

  • Lee, Young Han;Cho, Choongsang;Kim, Je Woo
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2018.11a
    • /
    • pp.42-43
    • /
    • 2018
  • 국내 디지털 방송 프로그램은 2016년 방송법 개정 이후, ITU-R / EBU에서 제안한 측정 방식을 활용하여 채널 및 프로그램 간의 음량을 맞추어 제공되고 있다. 일반적으로 뉴스나 중계와 같이 실시간으로 음량을 맞춰야 하는 분야를 제외하고는 평균 음량을 규정에 맞춰 송출하고 있다. 본 논문에서는 일괄적으로 평균 음량을 맞출 경우 발생하는 저음량의 명료도를 높이기 위한 기술을 제안한다. 즉, 방송 음량을 조절하는 기술 중의 하나로 오디오 콘텐츠를 분석하여 구간별 음량 조절 정도를 달리함으로써 저음량에서의 음성은 상대적으로 높은 음량을 가지고 배경음악 등을 상대적으로 낮음 음량을 가지도록 생성함으로써 명료도를 높이는 방식을 제안한다. 제안한 방식의 성능을 확인하기 위해 오디오 콘텐츠 분석 정확도 측정과 오디오 파형 분석을 실시하였으며 이를 통해 기존의 음량 제어 기술과 비교하여 음성 구간에 대해 음량을 증폭시키는 것을 확인하였다.

  • PDF

A Case Study on the Healing Forest Development Plan of Kangwon Province (강원도 치유의 숲 조성 기본계획 수립에 관한 연구)

  • Kim, Myeong-Jun;Lee, Joon-Woo;Cha, Du-Song
    • Journal of Forest and Environmental Science
    • /
    • v.26 no.1
    • /
    • pp.53-63
    • /
    • 2010
  • This study carried out to establish a master plan about healing forest in Gangwon-do focusing on healing road and visitor center. The site of this study was approximately 721 ha of mountain in Imgye-myeon, Gangwon-do, and the master plan was established through analysis of humanities-social and natural environments. The healing forest was developed 6 healing trails(10.5 km), devided by 3 steps, and each healing trail was designed to make rest area, wooden bridge, and open space. Also, visitor center, the core place of healing forest, was devided to several spaces as health measurement room, AV room, etc. and was planed for audio-visual education room for visitors.

Design and Construction of a FFT Analyzer Using a Microcomputer (마이크로컴퓨터를 이용한 FFT 분석기의 설계 및 제작)

  • Lee, Hyeun Tae;Kim, Jung Gyu;Lee, Sang Bae
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.23 no.6
    • /
    • pp.944-949
    • /
    • 1986
  • By improving the ability of arithmatic processing with an arithmatic processor in a microcomputer and realizing the data input system for real time analysis, an FFT analyzer that is usable within the range of audio frequency is designed and constructed. The input signal passes through a gain programmable pre-amplifier and anti-aliasing lowpass filter into an analogditital converter to be converted into digital form. The converted input data is processed by an Apple II microcomputer. The results of the processing are displayed using a microcomputer display unit and can be copied on a printer or stored in a floppy disk.

  • PDF

Recent deep learning methods for tabular data

  • Yejin Hwang;Jongwoo Song
    • Communications for Statistical Applications and Methods
    • /
    • v.30 no.2
    • /
    • pp.215-226
    • /
    • 2023
  • Deep learning has made great strides in the field of unstructured data such as text, images, and audio. However, in the case of tabular data analysis, machine learning algorithms such as ensemble methods are still better than deep learning. To keep up with the performance of machine learning algorithms with good predictive power, several deep learning methods for tabular data have been proposed recently. In this paper, we review the latest deep learning models for tabular data and compare the performances of these models using several datasets. In addition, we also compare the latest boosting methods to these deep learning methods and suggest the guidelines to the users, who analyze tabular datasets. In regression, machine learning methods are better than deep learning methods. But for the classification problems, deep learning methods perform better than the machine learning methods in some cases.

Digital enhancement of pronunciation assessment: Automated speech recognition and human raters

  • Miran Kim
    • Phonetics and Speech Sciences
    • /
    • v.15 no.2
    • /
    • pp.13-20
    • /
    • 2023
  • This study explores the potential of automated speech recognition (ASR) in assessing English learners' pronunciation. We employed ASR technology, acknowledged for its impartiality and consistent results, to analyze speech audio files, including synthesized speech, both native-like English and Korean-accented English, and speech recordings from a native English speaker. Through this analysis, we establish baseline values for the word error rate (WER). These were then compared with those obtained for human raters in perception experiments that assessed the speech productions of 30 first-year college students before and after taking a pronunciation course. Our sub-group analyses revealed positive training effects for Whisper, an ASR tool, and human raters, and identified distinct human rater strategies in different assessment aspects, such as proficiency, intelligibility, accuracy, and comprehensibility, that were not observed in ASR. Despite such challenges as recognizing accented speech traits, our findings suggest that digital tools such as ASR can streamline the pronunciation assessment process. With ongoing advancements in ASR technology, its potential as not only an assessment aid but also a self-directed learning tool for pronunciation feedback merits further exploration.