• Title/Summary/Keyword: Audio Quality

Search Result 446, Processing Time 0.028 seconds

Audio Stream Delivery Using AMR(Adaptive Multi-Rate) Coder with Forward Error Correction in the Internet (인터넷 환경에서 FEC 기능이 추가된 AMR음성 부호화기를 이용한 오디오 스트림 전송)

  • 김은중;이인성
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.26 no.12A
    • /
    • pp.2027-2035
    • /
    • 2001
  • In this paper, we present an audio stream delivery using the AMR (Adaptive Multi-Rate) coder that was adopted by ETSI and 3GPP as a standard vocoder for next generation IMT-2000 service in which includes combined sender (FEC) and receiver reconstruction technique in the Internet. By use of the media-specific FEC scheme, the possibility to recover lost packets can be much increased due to the addition of repair data to a main data stream, by which the contents of lost packets can be recovered. The AMR codec is based on the code-excited linear predictive (CELP) coding model. So we use a frame erasure concealment for CELP-based coders. The proposed scheme is evaluated with ITU-T G.729 (CS-ACELP) coder and AMR - 12.2 kbit/s through the SNR (Signal to Noise Ratio) and the MOS (Mean Opinion Score) test. The proposed scheme provides 1.1 higher in Mean Opinion Score value and 5.61 dB higher than AMR - 12.2 kbit/s in terms of SNR in 10% packet loss, and maintains the communicab1e quality speech at frame erasure rates lop to 20%.

  • PDF

Speech Animation Synthesis based on a Korean Co-articulation Model (한국어 동시조음 모델에 기반한 스피치 애니메이션 생성)

  • Jang, Minjung;Jung, Sunjin;Noh, Junyong
    • Journal of the Korea Computer Graphics Society
    • /
    • v.26 no.3
    • /
    • pp.49-59
    • /
    • 2020
  • In this paper, we propose a speech animation synthesis specialized in Korean through a rule-based co-articulation model. Speech animation has been widely used in the cultural industry, such as movies, animations, and games that require natural and realistic motion. Because the technique for audio driven speech animation has been mainly developed for English, however, the animation results for domestic content are often visually very unnatural. For example, dubbing of a voice actor is played with no mouth motion at all or with an unsynchronized looping of simple mouth shapes at best. Although there are language-independent speech animation models, which are not specialized in Korean, they are yet to ensure the quality to be utilized in a domestic content production. Therefore, we propose a natural speech animation synthesis method that reflects the linguistic characteristics of Korean driven by an input audio and text. Reflecting the features that vowels mostly determine the mouth shape in Korean, a coarticulation model separating lips and the tongue has been defined to solve the previous problem of lip distortion and occasional missing of some phoneme characteristics. Our model also reflects the differences in prosodic features for improved dynamics in speech animation. Through user studies, we verify that the proposed model can synthesize natural speech animation.

Information Technologies as an Incentive to Develop the Creative Potential of the Educational Process

  • Natalia, Vdovychenko;Volodymyr, Kukorenchuk;Alina, Ponomarenko;Mykola, Honcharenko;Eduard, Stranadko
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.4
    • /
    • pp.408-416
    • /
    • 2022
  • The new millennium is characterized by an unprecedented breakthrough in knowledge and information and communication technologies, and the challenges of the XXI century require modernized paradigms of interaction in all spheres of life. Education continues to play a key role in national and global growth. The key role of education and its leadership in developing creative potential, as the main paradigm of the countries' stability, have significantly influenced educational centers. The developers of educational programs use information technologies as an incentive to develop creative potential of educational process. Professional training of the educational candidate is enhanced by the use of information technologies, so the educational applicants should develop technological skills to be productive members of society. Using the latest achievements in the field of information technologies for the organization of the educational process helps to form the operational style of education applicants' thinking, which provides the ability to acquire skills of processing information, that is presented in the text, graphic, tabular form, and increase the level of general and informational culture necessary for better orientation in the modern information space. The purpose of the research is to determine the effectiveness of information technologies as an incentive to develop creative potential of educational process on the basis of the survey, to establish advantages and ability to provide high-quality education in the context of using information technologies. Methods of research: comparative analysis; systematization; generalization, survey. Results. Based on the survey conducted among students and teachers, it has been found out that the teachers use the following information technologies for the development of creative potential of the educational process: to provide video and audio communication process (100%), Moodle (95,6%), Duolingo (89,7%), LinguaLeo (89%), Google Forms (88%) and Adobe Captivate Prime (80,6%). It is determined that modular digital learning environments (97,9%), interactive exercises tools (96,3%), ICT for video and audio communication (96%) and interactive exercises tools (95,1%) are most conducive to the development of creative potential of the educational process. As a result of the research, it was revealed that implementation of information technologies for the development of creative potential of educational process in educational institutions is a complex process due to a large number of variables, which should be taken into account both on the educational course and on the individual level. It has been determined that the using the model of implementation information technologies for the development of creative potential in educational process, which is stimulated due to this model, benefits both students and teachers by establishing a reliable bilateral connection between teacher and education applicant.

Real data-based active sonar signal synthesis method (실데이터 기반 능동 소나 신호 합성 방법론)

  • Yunsu Kim;Juho Kim;Jongwon Seok;Jungpyo Hong
    • The Journal of the Acoustical Society of Korea
    • /
    • v.43 no.1
    • /
    • pp.9-18
    • /
    • 2024
  • The importance of active sonar systems is emerging due to the quietness of underwater targets and the increase in ambient noise due to the increase in maritime traffic. However, the low signal-to-noise ratio of the echo signal due to multipath propagation of the signal, various clutter, ambient noise and reverberation makes it difficult to identify underwater targets using active sonar. Attempts have been made to apply data-based methods such as machine learning or deep learning to improve the performance of underwater target recognition systems, but it is difficult to collect enough data for training due to the nature of sonar datasets. Methods based on mathematical modeling have been mainly used to compensate for insufficient active sonar data. However, methodologies based on mathematical modeling have limitations in accurately simulating complex underwater phenomena. Therefore, in this paper, we propose a sonar signal synthesis method based on a deep neural network. In order to apply the neural network model to the field of sonar signal synthesis, the proposed method appropriately corrects the attention-based encoder and decoder to the sonar signal, which is the main module of the Tacotron model mainly used in the field of speech synthesis. It is possible to synthesize a signal more similar to the actual signal by training the proposed model using the dataset collected by arranging a simulated target in an actual marine environment. In order to verify the performance of the proposed method, Perceptual evaluation of audio quality test was conducted and within score difference -2.3 was shown compared to actual signal in a total of four different environments. These results prove that the active sonar signal generated by the proposed method approximates the actual signal.

Development of an Eye Patch-Type Biosignal Measuring Device to Measure Sleep Quality (수면의 질을 측정하기 위한 안대형 생체신호 측정기기 개발)

  • Changsun Ahn;Jaekwan Lim;Bongsu Jung;Youngjoo Kim
    • KIPS Transactions on Computer and Communication Systems
    • /
    • v.12 no.5
    • /
    • pp.171-180
    • /
    • 2023
  • The three major sleep disorders in Korea are snoring, sleep apnea, and insomnia. Lack of sleep is the root of all diseases. Some of the most serious potential problems associated with sleep deprivation are cardiovascular problems, cognitive impairment, obesity, diabetes, colitis, prostate cancer, etc. To solve these problems, the Korean government provided low-cost national health insurance benefits for polysomnography tests in July 2018. However, insomnia patients still have problems getting treated in terms of time, space, and economic perspectives. Therefore, it would be better for insomnia patients to be allowed to test at home. The measuring device can measure six biosignals (eye movement, tossing and turning, body temperature, oxygen saturation, heart rate, and audio). A gyroscope sensor (MPU9250, InvenSense, USA) was used for eye movement, tossing, and turning. The input range of the sensor was in 258°/sec to 460°/sec, and the data range was in the input range. Body temperature, oxygen saturation range, and heart rate were measured by a sensor (MAX30102, Analog Devices, USA). The body temperature was measured in 30 ℃ to 45 ℃, and the oxygen saturation range was 0% for the unused state and 20 % to 90 % for the used state. The heart rate measurement range was in 40 bpm to 180 bpm. The measurement of audio signal was performed by an audio sensor (AMM2742-T-R, PUIaudio, USA). The was -42 dB ±1 dB frequency range was 20 Hz to 20 kHz. The measured data was successfully received in wireless network conditions. The system configuration was consisted of a PC and a mobile app for bio-signal measurement and data collection. The measured data was collected by mobile phones and desktops. The data collected can be used as preliminary data to determine the stage of sleep and perform the screening function for sleep induction and sleep disturbances. In the future, this convenient sleep measurement device could be beneficial for treating insomnia.

MPEG2-TS to RTP Transformation and Application system (MPEG2-TS의 RTP 변환 및 적용 시스템)

  • Im, Sung-Jin;Kim, Ho-Kyom;Hong, Jin-Woo;Jung, Hoe-Kyung
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2010.10a
    • /
    • pp.643-645
    • /
    • 2010
  • The Internet-based multimedia services such as IPTV is being expanded with the development of technology to support the convergence of broadcasting and telecommunications technology for the control seems to be growing larger. Especially for the real-time TV broadcast multicast control technology to support the authentication and resource control, in addition to the technology services that enhance the value of technology for a variety of services in both directions seems to be developed. And, Internet-based transmission system transmit the video content for the video content delivery using RTP(Real Time Transport Protocol). Standardization body, IETF(Internet Engineering Task Force) within the RTP, according to a variety of audio and video formats only transmission format(RTP Payload Format) Establish a separate standard and scalable video content "RTP Payload Format for SVC(Switched Virtual Connection) Video" the standardization is currently processing. In this paper we are improving the quality of broadcasting and telecommunication systems, so that the upper layer by the application can react adaptively to the existing MPEG2-TS and RTP who are provided by a variety of content applied to a variety of devices consumers ETE(End- to-End) QoS(Quality of Service) for enhance the system who was designed and implemented.

  • PDF

Speech Packet Transmission Using the AMR-WB Coder with FEC (FEC기능을 추가한 AMR-WB 음성 부호화기를 이용한 음성 패킷 전송)

  • 황정준;이인성
    • Journal of the Institute of Electronics Engineers of Korea TC
    • /
    • v.40 no.11
    • /
    • pp.63-71
    • /
    • 2003
  • This paper suggests the packet loss recovery method to communicate in real time in the Internet. To reduce the effects of packet loss, Forward Error Correction (FEC) that adds redundant information to voice packets can be used. Adaptive Multi Rate Wideband(AMR-WB) codec which is recently selected by the Third Generation Partnership Project(3GPP) for GSM and the third generation mobile communication WCDMA system and has also been standardized in ITU-T for providing wideband speech services is used. The major cause for speech qualitly degradation in IP-networks is packet loss. So, We recovered single lossy packet by using FEC method and concealed continued errors. The proposed scheme if evaluated in the Gilbert Internet channel model. The high quality of audio maintained up to 30% packet loss.

Sound event detection based on multi-channel multi-scale neural networks for home monitoring system used by the hard-of-hearing (청각 장애인용 홈 모니터링 시스템을 위한 다채널 다중 스케일 신경망 기반의 사운드 이벤트 검출)

  • Lee, Gi Yong;Kim, Hyoung-Gook
    • The Journal of the Acoustical Society of Korea
    • /
    • v.39 no.6
    • /
    • pp.600-605
    • /
    • 2020
  • In this paper, we propose a sound event detection method using a multi-channel multi-scale neural networks for sound sensing home monitoring for the hearing impaired. In the proposed system, two channels with high signal quality are selected from several wireless microphone sensors in home. The three features (time difference of arrival, pitch range, and outputs obtained by applying multi-scale convolutional neural network to log mel spectrogram) extracted from the sensor signals are applied to a classifier based on a bidirectional gated recurrent neural network to further improve the performance of sound event detection. The detected sound event result is converted into text along with the sensor position of the selected channel and provided to the hearing impaired. The experimental results show that the sound event detection method of the proposed system is superior to the existing method and can effectively deliver sound information to the hearing impaired.

Experiences of Treatment-Related Side Effects and Supportive Care with Korean Medicine in Women with Breast Cancer - A Focus Group Study (유방암 환자의 항암 치료 부작용 및 한의학적 보완치료 경험에 관한 포커스 그룹 연구)

  • Han, Sola;Jang, Bo-Hyoung;Hwang, Deok-Sang;Suh, Hae Sun
    • The Journal of Korean Obstetrics and Gynecology
    • /
    • v.30 no.1
    • /
    • pp.85-94
    • /
    • 2017
  • Objectives: To explore experiences of treatment-related side effects and supportive care among Korean breast cancer survivors (BCS). Methods: Focus group interview was conducted with six Korean women with breast cancer. Participants were recruited through snow-balling. Interview was audio-recorded and transcribed verbatim. NVivo-11 was used to code the data into themes. Results: Two major themes were identified: (1) experiences of Western medicine, including treatment, side effects, needs and costs; (2) experiences of supportive care with Korean medicine, including the same as above. All participants experienced Western medicine in treatment phase and reported impairment of physical, emotional, and social functioning during and after Western medicine treatment. Only three participants used Korean medicine after treatments end. The negative responses from Western medicine doctors were the most important factor keeping participants from accessing Korean medicine when treatment-related side effects occurred. For this reason, some participants used Korean medicine without disclosure. Participants usually acquired information about Korean medicine from online community or other BCS, which was another important factor because it raised concerns about side effects and credibility of Korean medicine. High cost was also reported as barrier in using Korean medicine. During the cancer treatment, participants tended to endure their treatment-related side effects. Conclusions: Korean BCS may be at high risk of physical or emotional distress during treatment period. Findings suggest that there is a high need for supportive care to relieve treatment-related side effects and improve patients' quality-of-life. Furthermore, developing a systematic guidance or credible information sources should be warranted to help patients find the best supportive care options including Korean medicine.

An Exploratory Investigation on Multimedia Information Needs and Searching Behavior among College Students (멀티미디어 정보요구와 검색행태에 관한 탐색적 연구)

  • Chung, Eun-Kyung
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.46 no.3
    • /
    • pp.251-270
    • /
    • 2012
  • Multimedia needs and searching have become important in everyday life, especially in a younger generation. The characteristics of multimedia needs and searching behaviors are distinctive compared to textual information needs and searching behaviors in a wide variety of ways. By interviewing and observing multimedia needs and searching behaviors of college students from 20 areas in Seoul, this study aims to improve the understanding on users' multimedia needs and how users search multimedia. The findings are presented in terms of searching sources, multimedia needs, relevance criteria and searching barriers. For multimedia, the searching sources are found primarily as Naver and Google and the distinguished features are presented depending on the individual multimedia types. As multimedia needs are categorized into generic, specific and abstract, most of the needs are classified as specific needs rather than generic needs, but there exist differences depending on the types of multimedia. In addition, the aspects of relevance criteria and searching barriers are reflected with the characteristics of individual multimedia types. The findings of this study demonstrate that distinctive indexing and searching environments depending on the types of multimedia might be necessary to improve the quality of multimedia searching.