• 제목/요약/키워드: audio data

Search Result 879, Processing Time 0.028 seconds

A Study of Real-Time Implementation of Audio/Data Processor for Digital/Analog Dual mode Mobile Phone (디지탈/아날로그 겸용 이동통신 단말기를 위한 오디오/데이타 프로세서의 실시간 구현에 관한 연구)

  • Byun, Kyung-Jin;Kim, Jong-Jae;Han, Ki-Chun;Yoo, Hah-Young;Cha, Jin-Jong;Kim, Kyung-Su
    • The Journal of the Acoustical Society of Korea
    • /
    • v.16 no.2
    • /
    • pp.80-88
    • /
    • 1997
  • In this paper, the implementation of audio/data processor using ETRI DSP to support analog mode in digital/analog dual mode mobile phone is presented. Audio/data processor performs the wideband data processing, audio signal processing, demodulation function, and data rate conversion when it is operated in analog mode. These functions are programmed in assembly language, and then loaded to ETRI DSP together with vocoder program for the digital mode operation. This is a very efficient implementation of the dual mode cellular phone ASIC since the vocoder for the digital mode and audio/data processor for the analog mode are programmed together in the same hardware.

  • PDF

Study on data augmentation methods for deep neural network-based audio tagging (Deep neural network 기반 오디오 표식을 위한 데이터 증강 방법 연구)

  • Kim, Bum-Jun;Moon, Hyeongi;Park, Sung-Wook;Park, Young cheol
    • The Journal of the Acoustical Society of Korea
    • /
    • v.37 no.6
    • /
    • pp.475-482
    • /
    • 2018
  • In this paper, we present a study on data augmentation methods for DNN (Deep Neural Network)-based audio tagging. In this system, an audio signal is converted into a mel-spectrogram and used as an input to the DNN for audio tagging. To cope with the problem associated with a small number of training data, we augment the training samples using time stretching, pitch shifting, dynamic range compression, and block mixing. In this paper, we derive optimal parameters and combinations for the augmentation methods through audio tagging simulations.

Security of Generalized Patchwork Algorithm for Audio Signal (오디오 신호에 적용된 Generalized Patchwork Algorithm의 안전성)

  • Kim Ki-Seob;Kim Hyoung-Joong;;Yang Jae-Soo
    • 한국정보통신설비학회:학술대회논문집
    • /
    • 2006.08a
    • /
    • pp.219-222
    • /
    • 2006
  • In this paper we present a cryptanalysis of the generalized patchwork algorithm under the assumption that the attacker possesses only a single copy of the watermarked audio. In the scheme, watermark is inserted by modifying randomly chosen DCT values in each block of the original audio. Towards the attack we first fit low degree polynomials (which minimize the mean square error) on the data available from each block of the watermarked content. Then we replace the corresponding DCT data of the at-tacked audio by the available data from the polynomials to construct an attacked audio. The technique nullifies the modification achieved during watermark embedding. Experimental results show that recovery of the watermark becomes difficult after the attack.

  • PDF

Implementation of Slide-Show Functionality for the Terrestrial Digital Multimedia Broadcasting (지상파 디지털 멀티미디어 방송을 위한 슬라이드 쇼 기능 구현)

  • 박성일;김광석;김용한
    • Journal of Broadcast Engineering
    • /
    • v.8 no.3
    • /
    • pp.217-227
    • /
    • 2003
  • This paper describes an implementation of the slide-show functionality, which is one of the services that can be provided by the Digital Multimedia Broadcasting (DMB). While the existing analog radio broadcasting services provide audio only, DMB slide-show is the functionality that can deliver still images associated with the audio. For example, it can deliver the photographs of the singer, album cover images, or the lyrics of the song that correspond to the audio. There are two modes for the transmission of the slide-show. Firstly. the program-associated data (PAD) field within the DMB audio frame can be utilized and secondly, the slide-show data can be transmitted, after being multiplexed, with other service data as individual data stream separated from the audio. This paper describes PC-based implementations of a transmitter-side module that inserts slide-show data into the PAD area within audio bitstream and a receiver-side application module that plays the slide-show through decoding the PAD within the received audio bitstream and demonstrates their validity through experiments.

Additional data packetizing method for providing multichannel audio service on T-DMB environment (지상파 DMB 환경에서 멀티채널 오디오 서비스를 제공하기 위한 부가정보 패킷화 방법 연구)

  • Lee, Yong-Ju;Seo, Jeong-Il;Beack, Seung-Kwon;Kang, Kyeong-Ok;Lim, Jong-Soo
    • Journal of Broadcast Engineering
    • /
    • v.14 no.3
    • /
    • pp.332-341
    • /
    • 2009
  • Terrestrial digital multimedia broadcasting(T-DMB) is one of mobile broadcasting services, and the commercial service was started in December 2005 in Korea. The performance targets of T-DMB are providing VCD(video CD) quality video and FM radio quality audio. In recent years, the researches for providing high quality video or audio service on T-DMB environments have been being carried out. To provide high-quality video or audio service, some additional data should be transmitted to the receiver as well as T-DMB video and audio data. Since the data rate for one T-DMB program is low, it is important to transmit the additional data at a low bit rate. In this paper, we propose a packetizing method for efficient transmission of the additional data to provide multichannel audio service on T-DMB environment.

Reversible Watermarking for Audio Using Recompression Method (재압축 기술을 이용한 오디오 파일에서의 가역 정보은닉)

  • Whang, Ho Young;Kim, Hyoung Joong
    • Journal of Digital Contents Society
    • /
    • v.14 no.2
    • /
    • pp.199-206
    • /
    • 2013
  • Various methods of data compression have been developed to handle data within limited storage capacity and limited transmission speed. Recompression technology, a technology most recent among them, is a technology that can embed data regardless of the information entropy of a data. Recompression technology separates original multimedia data in to blocks and embeds 0 or 1 according to whether each block is flipped or not. In this paper, this technology has been applied on audio files. And was able to implement reversible watermarking for audio files.

A study on the implementation of a digital video/audio system to support multi-audio format (다양한 오디오 포맷을 지원하는 비디오/오디오 시스템 구현에 관한 연구)

  • Park In-Gyu
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.43 no.4 s.310
    • /
    • pp.123-132
    • /
    • 2006
  • In this paper, the digital video and audio system is improved so that various digital video data formats in DVD disc, and digital audio data formats through the S/PDIF ports may be decoded. It is not easy to implement all decoding functions of video and audio by a DVD processor. The special structure in audio decoding circuit is proposed in this system so as to have simultaneously almost same video and audio performance in quality. By dividing the decoding circuit separately into video and audio part, the audio quality can be dramatically improved together with supporting several audio formats and with several effects. In order to satisfy the perfect audio system to support to audio decoding formats, it is just enough to get the expensive, complicated decoder. However, it may be not easy to get this expensive decoder in near future. Therefore it is rather to adopt the downloading method by which the host should download the appropriate code into memory by detecting the corresponding audio bit streams. It is proved that this method may be efficient in the point of sharing resource of audio data for video decoding.

Analysis of Storage and Retrieval Results of Audio Sources and Signatures using Blockchain and Distributed Storage System

  • Lee, Kyoung-Sik;Kim, Sang-Kyun
    • Journal of Broadcast Engineering
    • /
    • v.24 no.7
    • /
    • pp.1228-1236
    • /
    • 2019
  • Recently, media platforms such as YouTube and Twitch provide services that can generate personal revenue by utilizing media content produced by individuals. In this regard, interest in the copyright of media content is increasing. In particular, in the case of an audio source, competition for securing audio source copyright is fierce because it is an essential element for almost all media content production. In this paper, we propose a method to store the audio source and its signature using a blockchain and distributed storage system to verify the copyright of music content. To identify the possibility of extracting the audio signature of the audio source and to include it as blockchain transaction data, we implement the audio source and its signature file upload system based on the proposed scheme. In addition, we show the effectiveness of the proposed method through experiments on uploading and retrieving audio files and identify future improvements.

A Study on Digital Image Watermarking for Embedding Audio Logo (음성로고 삽입을 위한 디지털 영상 워터마킹에 관한 연구)

  • Cho, Gang-Seok;Koh, Sung-Shik
    • Journal of the Institute of Electronics Engineers of Korea TE
    • /
    • v.39 no.3
    • /
    • pp.21-27
    • /
    • 2002
  • The digital watermarking methods have been proposed as a solution for solving the illegal copying and proof of ownership problems in the context of multimedia data. But it is still difficult to have been overcame the problem of the protection of property to multimedia data, such as digital images, digital video, and digital audio. This paper describes a watermarking algorithm that embeds non-linearly audio logo watermark data which is converted from audio signal of the ownership in the components of pixel intensities in an original image and that insists of ownership by hearing the audio signal transformed from the extracted audio logo through the speaker. Experimental results show that our algorithm using audio logo proposed in this paper is robust against attacks such as particularly lossy JPEG image compression. 

Performance Analysis of Audio Data Hiding Method based on Phase Information with Various Window Length (주파수 변환의 길이에 따른 위상 기반 오디오 정보 은닉 기술의 음질 및 성능 분석)

  • Cho, Kiho;Kim, Nam Soo
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.50 no.12
    • /
    • pp.232-237
    • /
    • 2013
  • The role of the window length of time-frequency transformation is important for the audio data hiding methods utilizing phase information. In this paper, the experiments for our audio data hiding method were conducted in order to evaluate the audio quality and robustness against reverberant environment. The experimental results showed the tendency that the worse audio quality but better robustness were obtained when the lengthy window was applied. The important reason for quality degradation was pre-echo which flatters the percussive sound. The results also indicated that the wireless communication theory related to the length of time-frequency transform can be applied in the field of audio data hiding and acoustic data transmission.