Search | Korea Science

Music Genre Classification using Time Delay Neural Network (시간 지연 신경망을 이용한 음악 장르 분류)

이재원;조찬윤;김상균
- Journal of Korea Multimedia Society
- /
- v.4 no.5
- /
- pp.414-422
- /
- 2001
This paper proposes a classifier of music genre using time delay neural network(TDNN) fur an audio data retrieval systems. The classifier considers eight kinds of genres such as Blues, Country, Hard Core, Hard Rock, Jazz, R&B(Soul), Techno and Trash Metal. The comparative unit to classify the genres is a melody between bars. The melody pattern is extracted based un snare drum sound which represents the periodicity of rhythm effectively. The classifier is constructed with the TDNN and uses fourier transformed feature vector of the melody as input pattern. We experimented the classifier on eighty training data from ten musics for each genres and forty test data from five musics for each genres, and obtained correct classification rates of 92.5% and 60%, respectively.
PDF

Dynamic Timed Multimedia Synchronization Model for Efficient Quality of Service (효율적인 서비스 품질을 위한 동적 시간형 멀티미디어 동기화 모델)

이근왕;오해석
- Journal of the Korean Institute of Telematics and Electronics C
- /
- v.36C no.10
- /
- pp.75-80
- /
- 1999
Multimedia synchronization model for distributed, continuous or discrete media that was guaranteed high quality of service is requited in developing multimedia application software. In this paper we have specific object controller which is called dynamic key media that is changed by user event generation. This becomes media whose event occurrence and periods can't be predicted. For event occurrence not only audio but also text and image can be chosen for key media and performs its role. Object controller transfers information for next transition. The proposed model offers high qualify of services by permitting maximum allowed jitter and skew in playout time and verified its effectiveness by simulation.
PDF

B-ISDN Signalling Protocol for Internet-Based Service (인터넷 서비스를 위한 B-ISDN 신호 프로토콜의 표준화 동향)

Kim, J.Y.;Joo, S.S.
- Electronics and Telecommunications Trends
- /
- v.13 no.6 s.54
- /
- pp.83-93
- /
- 1998
Best effort 서비스 품질만 지원하는 현재의 인터넷에 음성, 오디오 그리고 영상 통신 응용 서비스와 같은 새로운 멀티미디어 응용 서비스를 사용하려는 요구가 확대됨에 따라서 멀티미디어 서비스를 제공할 수 있는 인터넷의 필요성이 증대하고 있다. 또한 이러한 멀티미디어 서비스를 제공하기 위하여 서비스 품질(QoS)을 보장할 수 있는 통신 방식과 대량의 트래픽을 효과적으로 전달할 수 있는 메커니즘이 필요하게 되었다. 비동기 전송방식(ATM)은 이러한 멀티미디어 서비스를 인터넷에서 제공할 수 있는 최적의 통신 방식으로 고려되고 있는데, 이것은 ATM의 장점인 고속의 스위칭 기술과 논리적으로 VPI/VCI를 다중화 하는 기법, 그리고 유연한 서비스 품질 관리가 가능하기 때문이다. 본 고에서는 ATM 망에서 인터넷 서비스를 지원하기 위하여 결성된 ITU-T SG11의 Coordination 그룹인 Signalling Support of Internet-Based Applications(SoI) 회의 결과를 중심으로 하며 SoI의 표준화 연구 목표, B-ISDN 신호 프로토콜을 이용한 Long-lived 세션과 QoS에 민감한 세션의 인터넷 트래픽에 대한 ATM 연결 설정 절차 및 인터넷 세션 정보의 전달 방법 그리고 인터넷 서비스를 위한 멀티캐스팅 방법에 대하여 기술한다. 본 고의 목적은 인터넷 서비스 및 프로토콜을 지원하기 위하여 확장이 필요한 B-ISDN 신호 프로토콜의 기능을 명확히 기술하기 위한 것이다.
https://doi.org/10.22648/ETRI.1998.J.130607 인용 PDF

Proposals for successful implementation of hybrid radio service (하이브리드라디오의 성공적 도입을 위한 제언)

Lim, Jaeyoon
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2017.11a
- /
- pp.189-192
- /
- 2017
본 논문에서는 하이브리드라디오 서비스를 성공적으로 도입하기 위해 필요한 정책적 조건들을 제안한다. 하이브리드라디오는 그 UI(User Interface) 특성과 오디오 방송 콘텐츠 외 서비스 구현으로 인해 기존 지상파플랫폼 외 추가적인 플랫폼이 필요하며, 이들 플랫폼을 구축하고 운영하는 새로운 플랫폼사업자의 등장을 수반하게 된다. 콘텐츠공급자와 플랫폼사업자가 일치하던 지상파 방송 산업과 달리 콘텐츠사업자와 플랫폼사업자가 분리되는 산업에서는 소수 플랫폼사업자가 다수 콘텐츠사업자에 비해 협상력 우위에 서는 역학관계가 생기게 되고, 이런 게임의 룰에 익숙하지 않고 사업적 규모도 영세한 지상파라디오 사업자들은 힘겹고 불공정한 사업 환경 속에 던져지는 상황을 맞게 될 수도 있다. 따라서 플랫폼사업자와 콘텐츠사업자가 상생하면서도 경쟁의 효율을 높이기 위해선 정교한 정책적, 제도적 장치가 필요하다. 또 하이브리드라디오가 기존 라디오 산업 내 제로섬 경쟁을 넘어 전체 이용자 규모를 확대하고 이를 효과적으로 수익과 연결하기 위해서는, 하이브리드라디오의 기술적 특성이 수용자에게 의미 있는 편익이 될 수 있도록 하이브리드라디오의 사업적 가능성을 충분히 활성화할 수 있는 제도 개선이 필요하다. 마지막으로 급변하는 미디어 빅뱅 상황에서 유의미한 유효 기간 내에 하이브리드라디오라는 신규 콘텐츠 유통 플랫폼을 성공적으로 도입하기 위해서는 산업 플레이어 간 불필요한 오해를 줄여 무의미한 논란을 최소화해야 한다.
PDF

A Digital Audio Watermark Using Wavelet Transform and Masking Effect (웨이브릿과 마스킹 효과를 이용한 디지털 오디오 워터마킹)

Hwang, Won-Young;Kang, Hwan-Il;Han, Seung-Soo;Kim, Kab-Il;Kang, Hwan-Soo
- Proceedings of the IEEK Conference
- /
- 2003.11b
- /
- pp.243-246
- /
- 2003
In this paper, we propose a new digital audio watermarking technique with the wavelet transform. The watermark is embedded by eliminating unnecessary information of audio signal based on human auditory system (HAS). This algorithm is an audio watermarking method, which does not require any original audio information in watermark extraction process. In this paper, the masking effect is used for audio watermarking, that is, post-tempera] masking effect. We construct the window with the synchronization signal and we extract the best frame in the window by using the zero-crossing rate (ZCR) and the energy of the audio signal. The watermark may be extracted by using the correlation of the watermark signal and the portion of the frame. Experimental results show good robustness against MPEG1-layer3 compression and other common signal processing manipulations. All the attacks are made after the D/A/D conversion.
PDF

The Realtime method of 3D Sound Rendering for Virtual Reality : Complexity Reduction of Scene and Sound Sources (장면 및 음원 복잡도 축소에 의한 3차원 사운드 재현의 실시간화 기법)

Seong SukJeong;Yi JeongSeon;Oh SuJin;Nam YangHee
- Proceedings of the Korean Information Science Society Conference
- /
- 2005.07b
- /
- pp.550-552
- /
- 2005
실감 재현이 중요한 가상현실 응용에서는 사용자에게 고급 그래픽 환경을 제시하고 사용자의 인터랙션에 즉각적인 피드백을 제공함으로서 실재감과 몰입감을 증대시키는 연구가 진행되어왔다. 실재감, 공간감 전달을 위해 시각과 청각을 함께 활용하는 것이 효과적이나, 가상공간의 특징을 반영한 3차원 사운도 재현 연구는 국내외 통틀어 초기단계에 머물러 있다. 실재감과 공간감을 반영한 3차원 사운드의 재현을 위해서는 음원의 전파, 반사, 잔향 풍의 계산이 사용자의 인터랙션에 따라 새롭게 계산되어야한다. 그러나 사운드 전파경로와 공간을 이루는 모든 폴리곤들과의 충돌을 검사하며 반사 등을 계산하는 것은 실시간성이 중요한 가상현실응용에서는 무리가 따르므로 실 시간성을 보장하기 위한 계산량 축소가 요구된다. 본 논문에서는 다수의 음원이 존재하는 복잡한 가상공간에서의 3차원 사운드를 재현하기 위하여 사운드 신과 계산에 필요한 최소한의 정보를 가지는 오디오 씬 그라프의 공간을 재구성하고 다수의 음원을 대상으로 음원 축소 및 군집화를 적용하여 3차원 사운드효과를 실시간으로 재현하는 알고리즘을 제안한다.
PDF

The 3D Sound Contents Authoring Tool (멀티미디어 컨텐트 제작을 위한 입체음향 생성저작도구)

Kim, Jae-Woo;Myung, Hyun;Kim, Hyun-Bin
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 1999.06b
- /
- pp.75-78
- /
- 1999
본 논문에서는 멀티미디어 컨텐트 생성을 위한 입체음향 생성 저작도구 개발에 관하여 논의한다. Windows 95/98, Windows NT 환경의 PC 상에서 동작하는 입체음향 생성 저작도구는 일반적인 음향 편집기가 갖는 편집기능 및 음향 효과 기능 이외에 음상정위 기능, 음장제어 기능의 입체음향 기능을 가지고 있으며 스피커를 통하여 입체음향을 청취할 경우 발생하는 크로스톡크를 제거하는 기능도 가지고 있다. 개발된 저작도구를 이용하여 Mono, Stereo 형태로 저장된 음향파일을 순수한 소프트웨어 만으로 가공하여 바이노럴(Binaural) 형태의 입체음향을 생성하여 가상 음원의 위치 및 이동궤적을 정의할 수 있으며 가상공간이 갖는 공간감을 구현할 수 있다 또한 편리한 사용자 인터페이스 환경을 제공하여 GUI를 통하여 3차원 공간 상의 가상음원의 위치 및 이동 궤적과 가상공간을 사용자가 지정할 수 있도록 하였다 입체음향 생성저작도구는 일반 PC 환경에서 사용자가 가공하고자 하는 임의의 오디오 파일을 입체음향으로 생성할 수 있는 GUI 환경을 제공하며, 저비용으로 효과적인 입체음향 컨텐트를 제작할 수 있도록 함으로써, 게임 및 멀티미디어 컨텐트 제작의 고부가가치화와 입체음향 기술의 산업화에 기여할 것으로 기대된다.
PDF

Design of the New Third-Order Cascaded Sigma-Delta Modulator for Switched-Capacitor Application (스위치형 커패시터를 적용한 새로운 형태의 3차 직렬 접속형 시그마-델타 변조기의 설계)

Ryu Jee-Youl;Noh Seok-Ho
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2006.05a
- /
- pp.906-909
- /
- 2006
This paper proposes a new body-effect compensated switch configuration for low voltage and low distortion switched-capacitor (SC) applications. The proposed circuit allows rail-to-rail switching operation for low voltage SC circuits and has better total harmonic distortion than the conventional bootstrapped circuit by 19 dB. A 2-1 cascaded sigma-delta modulator is provided for performing the high-resolution analog-to-digital conversion on audio codec in a communication transceiver. An experimental prototype for a single-stage folded-cascode operational amplifier (opamp) and a 2-1 cascaded sigma-delta modulator has been implemented in a 0.25 micron double-poly, triple-metal standard CMOS process with 2.7 V of supply voltage.
PDF

Headphone-based multi-channel 3D sound generation using HRTF (HRTF를 이용한 헤드폰 기반의 다채널 입체음향 생성)

Kim Siho;Kim Kyunghoon;Bae Keunsung;Choi Songin;Park Manho
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.42 no.1
- /
- pp.71-77
- /
- 2005
In this paper we implement a headphone-based 5.1 channel 3-dimensional (3D) sound generation system using HRTF (Head Related Transfer Function). Each mono sound source in the 5.1 channel signal is localized on its virtual location by binaural filtering with corresponding HRTFs, and reverberation effect is added for spatialization. To reduce the computational burden, we reduce the number of taps in the HRTF impulse response and model the early reverberation effect with several tens of impulses extracted from the whole impulse sequences. We modified the spectrum of HRTF by weighing the difference of front-back spec01m to reduce the front-back confusion caused by non-individualized HRTF DB. In informal listening test we can confirm that the implemented 3D sound system generates live and rich 3D sound compared with simple stereo or 2 channel down mixing.
PDF KSCI

A Study on Multiple Sensorial Media Application Format (다중 감각 미디어 응용 포맷의 구성 방법 연구)

Jung, Yup Oh;Kim, Sang-Kyun
- Journal of Broadcast Engineering
- /
- v.21 no.3
- /
- pp.330-340
- /
- 2016
This paper explains about the structure of multiple sensorial media application format (ISO/IEC 23000-17), which is newly standardized as a project of MPEG-A. This format facilitates effective storage, playing, and management of media with multiple sensorial effects. The ISO base media file format from MPEG-4 Part 12 and sensory effect metadata (SEM) from MPEG-V Part 3 are used to composed the multiple sensorial media application format. In this paper, a fragmentation method to break a SEM XML document into valid SEM samples is presented. Several binarization methods to compress the SEM samples are compared and evaluated as well. The compression ratio and processing time using the MPEG-V binary representation and the Binary MPEG format for XML (BiM) are superior to the gzip compression.
https://doi.org/10.5909/JBE.2016.21.3.330 인용 PDF KSCI KPUBS HTML

Search Result 178, Processing Time 0.03 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)