• Title/Summary/Keyword: Audio Generation

Search Result 103, Processing Time 0.055 seconds

The Study on Development of a Digital Internet Radio Receiver (디지털 인터넷 라디오 수신기 구현에 대한 연구)

  • Park, In-Gyu
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.12 no.2
    • /
    • pp.102-110
    • /
    • 2006
  • This paper explains the design and development of the stand-alone high sound quality Internet Radio system, which is aimed for a small embedded type audio device rather than a general PC type. This device is designed to work with an Internet connection. This kind of system is not standardized so far, and also the related algorithm is not open to the public. So it is necessary to analyze several receiving algorithms of current radio receivers, and develop our own hardware in order to overcome these obstacles, finally to get the high quality of sound radio. The main electronic components of this Internet Radio are TCP/IP interfaces, an audio MP3 decoder, an I/O interface, and a Flash Memory Card with advanced audio multicasting for the next-generation Internet Radio. Basic structures and implementation issues of the next-generation most-versatile digital music player, and Internet Radio receivers, are discussed.

A Preliminary Examination on the Multimedia Information Needs and Web Searches of College Students in Korea

  • Chung, Eun-Kyung
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.44 no.4
    • /
    • pp.95-114
    • /
    • 2010
  • Multimedia searching is an important activity on the Web, especially among the younger generation. The purpose of this study aims to examine college students’ multimedia information needs and searching on the Internet. While there is a clear pattern among students with respect to their multimedia uses, searching sources, relevance criteria and searching barriers, some differences exist especially according to searching of different multimedia types such as image, audio and video. For multimedia uses, information/data-focused uses are frequently found in image and video, while the use of audio is mainly for object-focused searches. As multimedia searching sources, audio and video files present a similar pattern of being high in media specific searching sources and low in generic search engines. Browsing through related blogs and homepages is an important part of searching for media files accounting for approximately 20% of total search for each media. The relevance criteria used by study participants when search for image files was primarily concerned with topicality while the contextual and media quality in the audio and video types are also considered important. Searching barriers for audio and video files are categorized into three broad aspects, including access and search quality, preview limitations and collection limitations, while obstacles for image files searching include access difficulties and low qualities of various collection.

MPEG-2 AAC Encoder Implementation Using a floating-Point DSP (부동 소수점 DSP를 이용한 MPEG-2 AAC 부호차기 구현)

  • Kim Seung-Woo
    • Journal of Korea Multimedia Society
    • /
    • v.8 no.7
    • /
    • pp.882-888
    • /
    • 2005
  • MPEG-2 Advanced Audio Coding (AAC) has already been standardized as a sophisticated next generation technology AAC provides an audio signal that has CD quality at 96-128kbps/stereo. This paper describes a high-quality and efficient software implementation of an MPEG-2 AAC LC Profile encoder. Common scalefactor and noisless coding are accelerated by $45\%$ and $27\%$, respectively, through the use of TMS320C30 instructions. The implemented encoder uses 7.5kWords of program memory, 18kWords of data ROM and 92kBytes of data RAM, respectively. The results of subjective Qualify test showed that the sound quality achieved at 96kbps/stereo was equivalent to that of MP3 at 128kbps/stereo.

  • PDF

A comprehensive design cycle for car engine sound: from signal processing to software component to be integrated in the audio system of the vehicle

  • Orange, Francois;Boussard, Patrick
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2012.04a
    • /
    • pp.208-209
    • /
    • 2012
  • This paper describes a comprehensive process and range of design tools and components for providing Improved perception of engine sound for mass production vehicles by the generation of finely tuned engine harmonics.

  • PDF

Automatic Generation of Video Metadata for the Super-personalized Recommendation of Media

  • Yong, Sung Jung;Park, Hyo Gyeong;You, Yeon Hwi;Moon, Il-Young
    • Journal of information and communication convergence engineering
    • /
    • v.20 no.4
    • /
    • pp.288-294
    • /
    • 2022
  • The media content market has been growing, as various types of content are being mass-produced owing to the recent proliferation of the Internet and digital media. In addition, platforms that provide personalized services for content consumption are emerging and competing with each other to recommend personalized content. Existing platforms use a method in which a user directly inputs video metadata. Consequently, significant amounts of time and cost are consumed in processing large amounts of data. In this study, keyframes and audio spectra based on the YCbCr color model of a movie trailer were extracted for the automatic generation of metadata. The extracted audio spectra and image keyframes were used as learning data for genre recognition in deep learning. Deep learning was implemented to determine genres among the video metadata, and suggestions for utilization were proposed. A system that can automatically generate metadata established through the results of this study will be helpful for studying recommendation systems for media super-personalization.

Test Stream Generation Method for UHDTV Broadcasting Standard (UHD 방송 표준 검증을 위한 시험 스트림 개발에 관한 연구)

  • Kim, Jaeil;Bae, Sungpo;Yang, Jinyoung;Kwon, Donghyun
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.41 no.7
    • /
    • pp.823-832
    • /
    • 2016
  • This paper presents a generation method of test streams for verifying conformance of an UHD broadcasting receiver including decoders for video and audio as well as parsers for PSIP and closed caption data. The proposed test streams for video/audio signals can evaluate conformance of HEVC, AC-3 and DTS-HD standards. Especially, test streams for HEVC video compression standard can be used for testing syntax compliance and error resilience for a HEVC decoder. Moreover, the proposed test streams for system/program and closed caption can be applied for verifying parsers for PSIP and CEA-708 standards.

The Analysis of Verbal Interaction on the Process of Elementary Students' Hypothesis Generation Learning

  • Park, Hee-Young;Lee, Il-Sun;Byeon, Jung-Ho;Kim, Won-Jung;Kwon, Yong-Ju
    • Journal of The Korean Association For Science Education
    • /
    • v.32 no.8
    • /
    • pp.1269-1280
    • /
    • 2012
  • The purpose of this study is to analyze the verbal interaction during elementary students' hypothesis generation learning. For this study, 32 6th graders were selected and were assorted into heterogeneous small-groups by achievement levels. The topics of hypothesis generation learning were developed by analyzing the current elementary school curriculum. Each group's verbal interactions were audio/video taped and transcribed. After coding the protocol and having student retrospective interview, types and frequency of verbal interaction were analyzed. The frequency of verbal interaction during observation was highest and that of questioning situation identification was lowest. Regarding to the quality of verbal interactions, low level interactions were significantly frequent during observation. On the other hand, hypothetical explicans generation revealed high frequency of high level interactions. The results revealed that elementary students can make high level verbal interactions through hypothesis generation learning.

A Deep Learning System for Emotional Cat Sound Classification and Generation (감정별 고양이 소리 분류 및 생성 딥러닝 시스템)

  • Joo Yong Shim;SungKi Lim;Jong-Kook Kim
    • The Transactions of the Korea Information Processing Society
    • /
    • v.13 no.10
    • /
    • pp.492-496
    • /
    • 2024
  • Cats are known to express their emotions through a variety of vocalizations during interactions. These sounds reflect their emotional states, making the understanding and interpretation of these sounds crucial for more effective communication. Recent advancements in artificial intelligence has introduced research related to emotion recognition, particularly focusing on the analysis of voice data using deep learning models. Building on this background, the study aims to develop a deep learning system that classifies and generates cat sounds based on their emotional content. The classification model is trained to accurately categorize cat vocalizations by emotion. The sound generation model, which uses deep learning based models such as SampleRNN, is designed to produce cat sounds that reflect specific emotional states. The study finally proposes an integrated system that takes recorded cat vocalizations, classify them by emotion, and generate cat sounds based on user requirements.

Continuance Intention Toward Second-generation Mobile Instant Messaging App of LINE in Taiwan

  • Bao Q. Duong;Charlie Chen;Craig Van Slyke
    • Asia pacific journal of information systems
    • /
    • v.33 no.4
    • /
    • pp.899-933
    • /
    • 2023
  • The second-generation mobile instant messaging (SMIM) proliferates with various relationship management functions: group chats, audio/video chats, file sharing, real-time location sharing, and nonverbal graphics, such as emojis and stickers. This study integrates the important but often overlooked affordances theory into innovation diffusion and proposes an SMIM continuance intention model. SMIM is a social affordance platform for users to develop new friendships and maintain their relationships. Integrated diffusion of innovation and affordance theoretical frameworks, this study investigates the influence of four factors on the success of using SMIM apps to improve friendship development and relationship management. Data were collected from 231 participants using a survey questionnaire in a public university in Taiwan. The findings confirm the effects of friendship development and relationship maintenance on the intention of users to continue using SMIM apps. Implications for research and practice are discussed.

Ultra-low-power DSP for Audio Signal Processing (오디오 신호 처리를 위한 초저전력 DSP 프로세서)

  • Kwon, Kiseok;Ahn, Minwook;Jo, Seokhwan;Lee, Yeonbok;Lee, Seungwon;Park, Young-Hwan;Kim, Sukjin;Kim, Do-Hyung;Kim, Jaehyun
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2014.06a
    • /
    • pp.157-159
    • /
    • 2014
  • In this paper, we introduce SlimSRP, an ultra-low-power digital signal processor (DSP) solution for mobile audio and voice applications. So far, application processors (APs) have taken charge of all the tasks in mobile devices. However, they have suffered from short battery life problems to deal with complex usage scenarios, such as always-on voice trigger with continuous audio playback. From extensive analysis of audio and voice application characteristics, SlimSRP is designed to relive the performance and power burden of APs. It employs three-issue VLIW architecture, and the major low-power and high-performance techniques include: (1) an optimized register-file architecture friendly for constants generation, (2) a powerful instruction set to reduce the number of register file accesses and (3) a unique instruction compression scheme that contributes to saved memory size and reduced cache miss. An implementation of SlimSRP runs at up to 200MHz and the logic occupies 95K NAND2 gates in Samsung 28LPP process. The experimental results demonstrate that a MP3 decoder application with a 128kbps 44.1kHz input can run at 5.1MHz and the logic consumes only 22uW/MHz.

  • PDF