• 제목/요약/키워드: Audio Generation

Search Result 103, Processing Time 0.025 seconds

Trends and Development Prospects in Broadcasting Technology (방송 기술 동향 및 발전 전망)

  • J.S. Um;B.M. Lim;H.Y. Jung;S.K. Ahn;H.J. Yim;J.H. Seo
    • Electronics and Telecommunications Trends
    • /
    • v.39 no.2
    • /
    • pp.43-53
    • /
    • 2024
  • The media environment is rapidly evolving to be tailored to viewers using personal mobile devices in accordance with technological evolution and changes in social structures. Broadcast media technology is also advancing to enable new services, including data casting, in various reception environments beyond the existing fixed environment and one-way audio/video content services. In addition, technologies to increase the transmission capacity to accommodate next-generation large-capacity media content as well as communication network utilization and convergence technologies are being developed to facilitate interactive services and expand the broadcasting coverage. We discuss the current status and future prospects in broadcasting technology for terrestrial and mobile communication systems and analyze broadcasting technology elements for upcoming media environments relying on generative artificial intelligence.

DisplayPort 1.1a Standard Based Multiple Video Streaming Controller Design (디스플레이포트1.1a 표준 기반 멀티플 비디오 스트리밍 컨트롤러 설계)

  • Jang, Ji-Hoon;Im, Sang-Soon;Song, Byung-Cheol;Kang, Jin-Ku
    • Journal of the Institute of Electronics Engineers of Korea SD
    • /
    • v.48 no.11
    • /
    • pp.27-33
    • /
    • 2011
  • Recently many display devices support the digital display interface as display market growth. DisplayPort is a next generation display interface at the PC, projector and high definition content applications in more widely used connection solution development. This paper implements multiple streams based on the behavior of the main link that is suitable for the display port v1.1a standard. The limit point of Displayport, interface between the Sink Device and Sink Device is also implemented. And two or more differential image data are enable to output the result through four Lanes stated in display port v1.1a, of two or more display devices without the addition of a separate Lane. The Multiple Video Streaming Controller is implemented with 6,222 ALUTs and 6,686 register, 999,424 of block memory bits synthesized using Quartus II at Altera Audio/Video Development board (Stratix II GX FPGA Chip).

Haptic Media Broadcasting (촉각방송)

  • Cha, Jong-Eun;Kim, Yeong-Mi;Seo, Yong-Won;Ryu, Je-Ha
    • Broadcasting and Media Magazine
    • /
    • v.11 no.4
    • /
    • pp.118-131
    • /
    • 2006
  • With rapid development in ultra fast communication and digital multimedia, the realistic broadcasting technology, that can stimulate five human senses beyond the conventional audio-visual service is emerging as a new generation broadcasting technology. In this paper, we introduce a haptic broadcasting system and related core system and component techniques by which we can 'touch and feel' objects in an audio-visual scene. The system is composed of haptic media acquisition and creation, contents authoring, in the haptic broadcasting, the haptic media can be 3-D geometry, dynamic properties, haptic surface properties, movement, tactile information to enable active touch and manipulation and passive movement following and tactile effects. In the proposed system, active haptic exploration and manipulation of a 3-D mesh, active haptic exploration of depth video, passive kinesthetic interaction, and passive tactile interaction can be provided as potential haptic interaction scenarios and a home shopping, a movie with tactile effects, and conducting education scenarios are produced to show the feasibility of the proposed system.

An Analysis of Hanliu Phenomenon on the Chinese Street Fashion Style (중국의 스트리트 패션에 나타난 한류현상 분석)

  • Park, Kil-Soon
    • Korean Journal of Human Ecology
    • /
    • v.13 no.6
    • /
    • pp.967-983
    • /
    • 2004
  • The purpose of this study is to review Hanliu phenomenon, a kind of social and cultural phenomenon, in China and to analyze its effects on the fashion style of new young generation of China. In this study, Hanliu phenomenon means the enthusiasm of Asian people for Korean mass culture including Korean dramas, pop songs, and fashions from late 1990s. This research adopts two kinds of methods for analyzing the phenomenon: qualitative and quantitative research methods. As a qualitative research method, we analyzed it with several sources of documentaries and audio-visual materials: articles from newspapers and magazines, special TV reports, and documentary movie files from Internet. As a quantitative research method, we surveyed approximately 100 female students of Beijing university and asked how they feel Korean culture and fashions. The Hanliu phenomenon led to the popularity of Korean products as well as general culture of Korea. Also, it influenced Chinese young generation so much that Korean fashion has become prevailing. Such influence on the street fashion of Chinese youths can be summarized in three factors as follows: First, Korean entertainers' fashion is widely imitated. For example, H.O.T-like hairstyles, hip-hop styles, large heel shoes with boots-cut pants, and long-curled permanent hairstyles have been on among Chinese youths. Second, the preference for Korean fashion products has highly increased. The number of stores dealing with Korean fashion products has increased. Even the 'Kim Hee Seen,' a fashion brand named after a famous Korean actress, was introduced. Finally, Korean culture and products have widely been imitated in China as much as the increasing popularity of Korean fashion products. This study reveals that Hanliu phenomenon is widespread in China, and Chinese youths are largely affected by the fashion styles of Korean entertainers. Also, Korean fashion products are largely imitated and benchmarked in China. Hanliu phenomenon is a big chance to approach the fashion market of China, the largest buying power in the world. To make inroads into the Chinese fashion market, we suggest that we need to have our own brand and to make the most of culture, stars, and Internet in marketing. Also, we need a well-planned strategy for a success in the Chinese fashion market.

  • PDF

Research on Effects of Three Different Designs and Implementations on Cyber Education (정보활용기술 발전에 따른 효과적 사이버 교육을 위한 설계 및 구현의 차이에 대한 연구)

  • Ha, Tai-Hyun;Kang, Jung-Hwa
    • The Journal of Korean Association of Computer Education
    • /
    • v.6 no.4
    • /
    • pp.71-83
    • /
    • 2003
  • This study is aimed to develop and evaluate different approaches for cyber education. The project involved the development of sample cyber education programs using different design approaches, with built-in evaluation mechanisms. The different design approaches depend on what delivery technologies are involved. In the First Generation, the delivery technologies use text, flash and animation, whereas the synchronized content to video and audio are used in the Second and the Third Generations but the difference is the delivery method used by the videoclip. Tests were carried out through self-assessment to measure and analyze the efficient teaching. The results show that the Third generation technologies were the most effective method for cyber education. However, since the Third generation program is developed in multimedia, it tends 10 require higher development costs, and more advanced hardware and software as well as a higher bandwidth for network. Therefore, the research indicates that the development of technical supports, like loading speed, has to be solved simultaneously with the development of multimedia products for effective cyber education.

  • PDF

High Efficiency Switch-Mode LED driver for Visible Light Communication System (가시광 통신 시스템을 위한 고효율 스위치모드 LED 구동회로)

  • Kang, Jung-Min;Cho, Sang-Ho;Hong, Sung-Soo;Han, Sang-Kyoo;SaKong, Suk-Chin
    • The Transactions of the Korean Institute of Power Electronics
    • /
    • v.16 no.4
    • /
    • pp.358-365
    • /
    • 2011
  • Recently, the LED(Light Emitting Diode) replacing incandescent light bulbs and fluorescent light has great attentions as a most promising candidate for the next generation lighting source due to its environment-friendly characteristics, long life and excellent efficiency. Moreover, since it is a semiconductor device which can convert the electric energy to visible light at a very high speed, it can also used as a communication device. Therefore, the VLC(Visible Light Communication) using the LED can perform the near field communication and lighting function at the same time without additional expenses. However, since the switching device of the conventional LED driver for VLC is operated in the linear region, there exist several drawbacks such as a poor power conversion efficiency and serious heat generation. On the other hand, since the proposed driver is operated in the on/off switching region, it features a higher efficiency and more improved heat generation. To verify the validity of the proposed LED driver, experimental results from a prototype of 20W rated LED driver applied to 3MHz bps broadcasting audio system are given.

Recent Trends and Prospects of 3D Content Using Artificial Intelligence Technology (인공지능을 이용한 3D 콘텐츠 기술 동향 및 향후 전망)

  • Lee, S.W.;Hwang, B.W.;Lim, S.J.;Yoon, S.U.;Kim, T.J.;Kim, K.N.;Kim, D.H;Park, C.J.
    • Electronics and Telecommunications Trends
    • /
    • v.34 no.4
    • /
    • pp.15-22
    • /
    • 2019
  • Recent technological advances in three-dimensional (3D) sensing devices and machine learning such as deep leaning has enabled data-driven 3D applications. Research on artificial intelligence has developed for the past few years and 3D deep learning has been introduced. This is the result of the availability of high-quality big data, increases in computing power, and development of new algorithms; before the introduction of 3D deep leaning, the main targets for deep learning were one-dimensional (1D) audio files and two-dimensional (2D) images. The research field of deep leaning has extended from discriminative models such as classification/segmentation/reconstruction models to generative models such as those including style transfer and generation of non-existing data. Unlike 2D learning, it is not easy to acquire 3D learning data. Although low-cost 3D data acquisition sensors have become increasingly popular owing to advances in 3D vision technology, the generation/acquisition of 3D data is still very difficult. Even if 3D data can be acquired, post-processing remains a significant problem. Moreover, it is not easy to directly apply existing network models such as convolution networks owing to the various ways in which 3D data is represented. In this paper, we summarize technological trends in AI-based 3D content generation.

Voice Synthesis Detection Using Language Model-Based Speech Feature Extraction (언어 모델 기반 음성 특징 추출을 활용한 생성 음성 탐지)

  • Seung-min Kim;So-hee Park;Dae-seon Choi
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.34 no.3
    • /
    • pp.439-449
    • /
    • 2024
  • Recent rapid advancements in voice generation technology have enabled the natural synthesis of voices using text alone. However, this progress has led to an increase in malicious activities, such as voice phishing (voishing), where generated voices are exploited for criminal purposes. Numerous models have been developed to detect the presence of synthesized voices, typically by extracting features from the voice and using these features to determine the likelihood of voice generation.This paper proposes a new model for extracting voice features to address misuse cases arising from generated voices. It utilizes a deep learning-based audio codec model and the pre-trained natural language processing model BERT to extract novel voice features. To assess the suitability of the proposed voice feature extraction model for voice detection, four generated voice detection models were created using the extracted features, and performance evaluations were conducted. For performance comparison, three voice detection models based on Deepfeature proposed in previous studies were evaluated against other models in terms of accuracy and EER. The model proposed in this paper achieved an accuracy of 88.08%and a low EER of 11.79%, outperforming the existing models. These results confirm that the voice feature extraction method introduced in this paper can be an effective tool for distinguishing between generated and real voices.

Speech Packet Transmission Using the AMR-WB Coder with FEC (FEC기능을 추가한 AMR-WB 음성 부호화기를 이용한 음성 패킷 전송)

  • 황정준;이인성
    • Journal of the Institute of Electronics Engineers of Korea TC
    • /
    • v.40 no.11
    • /
    • pp.63-71
    • /
    • 2003
  • This paper suggests the packet loss recovery method to communicate in real time in the Internet. To reduce the effects of packet loss, Forward Error Correction (FEC) that adds redundant information to voice packets can be used. Adaptive Multi Rate Wideband(AMR-WB) codec which is recently selected by the Third Generation Partnership Project(3GPP) for GSM and the third generation mobile communication WCDMA system and has also been standardized in ITU-T for providing wideband speech services is used. The major cause for speech qualitly degradation in IP-networks is packet loss. So, We recovered single lossy packet by using FEC method and concealed continued errors. The proposed scheme if evaluated in the Gilbert Internet channel model. The high quality of audio maintained up to 30% packet loss.

An Integrated File System for Guaranteeing the Quality of Service of Multimedia Stream (멀티미디어 스트림의 QoS를 보장하는 통합형 파일시스템)

  • 김태석;박경민;최정완;김두한;원유집;고건;박승민;김정기
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.31 no.9
    • /
    • pp.527-535
    • /
    • 2004
  • Handling mixed workload in digital set-top box or streaming server becomes an important issue as integrated file system gets momentum as the choice for the next generation file system. The next generation file system is required to handle real-time audio/video playback while being able to handle text requests such as web page, image file, etc. Legacy file system provides only best effort I/O service and thus cannot properly support the QoS of soft real-time I/O. In this paper, we would like to present our experience in developing the file system which fan guarantee the QoS of multimedia stream. We classify all application I/O requests into two category: periodic I/O and sporadic I/O. The QoS requirement of multimedia stream could be guaranteed by giving a higher priority to periodic requests than sporadic requests. The proto-type file system(Qosfs) is developed on Linux Operating System.