• Title/Summary/Keyword: Voice broadcast

Search Result 57, Processing Time 0.033 seconds

DNN based Speech Detection for the Media Audio (미디어 오디오에서의 DNN 기반 음성 검출)

  • Jang, Inseon;Ahn, ChungHyun;Seo, Jeongil;Jang, Younseon
    • Journal of Broadcast Engineering
    • /
    • v.22 no.5
    • /
    • pp.632-642
    • /
    • 2017
  • In this paper, we propose a DNN based speech detection system using acoustic characteristics and context information of media audio. The speech detection for discriminating between speech and non-speech included in the media audio is a necessary preprocessing technique for effective speech processing. However, since the media audio signal includes various types of sound sources, it has been difficult to achieve high performance with the conventional signal processing techniques. The proposed method improves the speech detection performance by separating the harmonic and percussive components of the media audio and constructing the DNN input vector reflecting the acoustic characteristics and context information of the media audio. In order to verify the performance of the proposed system, a data set for speech detection was made using more than 20 hours of drama, and an 8-hour Hollywood movie data set, which was publicly available, was further acquired and used for experiments. In the experiment, it is shown that the proposed system provides better performance than the conventional method through the cross validation for two data sets.

Vocal Separation Using Selective Frequency Subtraction Considering with Energies and Phases (에너지와 위상을 고려한 선택적 주파수 차감법을 이용한 보컬 분리)

  • Kim, Hyuntae;Park, Jangsik
    • Journal of Broadcast Engineering
    • /
    • v.20 no.3
    • /
    • pp.408-413
    • /
    • 2015
  • Recently, According to increasing interest to original sound Karaoke instrument, MIDI type karaoke manufacturer attempt to make more cheap method instead of original recoding method. The specific method is to make the original sound accompaniment to remove only the voice of the singer in the singer music album. In this paper, a system to separate vocal components from music accompaniment for stereo recordings were proposed. Proposed system consists of two stages. The first stage is a vocal detection. This stage classifies an input into vocal and non vocal portions by using SVM with MFCC. In the second stage, selective frequency subtractions were performed at each frequency bin in vocal portions. In this case, it is determined in consideration not only the energies for each frequency bin but also the phase of the each frequency bin at each channel signal. Listening test with removed vocal music from proposed system show relatively high satisfactory level.

A Vision Disabled-Aid using the Context of Internet of Things (사물인터넷을 이용한 시각 장애자 보조 방법)

  • Sahu, Nevadita;Jeong, Min Hyuk;Chun, Jonghoon;Kim, Sang-Kyun
    • Journal of Broadcast Engineering
    • /
    • v.22 no.1
    • /
    • pp.78-86
    • /
    • 2017
  • The Internet of Things can offer disabled people the assistance and support, which is essential to achieve a good quality of life. The visually impaired people need assistance in finding locations, detecting obstacles on the way, and getting directions while moving around to reach their destination. Based on this persistent need, this paper proposes a navigation system for blind people using Internet of Things. The technologies used in our proposed system are: a smart cane containing an RFID reader and an ultrasonic sensor, a smart phone and Internet. The sensed data from the ultrasonic sensor for detecting obstacle is converted to International Standard format from ISO/IEC 23005-5 (MPEG-V Part 5). The system detects the blind person's location using the RFID tags implemented on the way. The system uses voice message in the smart phone to communicate with the blind person to lead him to his destination. The proposed system has been tested to navigate successfully in the campus.

Participation of Television Viewers in Social Community : Social Television (TV 매체를 통한 시청자의 사회적 커뮤니티 참여 : 소셜 TV를 중심으로)

  • Oh, Jong-Sir
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2009.05a
    • /
    • pp.268-272
    • /
    • 2009
  • Reportedly it says that 45% teenagers in the United State exchange the SMS with their friends during television watching. In other word TV viewing moulds the social community between audiences. In terms of social television it is all about interaction or communication technology relevant to TV watching as well as social behaviour. Besides it integrates voice communication, text chat, context awareness, TV recommendations, ratings, video conference and so forth. So far it approaches the conceptual stage or pilot production and remains more research and development. This study is to scrutinise whether the functionality of social TV enables to substitute for social activities of TV viewers or not.

  • PDF

A Study on Optical internet Transmission technic Using DWDM based on network (네트워크 기반에서의 DWDM을 이용한 광 인터넷 전송 기술에 관한 연구)

  • 장우순;정진호
    • Journal of Internet Computing and Services
    • /
    • v.2 no.1
    • /
    • pp.87-96
    • /
    • 2001
  • This article proposes traffic dispersion with optical transmission technical and development of transmission rate for the safe multicast computer communication in the high bandwidth, Recently multicast traffic such as distance conference or Internet broadcast increases therefore the importance of traffic dispersion and transmission rate is emphasized. Ultimately this article offers the way of carrying out the above suggestion, First this paper points out traffic problems occurred in voice and text centered transmission. Next, transmission rate can be controlled by optical transmission technic to solve above difficulties in the multimedia and Internet. We investigated the feature and output on Add-Drop Mux/Demux and Also presented charges of length accord each stage in interference. We can show, the best data of design as a result of this experiment.

  • PDF

Malaysia's 13th General Election: Political Communication and Public Agenda in Social Media

  • Sern, Tham Jen;Zanuddin, Hasmah
    • Asian Journal for Public Opinion Research
    • /
    • v.1 no.2
    • /
    • pp.73-89
    • /
    • 2014
  • Everyone has a voice and can broadcast it to the world. We hear about the old maxim of media do not tell people what to think but what to think about. Under this theory or approach, a key function of political communication is to make the public think about an issue in a way that is favorable to the sender of the message. In a democracy, political communication is seen as crucial for the building of a society where the state and its people feel they are connected. Thus, this is a study on how social media (e.g., Facebook, blogs, and YouTube) were used in the domain of Malaysian politics during the 13th general election campaigning period in order to set the agenda to form public opinion. The study found that Facebook was the most popular social media tool that political parties actively engaged with during the 13th general election campaign period. Apart from that, issues pertaining to the election were significantly highlighted by the political parties in social media, especially Facebook. However, other issues that were also important to the people such as the economy, crime, and education were not sufficiently highlighted during the election campaign period. This indicates that the political parties influence the public on what to think about using social media.

Dialogic Male Voice Triphone DB Construction (남성 음성 triphone DB 구축에 관한 연구)

  • Kim, Yu-Jin;Baek, Sang-Hoon;Han, Min-Soo;Chung, Jae-Ho
    • The Journal of the Acoustical Society of Korea
    • /
    • v.15 no.2
    • /
    • pp.61-71
    • /
    • 1996
  • In this paper, dialogic triphone data base construction for triphone synthesis system is discussed. Particularly, in this work, dialogic speech data is collected from the broadcast media, and three different transcription steps are taken. Total 10 hours of speech data are collected. Among them, six hours of speech data are used for the triphone data base construction, and the rest four hours of data are reserved. Dialogic speech data base construction is far different from the reciting speech data base construction. This paper describes various steps that necessary for the dialogic triphone data base construction from collecting speech data to triphone unit labeling.

  • PDF

Comparison of Sound Pressure Level and Speech Intelligibility of Emergency Broadcasting System at T-junction Corridor Space (T자형 복도 공간의 비상 방송용 확성기 배치별 음압 레벨과 음성 명료도 비교)

  • Jeong, Jeong-Ho;Lee, Sung-Chan
    • Fire Science and Engineering
    • /
    • v.33 no.1
    • /
    • pp.105-112
    • /
    • 2019
  • In this study, an architectural acoustics simulation was conducted to examine the clear and uniform transmission of emergency broadcasting sound in a T junction corridor space. The sound absorption performance of the corridor space and the location and spacing of the loudspeaker for emergency broadcasting were varied. The distribution of the sound pressure level and the distribution of sound transmission indices (STI, RASTI) were compared. The simulation showed that the loudspeaker for emergency broadcasting should be installed approximately 10 m from the center of the T junction corridor connection for clear voice transmission. Narrowing the 25 m installation interval of the NFSC shows that an even clearer and sufficient volume of emergency broadcast sound can be delivered evenly.

Efficient Design of a Disaster Broadcasting System using LTE Modem (이동 LTE모뎀을 활용한 재난방송시스템 설계)

  • Moon, Chaeyoung;Kim, Semin;Ryoo, Kwangki
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2018.10a
    • /
    • pp.292-294
    • /
    • 2018
  • Recently, damage caused by natural disasters such as fire, earthquake, heavy rains and heavy snow is increasing. In addition, traffic accidents due to freezing, fog and fire in tunnels and bridges are frequently occurring. In such a disaster situation, it is very important to take prompt action by the person in charge of managing the facility and area.To this end, a disaster broadcasting system is used, but in the existing system, the broadcasting room and the speaker are connected by a wired connection. Also, the person in charge has to be in the broadcasting room to broadcast, which has a problem of delaying the time. In this paper, we design a disaster broadcasting system using LTE modem. The designed system enables a broadcasting person to make a call to a broadcasting system from anywhere using a cellular phone and a public telephone. Broadcasting via telephone is possible only with the telephone number pre-registered in the system and can be registered / deleted by the administrator. The registered telephone number, incoming voice file, and announcement voice for automatic broadcasting are stored in the system internal SD memory for convenient management. This disaster broadcasting system is expected to contribute to quick and convenient disaster broadcasting.

  • PDF

IPTV Service Provider over FTTH (광가입자망을 통한 IPTV 서비스 제공)

  • Park In-Gyu
    • Journal of the Institute of Electronics Engineers of Korea TC
    • /
    • v.43 no.5 s.347
    • /
    • pp.7-16
    • /
    • 2006
  • IPTV is referred to the service which provide integrated IPTV services for providing video, 10/100-Mbit/sec Internet, voice, video-on-demand (VOD), and other broadband applications including home security, video conferencing, and telemedicine. All services are integrated into an IP (Internet Protocol) architecture designed specifically for Gigabit Ethernet FTTH systems, HFC or xDLC. It is absolutely necessary that telecon operators provide IP video delivery platforms that enable service providers to transform their business. With their own products, they can better manage their existing services and generate new revenues from broadcast TV, movies on demand and multimedia. Triple-play is a combination of broadcast, telephony and broadband services offered through IPTV networks. With cable operators allowed to offer a triple-play bundle, the nation's telecom operators are beginning to get a little anxious. Cable operators assert that triple-play is a must-have and natural extension of the cable service bundle. The Korean Cable TV Association asserts that the triple-play model is of paramount importance to the cable industry's future growth. But the telecom sector considers itself unfairly disadvantaged, saying they cannot compete until regulatory issues are resolved. The start of web-based television in Korea may still be some time off with a confrontation between the nation's IT regulator and broadcasting sector over the service's legal boundaries shows no signs of being resolved my time soon. korea should be is the fastest-growing provider of IPTV solutions in the industry, with over worldwide customers.