• Title/Summary/Keyword: Voice signal

Search Result 433, Processing Time 0.025 seconds

Noise Removal using Modified Switching Filter in Mixed Noise Environments (복합잡음 환경에서 변형된 스위칭 필터를 이용한 잡음 제거)

  • Kwon, Se-Ik;Kim, Nam-Ho
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.20 no.6
    • /
    • pp.1215-1220
    • /
    • 2016
  • As society has developed rapidly toward a highly advanced digital information age, a multimedia communication service for acquisition, transmission and storage of image data as well as voice has being commercialized. However, image data is always corrupted by various noises during image processing, so researches for removing noises have been continued until now. There are diverse types of noise on the image including salt and pepper noise, AWGN, and mixed noise. Hence, the filter algorithm for the image recovery was proposed that salt and pepper noise was processed by linear interpolation, histogram weighted values and median filter after defining the noise to lessen the impact of mixed noise added in the image, and AWGN was processed by the pixel information of local mask establishing the weighted values in this study. In addition, the algorithm was compared with the conventional methods for objectively and used the PSNR(peak signal to noise ratio) as the basis of the determination.

DNN based Speech Detection for the Media Audio (미디어 오디오에서의 DNN 기반 음성 검출)

  • Jang, Inseon;Ahn, ChungHyun;Seo, Jeongil;Jang, Younseon
    • Journal of Broadcast Engineering
    • /
    • v.22 no.5
    • /
    • pp.632-642
    • /
    • 2017
  • In this paper, we propose a DNN based speech detection system using acoustic characteristics and context information of media audio. The speech detection for discriminating between speech and non-speech included in the media audio is a necessary preprocessing technique for effective speech processing. However, since the media audio signal includes various types of sound sources, it has been difficult to achieve high performance with the conventional signal processing techniques. The proposed method improves the speech detection performance by separating the harmonic and percussive components of the media audio and constructing the DNN input vector reflecting the acoustic characteristics and context information of the media audio. In order to verify the performance of the proposed system, a data set for speech detection was made using more than 20 hours of drama, and an 8-hour Hollywood movie data set, which was publicly available, was further acquired and used for experiments. In the experiment, it is shown that the proposed system provides better performance than the conventional method through the cross validation for two data sets.

A Study on the Channel Converting and Monitoring of the Remote Control Transceiver (원격제어 송수신기의 채널변환과 모니터용 모듈의 구현)

  • 조학현;최조천;김기문
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.3 no.2
    • /
    • pp.347-354
    • /
    • 1999
  • Generally, transceiver is operated to the remote control for the purpose of breading the traffic zone which is established on a mountain peak, island and building top using private line. Therefor, the remote control system of public radio station have been the very important role that is decision the quality relate to the quickness, accuracy, safety of communication on old type transceiver of SSB, VHF etc. In the case of using the only 1 private line which is exchanged voice signal with data signal had mixed or interrupted for up/down of channel, PTT control and monitoring of transmission channel and power. The up/down of channel and PTT control is according to the ASK and the data of monitoring is transfered to the FSK modulation, additional algorithm is studied on the serial protocol and traffic sequence using the MCS-51 processor in the simplex communication methode.

  • PDF

Mobile Guidance System for Evacuation based on Wi-Fi System and Node Architecture

  • Raju, Timalsina;Kim, Woo Sung
    • Journal of Information Technology Applications and Management
    • /
    • v.26 no.5
    • /
    • pp.41-56
    • /
    • 2019
  • Recently great loss of life and property is occurring because of fire, natural disaster, earth quake, tsunami and so on. People spend 80~90% of their time indoor environment like office, supermarket, campus. Therefore Indoor navigation and guidelines system became so essential for most of all. Incase of emergency we must be careful earlier, in such a cases 5G kind of new technology may also cannot work. So immediate action and quick routing notification for guidelines and protection is the most. Considering this issue We proposed indoor evacuating guidance system based on microcontroller Wi-Fi board for Indoor APP using mobile. Focusing various kind of technology like, ok google, voice search APP we purposed node architecture based system. When we listen fire alarm while living inside the room. Then to be safe we connect with server and start Arduino UNO+IoT ESP8266 Wi-Fi shield version1-IoT module to store data in MySQL DB server. We make application to escape out from the building up-to the three exits giving information from source point to destination. Our program can send information to the users emergency location and situations. For this when the user get sound or vibration in their mobile device it indicate fire out near by. At that time we update message from Arduino to DB server for the fixed current position inside the building which give routing signal for that fire out location by changing values from 0 to 1. We have user in point 10 where user is near by. Later we detect Wi-Fi signal form Nodemcu as room of each floor and try to connect with user. Main purpose of this paper is to save life of people in short time and find out the shortest path up to nearest exits in the time of emergencies and rescue them.

Coast Evaluation Techniques for Mode Selection in Video Coding (동영상에서 모드 선택을 위한 코스트 평가 방법)

  • Song, Dae-Geon
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.13 no.6
    • /
    • pp.275-280
    • /
    • 2013
  • Recently, access networking BroadBand the high performance of the video equipment to the Internet via voice, video, multimedia services, such as dealing with the media information dissemination is becoming increasingly attracting attention. More video devices and network environments in the future to keep pace with the high-quality video using the form dealing with an increasingly diversified and shall utilization is expected. Among them, video encoding technology, image compression encoding technology of information technology is one of the central role. Video coding technology that requires a vast amount of information contained in the video signal and the appropriate amount of information to eliminate redundancy as the efficiency of the digital code representing video signal is developed as a technology is going. Therefore, this study applied to video coding mode selection in the cost evaluation methods to examine and to maximize the coding efficiency and the proposed method compared to the conventional method was confirmed excellence.

Sasang Constitution Classification by Speech Signal Processing (음성 신호 분석에 의한 사상 체질 분류)

  • Cho Dong-Uk
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.31 no.5C
    • /
    • pp.548-555
    • /
    • 2006
  • This paper proposes on the Sasang constitution classification method which is the most important things in the Sasang constitution medicine. Pre-existing methods of Sasang constitution classification are a shape of the body and its countenance & morpological aspect and temper. Many diagnostic methods have been developed and used including the questionnaires on personal life style and propensities(QSCC, QSCC II), and the tonal analysis of person's voice. Recently the constitutional acupunture and the herbal medicine response analyses are developed and used additionally. But these methods which is done by the doctor's intuition. In this article, I propose a methodology to classify the Sasang constitution. pitch, intensity and formants are used to classify the Sasang constitution by comparing the similarities and differencies of tonal analysis. Finally, the validity of the method is proven through the experiments.

A Study of Enemy Aptitude of Pistol Sound Source for Space Estimation (공간평가를 위한 피스톨음원의 적정성에 관한 연구)

  • Shon, Jang-Ryul;Kim, Jung-Joong
    • Transactions of the Korean Society for Noise and Vibration Engineering
    • /
    • v.15 no.3 s.96
    • /
    • pp.320-328
    • /
    • 2005
  • Last target of architectural acoustics is that people wish to convey voice effectively from the space adaptively in use purpose in building. But, how exactly through space sound (sound source) that wish to deliver from indoor can be passed method to do quantification and evaluate quantity of sound by method to serve indoor architectural acoustics estimation summer period and methods to estimate definition propose. This Study searches special quality of sound source about MLS signal that is occurred short-answer sound source (pistol sound source) and nondirectional speaker among indoor sound estimation method, and measure and analyzed reverberation time (RT60), definition (C80, D50) by regulation of each ISO 3382 in age place (classroom, hall, gymnasium). Analysis result and sound factor among could know that d of two sound sources converges in measurement error extent about reverberation time (RT60) of analysis incidental and sound factors and value shows change irregularly about sound factor of D50, C80, pistol sound source judged there is problem. Also, could know that problem is happened in deflection except reverberation time is in deflection analysis with wave that measure each in fixed distance in branch. Finally, when differ size of sound source and measure about change of sound pressure level in case measure sound pressure level giving difference about 10 dB, sound factor could know that there is no different effect.

A Study on Kidney Diseases Diagnosis System for Sensation Type Using Physiological Signal Analysis (생체 신호 분석을 이용한 감각형 신장 질환 진단 시스템 연구)

  • Cho, Dong-Uk;Kim, Bong-Hyun;Lee, Se-Hwan
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.31 no.10C
    • /
    • pp.964-972
    • /
    • 2006
  • The kidney keeps with close relationship in the internal organs, that the kidney function filtering eliminate the wastes to the urine on the processing to replace the old with the new blood. In case of these problem in the kidney, there is no way to catch out with self-awakening symptom except for serious illness. This problem can solve with keeping the systematic diagnosis method in the kidney trouble shooting. Under the circumstances, the importance of the diagnosis for the kidney disease is growing day after day. In this paper, among the great four diagnosises, using the way of ocular inspection & auscultation, we would like to propose rouble shooting in the way of the kidney. To do this, through the assistance of the input image, extract the value of the color with appropriate output, analysing the color of the face with related to the kidney, using the results we would like to get the accurate symptoms on the kidney's problems. Also, through analysing and comparing with the relationship the kidney and the signal of voice, we would like to realize the proof system of human health. Finally, we'd like to make proof of the usefulness for proposed method from this study.

Multi-Modal Instruction Recognition System using Speech and Gesture (음성 및 제스처를 이용한 멀티 모달 명령어 인식 시스템)

  • Kim, Jung-Hyun;Rho, Yong-Wan;Kwon, Hyung-Joon;Hong, Kwang-Seok
    • Proceedings of the Korea Institute of Convergence Signal Processing
    • /
    • 2006.06a
    • /
    • pp.57-62
    • /
    • 2006
  • 휴대용 단말기의 소형화 및 지능화와 더불어 차세대 PC 기반의 유비쿼터스 컴퓨팅에 대한 관심이 높아짐에 따라 최근에는 펜이나 음성 입력 멀티미디어 등 여러 가지 대화 모드를 구비한 멀티 모달 상호작용 (Multi-Modal Interaction MMI)에 대한 연구가 활발히 진행되고 있다. 따라서, 본 논문에서는 잡음 환경에서의 명확한 의사 전달 및 휴대용 단말기에서의 음성-제스처 통합 인식을 위한 인터페이스의 연구를 목적으로 Voice-XML과 Wearable Personal Station(WPS) 기반의 음성 및 내장형 수화 인식기를 통합한 멀티 모달 명령어 인식 시스템 (Multi-Modal Instruction Recognition System : MMIRS)을 제안하고 구현한다. 제안되어진 MMIRS는 한국 표준 수화 (The Korean Standard Sign Language : KSSL)에 상응하는 문장 및 단어 단위의 명령어 인식 모델에 대하여 음성뿐만 아니라 화자의 수화제스처 명령어를 함께 인식하고 사용함에 따라 잡음 환경에서도 규정된 명령어 모델에 대한 인식 성능의 향상을 기대할 수 있다. MMIRS의 인식 성능을 평가하기 위하여, 15인의 피험자가 62개의 문장형 인식 모델과 104개의 단어인식 모델에 대하여 음성과 수화 제스처를 연속적으로 표현하고, 이를 인식함에 있어 개별 명령어 인식기 및 MMIRS의 평균 인식율을 비교하고 분석하였으며 MMIRS는 문장형 명령어 인식모델에 대하여 잡음환경에서는 93.45%, 비잡음환경에서는 95.26%의 평균 인식율을 나타내었다.

  • PDF

Efficient Resource Allocation Technique for LTE-Advanced based Interference Avoidance of Heterogeneous Network (LTE-Advanced 기반 이기종 네트워크 시스템의 간섭회피를 위한 효율적인 자원할당 기법)

  • Jang, Sung-Won;Seong, Hyeon-Kyeong
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.17 no.1
    • /
    • pp.46-52
    • /
    • 2016
  • LTE-Advanced system consisting of the number of cells in the cellular environment because it is built to allow efficient use of limited frequency resources of adjacent cell interference avoidance should be considered. Transition services in accordance with the development of the mobile communication technology, wireless multimedia content from voice-centric mobile communications services and causing a lot of mobile data traffic, such as smart phones and tablet terminals spread of a data-driven surge in mobile data traffic base stations in urban areas by increasing became a reality that can not be prevented. In this paper, we propose a new Hybrid resource allocation technique for improving the performance of the cell boundary and analyzed the performance of the proposed new techniques to perform the simulation using LTE-Advanced system level simulator based on 19cell of cellular system model.