Search | Korea Science

A Study on Speech Separation using Sinusoidal Model and Psycoacoustics Model (정현파 모델과 사이코어쿠스틱스 모델을 이용한 음성 분리에 관한 연구)

Hwang, Sun-Il;Han, Doo-Jin;Kwon, Chul-Hyun;Shin, Dae-Kyu;Park, Sang-Hui
- Proceedings of the KIEE Conference
- /
- 2001.07d
- /
- pp.2622-2624
- /
- 2001
In this thesis, speaker separation is employed when speech from two talkers has been summed into one signal and it is desirable to recover one or both of the speech signals from the composite signal. This paper proposed the method that separated the summed speeches and proved the similarity between the signals by the cross correlation between the signals for exact between original signal and separated signal. This paper uses frequency sampling method based on sinusoidal model to separate the composite signal with vocalic speech and vocalic speech and noise masking method based on psycoacoustics model to separate the composite signal with vocalic speech and nonvocalic speech.
PDF

A Study on Image Coding using the Human Visual System and DCT (시각특성과 DCT를 이용한 영상부호화에 관한 연구)

남승진;최성남;전중남;박규태
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.17 no.4
- /
- pp.323-335
- /
- 1992
In this paper, an adaptive cosine transform coding scheme which incorporate human visual properties into the coding scheme is investigated. Human vision is relatively sensitive to mid-frequency band, and insensitive to very low and very high frequency band. These property was mathematically modelled with MTF(Modulation Transfer Function) through many psychovisual experiment. DCT transforms energy in spatial domain into frequency domain, so can exploit the MTF very efficiently. Another well-known visual characteristics is spatial masking effect that visibility of noise is less in regions of high activity than in regions of low activity. Proposed coding scheme imploys quantization matrix which represent the properties of these spatial frequency response of human vision, and adaptively quality of an image. To compute the activity index of an image block, simple operation is performed in spatial domain, and according to activity index. block of low activity region is more exactly quantized relatively than that of high activity region. Results showed that, at low bit rate, the subjective quality of the reconstructed images by proposed coding scheme is acceptible than that of coding scheme without HVS properties.
PDF

Video Watermarking Using Human Visual System and Wavelet Transform (인간 시각 시스템 및 웨이블릿 변환을 이용한 비디오 워터마킹)

권성근;김병주;김태수;이석환;권기룡;이건일
- Journal of Korea Multimedia Society
- /
- v.6 no.3
- /
- pp.436-443
- /
- 2003
A digital video watermarking algorithm is proposed that uses HVS and DWT. In this algorithm, each video frame is decomposed into four-level by DWT which reveals the characteristics of the human eyes and watermark is embedded into DWT coefficients using HVS. For robustness, the lowest level subbands which represent the highest frequency component are excluded in watermark embedding step and watermark is embedded into the perceptually significant coefficients (PSCs) of the rest subbands. PSCs of the baseband are selected according to the amplitude of the coefficients and PSCs of the high frequency subbands are selected by successive subband quantization (SSQ). Watermark is embedded into the PSCs of the baseband and high frequency subbands by Weber's law and spatial masking effect, respectively, for the invisibility and robustness. We tested the performance of the proposed algorithm compared with the conventional watermarking algorithm by computer simulation. Experimental results show that the proposed watermarking algorithm produces a better invisibility and robustness than the conventional algorithm.
PDF

Extraction of Characteristics of Concrete Surface Cracks

Ahn, Sang-Ho
- Journal of information and communication convergence engineering
- /
- v.5 no.2
- /
- pp.126-130
- /
- 2007
This paper proposes a method that automatically extracts characteristics of cracks such as length, thickness and direction, etc., from a concrete surface image with image processing techniques. This paper, first, uses the closing morphologic operation to adjust the effect of light extending over the whole concrete surface image. After applying the high-pass filtering operation to sharpen boundaries of cracks, we classify intensity values of the image into 8 groups and remove intensity values belong to the highest frequency group among them for the removal of background. Then, we binarize the preprocessed image. The auxiliary lines used to measure cracks of concrete surface are removed from the binarized image with position information extracted by the histogram operation. Then, cracks broken by the removal of background are extended to reconstruct an original crack with the $5{\times}5$ masking operation. We remove unnecessary information by applying three types of noise removal operations successively and extracts areas of cracks from the binarized image. At last, the opening morphologic operation is applied to compensate extracted cracks and characteristics of cracks are measured on the compensated ones. Experiments using real images of concrete surface showed that the proposed method extracts cracks well and precisely measures characteristics of cracks.
PDF KSCI

A Common Synthesis Filter for MPEG-2 BC/AAC Audio Using Recursive Structure (Recursive 구조를 이용한 MPEG-2 BC/AAC 오디오 공용 합성 필터)

강명수;박세기;오신범;이채욱
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.29 no.6C
- /
- pp.874-882
- /
- 2004
MPEG Audio compression algorithm is the international standard for the digital compression of high quality audio using mechanism of the perceptual coding based on psychoacoustic masking. It is necessary to discuss the constraints on designing of common filter banks for MPEG-2 BC and MPEG-2 AAC decoder system, which is not Down yet, mapping audio signals from the time domain into the frequency domain. In this paper, we present an architecture of common synthesis filter whcih can be used for MPEG-2 BC and MPEG-2 AAC decoder using recursive structure. The proposed algorithm is based on recursive architecture that effectively performs common compulsion.
PDF KSCI

HVS Based Digital Watermarking Using the POCS Theory (POCS 이론을 이용한 인간시각시스템 기반 디지털 워터마킹)

Kim, Hee-Jung;Seo, Yong-Su;Kim, Ji-Hong
- Journal of Korea Multimedia Society
- /
- v.8 no.4
- /
- pp.516-524
- /
- 2005
In this paper, a new watermarking scheme based on the POCS theory and human visual system is proposed. Using the POCS theory, watermarks are embedded into imperceptible image regions such as edge and strong texture area in the spatial domain. Also it is inserted into middle frequency band in the transform domain to achieve the robustness against compression and filtering, etc. In addition, different gain factors are employed into blocks classified by considering texture masking effect. By doing so, the proposed method has a novel property of having both the imperceptibility and the robustness simultaneously. Simulation results show that the proposed method has an excellent performance better than conventional approaches.
PDF

Digital Audio Watermarking in The Cepstrum Domain (켑스트럼 영역에서의 오디오 워터마킹 방법)

이상광;호요성
- Journal of Broadcast Engineering
- /
- v.6 no.1
- /
- pp.13-20
- /
- 2001
In this paper, we propose a new digital audio watermarking scheme In the cepstrum domain. We insert a digital watermark signal Into the cepstral components of the audio signal using a technique analogous to spread spectrum Communications, hiding a narrow band signal in a wade band channel. In our proposed method, we use pseudo-random sequences to watermark the audio signal. The watermark Is then weighted in the cepstrum domain according to the distribution of cepstral coefficients and the frequency masking characteristics of the human auditory system. The proposed watermark embedding scheme minimizes audibility of the watermark signal. and the embedded watermark is robust to mu1tip1e watermarks, MPEG audio ceding and additive noose.
PDF

Recent Views of Tardive Dyskinesia (지연성 운동장애(Tardive Dyskinesia)의 최근 견해)

Kim, Yong-Sik;Kang, Ung-Gu;Joo, Yeon-Ho
- Korean Journal of Biological Psychiatry
- /
- v.3 no.1
- /
- pp.30-36
- /
- 1996
Tardive dyskinesia is a syndrome of involuntary hyperkinetic abnormal movements that occurs during or shortly after the cessation of neuroleptic drug treatment. Typically, the movements are choreoatheoid. Other movements such as tics and dystonia may be present. Nonetheless, any dyskinesia seen in a neuroleptic-treated patient is not always neuroleptic-induced tardive dyskinesia. The prevalence of tardive dyskinesia varies widely, which reflects many methodological problems, such as differential diagnosis. symptom fluctuation, masking effect of neuroleptics, validated diagnostic criteria. Of suggested risk factors, only old age has been consistently found to be associated with an increased frequency of tardive dyskinesia. Many hypotheses about the pathophysiolgy of tardive kinesia are proposeed, but time-honored ones are not present. No consistently safe and effective treatments are found. Various treatment modalities signifies the general ineffectiveness of these agents for most patients. In general, reduction or cessation of neuroleptics, if possible, is recommended. Remission or improvemets of tardive dyskinesia after neuroleptics withdrawal usually occurs among most patients within three months.
PDF

Analysis of Windowing Effects in the Estimation of Beat Frequencies (비트 주파수 추정에서의 윈도잉 효과 분석)

Lee, Jong-Gil
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2010.05a
- /
- pp.668-670
- /
- 2010
It is necessary to estimate the range and Doppler shifted spectrum for the extraction of useful information from the return echoes in the frequency modulated continuous wave radar systems used for the remote sending purpose such as detection of moving targets. However, the spectrum estimation using the FFT method causes the very large sidolobes of clutter masking the essential signal information if the acquisition time of an echo signal is pretty short. Therefore, in this paper, the efficient data windowing method is investigated to suppress the strong sidelobe levels of the clutter and results are analyzed.
PDF

The Noise Characteristics and Appropriate Talk Distance in Dental Clinic (치과병원의 소음특성과 적절한 대화거리)

Ji, Dong-Ha;Choi, Mi-Suk
- Journal of the Korea Academia-Industrial cooperation Society
- /
- v.14 no.5
- /
- pp.2516-2523
- /
- 2013
Noise occurred when medical treatment in dental clinic will affect the patients. This study was measured the noise level and frequency in case of medical examination and also has evaluated the degree of indoor noise using the NR-curve, NRN and a distance to conversation between worker and patients using the PSIL. It shows that noise level was 69.3~81.5dB(A) and frequency was very high (more than 4K(Hz)) and analysis by NR-curve showed that it was exceed the noise permit level and distance to conversation was less than 1meter by PSIL. To remedy a fear of noise in patients and provide a conversational satisfaction, it's considered that choosing the low noise-vib. equipment, using the masking effect and set the room to explain. So It is possible to improve their competitiveness.
https://doi.org/10.5762/KAIS.2013.14.5.2516 인용 PDF KSCI

Search Result 102, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)