Search | Korea Science

IMAGE DENOISING BASED ON MIXTURE DISTRIBUTIONS IN WAVELET DOMAIN

Bae, Byoung-Suk;Lee, Jong-In;Kang, Moon-Gi
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2009.01a
- /
- pp.246-249
- /
- 2009
Due to the additive white Gaussian noise (AWGN), images are often corrupted. In recent days, Bayesian estimation techniques to recover noisy images in the wavelet domain have been studied. The probability density function (PDF) of an image in wavelet domain can be described using highly-sharp head and long-tailed shapes. If a priori probability density function having the above properties would be applied well adaptively, better results could be obtained. There were some frequently proposed PDFs such as Gaussian, Laplace distributions, and so on. These functions model the wavelet coefficients satisfactorily and have its own of characteristics. In this paper, mixture distributions of Gaussian and Laplace distribution are proposed, which attempt to corporate these distributions' merits. Such mixture model will be used to remove the noise in images by adopting Maximum a Posteriori (MAP) estimation method. With respect to visual quality, numerical performance and computational complexity, the proposed technique gained better results.
PDF

Automatic Basal Cell Carcinoma Detection using Confocal Raman Spectra (공초점 라만스펙트럼을 이용한 자동 기저세포암 검출)

Min, So-Hee;Park, Aaron;Baek, Seong-Joon;Kim, Jin-Young
- Proceedings of the IEEK Conference
- /
- 2006.06a
- /
- pp.255-256
- /
- 2006
Raman spectroscopy has strong potential for providing noninvasive dermatological diagnosis of skin cancer. In this study, we investigated two classification methods with maximum a posteriori (MAP) probability and multi-layer perceptron (MLP) classification. The classification framework consists of preprocessing of Raman spectra, feature extraction, and classification. In the preprocessing step, a simple windowing method is proposed to obtain robust features. Classification results with MLP involving 216 spectra preprocessed with the proposed method gave 97.3% sensitivity, which is very promising results for automatic Basal Cell Carcinoma (BCC) detection.
PDF

Online Adaptation of Continuous Density Hidden Markov Models Based on Speaker Space Model Evolution (화자공간모델 진화에 근거한 연속밀도 은닉 마코프모델의 온라인 적응)

Kim Dong Kook;Kim Young Joon;Kim Hyun Woo;Kim Nam Soo
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.69-72
- /
- 2002
본 논문에서 화자공간모델 evolution에 기반한 continuous density hidden Markov model (CDHMM)의 online 적응에 대한 새로운 기법을 제안한다. 학습화자의 a priori knowledge을 나타내는 화자공간모델은 factor analysis (FA) 또는 probabilistic principal component analysis (PPCA)와 같은 은닉변수모델(latent variable model)에 의해 효과적으로 나타내어진다. 은닉 변수모델은 화자공간모델뿐아니라 CDHMM 파라메터의 ajoint prior분포를 표시함으로, maximum a posteriori(MAP)적응기법에 직접 적용되어진다. 화자공간모델의 hyperparameters와 CDHMM파라메터를 동시에 순차적으로 적응하기 위해 quasi-Bayes (QB)추정 기술에 기반한 online 적응기법을 제안한다. 연속숫자음 인식과 관련된 화자적응 실험을 통해 제안된 기법은 적은 적응데이터에서 좋은 성능을 나타내며, 데이터가 증가함에 따라 성능이 지속적으로 증가함을 보여준다.
PDF

Break Strength Prediction Using Maximum a Posterior Probability (MAP 확률을 이용한 끊어 읽기 강도 예측)

Kim Sanghun;Park Jun;Lee Youngjik
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.75-78
- /
- 2000
본 논문은 자연스러운 합성음 생성을 위한 끊어 읽기 강도 예측에 관한 것으로, 문장에 대한 품사열이 주어졌을 때 Posteriori 확률을 최대화하는 끊어 읽기 강도를 비터비 디코딩으로 예측한다. 훈련용 데이터는 여성화자 1인이 발성한 2,100 문장이며, 음성 데이터로부터 휴지길이(pause)에 따라 끊어 읽기 강도를 2단계로 할당하고, 텍스트에서는 30개의 품사 태그 심볼을 이용하여 형태소분석 및 태깅을 수행하였다. 관측확률은 3개 연속하는 품사열이 발생할 확률로 하고 끊어 읽기 강도 천이확률은 bigram으로 했을 때, cross validation 방법으로 성능 평가를 수행하였다 평가결과, 훈련데이타에 대해서는 $89.7\%$, 테스트 데이터에 대해서는 $84.9\%$의 예측정확률을 보였다.
PDF

Head Pose Classification using Multi-scale Block LBP and Random Forest (다중 크기 블록 지역 이진 패턴을 이용한 랜덤 포레스트 기반의 머리 방향 분류 기법)

Kang, Minjoo;Lee, Hayeon;Kang, Je-Won
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2016.06a
- /
- pp.253-255
- /
- 2016
본 논문에서는 다중 지역 이진 패턴(Multi-scale Bock LBP, MB-LBP) 특징과 랜덤 포레스트에 기반한 새로운 기법의 머리 방향 분류 기법을 제안한다. 제안 기법에서는 occlusion 과 조명의 변화에 강인한 분류 정확도를 얻기 위해서 랜덤화된 트리를 학습하는 것을 목표로 한다. 우선, 얼굴 이미지로부터 많은 MB-LBP 특징을 추출하고, 얼굴 영상들을 랜덤하게 입력하고 MB-LBP 크기 파라미터와 같은 랜덤 특징과 블록 좌표들을 사용하여 트리를 생성한다. 게다가 각 노드에서 정보 이득을 최대화 하는 트리의 내부 노드를 생성하기 위해서 uniform LBP 의 특성을 고려한 분할 함수를 개발한다. 랜덤화된 트리는 랜덤 포레스트에 포함되어 있으며 마지막 결정단계에서 Maximum-A-Posteriori criterion 으로 최종 결정을 한다. 실험 결과는 제안 기법이 다양한 조명, 자세, 표현, occlusion 상황에서 기존의 방법보다 개선된 성능으로 머리 방향을 분류 할 수 있음을 보여준다.
PDF

The Comparison of Speaker Adaptation Methods (화자 적응 방법들의 비교)

황영수
- The Journal of the Acoustical Society of Korea
- /
- v.18 no.1
- /
- pp.61-66
- /
- 1999
In this paper, we proposed various speaker adaptation methods and studied the performance of these methods. Methods which were studied in this paper are MAPE(Maximum A Posteriori Probability Estimation), Linear Spectral Estimating, Multi-Layer Perceptron and ARTMAP. In order to evaluate the performance of these methods, we used Korean isolated digits as the experimental data, the hybrid speaker adaptation method, which unified MAPE, linear spectral estimating and output probability of SCHMM, showed the better recognition result than those which performed other methods. And the method using ARTMAP showed the similar result to above hybrid method.
PDF

Speech Recognition Using the Energy and VQ (에너지와 VQ를 이용한 음성 인식)

Hwang, Young-Soo
- The Journal of The Korea Institute of Intelligent Transport Systems
- /
- v.6 no.3
- /
- pp.87-94
- /
- 2007
In this paper, the performance of the speech recognition and speaker adaptation methods are studied. The speech recognition using energy state and VQ(Vector Quantization) is suggested and the speaker adaptation methods(Maximum a posteriori probability estimation, linear specrum estimation) are considered. The experimental results show that recognition ration using energy state is 2-3 % better than that of general VQ.
PDF

Recognition of Emotion and Emotional Speech Based on Prosodic Processing

Kim, Sung-Ill
- The Journal of the Acoustical Society of Korea
- /
- v.23 no.3E
- /
- pp.85-90
- /
- 2004
This paper presents two kinds of new approaches, one of which is concerned with recognition of emotional speech such as anger, happiness, normal, sadness, or surprise. The other is concerned with emotion recognition in speech. For the proposed speech recognition system handling human speech with emotional states, total nine kinds of prosodic features were first extracted and then given to prosodic identifier. In evaluation, the recognition results on emotional speech showed that the rates using proposed method increased more greatly than the existing speech recognizer. For recognition of emotion, on the other hands, four kinds of prosodic parameters such as pitch, energy, and their derivatives were proposed, that were then trained by discrete duration continuous hidden Markov models(DDCHMM) for recognition. In this approach, the emotional models were adapted by specific speaker's speech, using maximum a posteriori(MAP) estimation. In evaluation, the recognition results on emotional states showed that the rates on the vocal emotions gradually increased with an increase of adaptation sample number.
PDF KSCI

Adaptive Image Enhancement Algorithm using Local Statistics (국부통계특성을 이용한 적응적 영상 Enhancement 알고리듬)

Kim Kyoung Ho;Hong Min-Cheol
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2004.11a
- /
- pp.71-74
- /
- 2004
본 논문에서는 MAP(maximum a posteriori) 추정방식과 국부통계특성을 이용한 적응적 영상 향상 방법을 제안한다. 원 영상의 에지를 보존 할 수 있는 MAP추정 방식과 인간의 시각 특성을 나타내는 시각 함수를 이용한 가중치 행렬을 사용하였다. MAP 추정 방식은 컨벡스 함수를 최적화하여 원 영상의 에지를 보존하는 방법을 이용하였으며, 시각 함수는 국부 정보의 평균, 분산을 이용하여 정의하였다. 제안 방식으로부터 국부영역의 비용함수에 의해 발생되는 스무딩 정도를 다르게 하여 보간된 영상의 화질을 개선시킨다. 제안된 방식의 성능을 실험 결과로부터 확인한 수 있었다.
PDF

Iterative Turbo Decoding Using Three Cascade MAP Decoder (3개의 직렬 MAP 복호기를 이용한 반복 터보 복호화기)

김동원;이호웅;강철호
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.26 no.6B
- /
- pp.709-716
- /
- 2001
반복 복호 알고리듬에 의해 복호화된 터보 코드는 가산성 백색 가우시안 잡음(AWGN) 채널 환경에서 이론적으로 Shannon의 한계에 근접한 뛰어난 코딩 이득을 나타내는 것으로 보여지고 있다. 그러나, 터보 코드의 성능은 터보 부호화기에서 프레임의 크기 즉, 인터리버의 크기에 의존한다. IMT-2000과 같은 이동 통신 채널 환경에서 음성을 전송하는 경우에는 터보 코드의 프레임 크기는 매우 작다. 그리고, 그것은 터보 코드의 성능을 떨어뜨리는 직접적인 원인이 된다. 본 논문에서는 차세대 이동 통신 시스템에서 프레임 크기가 작은 음성 프레임을 이용하여 터보 코드의 성능을 검증하며, 작은 프레임 크기에 알맞은 3개의 직렬 MAP(Maximum A Posteriori probability) 복호기를 이용한 반복 복호의 터보 코드를 제안하고 부호율 1/3, 구속장의 길이 3 또는 4, 프레임 크기 24, 192 비트에 대하여 컴퓨터 모의실험을 통해 터보 코드의 성능을 분석한다.
PDF

Search Result 162, Processing Time 0.027 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)