Search | Korea Science

Statistical Model-Based Voice Activity Detection Using the Second-Order Conditional Maximum a Posteriori Criterion with Adapted Threshold (적응형 문턱값을 가지는 2차 조건 사후 최대 확률을 이용한 통계적 모델 기반의 음성 검출기)

Kim, Sang-Kyun;Chang, Joon-Hyuk
- The Journal of the Acoustical Society of Korea
- /
- v.29 no.1
- /
- pp.76-81
- /
- 2010
In this paper, we propose a novel approach to improve the performance of a statistical model-based voice activity detection (VAD) which is based on the second-order conditional maximum a posteriori (CMAP). In our approach, the VAD decision rule is expressed as the geometric mean of likelihood ratios (LRs) based on adapted threshold according to the speech presence probability conditioned on both the current observation and the speech activity decisions in the pervious two frames. Experimental results show that the proposed approach yields better results compared to the statistical model-based and the CMAP-based VAD using the LR test.
https://doi.org/10.7776/ASK.2010.29.1.076 인용 PDF KSCI

Image Adaptive Block DCT-Based Perceptual Digital Watermarking (영상 특성에 적응적인 블록 DCT 기반 지각적 디지털 워터마킹)

최윤희;최태선
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.41 no.6
- /
- pp.221-229
- /
- 2004
We present new digital watermarking scheme that embeds a watermark according to the characteristics of the image or video. The scheme is compatible with established image compression standard. We define a weighting function using a parent-child structure of the DCT coefficients in a block to embed a maximum watermark. The spatio-frequency localization of the DCT coefficients can be achieved with this structure. In the detection stage, we present an optimum a posteriori threshold with a given false detection error probability based on the statistical analysis. Simulation results show that the proposed algorithm is efficient and robust against various signal processing techniques. Especially, they are robust against widely used coding standards, such as JPEG and MPEG.
PDF KSCI

A Statistical Model-Based Voice Activity Detection Employing the Conditional MAP Criterion with Spectral Deviation (조건 사후 최대 확률과 음성 스펙트럼 변이 조건을 이용한 통계적 모델 기반의 음성 검출기)

Kim, Sang-Kyun;Chang, Joon-Hyuk
- The Journal of the Acoustical Society of Korea
- /
- v.30 no.6
- /
- pp.324-329
- /
- 2011
In this paper, we propose a novel approach to improve the performance of a statistical model-based voice activity detection (VAD) which is based on the conditional maximum a posteriori (CMAP) with deviation. In our approach, the VAD decision rule is expressed as the geometric mean of likelihood ratios (LRs) based on adapted threshold according to the speech presence probability conditioned on both the speech activity decisions and spectral deviation in the pervious frame. Experimental results show that the proposed approach yields better results compared to the CMAP-based VAD using the LR test.
https://doi.org/10.7776/ASK.2011.30.6.324 인용 PDF KSCI

Adaptive Threshold for Speech Enhancement in Nonstationary Noisy Environments (비정상 잡음환경에서 음질향상을 위한 적응 임계 치 알고리즘)

Lee, Soo-Jeong;Kim, Sun-Hyob
- The Journal of the Acoustical Society of Korea
- /
- v.27 no.7
- /
- pp.386-393
- /
- 2008
This paper proposes a new approach for speech enhancement in highly nonstationary noisy environments. The spectral subtraction (SS) is a well known technique for speech enhancement in stationary noisy environments. However, in real world, noise is mostly nonstationary. The proposed method uses an auto control parameter for an adaptive threshold to work well in highly nonstationary noisy environments. Especially, the auto control parameter is affected by a linear function associated with an a posteriori signal to noise ratio (SNR) according to the increase or the decrease of the noise level. The proposed algorithm is combined with spectral subtraction (SS) using a hangover scheme (HO) for speech enhancement. The performances of the proposed method are evaluated ITU-T P.835 signal distortion (SIG) and the segment signal to-noise ratio (SNR) in various and highly nonstationary noisy environments and is superior to that of conventional spectral subtraction (SS) using a hangover (HO) and SS using a minimum statistics (MS) methods.
https://doi.org/10.7776/ASK.2008.27.7.386 인용 PDF KSCI

QRS detection based on maximum a-posteriori estimation (MAP Estimation을 이용한 QRS Detection)

정희교;신건수;이명호
- 제어로봇시스템학회:학술대회논문집
- /
- 1987.10b
- /
- pp.709-712
- /
- 1987
In this paper, a mathematical model for the purpose of QRS detection is considered in the case of the occurrence of nonoverlapping pulse-shaped waveforms corrupted with white noise. The number of waveforms, the arrival times, amplitudes, and widths of QRS complexes are regarded as random variables. The joint MAP estimation of all the unknown quantities consists of linear filtering followed by an optimization procedure. Because of time-consuming, the optimization procedure is modified so that a threshold test is obtained. The model formulation with nonoverlapping waveforms leads to a standard procedure covering a segment before as well as after an accepted event. Adaptivity of the detector is gained by utilizing past signal properties in determining threshold for QRS detection.
PDF

An Improved Speech Absence Probability Estimation based on Environmental Noise Classification (환경잡음분류 기반의 향상된 음성부재확률 추정)

Son, Young-Ho;Park, Yun-Sik;An, Hong-Sub;Lee, Sang-Min
- The Journal of the Acoustical Society of Korea
- /
- v.30 no.7
- /
- pp.383-389
- /
- 2011
In this paper, we propose a improved speech absence probability estimation algorithm by applying environmental noise classification for speech enhancement. The previous speech absence probability required to seek a priori probability of speech absence was derived by applying microphone input signal and the noise signal based on the estimated value of a posteriori SNR threshold. In this paper, the proposed algorithm estimates the speech absence probability using noise classification algorithm which is based on Gaussian mixture model in order to apply the optimal parameter each noise types, unlike the conventional fixed threshold and smoothing parameter. Performance of the proposed enhancement algorithm is evaluated by ITU-T P.862 PESQ (perceptual evaluation of speech quality) and composite measure under various noise environments. It is verified that the proposed algorithm yields better results compared to the conventional speech absence probability estimation algorithm.
https://doi.org/10.7776/ASK.2011.30.7.383 인용 PDF KSCI

Fully Automatic Liver Segmentation Based on the Morphological Property of a CT Image (CT 영상의 모포러지컬 특성에 기반한 완전 자동 간 분할)

서경식;박종안;박승진
- Progress in Medical Physics
- /
- v.15 no.2
- /
- pp.70-76
- /
- 2004
The most important work for early detection of liver cancer and decision of its characteristic and location is good segmentation of a liver region from other abdominal organs. This paper proposes a fully automatic liver segmentation algorithm based on the abdominal morphology characteristic as an easy and efficient method. Multi-modal threshold as pre-processing is peformed and a spine is segmented for finding morphological coordinates of an abdomen. Then the liver region is extracted using C-class maximum a posteriori (MAP) decision and morphological filtering. In order to estimate results of the automatic segmented liver region, area error rate (AER) and correlation coefficients of rotational binary region projection matching (RBRPM) are utilized. Experimental results showed automatic liver segmentation obtained by the proposed algorithm provided strong similarity to manual liver segmentation.
PDF

A Study on Adaptive Model Updating and a Priori Threshold Decision for Speaker Verification System (화자 확인 시스템을 위한 적응적 모델 갱신과 사전 문턱치 결정에 관한 연구)

진세훈;이재희;강철호
- The Journal of the Acoustical Society of Korea
- /
- v.19 no.5
- /
- pp.20-26
- /
- 2000
In speaker verification system the HMM(hidden Markov model) parameter updating using small amount of data and the priori threshold decision are crucial factor for dealing with long-term variability in people voices. In the paper we present the speaker model updating technique which can be adaptable to the session-to-intra speaker variability and the priori threshold determining technique. The proposed technique decreases verification error rates which the session-to-session intra-speaker variability can bring by adapting new speech data to speaker model parameter through Baum Welch re-estimation. And in this study the proposed priori threshold determining technique is decided by a hybrid score measurement which combines the world model based technique and the cohen model based technique together. The results show that the proposed technique can lead a better performance and the difference of performance is small between the posteriori threshold decision based approach and the proposed priori threshold decision based approach.
PDF

Bayesian estimates of genetic parameters of non-return rate and success in first insemination in Japanese Black cattle

Setiaji, Asep;Arakaki, Daichi;Oikawa, Takuro
- Animal Bioscience
- /
- v.34 no.7
- /
- pp.1100-1104
- /
- 2021
Objective: The objective of present study was to estimate heritability of non-return rate (NRR) and success of first insemination (SFI) by using the Bayesian approach with Gibbs sampling. Methods: Heifer Traits were denoted as NRR-h and SFI-h, and cow traits as NRR-c and SFI-c. The variance covariance components were estimated using threshold model under Bayesian procedures THRGIBBS1F90. Results: The SFI was more relevant to evaluating success of insemination because a high percentage of animals that demonstrated no return did not successfully conceive in NRR. Estimated heritability of NRR and SFI in heifers were 0.032 and 0.039 and the corresponding estimates for cows were 0.020 and 0.027. The model showed low values of Geweke (p-value ranging between 0.012 and 0.018) and a low Monte Carlo chain error, indicating that the amount of a posteriori for the heritability estimate was valid for binary traits. Genetic correlation between the same traits among heifers and cows by using the two-trait threshold model were low, 0.485 and 0.591 for NRR and SFI, respectively. High genetic correlations were observed between NRR-h and SFI-h (0.922) and between NRR-c and SFI-c (0.954). Conclusion: SFI showed slightly higher heritability than NRR but the two traits are genetically correlated. Based on this result, both two could be used for early indicator for evaluate the capacity of cows to conceive.
https://doi.org/10.5713/ajas.20.0150 인용 PDF KSCI

Ordinal Measure of DCT Coefficients for Image Correspondence and Its Application to Copy Detection

Changick Kim
- Journal of Broadcast Engineering
- /
- v.7 no.2
- /
- pp.168-180
- /
- 2002
This paper proposes a novel method to detect unauthorized copies of digital images. This copy detection scheme can be used as either an alternative approach or a complementary approach to watermarking. A test image is reduced to 8$\times$8 sub-image by intensity averaging, and the AC coefficients of its discrete cosine transform (DCT) are used to compute distance from those generated from the query image, of which a user wants to find copies. Copies may be Processed to avoid copy detection or enhance image quality. We show ordinal measure of DCT coefficients, which is based on relative ordering of AC magnitude values and using distance metrics between two rank permutations, are robust to various modifications of the original image. The optimal threshold selection scheme using the maximum a posteriori (MAP) criterion is also addressed.
PDF KSCI

Search Result 13, Processing Time 0.021 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)