Search | Korea Science

Recognition of Conducting Motion using HMM (HMM을 이용한 지휘 동작의 인식)

문형득;구자영
- Journal of the Korea Society of Computer and Information
- /
- v.9 no.1
- /
- pp.25-30
- /
- 2004
In this Paper, a beat recognition method from a sequence of images of conducting person was proposed. Hand position was detected using color discrimination, and symbolized by quantization. Then a motion of the conductor was represented as a sequence of symbols. HMM (Hidden Markov Model), which is excellent for recognition of sequence pattern with some level of variation, was used to recognize the sequence of symbols to be a motion for a beat.
PDF

Condition Monitoring Of Rotating Machine With Mass Unbalance Using Hidden Markov Model (은닉 마르코프 모델을 이용한 질량 편심이 있는 회전기기의 상태진단)

Ko, Jungmin;Choi, Chankyu;Kang, To;Han, Soonwoo;Park, Jinho;Yoo, Honghee
- Proceedings of the Korean Society for Noise and Vibration Engineering Conference
- /
- 2014.10a
- /
- pp.833-834
- /
- 2014
In recent years, a pattern recognition method has been widely used by researchers for fault diagnoses of mechanical systems. A pattern recognition method determines the soundness of a mechanical system by detecting variations in the system's vibration characteristics. Hidden Markov model has recently been used as pattern recognition methods in various fields. In this study, a HMM method for the fault diagnosis of a mechanical system is introduced, and a rotating machine with mass unbalance is selected for fault diagnosis. Moreover, a diagnosis procedure to identity the size of a defect is proposed in this study.
PDF

Fault Diagnosis of an Electric Tool using Automaton (거동 반응을 이용한 전동공구 고장진단)

Lee, Seung-Mock;Choi, Yeon-Sun
- Proceedings of the Korean Society for Noise and Vibration Engineering Conference
- /
- 2006.05a
- /
- pp.1328-1333
- /
- 2006
For fault diagnosis of machines and equipments, knowledge-based method has been used widely but has some limitations for complex systems. These can be covered by model-based method. As one kind of model-based method, Qualitative modeling diagnosis method is developed in this research. The developed method uses output signal only. In this method quantization of the output signal mattes automata which can characterize the flow of the signal pattern to normal and fault respectively. As an example of the qualitative diagnosis method, an electric tool which has faults at gear and bearing were examined in this research. The result shows that the developed method can diagnose the fault clearly for the two fault cases.
PDF

A Video Traffic Model based on the Shifting-Level Process (Part II : An Efficient Analysis Method for SL/D/1/K Queueing System) (Shifting-Level Process에 기반한 영상트래픽 모델(2부: SL/D/1/K 대기체계 분석 방법))

안희준;김재균
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.24 no.10B
- /
- pp.1979-1985
- /
- 1999
In this paper, we offer an analysis method for SL/D/1/K queueing system, where the shifting-level (SL) process proposed in the part I of this study[1]. Since an exact analysis of SL/D/1/K queueing system is very difficult, we propose an approximation method, where the queze sizes at input state transition epochs is quantized and thus the name 'quantization reduction method'. We provide the upper and lower bounds of the approximation for the system size distribution also, In addition, since the continuos version of well-known DAR(1) model is a kind of SL process with exponential correlation term only, the proposed method can be directly applied to the analysis of DAR(1)/D/1/K queueing system as well.
PDF

Isolated Digit and Command Recognition in Car Environment (자동차 환경에서의 단독 숫자음 및 명령어 인식)

양태영;신원호;김지성;안동순;이충용;윤대희;차일환
- The Journal of the Acoustical Society of Korea
- /
- v.18 no.2
- /
- pp.11-17
- /
- 1999
This paper proposes an observation probability smoothing technique for the robustness of a discrete hidden Markov(DHMM) model based speech recognizer. Also, an appropriate noise robust processing in car environment is suggested from experimental results. The noisy speech is often mislabeled during the vector quantization process. To reduce the effects of such mislabelings, the proposed technique increases the observation probability of similar codewords. For the noise robust processing in car environment, the liftering on the distance measure of feature vectors, the high pass filtering, and the spectral subtraction methods are examined. Recognition experiments on the 14-isolated words consists of the Korean digits and command words were performed. The database was recorded in a stopping car and a running car environments. The recognition rates of the baseline recognizer were 97.4% in a stopping situation and 59.1% in a running situation. Using the proposed observation probability smoothing technique, the liftering, the high pass filtering, and the spectral subtraction the recognition rates were enhanced to 98.3% in a stopping situation and to 88.6% in a running situation.
PDF

Codeword-Dependent Distance Normalization and Smoothing of Output Probalities Based on the Instar-formed Fuzzy Contribution in the FVQ-DHMM (퍼지양자화 은닉 마르코프 모델에서 코드워드 종속거리 정규화와 Instar 형태의 퍼지 기여도에 기반한 출력확률의 평활화)

Choi, Hwan-Jin;Kim, Yeon-Jun;Oh, Yung-Hwan
- The Journal of the Acoustical Society of Korea
- /
- v.16 no.2
- /
- pp.71-79
- /
- 1997
In this paper, a codeword-dependent distance normalization(CDDN) and an instar-formed fuzzy smoothing of output distribution are proposed for robust estimation of output probabilities in the FVQ(fuzzy vector quantization)-DHMM(discrete hidden Markov model). The FVQ-DHMM is a variant of DHMM in which the state output probability is estimated by the sum oft he product of the output probability and its weighting factor for each codeword on an input vector. As the performance of the FVQ-DHMM is influenced by weighting factor and output distribution from a state, it is required to get a method to get robust estimation of weighting factors and output distribution for each state. From experimental results, the proposed CDDN method has reduced 24% of error rate over the conventional FVQ-DHMM, and also reduced 79% of error rate when the smoothing of output distribution is also applied to the computation of an output probability. These results indicate that the use of CDDN and the fuzzy smoothing of output distribution to the FVQ-DHMM lead to improved recognition, and therefore it may be used as an alternative to the robust estimation of output probabilities for HMMs.
PDF

The Effect of the Number of Phoneme Clusters on Speech Recognition (음성 인식에서 음소 클러스터 수의 효과)

Lee, Chang-Young
- The Journal of the Korea institute of electronic communication sciences
- /
- v.9 no.11
- /
- pp.1221-1226
- /
- 2014
In an effort to improve the efficiency of the speech recognition, we investigate the effect of the number of phoneme clusters. For this purpose, codebooks of varied number of phoneme clusters are prepared by modified k-means clustering algorithm. The subsequent processing is fuzzy vector quantization (FVQ) and hidden Markov model (HMM) for speech recognition test. The result shows that there are two distinct regimes. For large number of phoneme clusters, the recognition performance is roughly independent of it. For small number of phoneme clusters, however, the recognition error rate increases nonlinearly as it is decreased. From numerical calculation, it is found that this nonlinear regime might be modeled by a power law function. The result also shows that about 166 phoneme clusters would be the optimal number for recognition of 300 isolated words. This amounts to roughly 3 variations per phoneme.
https://doi.org/10.13067/JKIECS.2014.9.11.1221 인용 PDF KSCI

Implementation of FPGA-based Accelerator for GRU Inference with Structured Compression (구조적 압축을 통한 FPGA 기반 GRU 추론 가속기 설계)

Chae, Byeong-Cheol
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.26 no.6
- /
- pp.850-858
- /
- 2022
To deploy Gate Recurrent Units (GRU) on resource-constrained embedded devices, this paper presents a reconfigurable FPGA-based GRU accelerator that enables structured compression. Firstly, a dense GRU model is significantly reduced in size by hybrid quantization and structured top-k pruning. Secondly, the energy consumption on external memory access is greatly reduced by the proposed reuse computing pattern. Finally, the accelerator can handle a structured sparse model that benefits from the algorithm-hardware co-design workflows. Moreover, inference tasks can be flexibly performed using all functional dimensions, sequence length, and number of layers. Implemented on the Intel DE1-SoC FPGA, the proposed accelerator achieves 45.01 GOPs in a structured sparse GRU network without batching. Compared to the implementation of CPU and GPU, low-cost FPGA accelerator achieves 57 and 30x improvements in latency, 300 and 23.44x improvements in energy efficiency, respectively. Thus, the proposed accelerator is utilized as an early study of real-time embedded applications, demonstrating the potential for further development in the future.
https://doi.org/10.6109/jkiice.2022.26.6.850 인용 PDF KSCI

Joint Rate Control Scheme for Terrestrial Stereoscopic 3DTV Broadcast (스테레오스코픽 3차원 지상파 방송을 위한 합동 비트율 제어 연구)

Chang, Yongjun;Kim, Munchurl
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2010.11a
- /
- pp.14-17
- /
- 2010
Following the proliferation of three-dimensional video contents and displays, many terrestrial broadcasting companies prepare for starting stereoscopic 3DTV service. In terrestrial stereoscopic broadcast, it is a difficult task to code and transmit two video sequences while sustaining as high quality as 2DTV broadcast attains due to the limited bandwidth defined by the existing digital TV standards such as ATSC. Thus, a terrestrial 3DTV broadcasting system with heterogeneous video coding systems is considered for terrestrial 3DTV broadcast where the left image and right images are based on MPEG-2 and H.264/AVC, respectively, in order to achieve both high quality broadcasting service and compatibility for the existing 2DTV viewers. Without significant change in the current terrestrial broadcasting systems, we propose a joint rate control scheme for stereoscopic 3DTV service. The proposed joint rate control scheme applies to the MPEG-2 encoder a quadratic rate-quantization model which is adopted in the H.264/AVC. Then the controller is designed for the sum of two bit streams to meet the bandwidth requirement of broadcasting standards while the sum of image distortions is minimized by adjusting quantization parameter computed from the proposed optimization scheme. Besides, we also consider a condition on quality difference between the left and right images in the optimization. Experimental results demonstrate that the proposed bit rate control scheme outperforms the rate control method where each video coding standard uses its own bit rate control algorithm in terms of minimizing the mean image distortion as well as the mean value and the variation of absolute image quality differences.
PDF

The First Quantization Parameter Decision Algorithm for the H.264/AVC Encoder (H.264/AVC를 위한 초기 Quantization Parameter 결정 알고리즘)

Kwon, Soon-Young;Lee, Sang-Heon;Lee, Dong-Ha
- Journal of KIISE:Information Networking
- /
- v.35 no.3
- /
- pp.235-242
- /
- 2008
To improve video quality and coding efficiency, H.264/AVC adopted an adaptive rate control. But this method has a problem as it cannot predict an accurate quantization parameter(QP) for the first frame. The first QP is decided among four constant values by using encoder input parameters. It does not consider encoding bits, results in significant fluctuation of the image quality and decreases the average quality of the whole coded sequence. In this paper, we propose a new algorithm for the first frame QP decision in the H.264/AVC encoder. The QP is decided by the existing algorithm and the first frame is encoded. According to the encoded bits, the new initial QP is decided. We can predict optimal value because there is a linear relationship between encoded bits and the new initial QP. Next, we re-encode the first frame using the new initial QP. Experimental results show that the proposed algorithm not only achieves better quality than the state of the art algorithm, but also adopts a rate control forthe sequence that was impossible with the existing algorithm. By reducing fluctuation, subjective quality also improved.
PDF KSCI

Search Result 227, Processing Time 0.026 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)