• Title/Summary/Keyword: Zero crossings

Search Result 26, Processing Time 0.022 seconds

Auditory Representations for Robust Speech Recognition in Noisy Environments (잡음 환경에서의 음성 인식을 위한 청각 표현)

  • Kim, Doh-Suk;Lee, Soo-Young;Kil, Rhee-M.
    • The Journal of the Acoustical Society of Korea
    • /
    • v.15 no.5
    • /
    • pp.90-98
    • /
    • 1996
  • An auditory model is proposed for robust speech recognition in noisy environments. The model consists of cochlear bandpass filters and nonlinear stages, and represents frequency and intensity information efficiently even in noisy environments. Frequency information of the signal is obtained by zero-crossing intervals, and intensity information is also incorporated by peak detectors and saturating nonlinearities. Also, the robustness of the zero-crossings in estimating frequency is verified by the developed analytic relationship of the variance of the level-crossing interval perturbations as a function of the crossing level values. The proposed auditory model is computationally efficient and free from many unknown parameters compared with other auditory models. Speaker-independent speech recognition experiments demonstrate the robustness of the proposed method.

  • PDF

A Study on the Start-up Control for HDD Spindle Motors (HDD 스핀들 모터의 초기 구동 제어에 관한 연구)

  • Jeong, Jun
    • Transactions of the Korean Society for Noise and Vibration Engineering
    • /
    • v.18 no.10
    • /
    • pp.1065-1072
    • /
    • 2008
  • A HDD adopts a sensorless brushless DC (BLDC) motor as a spindle motor. Because there is no direct sensor measuring rotor position. open loop commutations with inductive sensing are used to increase the rotor speed up to a certain speed where the zero crossings of the back electromotive force (EMF) voltage are measurable. Therefore, successful open loop commutations are necessary for the stable start-up control of the spindle motors. In this paper, the time scale and the number of the open loop commutations are employed for design parameters to guarantee robustness to torque constant variation and initial rotor position. The design results are verified by experiments on a very low current start-up of the spindle motor with various environment. The experimental results show that the design results can decrease the start-up failure rate considerably.

A Comparison of Front-Ends for Robust Speech Recognition

  • Kim, Doh-Suk;Jeong, Jae-Hoon;Lee, Soo-Young;Kil, Rhee M.
    • The Journal of the Acoustical Society of Korea
    • /
    • v.17 no.3E
    • /
    • pp.3-11
    • /
    • 1998
  • Zero-crossings with Peak amplitudes (ZCPA) model motivated by human auditory periphery was proposed to extract reliable features form speech signals even in noisy environments for robust speech recognition. In this paper, the performance of the ZCPA model is further improved by incorporating conventional speech processing techniques into the model output. Spectral and cepstral representations of the ZCPA model output are compared, and the incorporation of dynamic features with several different lengths of time-derivative window are evaluated. Also, comparative evaluations with other front-ends in real-world noisy environments are performed, and result in the superiority of the ZCPA model.

  • PDF

The Comparison of features for Speech/Music Discrimination (음성/음악 분류를 위한 특징 비교)

  • Lee Kyong Rok;Seo Bong Su;Kim Jin Young
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • spring
    • /
    • pp.157-160
    • /
    • 2000
  • 본 논문에서는 멀티미디어 정보에서 원하는 정보를 추출하는 멀티미디어 인덱싱 중 오디오 인덱싱의 전처리 부격인 음성/음악 분류실험을 하였다. 오디오 인덱싱에 있어서 음성/음악 분류기는 원 오디오 신호에서 정보를 가진 음성 부분을 분리하는 역할을 한다. 실험에서는 음성/음악 분류에서 널리 쓰이는 멜캡스트럼(Mel Cepstrum), 정규화 로그 에너지(normalized log energy), 영교차(Zero-Crossings)를 특징 파라미터로 사용하였다[l, 2, 3]. 특징공간은 GMM(Gaussian Mixture Model)에 의해 모델링 되었고, 오디오 신호의 분류는 각각 3가지 분류항목(음성, 음악, 음성+음악)과 2가지 분류항목(음성, 음악)을 적용하였다. 실험결과 3가지 분류항목 적용시와 2가지 분류항목 적용시 모두 멜캡스트럼을 사용하였을 때 가장 좋은 결과를 보였다.

  • PDF

Practical Considerations for Hardware Implementations of the Auditory Model and Evaluations in Real World Noisy Environments

  • Kim, Doh-Suk;Jeong, Jae-Hoon;Lee, Soo-Young;Kil, Rhee M.
    • The Journal of the Acoustical Society of Korea
    • /
    • v.16 no.1E
    • /
    • pp.15-23
    • /
    • 1997
  • Zero-Crossings with Peak Amplitudes(ZCPA) model motivated by human auditory periphery was proposed to extract reliable features speech signals even in noisy environments for robust speech recognition. In this paper, some practical considerations for digital hardware implementations of the ZCPA model are addressed and evaluated for recognition of speech corrupted by several real world noises as well as white Gaussian noise. Infinite impulse response(IIR) filters which constitute the cochliar filterbank of the ZCPA are replaced by hamming bandpass filters of which frequency responses are less similar to biological neural tuning curves. Experimental results demonstrate that the detailed frequency response of the cochlear filters are not critical to performance. Also, the sensitivity of the model output to the variations in microphone gain is investigated, and results in good reliability of the ZCPA model.

  • PDF

Simplification of Transfer Function Via Walsh Function in Frequency Domain (주파수 영역에서 Walsh 함수에 의한 전달함수의 간단화)

  • Doo-Soo Ahn
    • The Transactions of the Korean Institute of Electrical Engineers
    • /
    • v.31 no.8
    • /
    • pp.33-38
    • /
    • 1982
  • This paper deals with the simplification of the transfer function in a frequency domain, viz. the integral of the squared errors between the original and the simplified model is minimized and the latter is estimated by the Walsh function. It tries to minimize the errors between the frequency responses of the two functions. This method is compared with the existing method by means of a numercal example. The frequency response of this simplified model approximates closely to that of the original model. The proposed method is simpler in analysis and easier in implementation than the existing methods. Though the Walsh function can be easily generated with the discrete values, it has errors because its zero crossings are not continuous. This method aims at the reduction of the errors in the real parts and the imaginary parts of the two functions by dividing into the more sub-intervals, and selecting the reduced-order model according to the response of the model. As a result, it can be applied for the simplification of higher order functions into lower order functions and for the design of control systems.

  • PDF

Performance Improvement of a Pedestrian Dead Reckoning System using a Low Cost IMU (저가형 관성센서를 이용한 보행자 관성항법 시스템의 성능 향상)

  • Kim, Yun-Ki;Park, Jae-Hyun;Kwak, Hwy-Kuen;Park, Sang-Hoon;Lee, ChoonWoo;Lee, Jang-Myung
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.19 no.6
    • /
    • pp.569-575
    • /
    • 2013
  • This paper proposes a method for PDR (Pedestrian Dead-Reckoning) using a low cost IMU. Generally, GPS has been widely used for localization of pedestrians. However, GPS is disabled in the indoor environment such as in buildings. To solve this problem, this research suggests the PDR scheme with an IMU attached to the pedestrian's waist. However, despite the fact many methods have been proposed to estimate the pedestrian's position, but their results are not sufficient. One of the most important factors to improve performance is, a new calibration method that has been proposed to obtain the reliable sensor data. In addition to this calibration, the PDR method is also proposed to detect steps, where estimation schemes of step length, attitude, and heading angles are developed. Peak and zero crossings are detected to count the steps from 3-axis acceleration values. For the estimation of step length, a nonlinear step model is adopted to take advantage of using one parameter. Complementary filter and zero angular velocity are utilized to estimate the attitude of the IMU module and to minimize the heading angle drift. To verify the effectiveness of this scheme, a real-time system is implemented and demonstrated. Experimental results show an accuracy of below 1% and below 3% in distance and position errors, respectively, which can be achievable using a high cost IMU.

Automated Vessels Detection on Infant Retinal Images

  • Sukkaew, Lassada;Uyyanonvara, Bunyarit;Barman, Sarah A;Jareanjit, Jaruwat
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2004.08a
    • /
    • pp.321-325
    • /
    • 2004
  • Retinopathy of Prematurity (ROP) is a common retinal neovascular disorder of premature infants. It can be characterized by inappropriate and disorganized vessel. This paper present a method for blood vessel detection on infant retinal images. The algorithm is designed to detect the retinal vessels. The proposed method applies a Lapalacian of Gaussian as a step-edge detector based on the second-order directional derivative to identify locations of the edge of vessels with zero crossings. The procedure allows parameters computation in a fixed number of operations independent of kernel size. This method is composed of four steps : grayscale conversion, edge detection based on LOG, noise removal by adaptive Wiener filter & median filter, and Otsu's global thresholding. The algorithm has been tested on twenty infant retinal images. In cooperation with the Digital Imaging Research Centre, Kingston University, London and Department of Opthalmology, Imperial College London who supplied all the images used in this project. The algorithm has done well to detect small thin vessels, which are of interest in clinical practice.

  • PDF

A Walsh-Based Distributed Associative Memory with Genetic Algorithm Maximization of Storage Capacity for Face Recognition

  • Kim, Kyung-A;Oh, Se-Young
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2003.09a
    • /
    • pp.640-643
    • /
    • 2003
  • A Walsh function based associative memory is capable of storing m patterns in a single pattern storage space with Walsh encoding of each pattern. Furthermore, each stored pattern can be matched against the stored patterns extremely fast using algorithmic parallel processing. As such, this special type of memory is ideal for real-time processing of large scale information. However this incredible efficiency generates large amount of crosstalk between stored patterns that incurs mis-recognition. This crosstalk is a function of the set of different sequencies [number of zero crossings] of the Walsh function associated with each pattern to be stored. This sequency set is thus optimized in this paper to minimize mis-recognition, as well as to maximize memory saying. In this paper, this Walsh memory has been applied to the problem of face recognition, where PCA is applied to dimensionality reduction. The maximum Walsh spectral component and genetic algorithm (GA) are applied to determine the optimal Walsh function set to be associated with the data to be stored. The experimental results indicate that the proposed methods provide a novel and robust technology to achieve an error-free, real-time, and memory-saving recognition of large scale patterns.

  • PDF

Edge Detection Using the Information of Edge Structural Regions (에지의 구조적 영역정보를 이용한 에지검출)

  • 김수겸;박중순;최정희
    • Journal of Advanced Marine Engineering and Technology
    • /
    • v.24 no.2
    • /
    • pp.82-89
    • /
    • 2000
  • Edge detection is the first step and very important step in image analysis. In this paper, proposed edge detection operators based on informations of edge types and it is different from other classical edge detection operators such as gradient and surface fitting operators. The first, we defined characteristics of edge types such as localization, thinness, length. The second, we defined valid edge types and ideal edge pixel positions in $3\times3$window based on edge characteristics of edge types. And we proposed edge detection algorithm and twelve windows based on valid edge types. In specially, proposed algorithm was shown better performence of edge detection than other operators such as gradient operator and the LoG(Laplacian of Gaussian) operator of zero crossings.

  • PDF