• 제목/요약/키워드: Recognition Enhancement

검색결과 362건 처리시간 0.028초

On Effective Dual-Channel Noise Reduction for Speech Recognition in Car Environment

  • Ahn, Sung-Joo;Kang, Sun-Mee;Ko, Han-Seok
    • 음성과학
    • /
    • 제11권1호
    • /
    • pp.43-52
    • /
    • 2004
  • This paper concerns an effective dual-channel noise reduction method to increase the performance of speech recognition in a car environment. While various single channel methods have already been developed and dual-channel methods have been studied somewhat, their effectiveness in real environments, such as in cars, has not yet been formally proven in terms of achieving acceptable performance level. Our aim is to remedy the low performance of the single and dual-channel noise reduction methods. This paper proposes an effective dual-channel noise reduction method based on a high-pass filter and front-end processing of the eigendecomposition method. We experimented with a real multi-channel car database and compared the results with respect to the microphones arrangements. From the analysis and results, we show that the enhanced eigendecomposition method combined with high-pass filter indeed significantly improve the speech recognition performance under a dual-channel environment.

  • PDF

광촉각 센서와 힘/역학센서의 퍼지융합을 통한 접촉면의 인식 (Recognition of contact surfaces using optical tactile and F/T sensors integrated by fuzzy fusion algorithm)

  • 고동환;한헌수
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 1996년도 한국자동제어학술회의논문집(국내학술편); 포항공과대학교, 포항; 24-26 Oct. 1996
    • /
    • pp.628-631
    • /
    • 1996
  • This paper proposes a surface recognition algorithm which determines the types of contact surfaces by fusing the information collected by the multisensor system, consisted of the optical tactile and force/torque sensors. Since the image shape measured by the optical tactile sensor system, which is used for determining the surface type, varies depending on the forces provided at the measuring moment, the force information measured by the f/t sensor takes an important role. In this paper, an image contour is represented by the long and short axes and they are fuzzified individually by the membership function formulated by observing the variation of the lengths of the long and short axes depending on the provided force. The fuzzified values of the long and short axes are fused using the average Minkowski's distance. Compared to the case where only the contour information is used, the proposed algorithm has shown about 14% of enhancement in the recognition ratio. Especially, when imposing the optimal force determined by the experiments, the recognition ratio has been measured over 91%.

  • PDF

Integrated Visual and Speech Parameters in Korean Numeral Speech Recognition

  • Lee, Sang-won;Park, In-Jung;Lee, Chun-Woo;Kim, Hyung-Bae
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2000년도 ITC-CSCC -2
    • /
    • pp.685-688
    • /
    • 2000
  • In this paper, we used image information for the enhancement of Korean numeral speech recognition. First, a noisy environment was made by Gaussian generator at each 10 dB level and the generated signal was added to original Korean numeral speech. And then, the speech was analyzed to recognize Korean numeral speech. Speech through microphone was pre-emphasized with 0.95, Hamming window, autocorrelation and LPC analysis was used. Second, the image obtained by camera, was converted to gray level, autocorrelated, and analyzed using LPC algorithm, to which was applied in speech analysis, Finally, the Korean numerial speech recognition with image information was more ehnanced than speech-only, especially in ‘3’, ‘5’and ‘9’. As the same LPC algorithm and simple image management was used, additional computation a1gorithm like a filtering was not used, a total speech recognition algorithm was made simple.

  • PDF

중소기업 품질시스템 운영 방안에 관한 연구 (A study on Quality System Management in Small and Medium Enterprises)

  • 박노국
    • 한국산업정보학회논문지
    • /
    • 제10권4호
    • /
    • pp.120-127
    • /
    • 2005
  • 본 논문에서는 현재 강원도 중소기업들이 기업경쟁력을 높이기 위해 실시하고있는 품질경영 방안에 대해 연구하였다. 연구결과 강원도에 위치한 중소기업에서 실시하고있는 활동은 고객 중심의 품질경영과 자동화, 신기술, 공정개선을 위한 활동 및 ISO 9000인증 획득에 많은 관심을 갖고 있으며, 다음으로 5S에 의한 공장합리화${\cdot}$제안제도에도 관심을 두고 있는 것으로 나타났다. 본 연구 대상인 기업은 고객 만족을 위한 제품/서비스를 제공함으로써 경쟁회사보다 시장성 우위를 확보하고, 가격경쟁력을 확보하려 노력하고 있는 것으로 분석되었다.

  • PDF

트랜슬레이션 임베딩 기반 관계 학습을 이용한 GUI 위젯 인식 (Recognition of GUI Widgets Utilizing Translational Embeddings based on Relational Learning)

  • 박민수;석호식
    • 전기전자학회논문지
    • /
    • 제22권3호
    • /
    • pp.693-699
    • /
    • 2018
  • CNN 기반의 객체 인식 성능은 매우 우수한 것으로 보고되고 있지만 모바일 기기의 앱 GUI와 같이 일반적으로 생각하기에 잡음이 적고 분명하게 인식될 수 있을 것으로 기대되는 환경에 적용해보면 인간의 관점에서 매우 유사한 GUI 입력 위젯들이 의외로 잘 인식되지는 않는다는 문제가 발생한다. 본 논문에서는 CNN의 입력 위젯 인식 성능을 향상시키기 위하여 모바일 앱의 GUI를 구성하는 객체들의 관계를 활용하는 방법을 제안한다. 제안 방법에서는 (1) CNN 기반의 객체 인식 도구인 Faster R-CNN을 이용하여 모바일 앱을 구성하는 입력 위젯을 1차 인식한 후 (2) 위젯 인식률 향상을 위하여 객체 간의 관계를 활용하는 방법을 결합하였다. 객체 간의 관계는 표현 공간상에서의 벡터의 평행 이동을 활용하여 표현되었으며, 총 323개의 앱에서 생성한 데이터에 적용한 결과 Faster R-CNN만을 사용한 경우와 비교하여 위젯 인식률을 상당히 개선할 수 있음을 확인하였다.

인공신경망을 이용한 마커 검출 및 인식의 정확도 개선 (Enhancement of the Correctness of Marker Detection and Marker Recognition based on Artificial Neural Network)

  • 강선경;김영운;소인미;정성태
    • 한국컴퓨터정보학회논문지
    • /
    • 제13권1호
    • /
    • pp.89-97
    • /
    • 2008
  • 본 논문에서는 인공신경망을 이용하여 사각형 형태 마커 검출 및 인식의 성능을 향상시키는 방법을 제안한다. 본 논문의 방법에서는 입력 영상으로부터 객체의 윤곽선을 찾은 다음에 선분으로 근사화한다. 근사화된 선분으로부터 기하학적 특징을 이용하여 사각형을 찾고 워핑 기법과 확대/축소 변환을 이용하여 사각형 영상을 정사각형 형태로 정규화 한다. 정사각형 형태로 정규화 한 다음에는 주성분 분석을 적용하여 특징 벡터의 크기를 줄인 다음에 인공신경망을 이용하여 마커 영상인지 아닌지를 검사한다. 마커 영상으로 판별된 영상에 대하여 인공신경망을 이용하여 마커의 종류를 인식한다. 인식 실험 결과 인공신경망을 사용함으로써 마커 검출의 오류 줄일 수 있었고 인식의 정확도를 개선할 수 있었다.

  • PDF

Improved Melody Recognition Performance of a Cochlear Implant Speech Processing Strategy Using Instantaneous Frequency Encoding Based on Teager Energy Operator

  • Choi, Sung-Jin;Ryu, Sang-Baek;Kim, Kyung-Hwan
    • 대한의용생체공학회:의공학회지
    • /
    • 제31권6호
    • /
    • pp.417-426
    • /
    • 2010
  • We present a speech processing strategy incorporating instantaneous frequency (IF) encoding for the enhancement of melody recognition performance of cochlear implants. For the IF extraction from incoming sound, we propose the use of a Teager energy operator (TEO), which is advantageous for its lower computational load. From time-frequency analysis, we verified that the TEO-based method provides proper IF encoding of input sound, which is crucial for melody recognition. Similar benefit could be obtained also from the use of a Hilbert transform (HT), but much higher computational cost was required. The melody recognition performance of the proposed speech processing strategy was compared with those of a conventional strategy using envelope extraction, and the HT-based IF encoding. Hearing tests on normal subjects were performed using acoustic simulation and a musical contour identification task. Insignificant difference in melody recognition performance was observed between the TEO-based and HT-based IF encodings, and both were superior to the conventional strategy. However, the TEO-based strategy was advantageous considering that it was approximately 35% faster than the HT-based strategy.

Statistical Model-Based Noise Reduction Approach for Car Interior Applications to Speech Recognition

  • Lee, Sung-Joo;Kang, Byung-Ok;Jung, Ho-Young;Lee, Yun-Keun;Kim, Hyung-Soon
    • ETRI Journal
    • /
    • 제32권5호
    • /
    • pp.801-809
    • /
    • 2010
  • This paper presents a statistical model-based noise suppression approach for voice recognition in a car environment. In order to alleviate the spectral whitening and signal distortion problem in the traditional decision-directed Wiener filter, we combine a decision-directed method with an original spectrum reconstruction method and develop a new two-stage noise reduction filter estimation scheme. When a tradeoff between the performance and computational efficiency under resource-constrained automotive devices is considered, ETSI standard advance distributed speech recognition font-end (ETSI-AFE) can be an effective solution, and ETSI-AFE is also based on the decision-directed Wiener filter. Thus, a series of voice recognition and computational complexity tests are conducted by comparing the proposed approach with ETSI-AFE. The experimental results show that the proposed approach is superior to the conventional method in terms of speech recognition accuracy, while the computational cost and frame latency are significantly reduced.

The Effect of Idesolide on Hippocampus-dependent Recognition Memory

  • Lee, Hye-Ryeon;Choi, Jun-Hyeok;Lee, Nuribalhae;Kim, Seung-Hyun;Kim, Young-Choong;Kaang, Bong-Kiun
    • Animal cells and systems
    • /
    • 제12권1호
    • /
    • pp.11-14
    • /
    • 2008
  • Finding a way to strengthen human cognitive functions, such as learning and memory, has been of great concern since the moment people realized that these functions can be affected and even altered by certain chemicals. Since then, plenty of endeavors have been made to look for safe ways of improving cognitive performances without adverse side-effects. Unfortunately, most of these efforts have turned out to be unsuccessful until now. In this study, we examine the effect of a natural compound, idesolide, on hippocampus-dependent recognition memory. We demonstrate that idesolide is effective in the enhancement of recognition memory, as measured by a novel object recognition task. Thus, idesolide might serve as a novel therapeutic medication for the treatment of memoryrelated brain anomalies such as mild cognitive impairment(MCI) and Alzheimer's disease.

Image Enhancement for Two-dimension bar code PDF417

  • Park, Ji-Hue;Woo, Hong-Chae
    • 한국정보기술응용학회:학술대회논문집
    • /
    • 한국정보기술응용학회 2005년도 6th 2005 International Conference on Computers, Communications and System
    • /
    • pp.69-72
    • /
    • 2005
  • As life style becomes to be complicated, lots of support technologies were developed. The bar code technology is one of them. It was renovating approach to goods industry. However, data storage ability in one dimension bar code came in limit because of industry growth. Two-dimension bar code was proposed to overcome one-dimension bar code. PDF417 bar code most commonly used in standard two-dimension bar codes is well defined at data decoding and error correction area. More works could be done in bar code image acquisition process. Applying various image enhancement algorithms, the recognition rate of PDF417 bar code is improved.

  • PDF