• Title/Summary/Keyword: recognition-rate

Search Result 2,809, Processing Time 0.03 seconds

Microphone Type Classification for Digital Audio Forgery Detection (디지털 오디오 위조검출을 위한 마이크로폰 타입 인식)

  • Seok, Jongwon
    • Journal of Korea Multimedia Society
    • /
    • v.18 no.3
    • /
    • pp.323-329
    • /
    • 2015
  • In this paper we applied pattern recognition approach to detect audio forgery. Classification of the microphone types and models can help determining the authenticity of the recordings. Canonical correlation analysis was applied to extract feature for microphone classification. We utilized the linear dependence between two near-silence regions. To utilize the advantage of multi-feature based canonical correlation analysis, we selected three commonly used features to capture the temporal and spectral characteristics. Using three different microphones, we tested the usefulness of multi-feature based characteristics of canonical correlation analysis and compared the results with single feature based method. The performance of classification rate was carried out using the backpropagation neural network. Experimental results show the promise of canonical correlation features for microphone classification.

Smoke Detection System Research using Fully Connected Method based on Adaboost

  • Lee, Yeunghak;Kim, Taesun;Shim, Jaechang
    • Journal of Multimedia Information System
    • /
    • v.4 no.2
    • /
    • pp.79-82
    • /
    • 2017
  • Smoke and fire have different shapes and colours. This article suggests a fully connected system which is used two features using Adaboost algorithm for constructing a strong classifier as linear combination. We calculate the local histogram feature by gradient and bin, local binary pattern value, and projection vectors for each cell. According to the histogram magnitude, this paper applied adapted weighting value to improve the recognition rate. To preserve the local region and shape feature which has edge intensity, this paper processed the normalization sequence. For the extracted features, this paper Adaboost algorithm which makes strong classification to classify the objects. Our smoke detection system based on the proposed approach leads to higher detection accuracy than other system.

A Study on the Recognition-Rate Improvement by the Keyword Spotting System using CM Algorithm (CM 알고리즘을 이용한 핵심어 검출 시스템의 인식률 향상에 관한 연구)

  • Won Jong-Moon;Lee Jung-Suk;Kim Soon-Hyob
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • autumn
    • /
    • pp.81-84
    • /
    • 2001
  • 본 논문은 중규모 단어급의 핵심어 검출 시스템에서 인식률 향상을 위해 미등록어 거절(Out-of-Vocabulary rejection) 기능을 제어하기 위한 연구이다. 이것은 핵심어 검출기에서 인식된 결과를 확인하는 과정으로 검증시스템이 구현되기 위해서는 매 음소마다 검증 기능이 필요하고, 이를 위해서 반음소(anti-phoneme model) 모델을 사용하였다. 검증의 역할은 인식기에서 인식된 단어가 등록어인지 미등록어인지 판별하는 것이다. 단어인식기는 비터비 탐색을 하므로, 기본적으로 단어단위로 인식을 하지만 그 인식된 단어는 내부적으로 음소단위로 인식된다. 따라서, 최소 검증 오류를 갖는 반음소 모델을 사용하고, 이를 이용하여 인식된 음소 단위들을 각각의 반음소 모델과 비교하여 통계적인 방법에 의해 신뢰도를 구한다 이 음소단위의 신뢰도를 단어 단위의 신뢰도로 환산하기 위해서 음소단위를 평균 내는 방식 을 취한다. 이렇게 함으로서, 등록어와 미등록어 사이의 분별력을 크게 하여 향상된 인식 성능을 얻었다.

  • PDF

Morphological analysis of spoken Korean using Viterbi search (Viterbi 검색 기법을 이용한 한국어 음성 언어의 형태소 분석)

  • 김병창
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1995.06a
    • /
    • pp.200-203
    • /
    • 1995
  • This paper proposes a spoken Korean processing model which is extensible to large vocabulary continuous spoken Korean system. The integration of phoneme level speech recognition with natural language processing can support a sophisticated phonological/morphological analysis. The model consists of a diphone speech recognizer, a viterbi dictionaly searcher and a morpheme connectivity information checker. Two-level hierarchical TDNNs recognize newly defined Korean diphones. The diphone sequences are segmented and converted to the most probable morpheme sequences by the Viterbi dictionary searcher. The morpheme sequency are then examined by the morpheme connectivity information checker and the correct morpheme sequence which has the greatest probability is collected. The experiments show that the morphological analysis for spoken Korean can be achieved for 328 Eojeols with 80.6% success rate.

  • PDF

Seafloor Classification Based on the Texture Analysis of Sonar Images Using the Gabor Wavelet

  • Sun, Ning;Shim, Tae-Bo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.27 no.3E
    • /
    • pp.77-83
    • /
    • 2008
  • In the process of the sonar image textures produced, the orientation and scale factors are very significant. However, most of the related methods ignore the directional information and scale invariance or just pay attention to one of them. To overcome this problem, we apply Gabor wavelet to extract the features of sonar images, which combine the advantages of both the Gabor filter and traditional wavelet function. The mother wavelet is designed with constrained parameters and the optimal parameters will be selected at each orientation, with the help of bandwidth parameters based on the Fisher criterion. The Gabor wavelet can have the properties of both multi-scale and multi-orientation. Based on our experiment, this method is more appropriate than traditional wavelet or single Gabor filter as it provides the better discrimination of the textures and improves the recognition rate effectively. Meanwhile, comparing with other fusion methods, it can reduce the complexity and improve the calculation efficiency.

The Stereoscopic Vision Robot System Design with DSP Processor (DSP를 이용한 스테레오 비젼 로봇의 설계에 관한 연구)

  • 노석환;강희조;류광렬
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2003.10a
    • /
    • pp.264-267
    • /
    • 2003
  • The stereoscopic vision robot system design with DSP processor is presented. The vision system is consists of control system, vision system and host computer. The vision system is based on 32bits DSP processor. The stereoscopic image processing applies the correlation coefficient method to execute the software. The result of experiment, image recognition rate is 95% on the stereoscopic vision robot system.

  • PDF

GMM-Based Maghreb Dialect Identification System

  • Nour-Eddine, Lachachi;Abdelkader, Adla
    • Journal of Information Processing Systems
    • /
    • v.11 no.1
    • /
    • pp.22-38
    • /
    • 2015
  • While Modern Standard Arabic is the formal spoken and written language of the Arab world; dialects are the major communication mode for everyday life. Therefore, identifying a speaker's dialect is critical in the Arabic-speaking world for speech processing tasks, such as automatic speech recognition or identification. In this paper, we examine two approaches that reduce the Universal Background Model (UBM) in the automatic dialect identification system across the five following Arabic Maghreb dialects: Moroccan, Tunisian, and 3 dialects of the western (Oranian), central (Algiersian), and eastern (Constantinian) regions of Algeria. We applied our approaches to the Maghreb dialect detection domain that contains a collection of 10-second utterances and we compared the performance precision gained against the dialect samples from a baseline GMM-UBM system and the ones from our own improved GMM-UBM system that uses a Reduced UBM algorithm. Our experiments show that our approaches significantly improve identification performance over purely acoustic features with an identification rate of 80.49%.

Normal Variants and Artifacts in Bone Scan: Potential for Errors in Interpretation (골스캔 판독시 오류를 범할 수 있는 정상 변이소견 및 인공물)

  • Sohn, Myung-Hee
    • The Korean Journal of Nuclear Medicine
    • /
    • v.38 no.1
    • /
    • pp.1-20
    • /
    • 2004
  • Bone scan is one of the most frequently peformed studios in nuclear medicine. In bone scan, the amount of radiolsotope taken up by lesion depends primarily on the local rate of bone turnover rather than on the bone mass. Bone scan is extremely sensitive for defecting bony abnormalities. However, abnormalities that appear on bone scan may not always represent disease. The normal scan appearances may be affected not only by skeletal physiology and anatomy but also by a variety of technical factors which can influence image quality. Many normal variants and artifacts may appear on bone scan. They could simulate a pathologic process and could mislead into the wrong diagnostic interpretation. Therefore, their recognition is necessary to avoid misdiagnosis. A nuclear medicine physician should be aware of variable appearance of the normal variants and artifacts on bone scan. In this article, a variety of normal variants and artifacts mimicking real pathologic lesion in bone scan interpretation are discussed and illustrated.

A study on the increase of user gesture recognition rate using data preprocessing (데이터 전처리를 통한 사용자 제스처 인식률 증가 방안)

  • Kim, Jun Heon;Song, Byung Hoo;Shin, Dong Ryoul
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2017.07a
    • /
    • pp.13-16
    • /
    • 2017
  • 제스처 인식은 HCI(Human-Computer Interaction) 및 HRI(Human-Robot Interaction) 분야에서 활발히 연구되고 있는 기술이며, 제스처 데이터의 특징을 추출해내고 그에 따른 분류를 통하여 사용자의 제스처를 정확히 판별하는 것이 중요한 과제로 자리 잡았다. 본 논문에서는 EMG(Electromyography) 센서로 측정한 사용자의 손 제스처 데이터를 분석하는 방안에 대하여 서술한다. 수집된 데이터의 노이즈를 제거하고 데이터의 특징을 극대화시키기 위하여 연속적인 데이터로 변환하는 전처리 과정을 거쳐 이를 머신 러닝 알고리즘을 사용하여 분류하였다. 이 때, 기존의 raw 데이터와 전처리 과정을 거친 데이터의 성능을 decision-tree 알고리즘을 통하여 비교하였다.

  • PDF

A survey on residential environment's improvement plan of silver town by means of applying ubiquitous system (유비쿼터스시스템 적용을 통한 실버타운의 주거환경 개선방안 연구)

  • Lee, Chulgoo
    • Journal of the Korean Society for Geothermal and Hydrothermal Energy
    • /
    • v.10 no.4
    • /
    • pp.8-14
    • /
    • 2014
  • With the advancement of the industry such as information technology, residential facilities has been consistently achieved high standard. Ubiquitous system has been introduced in apartments nowadays, and it will be general facilities in the near future. However elderly people is not generally accustomed to ubiquitous or internet environment, so degree of ubiquitous system usage is able to be less than young people. This study aims to grasp the relative importance of ubiquitous environment, and offer materials to realize residential environment's improvement of silver towns under investigation through a questionnaire survey. Skin recognition system was most preferred among various functions. Energy management system, protection system of solitude elderly person and security system by CCTV was also preferred at a high rate. Housework part such as cooking aid, height control of sink and basin etc., was not less important system relatively.