• Title/Summary/Keyword: Linear feature

Search Result 785, Processing Time 0.029 seconds

EAR: Enhanced Augmented Reality System for Sports Entertainment Applications

  • Mahmood, Zahid;Ali, Tauseef;Muhammad, Nazeer;Bibi, Nargis;Shahzad, Imran;Azmat, Shoaib
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.11 no.12
    • /
    • pp.6069-6091
    • /
    • 2017
  • Augmented Reality (AR) overlays virtual information on real world data, such as displaying useful information on videos/images of a scene. This paper presents an Enhanced AR (EAR) system that displays useful statistical players' information on captured images of a sports game. We focus on the situation where the input image is degraded by strong sunlight. Proposed EAR system consists of an image enhancement technique to improve the accuracy of subsequent player and face detection. The image enhancement is followed by player and face detection, face recognition, and players' statistics display. First, an algorithm based on multi-scale retinex is proposed for image enhancement. Then, to detect players' and faces', we use adaptive boosting and Haar features for feature extraction and classification. The player face recognition algorithm uses boosted linear discriminant analysis to select features and nearest neighbor classifier for classification. The system can be adjusted to work in different types of sports where the input is an image and the desired output is display of information nearby the recognized players. Simulations are carried out on 2096 different images that contain players in diverse conditions. Proposed EAR system demonstrates the great potential of computer vision based approaches to develop AR applications.

Development and Validity of Creativity Path Inventory (CPI) (창의성 경로 척도(Creativity Path Inventory)의 개발 및 타당화)

  • Lee, Hyunjoo;Lee, Mina;Park, Eunji
    • Journal of Gifted/Talented Education
    • /
    • v.25 no.4
    • /
    • pp.511-528
    • /
    • 2015
  • The development process from creative potential to realized talent is complex and non-linear. This feature of the process stands out more in the process of living a creative life in the long-term rather than in a situation to solve certain problems in the short-term. The purpose of this study is to develop Creativity Path Inventory (CPI) for undergraduate students based on Sawyer's Zigzag Model which is one of creative process theories and to verify reliability and validity of the inventory. Thus, reflecting the characteristics of each stage of the model, this study developed 88 items in 8 factors and finally confirmed 38 items in 7 factors through item analysis and verification process on construct validity. Internal consistency of a total of 38 items in CPI turned out to be .835, confirming the reliability of the inventory and goodness-of-fit index of the final model also demonstrated an appropriate result. CPI with verified reliability and validity will help enable people who want to manifest creativity in view of everyday creativity to realize self-improvement by self-reporting their strengths and weaknesses on their own.

Quantization Based Speaker Normalization for DHMM Speech Recognition System (DHMM 음성 인식 시스템을 위한 양자화 기반의 화자 정규화)

  • 신옥근
    • The Journal of the Acoustical Society of Korea
    • /
    • v.22 no.4
    • /
    • pp.299-307
    • /
    • 2003
  • There have been many studies on speaker normalization which aims to minimize the effects of speaker's vocal tract length on the recognition performance of the speaker independent speech recognition system. In this paper, we propose a simple vector quantizer based linear warping speaker normalization method based on the observation that the vector quantizer can be successfully used for speaker verification. For this purpose, we firstly generate an optimal codebook which will be used as the basis of the speaker normalization, and then the warping factor of the unknown speaker will be extracted by comparing the feature vectors and the codebook. Finally, the extracted warping factor is used to linearly warp the Mel scale filter bank adopted in the course of MFCC calculation. To test the performance of the proposed method, a series of recognition experiments are conducted on discrete HMM with thirteen mono-syllabic Korean number utterances. The results showed that about 29% of word error rate can be reduced, and that the proposed warping factor extraction method is useful due to its simplicity compared to other line search warping methods.

Vector Quantizer Based Speaker Normalization for Continuos Speech Recognition (연속음성 인식기를 위한 벡터양자화기 기반의 화자정규화)

  • Shin Ok-keun
    • The Journal of the Acoustical Society of Korea
    • /
    • v.23 no.8
    • /
    • pp.583-589
    • /
    • 2004
  • Proposed is a speaker normalization method based on vector quantizer for continuous speech recognition (CSR) system in which no acoustic information is made use of. The proposed method, which is an improvement of the previously reported speaker normalization scheme for a simple digit recognizer, builds up a canonical codebook by iteratively training the codebook while the size of codebook is increased after each iteration from a relatively small initial size. Once the codebook established, the warp factors of speakers are estimated by comparing exhaustively the warped versions of each speaker's utterance with the codebook. Two sets of phones are used to estimate the warp factors: one, a set of vowels only. and the other, a set composed of all the Phonemes. A Piecewise linear warping function which corresponds to the estimated warp factor is adopted to warp the power spectrum of the utterance. Then the warped feature vectors are extracted to be used to train and to test the speech recognizer. The effectiveness of the proposed method is investigated by a set of recognition experiments using the TIMIT corpus and HTK speech recognition tool kit. The experimental results showed comparable recognition rate improvement with the formant based warping method.

Design of Speech Enhancement U-Net for Embedded Computing (임베디드 연산을 위한 잡음에서 음성추출 U-Net 설계)

  • Kim, Hyun-Don
    • IEMEK Journal of Embedded Systems and Applications
    • /
    • v.15 no.5
    • /
    • pp.227-234
    • /
    • 2020
  • In this paper, we propose wav-U-Net to improve speech enhancement in heavy noisy environments, and it has implemented three principal techniques. First, as input data, we use 128 modified Mel-scale filter banks which can reduce computational burden instead of 512 frequency bins. Mel-scale aims to mimic the non-linear human ear perception of sound by being more discriminative at lower frequencies and less discriminative at higher frequencies. Therefore, Mel-scale is the suitable feature considering both performance and computing power because our proposed network focuses on speech signals. Second, we add a simple ResNet as pre-processing that helps our proposed network make estimated speech signals clear and suppress high-frequency noises. Finally, the proposed U-Net model shows significant performance regardless of the kinds of noise. Especially, despite using a single channel, we confirmed that it can well deal with non-stationary noises whose frequency properties are dynamically changed, and it is possible to estimate speech signals from noisy speech signals even in extremely noisy environments where noises are much lauder than speech (less than SNR 0dB). The performance on our proposed wav-U-Net was improved by about 200% on SDR and 460% on NSDR compared to the conventional Jansson's wav-U-Net. Also, it was confirmed that the processing time of out wav-U-Net with 128 modified Mel-scale filter banks was about 2.7 times faster than the common wav-U-Net with 512 frequency bins as input values.

Simulation of Tsunamis in the East Sea Using Dynamically-Interfaced Multi-Grid Model (동적결합둥지형 모형에 의한 동해안 쓰나미 시뮬레이션)

  • Choi, Byung-Ho;Efim, Pelinovsky;Woo, Seung-Buhm;Lee, Jong-Woong;Mun, Jong-Yoon
    • Journal of the Earthquake Engineering Society of Korea
    • /
    • v.7 no.1
    • /
    • pp.41-55
    • /
    • 2003
  • A dynamically-interfaced multi-grid finite difference model for simulation of tsunamis in the East Sea(Choi et al.) was established and further applied to produce detailed feature of coastal inundations along the whole eastern coast of Korea. The computational domain is composed of several sub-regions with different grid sizes connected in parallel of inclined directions with 16 innermost nested models. The innermost sub-region represents the coastal alignment reasonably well and has a grid size of about 30 meters. Numerical simulations have been performed in the framework of shallow-water equations(linear, as well as nonlinear) over the plane or spherical coordinate system, depending on the dimensions of the sub-region. Results of simulations show the general agreements with the observed data of run-up height for both tsunamis. The evolution of the distribution function of tsunami heights is studied numerically and it is shown that it tends to the log-normal curve for long distance from the source.

Real-Time Lane Detection Based on Inverse Perspective Transform and Search Range Prediction (역 원근 변환과 검색 영역 예측에 의한 실시간 차선 인식)

  • Jeong, Seung-Gweon;Kim, In-Soo;Kim, Sung-Han;Lee, Dong-Hwoal;Yun, Kang-Sup;Lee, Man-Hyung
    • Journal of the Korean Society for Precision Engineering
    • /
    • v.18 no.3
    • /
    • pp.68-74
    • /
    • 2001
  • A lane detection based on a road model or feature all needs correct acquirement of information on the lane in an image. It is inefficient to implement a lane detection algorithm through the full range of an image when it is applied to a real road in real time because of the calculating time. This paper defines two (other proper terms including"modes") for detecting lanes on a road. First is searching mode that is searching the lane without any prior information of a road. Second is recognition mode, which is able to reduce the size and change the position of a searching range by predicting the position of a lane through the acquired information in a previous frame. It allows to extract accurately and efficiently the edge candidate points of a lane without any unnecessary searching. By means of inverse perspective transform which removes the perspective effect on the edge candidate points, we transform the edge candidate information in the Image Coordinate System(ICS) into the plan-view image in the World Coordinate System(WCS). We define a linear approximation filter and remove faulty edge candidate points by using it. This paper aims at approximating more correctly the lane of an actual road by applying the least-mean square method with the fault-removed edge information for curve fitting.e fitting.

  • PDF

Face Recognition Using Fisherface Algorithm and Fixed Graph Matching (Fisherface 알고리즘과 Fixed Graph Matching을 이용한 얼굴 인식)

  • Lee, Hyeong-Ji;Jeong, Jae-Ho
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.38 no.6
    • /
    • pp.608-616
    • /
    • 2001
  • This paper proposes a face recognition technique that effectively combines fixed graph matching (FGM) and Fisherface algorithm. EGM as one of dynamic link architecture uses not only face-shape but also the gray information of image, and Fisherface algorithm as a class specific method is robust about variations such as lighting direction and facial expression. In the proposed face recognition adopting the above two methods, linear projection per node of an image graph reduces dimensionality of labeled graph vector and provides a feature space to be used effectively for the classification. In comparison with a conventional EGM, the proposed approach could obtain satisfactory results in the perspectives of recognition speeds. Especially, we could get higher average recognition rate of 90.1% than the conventional methods by hold-out method for the experiments with the Yale Face Databases and Olivetti Research Laboratory (ORL) Databases.

  • PDF

Recommended Practice for o Reasonable Design Demand Factor and Analysis of Power Consumption Characteristics by Loads in Office Buildings (사무소용 빌딩의 부하종별 전력소비특성 분석 및 수용률 기준 정립에 관한 연구)

  • Kim, Se-Dong;Lee, Jin
    • Journal of the Korean Institute of Illuminating and Electrical Installation Engineers
    • /
    • v.19 no.3
    • /
    • pp.111-118
    • /
    • 2005
  • It is increased electrical energy consumption with the development of intelligence society in the once buildings and thus an energy conservation through efficient use of electricity became more important. This paper shows a reasonable design demand factor in office buildings, that was made by the systematic and statistical way considering actual conditions, such as investigated electric equipment capacity, peak power consumption, demand factor, etc., for 54 office buildings and 34 electrical design offices. In this dissertation it is necessary to analyse the key features and general trend from the investigated data. It made an analysis of the feature parameters, such as average, standard deviation, median, maximum, minimum and thus it was carried the linear and nonlinear regression analysis.

Hypergraph model based Scene Image Classification Method (하이퍼그래프 모델 기반의 장면 이미지 분류 기법)

  • Choi, Sun-Wook;Lee, Chong Ho
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.24 no.2
    • /
    • pp.166-172
    • /
    • 2014
  • Image classification is an important problem in computer vision. However, it is a very challenging problem due to the variability, ambiguity and scale change that exists in images. In this paper, we propose a method of a hypergraph based modeling can consider the higher-order relationships of semantic attributes of a scene image and apply it to a scene image classification. In order to generate the hypergraph optimized for specific scene category, we propose a novel search method based on a probabilistic subspace method and also propose a method to aggregate the expression values of the member semantic attributes that belongs to the searched subsets based on a linear transformation method via likelihood based estimation. To verify the superiority of the proposed method, we showed that the discrimination power of the feature vector generated by the proposed method is better than existing methods through experiments. And also, in a scene classification experiment, the proposed method shows a competitive classification performance compared with the conventional methods.