Search | Korea Science

Language Identification by Fusion of Gabor, MDLC, and Co-Occurrence Features (Gabor, MDLC, Co-Occurrence 특징의 융합에 의한 언어 인식)

Jang, Ick-Hoon;Kim, Ji-Hong
- Journal of Korea Multimedia Society
- /
- v.17 no.3
- /
- pp.277-286
- /
- 2014
In this paper, we propose a texture feature-based language identification by fusion of Gabor, MDLC (multi-lag directional local correlation), and co-occurrence features. In the proposed method, for a test image, Gabor magnitude images are first obtained by Gabor transform followed by magnitude operator. Moments for the Gabor magniude images are then computed and vectorized. MDLC images are then obtained by MDLC operator and their moments are computed and vectorized. GLCM (gray-level co-occurrence matrix) is next calculated from the test image and co-occurrence features are computed using the GLCM, and the features are also vectorized. The three vectors of the Gabor, MDLC, and co-occurrence features are fused into a feature vector. In classification, the WPCA (whitened principal component analysis) classifier, which is usually adopted in the face identification, searches the training feature vector most similar to the test feature vector. We evaluate the performance of our method by examining averaged identification rates for a test document image DB obtained by scanning of documents with 15 languages. Experimental results show that the proposed method yields excellent language identification with rather low feature dimension for the test DB.
https://doi.org/10.9717/kmms.2014.17.3.277 인용 PDF KSCI KPUBS HTML

Estimation of Automatic Video Captioning in Real Applications using Machine Learning Techniques and Convolutional Neural Network

Vaishnavi, J;Narmatha, V
- International Journal of Computer Science & Network Security
- /
- v.22 no.9
- /
- pp.316-326
- /
- 2022
The prompt development in the field of video is the outbreak of online services which replaces the television media within a shorter period in gaining popularity. The online videos are encouraged more in use due to the captions displayed along with the scenes for better understandability. Not only entertainment media but other marketing companies and organizations are utilizing videos along with captions for their product promotions. The need for captions is enabled for its usage in many ways for hearing impaired and non-native people. Research is continued in an automatic display of the appropriate messages for the videos uploaded in shows, movies, educational videos, online classes, websites, etc. This paper focuses on two concerns namely the first part dealing with the machine learning method for preprocessing the videos into frames and resizing, the resized frames are classified into multiple actions after feature extraction. For the feature extraction statistical method, GLCM and Hu moments are used. The second part deals with the deep learning method where the CNN architecture is used to acquire the results. Finally both the results are compared to find the best accuracy where CNN proves to give top accuracy of 96.10% in classification.
https://doi.org/10.22937/IJCSNS.2022.22.9.42 인용 PDF KSCI

Image Retrieval Using Multiresoluton Color and Texture Features in Wavelet Transform Domain (웨이브릿 변환 영역의 칼라 및 질감 특징을 이용한 영상검색)

Chun Young-Deok;Sung Joong-Ki;Kim Nam-Chul
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.43 no.1 s.307
- /
- pp.55-66
- /
- 2006
We propose a progressive image retrieval method based on an efficient combination of multiresolution color and torture features in wavelet transform domain. As a color feature, color autocorrelogram of the hue and saturation components is chosen. As texture features, BDIP and BVLC moments of the value component are chosen. For the selected features, we obtain multiresolution feature vectors which are extracted from all decomposition levels in wavelet domain. The multiresolution feature vectors of the color and texture features are efficiently combined by the normalization depending on their dimensions and standard deviation vector, respectively, vector components of the features are efficiently quantized in consideration of their storage space, and computational complexity in similarity computation is reduced by using progressive retrieval strategy. Experimental results show that the proposed method yields average $15\%$ better performance in precision vs. recall and average 0.2 in ANMRR than the methods using color histogram color autocorrelogram SCD, CSD, wavelet moments, EHD, BDIP and BVLC moments, and combination of color histogram and wavelet moments, respectively. Specially, the proposed method shows an excellent performance over the other methods in image DBs contained images of various resolutions.
PDF KSCI

A Study on the Automatic Inspection System using Invariant Moments Algorithm with the Change of Size and Rotation (크기와 회전 변화에 불변 모멘트 알고리즘을 이용한 자동 검사 시스템에 관한 연구)

Lee, Yong-Joong
- Transactions of the Korean Society of Machine Tool Engineers
- /
- v.13 no.3
- /
- pp.37-43
- /
- 2004
The purpose of this study is to develop a practical image inspection system that could recognize it correctly, endowing flexibility to the productive field, although the same object for work will be changed in the size and rotated. In this experiment, it selected a fighter, rotating the direction from $30^{\circ}$ to $45^{\circ}$ simultaneously while changing the size from 1/4 to 1/16, as an object inspection without using another hardware for exclusive image processing. The invariant moments, Hu has suggested, was used as feature vector moment descriptor. As a result of the experiment, the image inspection system developed from this research was operated in real-time regardless of the chance of size and rotation for the object inspection, and it maintained the correspondent rates steadily above from 94% to 96%. Accordingly, it is considered as the flexibility can be considerably endowed to the factory automation when the image inspection system developed from this research is applied to the productive field.
PDF KSCI

A New Hybrid Algorithm for Invariance and Improved Classification Performance in Image Recognition

Shi, Rui-Xia;Jeong, Dong-Gyu
- International journal of advanced smart convergence
- /
- v.9 no.3
- /
- pp.85-96
- /
- 2020
It is important to extract salient object image and to solve the invariance problem for image recognition. In this paper we propose a new hybrid algorithm for invariance and improved classification performance in image recognition, whose algorithm is combined by FT(Frequency-tuned Salient Region Detection) algorithm, Guided filter, Zernike moments, and a simple artificial neural network (Multi-layer Perceptron). The conventional FT algorithm is used to extract initial salient object image, the guided filtering to preserve edge details, Zernike moments to solve invariance problem, and a classification to recognize the extracted image. For guided filtering, guided filter is used, and Multi-layer Perceptron which is a simple artificial neural networks is introduced for classification. Experimental results show that this algorithm can achieve a superior performance in the process of extracting salient object image and invariant moment feature. And the results show that the algorithm can also classifies the extracted object image with improved recognition rate.
https://doi.org/10.7236/IJASC.2020.9.3.85 인용 PDF KSCI

Face Recognition Using A New Methodology For Independent Component Analysis (새로운 독립 요소 해석 방법론에 의한 얼굴 인식)

류재흥;고재흥
- Proceedings of the Korean Institute of Intelligent Systems Conference
- /
- 2000.11a
- /
- pp.305-309
- /
- 2000
In this paper, we presents a new methodology for face recognition after analysing conventional ICA(Independent Component Analysis) based approach. In the literature we found that ICA based methods have followed the same procedure without any exception, first PCA(Principal Component Analysis) has been used for feature extraction, next ICA learning method has been applied for feature enhancement in the reduced dimension. However, it is contradiction that features are extracted using higher order moments depend on variance, the second order statistics. It is not considered that a necessary component can be located in the discarded feature space. In the new methodology, features are extracted using the magnitude of kurtosis(4-th order central moment or cumulant). This corresponds to the PCA based feature extraction using eigenvalue(2nd order central moment or variance). The synergy effect of PCA and ICA can be achieved if PCA is used for noise reduction filter. ICA methodology is analysed using SVD(Singular Value Decomposition). PCA does whitening and noise reduction. ICA performs the feature extraction. Simulation results show the effectiveness of the methodology compared to the conventional ICA approach.
PDF

Music Genre Classification Based on Timbral Texture and Rhythmic Content Features

Baniya, Babu Kaji;Ghimire, Deepak;Lee, Joonwhon
- Proceedings of the Korea Information Processing Society Conference
- /
- 2013.05a
- /
- pp.204-207
- /
- 2013
Music genre classification is an essential component for music information retrieval system. There are two important components to be considered for better genre classification, which are audio feature extraction and classifier. This paper incorporates two different kinds of features for genre classification, timbral texture and rhythmic content features. Timbral texture contains several spectral and Mel-frequency Cepstral Coefficient (MFCC) features. Before choosing a timbral feature we explore which feature contributes less significant role on genre discrimination. This facilitates the reduction of feature dimension. For the timbral features up to the 4-th order central moments and the covariance components of mutual features are considered to improve the overall classification result. For the rhythmic content the features extracted from beat histogram are selected. In the paper Extreme Learning Machine (ELM) with bagging is used as classifier for classifying the genres. Based on the proposed feature sets and classifier, experiment is performed with well-known datasets: GTZAN databases with ten different music genres, respectively. The proposed method acquires the better classification accuracy than the existing approaches.
https://doi.org/10.3745/PKIPS.y2013m05a.204 인용 PDF

Implementation of System Retrieving Multi-Object Image Using Property of Moments (모멘트 특성을 이용한 다중 객체 이미지 검색 시스템 구현)

안광일;안재형
- Journal of Korea Multimedia Society
- /
- v.3 no.5
- /
- pp.454-460
- /
- 2000
To retrieve complex data such as images, the content-based retrieval method rather than keyword based method is required. In this paper, we implemented a content-based image retrieval system which retrieves object of user query effectively using invariant moments which have invariant properties about linear transformation like position transition, rotation and scaling. To extract the shape feature of objects in an image, we propose a labeling algorithm that extracts objects from an image and apply invariant moments to each object. Hashing method is also applied to reduce a retrieval time and index images effectively. The experimental results demonstrate the high retrieval efficiency i.e precision 85%, recall 23%. Consequently, our retrieval system shows better performance than the conventional system that cannot express the shale of objects exactly.
PDF

Effective Content-Based Image Retrieval Using Relevance feedback (관련성 피드백을 이용한 효과적인 내용기반 영상검색)

손재곤;김남철
- Proceedings of the IEEK Conference
- /
- 2001.09a
- /
- pp.669-672
- /
- 2001
We propose an efficient algorithm for an interactive content-based image retrieval using relevance feedback. In the proposed algorithm, a new query feature vector first is yielded from the average feature vector of the relevant images that is fed back from the result images of the previous retrieval. Each component weight of a feature vector is computed from an inverse of standard deviation for each component of the relevant images. The updated feature vector of the query and the component weights are used in the iterative retrieval process. In addition, the irrelevant images are excluded from object images in the next iteration to obtain additional performance improvement. In order to evaluate the retrieval performance of the proposed method, we experiment for three image databases, that is, Corel, Vistex, and Ultra databases. We have chosen wavelet moments, BDIP and BVLC, and MFS as features representing the visual content of an image. The experimental results show that the proposed method yields large precision improvement.
PDF

Enhanced SIFT Descriptor Based on Modified Discrete Gaussian-Hermite Moment

Kang, Tae-Koo;Zhang, Huazhen;Kim, Dong W.;Park, Gwi-Tae
- ETRI Journal
- /
- v.34 no.4
- /
- pp.572-582
- /
- 2012
The discrete Gaussian-Hermite moment (DGHM) is a global feature representation method that can be applied to square images. We propose a modified DGHM (MDGHM) method and an MDGHM-based scale-invariant feature transform (MDGHM-SIFT) descriptor. In the MDGHM, we devise a movable mask to represent the local features of a non-square image. The complete set of non-square image features are then represented by the summation of all MDGHMs. We also propose to apply an accumulated MDGHM using multi-order derivatives to obtain distinguishable feature information in the third stage of the SIFT. Finally, we calculate an MDGHM-based magnitude and an MDGHM-based orientation using the accumulated MDGHM. We carry out experiments using the proposed method with six kinds of deformations. The results show that the proposed method can be applied to non-square images without any image truncation and that it significantly outperforms the matching accuracy of other SIFT algorithms.
https://doi.org/10.4218/etrij.12.0111.0538 인용 PDF KSCI

Search Result 71, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)