Search | Korea Science

Content-based image retrieval using a fusion of global and local features

Hee Hyung Bu;Nam Chul Kim;Sung Ho Kim
- ETRI Journal
- /
- v.45 no.3
- /
- pp.505-517
- /
- 2023
Color, texture, and shape act as important information for images in human recognition. For content-based image retrieval, many studies have combined color, texture, and shape features to improve the retrieval performance. However, there have not been many powerful methods for combining all color, texture, and shape features. This study proposes a content-based image retrieval method that uses the combined local and global features of color, texture, and shape. The color features are extracted from the color autocorrelogram; the texture features are extracted from the magnitude of a complete local binary pattern and the Gabor local correlation revealing local image characteristics; and the shape features are extracted from singular value decomposition that reflects global image characteristics. In this work, an experiment is performed to compare the proposed method with those that use our partial features and some existing techniques. The results show an average precision that is 19.60% higher than those of existing methods and 9.09% higher than those of recent ones. In conclusion, our proposed method is superior over other methods in terms of retrieval performance.
https://doi.org/10.4218/etrij.2022-0071 인용 PDF

Music Genre Classification Based on Timbral Texture and Rhythmic Content Features

Baniya, Babu Kaji;Ghimire, Deepak;Lee, Joonwhon
- Proceedings of the Korea Information Processing Society Conference
- /
- 2013.05a
- /
- pp.204-207
- /
- 2013
Music genre classification is an essential component for music information retrieval system. There are two important components to be considered for better genre classification, which are audio feature extraction and classifier. This paper incorporates two different kinds of features for genre classification, timbral texture and rhythmic content features. Timbral texture contains several spectral and Mel-frequency Cepstral Coefficient (MFCC) features. Before choosing a timbral feature we explore which feature contributes less significant role on genre discrimination. This facilitates the reduction of feature dimension. For the timbral features up to the 4-th order central moments and the covariance components of mutual features are considered to improve the overall classification result. For the rhythmic content the features extracted from beat histogram are selected. In the paper Extreme Learning Machine (ELM) with bagging is used as classifier for classifying the genres. Based on the proposed feature sets and classifier, experiment is performed with well-known datasets: GTZAN databases with ten different music genres, respectively. The proposed method acquires the better classification accuracy than the existing approaches.
https://doi.org/10.3745/PKIPS.y2013m05a.204 인용 PDF

Intra-and Inter-frame Features for Automatic Speech Recognition

Lee, Sung Joo;Kang, Byung Ok;Chung, Hoon;Lee, Yunkeun
- ETRI Journal
- /
- v.36 no.3
- /
- pp.514-517
- /
- 2014
In this paper, alternative dynamic features for speech recognition are proposed. The goal of this work is to improve speech recognition accuracy by deriving the representation of distinctive dynamic characteristics from a speech spectrum. This work was inspired by two temporal dynamics of a speech signal. One is the highly non-stationary nature of speech, and the other is the inter-frame change of a speech spectrum. We adopt the use of a sub-frame spectrum analyzer to capture very rapid spectral changes within a speech analysis frame. In addition, we attempt to measure spectral fluctuations of a more complex manner as opposed to traditional dynamic features such as delta or double-delta. To evaluate the proposed features, speech recognition tests over smartphone environments were conducted. The experimental results show that the feature streams simply combined with the proposed features are effective for an improvement in the recognition accuracy of a hidden Markov model-based speech recognizer.
https://doi.org/10.4218/etrij.14.0213.0181 인용 PDF KSCI KPUBS

Harmonic Structure Features for Robust Speaker Diarization

Zhou, Yu;Suo, Hongbin;Li, Junfeng;Yan, Yonghong
- ETRI Journal
- /
- v.34 no.4
- /
- pp.583-590
- /
- 2012
In this paper, we present a new approach for speaker diarization. First, we use the prosodic information calculated on the original speech to resynthesize the new speech data utilizing the spectrum modeling technique. The resynthesized data is modeled with sinusoids based on pitch, vibration amplitude, and phase bias. Then, we use the resynthesized speech data to extract cepstral features and integrate them with the cepstral features from original speech for speaker diarization. At last, we show how the two streams of cepstral features can be combined to improve the robustness of speaker diarization. Experiments carried out on the standardized datasets (the US National Institute of Standards and Technology Rich Transcription 04-S multiple distant microphone conditions) show a significant improvement in diarization error rate compared to the system based on only the feature stream from original speech.
https://doi.org/10.4218/etrij.12.0111.0455 인용 PDF KSCI

Content Based Image Retrieval Using Combined Features of Shape, Color and Relevance Feedback

Mussarat, Yasmin;Muhammad, Sharif;Sajjad, Mohsin;Isma, Irum
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.7 no.12
- /
- pp.3149-3165
- /
- 2013
Content based image retrieval is increasingly gaining popularity among image repository systems as images are a big source of digital communication and information sharing. Identification of image content is done through feature extraction which is the key operation for a successful content based image retrieval system. In this paper content based image retrieval system has been developed by adopting a strategy of combining multiple features of shape, color and relevance feedback. Shape is served as a primary operation to identify images whereas color and relevance feedback have been used as supporting features to make the system more efficient and accurate. Shape features are estimated through second derivative, least square polynomial and shapes coding methods. Color is estimated through max-min mean of neighborhood intensities. A new technique has been introduced for relevance feedback without bothering the user.
https://doi.org/10.3837/tiis.2013.12.011 인용 PDF KSCI KPUBS HTML

Steganalysis of Content-Adaptive Steganography using Markov Features for DCT Coefficients (DCT 계수의 마코프 특징을 이용한 내용 적응적 스테가노그래피의 스테그분석)

Park, Tae Hee;Han, Jong Goo;Eom, Il Kyu
- Journal of the Institute of Electronics and Information Engineers
- /
- v.52 no.8
- /
- pp.97-105
- /
- 2015
Content-adaptive steganography methods embed secret messages in hard-to-model regions of covers such as complicated texture or noisy area. Content-adaptive steganalysis methods often need high dimensional features to capture more subtle relationships of local dependencies among adjacent pixels. However, these methods require many computational complexity and depend on the location of hidden message and the exploited distortion metrics. In this paper, we propose an improved steganalysis method for content-adaptive steganography to enhance detection rate with small number features. We first show that the features form the difference between DCT coefficients are useful for analyzing the content-adaptive steganography methods, and present feature extraction mehtod using first-order Markov probability for the the difference between DCT coefficients. The extracted features are used as input of ensemble classifier. Experimental results show that the proposed method outperforms previous schemes in terms of detection rates and accuracy in spite of a small number features in various content-adaptive stego images.
https://doi.org/10.5573/ieie.2015.52.8.097 인용 PDF KSCI

Attack Detection on Images Based on DCT-Based Features

Nirin Thanirat;Sudsanguan Ngamsuriyaroj
- Asia pacific journal of information systems
- /
- v.31 no.3
- /
- pp.335-357
- /
- 2021
As reproduction of images can be done with ease, copy detection has increasingly become important. In the duplication process, image modifications are likely to occur and some alterations are deliberate and can be viewed as attacks. A wide range of copy detection techniques has been proposed. In our study, content-based copy detection, which basically applies DCT-based features for images, namely, pixel values, edges, texture information and frequency-domain component distribution, is employed. Experiments are carried out to evaluate robustness and sensitivity of DCT-based features from attacks. As different types of DCT-based features hold different pieces of information, how features and attacks are related can be shown in their robustness and sensitivity. Rather than searching for proper features, use of robustness and sensitivity is proposed here to realize how the attacked features have changed when an image attack occurs. The experiments show that, out of ten attacks, the neural networks are able to detect seven attacks namely, Gaussian noise, S&P noise, Gamma correction (high), blurring, resizing (big), compression and rotation with mostly related to their sensitive features.
https://doi.org/10.14329/apjis.2021.31.3.335 인용 PDF

Image Clustering using Color, Texture and Shape Features

Sleit, Azzam;Abu Dalhoum, Abdel Llatif;Qatawneh, Mohammad;Al-Sharief, Maryam;Al-Jabaly, Rawa'a;Karajeh, Ola
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.5 no.1
- /
- pp.211-227
- /
- 2011
Content Based Image Retrieval (CBIR) is an approach for retrieving similar images from an image database based on automatically-derived image features. The quality of a retrieval system depends on the features used to describe image content. In this paper, we propose an image clustering system that takes a database of images as input and clusters them using k-means clustering algorithm taking into consideration color, texture and shape features. Experimental results show that the combination of the three features brings about higher values of accuracy and precision.
https://doi.org/10.3837/tiis.2011.01.012 인용 PDF KSCI

Content-Based Image Retrieval Using Multi-Resolution Multi-Direction Filtering-Based CLBP Texture Features and Color Autocorrelogram Features

Bu, Hee-Hyung;Kim, Nam-Chul;Yun, Byoung-Ju;Kim, Sung-Ho
- Journal of Information Processing Systems
- /
- v.16 no.4
- /
- pp.991-1000
- /
- 2020
We propose a content-based image retrieval system that uses a combination of completed local binary pattern (CLBP) and color autocorrelogram. CLBP features are extracted on a multi-resolution multi-direction filtered domain of value component. Color autocorrelogram features are extracted in two dimensions of hue and saturation components. Experiment results revealed that the proposed method yields a lot of improvement when compared with the methods that use partial features employed in the proposed method. It is also superior to the conventional CLBP, the color autocorrelogram using R, G, and B components, and the multichannel decoded local binary pattern which is one of the latest methods.
https://doi.org/10.3745/JIPS.02.0138 인용 PDF KSCI

The Effects of YouTube Summary Contents Features and Contents Provider Credibility on Users' Flow and Satisfaction (유튜브 서머리 콘텐츠 특성과 콘텐츠 제공자 신뢰성이 이용자 몰입과 만족에 미치는 영향)

Jeong, Yu-Jin;Lee, Nam-Jung;Lee, Jung-Hoon
- Journal of the Korea Convergence Society
- /
- v.12 no.2
- /
- pp.35-44
- /
- 2021
Previous studies have studied short videos, short form content, snack culture and so on, but few studies have been conducted on the form of summary content that compressing and summarizing the original content. This study aims to contribute to the revitalization of the summary content market by exploring ways to enhance user satisfaction through analysis of the YouTube summary content features and the credibility of content providers that bring about flow and satisfaction of YouTube summary content users. The survey was conducted on 202 people who have watched YouTube summary contents for finding out the effects of YouTube summary contents features and content provider credibility on the details of flow. As a result, only entertainment had a significant impact on all flow details. This study is of academic significance in that it defines the features of YouTube summary contents, and has practical significance in that it suggests what direction the summary content should have in order to arouse user satisfaction in future.
https://doi.org/10.15207/JKCS.2021.12.2.035 인용 PDF KSCI

Search Result 1,188, Processing Time 0.027 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)