통합 검색 | Korea Science

Content-based image retrieval using a fusion of global and local features

Hee Hyung Bu;Nam Chul Kim;Sung Ho Kim
- ETRI Journal
- /
- 제45권3호
- /
- pp.505-517
- /
- 2023
Color, texture, and shape act as important information for images in human recognition. For content-based image retrieval, many studies have combined color, texture, and shape features to improve the retrieval performance. However, there have not been many powerful methods for combining all color, texture, and shape features. This study proposes a content-based image retrieval method that uses the combined local and global features of color, texture, and shape. The color features are extracted from the color autocorrelogram; the texture features are extracted from the magnitude of a complete local binary pattern and the Gabor local correlation revealing local image characteristics; and the shape features are extracted from singular value decomposition that reflects global image characteristics. In this work, an experiment is performed to compare the proposed method with those that use our partial features and some existing techniques. The results show an average precision that is 19.60% higher than those of existing methods and 9.09% higher than those of recent ones. In conclusion, our proposed method is superior over other methods in terms of retrieval performance.
https://doi.org/10.4218/etrij.2022-0071 인용 PDF

Music Genre Classification Based on Timbral Texture and Rhythmic Content Features

Baniya, Babu Kaji;Ghimire, Deepak;Lee, Joonwhon
- 한국정보처리학회:학술대회논문집
- /
- 한국정보처리학회 2013년도 춘계학술발표대회
- /
- pp.204-207
- /
- 2013
Music genre classification is an essential component for music information retrieval system. There are two important components to be considered for better genre classification, which are audio feature extraction and classifier. This paper incorporates two different kinds of features for genre classification, timbral texture and rhythmic content features. Timbral texture contains several spectral and Mel-frequency Cepstral Coefficient (MFCC) features. Before choosing a timbral feature we explore which feature contributes less significant role on genre discrimination. This facilitates the reduction of feature dimension. For the timbral features up to the 4-th order central moments and the covariance components of mutual features are considered to improve the overall classification result. For the rhythmic content the features extracted from beat histogram are selected. In the paper Extreme Learning Machine (ELM) with bagging is used as classifier for classifying the genres. Based on the proposed feature sets and classifier, experiment is performed with well-known datasets: GTZAN databases with ten different music genres, respectively. The proposed method acquires the better classification accuracy than the existing approaches.
https://doi.org/10.3745/PKIPS.y2013m05a.204 인용 PDF

Intra-and Inter-frame Features for Automatic Speech Recognition

Lee, Sung Joo;Kang, Byung Ok;Chung, Hoon;Lee, Yunkeun
- ETRI Journal
- /
- 제36권3호
- /
- pp.514-517
- /
- 2014
In this paper, alternative dynamic features for speech recognition are proposed. The goal of this work is to improve speech recognition accuracy by deriving the representation of distinctive dynamic characteristics from a speech spectrum. This work was inspired by two temporal dynamics of a speech signal. One is the highly non-stationary nature of speech, and the other is the inter-frame change of a speech spectrum. We adopt the use of a sub-frame spectrum analyzer to capture very rapid spectral changes within a speech analysis frame. In addition, we attempt to measure spectral fluctuations of a more complex manner as opposed to traditional dynamic features such as delta or double-delta. To evaluate the proposed features, speech recognition tests over smartphone environments were conducted. The experimental results show that the feature streams simply combined with the proposed features are effective for an improvement in the recognition accuracy of a hidden Markov model-based speech recognizer.
https://doi.org/10.4218/etrij.14.0213.0181 인용 PDF KSCI KPUBS

Harmonic Structure Features for Robust Speaker Diarization

Zhou, Yu;Suo, Hongbin;Li, Junfeng;Yan, Yonghong
- ETRI Journal
- /
- 제34권4호
- /
- pp.583-590
- /
- 2012
In this paper, we present a new approach for speaker diarization. First, we use the prosodic information calculated on the original speech to resynthesize the new speech data utilizing the spectrum modeling technique. The resynthesized data is modeled with sinusoids based on pitch, vibration amplitude, and phase bias. Then, we use the resynthesized speech data to extract cepstral features and integrate them with the cepstral features from original speech for speaker diarization. At last, we show how the two streams of cepstral features can be combined to improve the robustness of speaker diarization. Experiments carried out on the standardized datasets (the US National Institute of Standards and Technology Rich Transcription 04-S multiple distant microphone conditions) show a significant improvement in diarization error rate compared to the system based on only the feature stream from original speech.
https://doi.org/10.4218/etrij.12.0111.0455 인용 PDF KSCI

Content Based Image Retrieval Using Combined Features of Shape, Color and Relevance Feedback

Mussarat, Yasmin;Muhammad, Sharif;Sajjad, Mohsin;Isma, Irum
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제7권12호
- /
- pp.3149-3165
- /
- 2013
Content based image retrieval is increasingly gaining popularity among image repository systems as images are a big source of digital communication and information sharing. Identification of image content is done through feature extraction which is the key operation for a successful content based image retrieval system. In this paper content based image retrieval system has been developed by adopting a strategy of combining multiple features of shape, color and relevance feedback. Shape is served as a primary operation to identify images whereas color and relevance feedback have been used as supporting features to make the system more efficient and accurate. Shape features are estimated through second derivative, least square polynomial and shapes coding methods. Color is estimated through max-min mean of neighborhood intensities. A new technique has been introduced for relevance feedback without bothering the user.
https://doi.org/10.3837/tiis.2013.12.011 인용 PDF KSCI KPUBS HTML

DCT 계수의 마코프 특징을 이용한 내용 적응적 스테가노그래피의 스테그분석 (Steganalysis of Content-Adaptive Steganography using Markov Features for DCT Coefficients)

박태희;한종구;엄일규
- 전자공학회논문지
- /
- 제52권8호
- /
- pp.97-105
- /
- 2015
내용 적응적 스테가노그래피는 복잡한 텍스쳐 또는 잡음 영역과 같이 통계적 모델로는 기술하기 어려운 영역에 비밀 메시지를 은닉한다. 이러한 메시지를 검출하기 위해서는 인접 화소간의 국부적인 의존성을 정교하게 모델링해야 하기 때문에 종종 고차원의 특징벡터 추출이 필요하다. 이러한 스테그분석 방법은 계산량이 많을 뿐만 아니라 비밀 메시지의 검출 정확도가 은닉 영역과 사용된 왜곡 척도에 의존한다는 문제점을 가진다. 본 논문에서는 적은 수의 특징 벡터를 이용하여 비밀 메시지의 검출율을 높일 수 있는 개선된 내용 적응적 스테가노그래피의 스테그분석 방법을 제안하고자 한다. 먼저 이산 코사인 변환 계수의 차이를 이용한 특징이 내용 적응적 스테가노그래피의 분석에 유용함을 보이고, 이에 대한 1차 마코프 확률을 특징으로 사용하는 방법을 제시한다. 추출된 특징 벡터는 앙상블 분류기로 입력되어 커버 영상과 스테고 영상을 분류하기 위해 학습된다. 실험 결과 내용 기반 적응적 스테고 영상들에 대해 적은 수의 특징 벡터를 사용함에도 불구하고 기존의 방법에 비해 검출율과 정확도가 우수함을 확인할 수 있었다.
https://doi.org/10.5573/ieie.2015.52.8.097 인용 PDF KSCI

Attack Detection on Images Based on DCT-Based Features

Nirin Thanirat;Sudsanguan Ngamsuriyaroj
- Asia pacific journal of information systems
- /
- 제31권3호
- /
- pp.335-357
- /
- 2021
As reproduction of images can be done with ease, copy detection has increasingly become important. In the duplication process, image modifications are likely to occur and some alterations are deliberate and can be viewed as attacks. A wide range of copy detection techniques has been proposed. In our study, content-based copy detection, which basically applies DCT-based features for images, namely, pixel values, edges, texture information and frequency-domain component distribution, is employed. Experiments are carried out to evaluate robustness and sensitivity of DCT-based features from attacks. As different types of DCT-based features hold different pieces of information, how features and attacks are related can be shown in their robustness and sensitivity. Rather than searching for proper features, use of robustness and sensitivity is proposed here to realize how the attacked features have changed when an image attack occurs. The experiments show that, out of ten attacks, the neural networks are able to detect seven attacks namely, Gaussian noise, S&P noise, Gamma correction (high), blurring, resizing (big), compression and rotation with mostly related to their sensitive features.
https://doi.org/10.14329/apjis.2021.31.3.335 인용 PDF

Image Clustering using Color, Texture and Shape Features

Sleit, Azzam;Abu Dalhoum, Abdel Llatif;Qatawneh, Mohammad;Al-Sharief, Maryam;Al-Jabaly, Rawa'a;Karajeh, Ola
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제5권1호
- /
- pp.211-227
- /
- 2011
Content Based Image Retrieval (CBIR) is an approach for retrieving similar images from an image database based on automatically-derived image features. The quality of a retrieval system depends on the features used to describe image content. In this paper, we propose an image clustering system that takes a database of images as input and clusters them using k-means clustering algorithm taking into consideration color, texture and shape features. Experimental results show that the combination of the three features brings about higher values of accuracy and precision.
https://doi.org/10.3837/tiis.2011.01.012 인용 PDF KSCI

Content-Based Image Retrieval Using Multi-Resolution Multi-Direction Filtering-Based CLBP Texture Features and Color Autocorrelogram Features

Bu, Hee-Hyung;Kim, Nam-Chul;Yun, Byoung-Ju;Kim, Sung-Ho
- Journal of Information Processing Systems
- /
- 제16권4호
- /
- pp.991-1000
- /
- 2020
We propose a content-based image retrieval system that uses a combination of completed local binary pattern (CLBP) and color autocorrelogram. CLBP features are extracted on a multi-resolution multi-direction filtered domain of value component. Color autocorrelogram features are extracted in two dimensions of hue and saturation components. Experiment results revealed that the proposed method yields a lot of improvement when compared with the methods that use partial features employed in the proposed method. It is also superior to the conventional CLBP, the color autocorrelogram using R, G, and B components, and the multichannel decoded local binary pattern which is one of the latest methods.
https://doi.org/10.3745/JIPS.02.0138 인용 PDF KSCI

유튜브 서머리 콘텐츠 특성과 콘텐츠 제공자 신뢰성이 이용자 몰입과 만족에 미치는 영향 (The Effects of YouTube Summary Contents Features and Contents Provider Credibility on Users' Flow and Satisfaction)

정유진;이남정;이정훈
- 한국융합학회논문지
- /
- 제12권2호
- /
- pp.35-44
- /
- 2021
그간의 선행연구는 짧은 동영상, 숏폼 콘텐츠, 스낵 컬처 등에 한정되었으며, 원본 콘텐츠를 압축·요약한 형태의 서머리 콘텐츠에 대한 연구는 드물었다. 따라서 본 연구는 유튜브 서머리 콘텐츠 이용자의 몰입과 만족을 불러오는 유튜브 서머리 콘텐츠 특성과 콘텐츠 제공자 신뢰성의 요인 분석을 통해, 궁극적으로 이용자 만족도 제고 방안을 탐구하여 서머리 콘텐츠 시장의 활성화에 기여하고자 하였다. 이에 유튜브 서머리 콘텐츠를 이용해 본 경험이 있는 202명을 대상으로 설문조사를 실시하였고, 분석 결과 모든 몰입 세부속성에 유의한 영향을 주는 요인은 오락성인 것으로 나타났으며, 완전성과 독창성은 일부 몰입 속성에만 유의한 영향을 주는 것으로 나타났다. 그리고 간결성과 정보성은 유의한 영향을 주지 않은 것으로 나타났다. 본 연구는 유튜브 서머리 콘텐츠의 특성을 정의하고, 방향성을 제시하였다는 점에서 의의를 가진다.
https://doi.org/10.15207/JKCS.2021.12.2.035 인용 PDF KSCI

검색결과 1,180건 처리시간 0.022초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)