• 제목/요약/키워드: Nearest neighbor search

검색결과 120건 처리시간 0.022초

고차원 벡터 공간에서 k-최근접 검색에 관한 분석 (Analysis of k-Nearest Neighbor Search in High-Demensional Vector Spaces)

  • 최승락;곽태영;신봉근;이윤준;김명호
    • 한국정보과학회:학술대회논문집
    • /
    • 한국정보과학회 1998년도 가을 학술발표논문집 Vol.25 No.2 (1)
    • /
    • pp.191-193
    • /
    • 1998
  • 지금까지 제시된 최근접 질의 알고리즘은다소간의 cklms 있으나 기본적으로 질의 점과 MBR간의 최소거리에 기반한 분기와 한정 기법을 이용하고 있다. 그러나 차원이 증가함에 따라 질의 구와 겹치는 노드가 급속히 증가하기 때문에 최근접 질의 알고리즘의 성능은 매우 비효율적이다. 이러한 문제를 해결하기 위해서 MBR 간의 중첩을 줄이고 MBR 내에 가급적 많은 점을 포함할 수 있는 다양한 다차원 색인 구조가 제시도 되었다. 그러나 우리의 실험에 의하면 이러한 방법이 근본적인 해결책이 되지 못함을 알 수 있다. 고차원 백터 공간 모델이 가지는 문제로써 임의의 질의 점으로부터 모든 데이터 점들까지의 거리가 차원이 올라감에 따라 유사해지는 현상 때문에 비효율적인 성능이 나옴을 본 논문에서 지적한다.

A METHOD OF IMAGE DATA RETRIEVAL BASED ON SELF-ORGANIZING MAPS

  • Lee, Mal-Rey;Oh, Jong-Chul
    • Journal of applied mathematics & informatics
    • /
    • 제9권2호
    • /
    • pp.793-806
    • /
    • 2002
  • Feature-based similarity retrieval become an important research issue in image database systems. The features of image data are useful to discrimination of images. In this paper, we propose the highspeed k-Nearest Neighbor search algorithm based on Self-Organizing Maps. Self-Organizing Maps (SOM) provides a mapping from high dimensional feature vectors onto a two-dimensional space. The mapping preserves the topology of the feature vectors. The map is called topological feature map. A topological feature map preserves the mutual relations (similarity) in feature spaces of input data. and clusters mutually similar feature vectors in a neighboring nodes. Each node of the topological feature map holds a node vector and similar images that is closest to each node vector. In topological feature map, there are empty nodes in which no image is classified. We experiment on the performance of our algorithm using color feature vectors extracted from images. Promising results have been obtained in experiments.

응용문제에서 보로노이 다이어그램과 파워 다이어그램의 사용성 비교 (Using Voronoi Diagram and Power Diagram in Application Problems)

  • 김동욱
    • 산업경영시스템학회지
    • /
    • 제35권4호
    • /
    • pp.235-243
    • /
    • 2012
  • The Voronoi diagram of spheres and power diagram have been known as powerful tools to analyze spatial characteristics of weighted points, and these structures have variety range of applications including molecular spatial structure analysis, location based optimization, architectural design, etc. Due to the fact that both diagrams are based on different distance metrics, one has better usability than another depending on application problems. In this paper, we compare these diagrams in various situations from the user's viewpoint, and show the Voronoi diagram of spheres is more effective in the problems based on the Euclidean distance metric such as nearest neighbor search, path bottleneck locating, and internal void finding.

Shape Feature Extraction technique for Content-Based Image Retrieval in Multimedia Databases

  • Kim, Byung-Gon;Han, Joung-Woon;Lee, Jaeho;Haechull Lim
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2000년도 ITC-CSCC -2
    • /
    • pp.869-872
    • /
    • 2000
  • Although many content-based image retrieval systems using shape feature have tried to cover rotation-, position- and scale-invariance between images, there have been problems to cover three kinds of variance at the same time. In this paper, we introduce new approach to extract shape feature from image using MBR(Minimum Bounding Rectangle). The proposed method scans image for extracting MBR information and, based on MBR information, compute contour information that consists of 16 points. The extracted information is converted to specific values by normalization and rotation. The proposed method can cover three kinds of invariance at the same time. We implemented our method and carried out experiments. We constructed R*_tree indexing structure, perform k-nearest neighbor search from query image, and demonstrate the capability and usefulness of our method.

  • PDF

Voronoi 다이어그램을 이용한 고속 최근접 검색 기법

  • 권동섭;최원익;박명선;이석호
    • 한국정보과학회:학술대회논문집
    • /
    • 한국정보과학회 1999년도 가을 학술발표논문집 Vol.26 No.2 (1)
    • /
    • pp.3-5
    • /
    • 1999
  • 최근접 검색(nearest neighbor search)을 위해서 대부분의 기존 기법들은 데이터를 특정한 공간 인덱스 구조를 이용하여 인덱싱하고 이 인덱스를 이용하여 질의를 수행하는 방법을 사용하였다. 본 연구에서는 이러한 데이터 자체를 인덱싱하는 방법과는 달리 미리 최근접 질의의 결과가 되는 Vorononi 다이어그램을 생성해두고, 이를 통하여 최근접 검색을 수행하는 VGrid(Voronoi diagram-Grid) 기법을 제안한다. 이 방법은 미리 모든 데이터에 대한 Voronoi 다이어그램을 계산하고 그 결과를 격자(grid)를 이용하여 인덱싱한 다음 최근접 검색 질의가 주어지면 이 격자 인덱스를 이용하여 빠르게 결과를 찾아낸다. 이 방법을 이용하면 처음 인덱스를 생성할 때는 많은 계산 시간이 소모되지만, 일단 인덱스가 구성되고 나면 최근접 검색 질의 처리 시 디스크 접근 회수가 줄기 때문에 기존의 기법에 비해 빠르게 최근접 검색 질의를 수행할 수 있다.

  • PDF

최적화 기법을 통한 제주도권역 강우관측소의 고도별 분포특성 검토 (A study on the distribution characteristics of Jeju Island basin rain gauge by altitude through optimization technique)

  • 김태림;임혁진;김치영
    • 한국수자원학회:학술대회논문집
    • /
    • 한국수자원학회 2023년도 학술발표회
    • /
    • pp.352-352
    • /
    • 2023
  • 본 연구에서는 제주도권역 강우관측소의 고도별 공간분포의 적정성을 평가하기 위한 방안으로 고도별 강우관측소의 최근린지수(Nearest Neighbor Index, NNI)를 산정하고 현재 강우관측소 공간분포의 적정성을 평가하였다. 또한, 제주도권역을 고도에 따라 등면적으로 구분하고, 고도마다 상이한 지형조건을 고려하기 위해 등면적으로 구분된 각 강우관측소의 최대 NNI를 최적화 기법의 하나인 화음탐색법(Harmony Search, HS)을 이용하여 산정하였다. 이와같이 현재 강우관측소설치위치를 기준으로 산정한 NNI와 HS를 이용하여 산정한 최대 NNI의 차이를 바탕으로 지형적인 특성을 고려한 제주도권역 강우관측소 분포를 비교·검토하였다. 그 결과 고도가 높아짐에 따라 강우관측소의 개수가 낮은 고도에 비해 상대적으로 적어 관측소 밀도가 작은 것으로 산정되었다. 향후 제주도권역 강우관측소의 지형적인 특성을 반영한다면 보다 효율적인 제주도권역 강우량관측이 가능할 것으로 판단된다.

  • PDF

Hyperparameter Tuning Based Machine Learning classifier for Breast Cancer Prediction

  • Md. Mijanur Rahman;Asikur Rahman Raju;Sumiea Akter Pinky;Swarnali Akter
    • International Journal of Computer Science & Network Security
    • /
    • 제24권2호
    • /
    • pp.196-202
    • /
    • 2024
  • Currently, the second most devastating form of cancer in people, particularly in women, is Breast Cancer (BC). In the healthcare industry, Machine Learning (ML) is commonly employed in fatal disease prediction. Due to breast cancer's favorable prognosis at an early stage, a model is created to utilize the Dataset on Wisconsin Diagnostic Breast Cancer (WDBC). Conversely, this model's overarching axiom is to compare the effectiveness of five well-known ML classifiers, including Logistic Regression (LR), Decision Tree (DT), Random Forest (RF), K-Nearest Neighbor (KNN), and Naive Bayes (NB) with the conventional method. To counterbalance the effect with conventional methods, the overarching tactic we utilized was hyperparameter tuning utilizing the grid search method, which improved accuracy, secondary precision, third recall, and finally the F1 score. In this study hyperparameter tuning model, the rate of accuracy increased from 94.15% to 98.83% whereas the accuracy of the conventional method increased from 93.56% to 97.08%. According to this investigation, KNN outperformed all other classifiers in terms of accuracy, achieving a score of 98.83%. In conclusion, our study shows that KNN works well with the hyper-tuning method. These analyses show that this study prediction approach is useful in prognosticating women with breast cancer with a viable performance and more accurate findings when compared to the conventional approach.

시멘틱개념과 에지탐지 기반의 적응형 이미지 분류기법 (Adaptive Scene Classification based on Semantic Concepts and Edge Detection)

  • ;;김강석;강상길
    • 지능정보연구
    • /
    • 제15권2호
    • /
    • pp.1-13
    • /
    • 2009
  • 개념 기반 이미지풍경 분류 기법은 데이터베이스에 있는 대량의 이미지 를 카테고리별로 구분하는 많이 적용되는 응용분야이다. 풍경이 속하는 카테고리를 알면 데이터베이스에서 해변, 산, 숲, 필드와 같은 필요한 풍경사진을 찾고자 할 때 불필요한 이미지를 필터링하여 신속하고 정확하게 찾을 수 있다. 본 논문에서는 이미지 분류를 위한 시멘틱 모델링 기반의 적응 세그멘테이션 기법을 제안 한다. 잔디, 물, 하늘과 같은 시멘틱 개념에 따른 이미지를 서브구역으로 나누어 세그멘테이션을 한다. 세그멘테이션은 에지탐색을 이용하고 또한 K-Nearest(K-NN)를 이용하여 세그멘테이션을 한다. 세그멘테이션 과정에서 이미지의 복잡도에 따라 적응적으로 서브구역으로 나눈다. 실험에서는 Vosel과 schiele가 제안한 방법과의 비교를 통해서 정확도면에서 제안된 연구의 우수성을 보여준다.

  • PDF

Vantage Point Metric Index Improvement for Multimedia Databases

  • Chanpisey, Uch;Lee, Sang-Kon Samuel;Lee, In-Hong
    • 한국정보과학회:학술대회논문집
    • /
    • 한국정보과학회 2011년도 한국컴퓨터종합학술대회논문집 Vol.38 No.1(C)
    • /
    • pp.112-114
    • /
    • 2011
  • On multimedia databases, in order to realize the fast access method, indexing methods for the multidimension data space are used. However, since it is a premise to use the Euclid distance as the distance measure, this method lacks in flexibility. On the other hand, there are metric indexing methods which require only to satisfy distance axiom. Since metric indexing methods can also apply for distance measures other than the Euclid distance, these methods have high flexibility. This paper proposes an improved method of VP-tree which is one of the metric indexing methods. VP-tree follows the node which suits the search range from a route node at searching. And distances between a query and all objects linked from the leaf node which finally arrived are computed, and it investigates whether each object is contained in the search range. However, search speed will become slow if the number of distance calculations in a leaf node increases. Therefore, we paid attention to the candidates selection method using the triangular inequality in a leaf node. As the improved methods, we propose a method to use the nearest neighbor object point for the query as the datum point of the triangular inequality. It becomes possible to make the search range smaller and to cut down the number of times of distance calculation by these improved methods. From evaluation experiments using 10,000 image data, it was found that our proposed method could cut 5%~12% of search time of the traditional method.

문자열 검출을 위한 슬라브 영역 추정 (Slab Region Localization for Text Extraction using SIFT Features)

  • 최종현;최성후;윤종필;구근휘;김상우
    • 전기학회논문지
    • /
    • 제58권5호
    • /
    • pp.1025-1034
    • /
    • 2009
  • In steel making production line, steel slabs are given a unique identification number. This identification number, Slab management number(SMN), gives information about the use of the slab. Identification of SMN has been done by humans for several years, but this is expensive and not accurate and it has been a heavy burden on the workers. Consequently, to improve efficiency, automatic recognition system is desirable. Generally, a recognition system consists of text localization, text extraction, character segmentation, and character recognition. For exact SMN identification, all the stage of the recognition system must be successful. In particular, the text localization is great important stage and difficult to process. However, because of many text-like patterns in a complex background and high fuzziness between the slab and background, directly extracting text region is difficult to process. If the slab region including SMN can be detected precisely, text localization algorithm will be able to be developed on the more simple method and the processing time of the overall recognition system will be reduced. This paper describes about the slab region localization using SIFT(Scale Invariant Feature Transform) features in the image. First, SIFT algorithm is applied the captured background and slab image, then features of two images are matched by Nearest Neighbor(NN) algorithm. However, correct matching rate can be low when two images are matched. Thus, to remove incorrect match between the features of two images, geometric locations of the matched two feature points are used. Finally, search rectangle method is performed in correct matching features, and then the top boundary and side boundaries of the slab region are determined. For this processes, we can reduce search region for extraction of SMN from the slab image. Most cases, to extract text region, search region is heuristically fixed [1][2]. However, the proposed algorithm is more analytic than other algorithms, because the search region is not fixed and the slab region is searched in the whole image. Experimental results show that the proposed algorithm has a good performance.