• 제목/요약/키워드: Similarity sampling

검색결과 122건 처리시간 0.028초

다변량기법을 활용한 용담호 수질측정지점 유사성 연구 (A Study on Measuring the Similarity Among Sampling Sites in Lake Yongdam with Water Quality Data Using Multivariate Techniques)

  • 이요상;권세혁
    • 환경영향평가
    • /
    • 제18권6호
    • /
    • pp.401-409
    • /
    • 2009
  • Multivariate statistical approaches to classify sampling sites with measuring their similarity by water quality data and understand the characteristics of classified clusters have been discussed for the optimal water quality monitering network. For empirical study, data of two years (2005, 2006) at the 9 sampling sites with the combination of 2 depth levels and 7 important variables related to water quality is collected in Yongdam reservoir. The similarity among sampling sites is measured with Euclidean distances of water quality related variables and they are classified by hierarchical clustering method. The clustered sites are discussed with principal component variables in the view of the geographical characteristics of them and reducing the number of measuring sites. Nine sampling sites are clustered as follows; One cluster of 5, 6, and 7 sampling sites shows the characteristic of low water depth and main stream of water. The sites of 2 and 4 are clustered into the same group by characteristics of hydraulics which come from that of main stream. But their changing pattern of water quality looks like different since the site of 2 is near to dam. The sampling sites of 3, 8, and 9 are individually positioned due to the different tributary.

저수지 수질조사 지점간 유사성 분석 (A Study on Measuring the Similarity Among Sampling Sites in Lake)

  • 이요상;고덕구;이현석
    • 한국수자원학회:학술대회논문집
    • /
    • 한국수자원학회 2010년도 학술발표회
    • /
    • pp.957-961
    • /
    • 2010
  • Multivariate statistical approaches to classify sampling sites with measuring their similarity by water quality data. For empirical study, data of two years at the 9 sampling sites with the combination of 2 depth levels and 7 important variables related to water quality is collected in reservoir. The similarity among sampling sites is measured with Euclidean distances of water quality related variables and they are classified by hierarchical clustering method. The clustered sites are discussed with principal component variables in the view of the geographical characteristics of them and reducing the number of measuring sites. Nine sampling sites are clustered as follows; One cluster of 5, 6, and 7 sampling sites shows the characteristic of low water depth and main stream of water. The sites of 2 and 4 are clustered into the same group by characteristics of hydraulics which come from that of main stream. But their changing pattern of water quality looks like different since the site of 2 is near to dam. The sampling sites of 3, 8, and 9 are individually positioned due to the different tributary.

  • PDF

Top-${\kappa}$ 유사도 조인을 위한 샘플링 기반 알고리즘 (A Sampling-based Algorithm for Top-${\kappa}$ Similarity Joins)

  • 박종수
    • 한국정보과학회논문지:데이타베이스
    • /
    • 제41권4호
    • /
    • pp.256-261
    • /
    • 2014
  • Top-${\kappa}$ 유사도 조인 문제는 두 개의 입력 레코드 집합들에서 유사도를 기준한 상위 ${\kappa}$ 개의 레코드 쌍을 찾는 것이다. 샘플링 기법을 이용하여 상위 ${\kappa}$ 개의 유사도 조인 쌍을 반환하는 효율적인 알고리즘을 제안한다. 입력 레코드들의 표본에서 집합 유사도 조인들의 히스토그램을 구성하고, 상위 ${\kappa}$ 개의 조인 쌍을 위한 추정 유사도 한계치를 통계 추론으로 95% 신뢰 구간의 오차 한계 내에서 계산한다. 상위 ${\kappa}$ 개의 유사도 조인을 얻기 위하여 최소-히프 구조를 사용하는 일반 유사도 조인 알고리즘에 이 추정 한계치를 적용한다. 대 용량의 실제 데이터집합에서의 실험결과는 제안된 알고리즘의 좋은 성능을 보여준다.

Comparison of terrestrial insect communities associated with the crabgrass (Digitaria ciliaris) community, Korea

  • Jeong Ho Hwang;Jong-Hak Yun
    • Journal of Ecology and Environment
    • /
    • 제47권4호
    • /
    • pp.250-260
    • /
    • 2023
  • Background: Crabgrass (Digitaria ciliaris, Poaceae) is a globally distributed weed, including in Afro-Eurasia, America, and Australia. As a highly gregarious plant, crabgrass is an important habitat for a diverse array of insects, and a potential habitat for agricultural pests. To compare the insect communities associated with the crabgrass community, insects were sampled using sweep sampling (100 sweeps per sample) at five sites, including Daejeon (Daejeon and Gap rivers), Anseong, Namhae, and Inje, with a focus on the Daejeon River. Results: A total of 5,888 individual insects belonging to eight orders, 42 families, and 115 species were collected from the five sites. Both the number of species and individuals of Hemiptera were the highest at all of the sites. In the present study, 73% of the insect population fed on D. ciliaris as a host plant. The dominant species in the D. ciliaris community was Laodelphax striatellus (Delphacidae), being ubiquitous at all the sites which showed a high abundance of rice pests in the communities and the suitability of D. ciliaris as an alternative host plant for them. The Shannon-Wiener diversity index was highest in Inje on 17 September (2.88), and the Chao1-bc diversity index was highest in the Gap River on 5 September (80). The sampling efficiency of 100 sweep samples (sample coverage) was calculated to be as high as 90%. The results of the samples taken from September to November in the Daejeon River showed that the number of species and individuals decreased gradually over time, and the number of dominant species decreased sharply between September and October. Similarity analysis indicated that sampling dates that were closer together yielded sampled assemblages with higher faunal similarity. In addition, in each sampling, the difference in the minimum temperature during the two-week period prior to sampling and faunal similarities were negatively correlated. Conclusions: This study provides foundational data that could enhance our understanding of insect diversity in D. ciliaris. The data can facilitate ecological conservation and management of Korean grasslands generally, as well as identification of potential pests that may disperse from D. ciliaris communities to nearby farmland.

Improvement of ASIFT for Object Matching Based on Optimized Random Sampling

  • Phan, Dung;Kim, Soo Hyung;Na, In Seop
    • International Journal of Contents
    • /
    • 제9권2호
    • /
    • pp.1-7
    • /
    • 2013
  • This paper proposes an efficient matching algorithm based on ASIFT (Affine Scale-Invariant Feature Transform) which is fully invariant to affine transformation. In our approach, we proposed a method of reducing similar measure matching cost and the number of outliers. First, we combined the Manhattan and Chessboard metrics replacing the Euclidean metric by a linear combination for measuring the similarity of keypoints. These two metrics are simple but really efficient. Using our method the computation time for matching step was saved and also the number of correct matches was increased. By applying an Optimized Random Sampling Algorithm (ORSA), we can remove most of the outlier matches to make the result meaningful. This method was experimented on various combinations of affine transform. The experimental result shows that our method is superior to SIFT and ASIFT.

과제 유사도 측정 개선모형에 관한 실증적 연구 (An Empirical Study on Improvement model for Measuring of Project Similarity)

  • 정옥남;류성열;김종배
    • 디지털콘텐츠학회 논문지
    • /
    • 제12권4호
    • /
    • pp.457-465
    • /
    • 2011
  • 지난 5년간 우리나라 R&D투자는 연평균 12.2%씩 증가하고 있다. 연구개발 중복 투자 방지와 독창성 도출을 위해서는 유사 중복과제 수행의 사전방지가 필요하고, 이를 위해 과제 유사도의 정확도를 개선할 필요가 있다. 본 논문에서는 유사 중복과제 수행의 사전방지를 위한 과제 유사도 측정 개선모형을 제안한다. 과제 유사도 측정 개선모형은 크게 두 단계로 정의된다. 먼저 추출단계에서 Document Vector를 기반으로 한 검색엔진에 연구보고서 초록을 추가한다. 다음은 분석단계에서 과제 키워드에서 복합 키워드 중심으로 생성한 과제의 연구주제망과 항목별 가중치를 활용하여 유사도를 측정한다. 실험결과 과제정보만을 활용한 기존방식보다 연구보고서 초록을 활용한 개선모형의 유사도가 평균 0.19이상 개선되었고, 단순키워드를 활용한 기존방식보다 복합 키워드 기반의 연구주제망과 항목별 가중치를 활용한 개선모형의 유사도가 평균 9.25이상 감소되었다. 연구보고서 초록이 유사도에 영향을 미치고 있고, 복합 키워드 기반의 연구주제망을 활용함으로써 유사도에 대한 정확도를 판단할 수 있는 범위가 확대되는 것을 확인하였다. 또한, 추가된 사항의 폭이 넓으면 넓을수록 유사도의 정확도가 높아지는 것과 과제정보 등 검색대상의 모집단이 클수록 과제 유사도의 정확도가 높아지는 것도 실험을 통해 확인하였다.

수질 관측지점 유사성 측정방법 연구 (Similarity of Sampling Sites by Water Quality)

  • 권세혁;이요상
    • Communications for Statistical Applications and Methods
    • /
    • 제17권1호
    • /
    • pp.39-45
    • /
    • 2010
  • 환경가치가 높아짐에 따라 하천 수질에 대한 관심의 증대로 수질측정망 연구가 최근 활발히 진행되고 있으나 입지환경의 지리적 특성이나 유입량, 유출량, 유량, 유속과 같은 하천 특성 중심 연구이다. 본 연구에서는 상대적으로 연구가 미미한 관측지점의 수질 유사성을 측정하는 방법으로 수질의 시계열 패턴을 고려할 수 있는 상관계수행렬 방법을 제안하고 기존의 주성분점수를 이용한 방법과 비교하였다. 용담댐에서 2년간 조사된 수질관련 데이터를 이용하여 두 방법에 대한 실증분석을 실시하여 관측지점의 지리적 특성에 의해 분류된 결과와 본 연구에서 제안된 방법에 의해 관측지점 유사성을 측정하여 얻은 군집결과가 더 일치함을 보였다.

Design of Solving Similarity Recognition for Cloth Products Based on Fuzzy Logic and Particle Swarm Optimization Algorithm

  • Chang, Bae-Muu
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제11권10호
    • /
    • pp.4987-5005
    • /
    • 2017
  • This paper introduces a new method to solve Similarity Recognition for Cloth Products, which is based on Fuzzy logic and Particle swarm optimization algorithm. For convenience, it is called the SRCPFP method hereafter. In this paper, the SRCPFP method combines Fuzzy Logic (FL) and Particle Swarm Optimization (PSO) algorithm to solve similarity recognition for cloth products. First, it establishes three features, length, thickness, and temperature resistance, respectively, for each cloth product. Subsequently, these three features are engaged to construct a Fuzzy Inference System (FIS) which can find out the similarity between a query cloth and each sampling cloth in the cloth database D. At the same time, the FIS integrated with the PSO algorithm can effectively search for near optimal parameters of membership functions in eight fuzzy rules of the FIS for the above similarities. Finally, experimental results represent that the SRCPFP method can realize a satisfying recognition performance and outperform other well-known methods for similarity recognition under considerations here.

시퀀스 요소 기반의 유사도를 이용한 시퀀스 데이터 클러스터링 (Mining Clusters of Sequence Data using Sequence Element-based Similarity Measure)

  • 오승준;김재련
    • 한국지능정보시스템학회:학술대회논문집
    • /
    • 한국지능정보시스템학회 2004년도 추계학술대회
    • /
    • pp.221-229
    • /
    • 2004
  • Recently, there has been enormous growth in the amount of commercial and scientific data, such as protein sequences, retail transactions, and web-logs. Such datasets consist of sequence data that have an inherent sequential nature. However, only a few of the existing clustering algorithms consider sequentiality. This study presents a method for clustering such sequence datasets. The similarity between sequences must be decided before clustering the sequences. This study proposes a new similarity measure to compute the similarity between two sequences using a sequence element. Two clustering algorithms using the proposed similarity measure are proposed: a hierarchical clustering algorithm and a scalable clustering algorithm that uses sampling and a k-nearest neighbor method. Using a splice dataset and synthetic datasets, we show that the quality of clusters generated by our proposed clustering algorithms is better than that of clusters produced by traditional clustering algorithms.

  • PDF

신경 회로망과 Log-Polar Sampling 기법을 사용한 항공기 영상의 연상 연식 (Neural-Network and Log-Polar Sampling Based Associative Pattern Recognizer for Aircraft Images)

  • 김종오;김인철;진성일
    • 전자공학회논문지B
    • /
    • 제28B권12호
    • /
    • pp.59-67
    • /
    • 1991
  • In this paper, we aimed to develop associative pattern recognizer based on neural network for aircraft identification. For obtaining invariant feature space description of an object regardless of its scale change and rotation, Log-polar sampling technique recently developed partly due to its similarity to the human visual system was introduced with Fourier transform post-processing. In addition to the recognition results, image recall was associatively performed and also used for the visualization of the recognition reliability. The multilayer perceptron model was learned by backpropagation algorithm.

  • PDF