• Title/Summary/Keyword: Line-Clustering

Search Result 206, Processing Time 0.023 seconds

A Code Clustering Technique for Unifying Method Full Path of Reusable Cloned Code Sets of a Product Family (제품군의 재사용 가능한 클론 코드의 메소드 경로 통일을 위한 코드 클러스터링 방법)

  • Kim, Taeyoung;Lee, Jihyun;Kim, Eunmi
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.12 no.1
    • /
    • pp.1-18
    • /
    • 2023
  • Similar software is often developed with the Clone-And-Own (CAO) approach that copies and modifies existing artifacts. The CAO approach is considered as a bad practice because it makes maintenance difficult as the number of cloned products increases. Software product line engineering is a methodology that can solve the issue of the CAO approach by developing a product family through systematic reuse. Migrating product families that have been developed with the CAO approach to the product line engineering begins with finding, integrating, and building them as reusable assets. However, cloning occurs at various levels from directories to code lines, and their structures can be changed. This makes it difficult to build product line code base simply by finding clones. Successful migration thus requires unifying the source code's file path, class name, and method signature. This paper proposes a clustering method that identifies a set of similar codes scattered across product variants and some of their method full paths are different, so path unification is necessary. In order to show the effectiveness of the proposed method, we conducted an experiment using the Apo Games product line, which has evolved with the CAO approach. As a result, the average precision of clustering performed without preprocessing was 0.91 and the number of identified common clusters was 0, whereas our method showed 0.98 and 15 respectively.

Elliptical Clustering with Incremental Growth and its Application to Skin Color Region Segmentation (점증적으로 증가하는 타원형 군집화 : 피부색 영역 검출에의 적용)

  • Lee Kyoung-Mi
    • Journal of KIISE:Software and Applications
    • /
    • v.31 no.9
    • /
    • pp.1161-1170
    • /
    • 2004
  • This paper proposes to segment skin color areas using a clustering algorithm. Most of previously proposed clustering algorithms have some difficulties, since they generally detect hyperspherical clusters, run in a batch mode, and predefine a number of clusters. In this paper, we use a well-known elliptical clustering algorithm, an EM algorithm, and modify it to learn on-line and find automatically the number of clusters, called to an EAM algorithm. The effectiveness of the EAM algorithm is demonstrated on a task of skin color region segmentation. Experimental results present the EAM algorithm automatically finds a right number of clusters in a given image without any information on the number. Comparing with the EM algorithm, we achieved better segmentation results with the EAM algorithm. Successful results were achieved to detect and segment skin color regions using a conditional probability on a region. Also, we applied to classify images with persons and got good classification results.

Intrusion detection algorithm based on clustering : Kernel-ART

  • Lee, Hansung;Younghee Im;Park, Jooyoung;Park, Daihee
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2002.05a
    • /
    • pp.109-113
    • /
    • 2002
  • In this paper, we propose a new intrusion detection algorithm based on clustering: Kernel-ART, which is composed of the on-line clustering algorithm, ART (adaptive resonance theory), combining with mercer-kernel and concept vector. Kernel-ART is not only satisfying all desirable characteristics in the context of clustering-based 105 but also alleviating drawbacks associated with the supervised learning IDS. It is able to detect various types of intrusions in real-time by means of generating clusters incrementally.

  • PDF

An Optimized Partner Searching System for B2B Marketplace Applying Clustering Techniques (군집화 기법을 이용한 B2B Marketplace상의 최적 파트너 검색 시스템)

  • Kim Shin-Young;Kim Soo-Young
    • Proceedings of the Korean Operations and Management Science Society Conference
    • /
    • 2003.05a
    • /
    • pp.572-579
    • /
    • 2003
  • With the expansion of e-commerce, E-marketplace has become one of the most discussed topics in recent years. Limited theoretical works, however, have been done to optimize the practical use of e-marketplace systems. Other potential issues aside, this research has focused on this problem: 'the participants waste too much time, effort and cost to find out their best partner in B2B marketplace.' To solve this problem, this paper proposes a system which provides the user-company with the automated and customized brokering service. The system proposed in this paper assesses the weight on the priorities of a user-company, runs the two-stage clustering algorithm with self-organizing map and K-means clustering technique. Subsequently, the system shows the clustering result and user guide-line. This system enables B2B marketplace to have more efficiency on transaction with smaller pool of partners to be searched.

  • PDF

The Alcock-Paczynski effect via clustering shells

  • Sabiu, Cristiano G.;Lee, Seokcheon;Park, Changbom
    • The Bulletin of The Korean Astronomical Society
    • /
    • v.38 no.2
    • /
    • pp.58.2-58.2
    • /
    • 2013
  • Both peculiar velocities and errors in the assumed redshift-distance relation ("Alcock-Paczynski effect") generate correlations between clustering amplitude and orientation with respect to the line-of-sight. In this talk we propose a novel technique to extract the Alcock-Paczynski, geometric, distortion information from the anisotropic clustering of galaxies in 3-dimensional redshift space while minimizing non-linear clustering and peculiar velocity effects. We capitalize on the recent, large dataset from the Sloan Digital Sky Survey III (SDSS-III), which provides a large comoving sample of the universe out to high redshift. We focus our analysis on the Baryon Oscillation Spectroscopic Survey (BOSS) constant mass (CMASS) sample of 549,005 bright galaxies in the redshift range 0.43

  • PDF

Determination of Bar Code Cross-line Based on Block HOG Clustering (블록 HOG 군집화 기반의 1-D 바코드 크로스라인 결정)

  • Kim, Dong Wook
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.26 no.7
    • /
    • pp.996-1003
    • /
    • 2022
  • In this paper, we present a new method for determining the scan line and range for vision-based 1-D barcode recognition. This is a study on how to detect valid barcode representative points and directions by applying the DBSCAN clustering method based on block HOG (histogram of gradient) and determine scan lines and barcode crosslines based on this. In this paper, the minimum and maximum search techniques were applied to determine the cross-line range of barcodes based on the obtained scan lines. This can be applied regardless of the barcode size. This technique enables barcode recognition even by detecting only a partial area of the barcode, and does not require rotation to read the code after detecting the barcode area. In addition, it is possible to detect barcodes of various sizes. Various experimental results are presented to evaluate the performance of the proposed technique in this paper.

A Search-Result Clustering Method based on Word Clustering for Effective Browsing of the Paper Retrieval Results (논문 검색 결과의 효과적인 브라우징을 위한 단어 군집화 기반의 결과 내 군집화 기법)

  • Bae, Kyoung-Man;Hwang, Jae-Won;Ko, Young-Joong;Kim, Jong-Hoon
    • Journal of KIISE:Software and Applications
    • /
    • v.37 no.3
    • /
    • pp.214-221
    • /
    • 2010
  • The search-results clustering problem is defined as the automatic and on-line grouping of similar documents in search results returned from a search engine. In this paper, we propose a new search-results clustering algorithm specialized for a paper search service. Our system consists of two algorithmic phases: Category Hierarchy Generation System (CHGS) and Paper Clustering System (PCS). In CHGS, we first build up the category hierarchy, called the Field Thesaurus, for each research field using an existing research category hierarchy (KOSEF's research category hierarchy) and the keyword expansion of the field thesaurus by a word clustering method using the K-means algorithm. Then, in PCS, the proposed algorithm determines the category of each paper using top-down and bottom-up methods. The proposed system can be used in the application areas for retrieval services in a specialized field such as a paper search service.

Cosmological constraints using BAO - From spectroscopic to photometric catalogues

  • Sridhar, Srivatsan
    • The Bulletin of The Korean Astronomical Society
    • /
    • v.44 no.1
    • /
    • pp.56.2-56.2
    • /
    • 2019
  • Measurement of the location of the baryon acoustic oscillation (BAO) feature in the clustering of galaxies has proven to be a robust and precise method to measure the expansion of the Universe. The best constraints so far have been provided from spectroscopic surveys because the errors on the redshift obtained from spectroscopy are minimal. This in turn means that the errors along the line-of-sight are reduced and so one can expect constraints on both angular diameter distance $D_A$ and expansion rate $H^{-1}$. But, future surveys will probe a larger part of the sky and go to deeper redshifts, which correspond to more number of galaxies. Analysing each galaxy using spectroscopy, which is a time consuming task, will not be practically possible. So, photometry will be the most convenient way to measure redshifts for future surveys such as LSST, Euclid, etc. The advantage of photometry is measuring the redshift of vast number of galaxies in a single exposure, but the disadvantage are the errors associated with the measured redshifts. Using a wedge approach, wherein the clustering is split into different wedges along the line-of-sight ${\pi}$ and across the line-of-sight ${\sigma}$, we show that the BAO information can be recovered even for photometric catalogues with errors along the line-of-sight. This means that we can get cosmological distance constraints even if we don't have spectroscopic information.

  • PDF

Distance Measures in HMM Clustering for Large-scale On-line Chinese Character Recognition (대용량 온라인 한자 인식을 위한 클러스터링 거리계산 척도)

  • Kim, Kwang-Seob;Ha, Jin-Young
    • Journal of KIISE:Software and Applications
    • /
    • v.36 no.9
    • /
    • pp.683-690
    • /
    • 2009
  • One of the major problems that prevent us from building a good recognition system for large-scale on-line Chinese character recognition using HMMs is increasing recognition time. In this paper, we propose a clustering method to solve recognition speed problem and an efficient distance measure between HMMs. From the experiments, we got about twice the recognition speed and 95.37% 10-candidate recognition accuracy, which is only 0.9% decrease, for 20,902 Chinese characters defined in Unicode CJK unified ideographs.

Analysis of the Spatial Structure of Traditional Villages for Revitalization of the Community in Urban Villages (도시마을 커뮤니티 활성화를 위한 전통마을 공간 구조 특성 분석)

  • Moon, Ji-Won;Kim, Joo-Hyun;Ha, Jae-Myung
    • Journal of the Korean housing association
    • /
    • v.19 no.6
    • /
    • pp.85-93
    • /
    • 2008
  • This study analyzes areas, traffic lines and characteristics of block of traditional villages in order to suggest how to build urban village in the way that can solve problems occurring in residential areas these days. The study showed the following results: 1) Traditional villages have definite boundary and entrance, and the community area for the villages is close to the entrance to encourage community activities of villagers. 2) With an access in the form of a blind alley branched from the main road, traditional villages form a small-sized clustering and encourage community activities in a natural way. 3) Formed of block with a pattern of net, blind alley or standing in a line on both sides, traditional villages help residents to form close relations between. These findings suggest that for building desirable urban villages, 1) they should have definite boundary, 2) size and location of community area should be determined in the way to activate community activities of residents, 3) roads inside the village should have branched form rather than standardized check pattern so that small-sized clustering could be formed along the branched inner roads, and 4) clustering in villages should be arranged in a line on both sides or in the form of a blind alley giving consideration to the length and width of roads. The roads should be also of a closed type so that residents could create strong bonds with their neighbors.