• Title/Summary/Keyword: 계층적 군집법

Search Result 40, Processing Time 0.028 seconds

A Study on Research Topics for Thyroid Cancer in Korea (국내 갑상선암 연구 주제 동향 분석)

  • Yang, Ji-Yeon;Shin, Seung-Hyeok;Heo, Seong-Min;Lee, Tae-Gyeong
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2019.01a
    • /
    • pp.409-410
    • /
    • 2019
  • 본 논문에서는 국내 갑상선암의 연구 동향을 파악하기 위해 텍스트 중심의 접근법을 제안한다. 국내 갑상선암은 2000년대에 들어서며 발생이 급증하여 과잉진단의 논란을 불러일으켰으나, 다양한 분야의 자정 노력으로 수술 환자수가 크게 줄었다. 본 연구에서는 텍스트 마이닝 기술을 사용하여 디비피아에 등록되어 있는 갑상선암 관련 논문의 키워드와 초록을 수집하여 분석하였다. 1980년대는 대부분의 사례보고가 있었고 1990년대에 들어서면서 검진을 통한 조기 진단의 내용이 자주 나타났다. 2000년대에는 여러 장비들을 활용한 검사방법과 미세한 암의 발견에 대한 논의가 증가하였음을 확인 할 수 있었다. 2010년대에 들어서는 환자의 삶의 질에 대한 연구가 많이 이루어졌다. 지난 수십 년 동안 갑상선 암 연구 주제에 대해 뚜렷한 변화가 나타났으며, 향후 연구의 기초자료로 활용될 수 있으리라 기대된다.

  • PDF

GC-Tree: A Hierarchical Index Structure for Image Databases (GC-트리 : 이미지 데이타베이스를 위한 계층 색인 구조)

  • 차광호
    • Journal of KIISE:Databases
    • /
    • v.31 no.1
    • /
    • pp.13-22
    • /
    • 2004
  • With the proliferation of multimedia data, there is an increasing need to support the indexing and retrieval of high-dimensional image data. Although there have been many efforts, the performance of existing multidimensional indexing methods is not satisfactory in high dimensions. Thus the dimensionality reduction and the approximate solution methods were tried to deal with the so-called dimensionality curse. But these methods are inevitably accompanied by the loss of precision of query results. Therefore, recently, the vector approximation-based methods such as the VA- file and the LPC-file were developed to preserve the precision of query results. However, the performance of the vector approximation-based methods depend largely on the size of the approximation file and they lose the advantages of the multidimensional indexing methods that prune much search space. In this paper, we propose a new index structure called the GC-tree for efficient similarity search in image databases. The GC-tree is based on a special subspace partitioning strategy which is optimized for clustered high-dimensional images. It adaptively partitions the data space based on a density function and dynamically constructs an index structure. The resultant index structure adapts well to the strongly clustered distribution of high-dimensional images.

Catchment Similarity Assessment Based on Catchment Characteristics of GIS in Geum River Catchments, Korea (금강 유역을 대상으로 한 GIS 기반의 유역의 유사성 평가)

  • Lee, Hyo Sang;Park, Ki Soon;Jung, Sung Heuk;Choi, Seuk Keun
    • Journal of Korean Society for Geospatial Information Science
    • /
    • v.21 no.3
    • /
    • pp.37-46
    • /
    • 2013
  • Similarity measure of catchments is essential for regionalization studies, which provide in depth analysis in hydrological response and flood estimations at ungauged catchments. However, this similarity measure is often biased to the selected catchments and is not clearly explained in hydrological sense. This study applied a type of hydrological similarity distance measure-Flood Estimation Handbook to 25 Geum River catchments, Korea. Three Catchment Characteristics, Area(A)-Annual precipitation(SAAR)-SCS Curve Number(CN), are used in Euclidian distance measures. Furthermore, six index of Flow Duration Curve are applied to clustering analysis of SPSS. The catchments' grouping of hydrological similarity measures suggests three groups (H1, H2 and H3) and the four catchments are not grouped in this study. The clustering analysis of FDC provides four Groups; F1, F2, F3 and F4. The six catchments (out of seven) of H1 are grouped in F1, while Sangyeogyo is grouped in F2. The four catchments (out of six) of H2 are also grouped in F2, while Cheongju and Guryong are grouped in F1. The catchments of H3 are categorized in F1. The authors examine the results (H1, H2 and H3) of similarity measure based on catchment physical descriptors with results (F1 and F2) of clustering based on catchment hydrological response. The results of hydrological similarity measures are supported by clustering analysis of FDC. This study shows a potential of hydrological catchment similarity measures in Korea.

Relationship between Diurnal Patterns of Transit Ridership and Land Use in the Metropolitan Seoul Area (서울 대도시권 하루 시간대별 지하철 통행흐름 패턴과 토지이용과의 관계)

  • Lee, Keum-Sook;Song, Ye-Na;Park, Jong-Soo;Anderson, William P.
    • Journal of the Economic Geographical Society of Korea
    • /
    • v.15 no.1
    • /
    • pp.26-41
    • /
    • 2012
  • This study investigates the time-space characteristics of intra-urban passenger flows in the Metropolitan Seoul area. In particular, we analyze the relationships between transit ridership and land use through the use of the subway passenger flow data obtained from the transit transaction databases. For this purpose, the strength of each subway station, i.e., the number of total in-coming and out-going passengers at each station, in the morning, afternoon, and evening, is calculated and visualized, which reflects urban land use patterns. Then the subway stations are classified into four groups via a hierarchical analysis of the in-coming and out-going passenger flows at 353 stations. Each group appears to have characteristic properties according to the region, e.g., residential areas and central business districts. This has been confirmed by the analysis which probes explicitly the relationship between the local socio-economic variables and station groups. This analysis, disclosing the inter-relationship between the subway network and urban land use, may be useful at various stages in urban as well as transportation planning, and provides analytical tools for a wide spectrum of applications ranging from impact evaluation to decision-making and planning support.

  • PDF

Analyzing the Co-occurrence of Endangered Brackish-Water Snails with Other Species in Ecosystems Using Association Rule Learning and Clustering Analysis (연관 규칙 학습과 군집분석을 활용한 멸종위기 기수갈고둥과 생태계 내 종 간 연관성 분석)

  • Sung-Ho Lim;Yuno Do
    • Korean Journal of Ecology and Environment
    • /
    • v.57 no.2
    • /
    • pp.83-91
    • /
    • 2024
  • This study utilizes association rule learning and clustering analysis to explore the co-occurrence and relationships within ecosystems, focusing on the endangered brackish-water snail Clithon retropictum, classified as Class II endangered wildlife in Korea. The goal is to analyze co-occurrence patterns between brackish-water snails and other species to better understand their roles within the ecosystem. By examining co-occurrence patterns and relationships among species in large datasets, association rule learning aids in identifying significant relationships. Meanwhile, K-means and hierarchical clustering analyses are employed to assess ecological similarities and differences among species, facilitating their classification based on ecological characteristics. The findings reveal a significant level of relationship and co-occurrence between brackish-water snails and other species. This research underscores the importance of understanding these relationships for the conservation of endangered species like C. retropictum and for developing effective ecosystem management strategies. By emphasizing the role of a data-driven approach, this study contributes to advancing our knowledge on biodiversity conservation and ecosystem health, proposing new directions for future research in ecosystem management and conservation strategies.

Analysis of Fish Community according to Habitat in the Woraksan National Park, Korea (월악산국립공원의 서식지에 따른 어류군집 분석)

  • Park, Seung-Chul
    • Korean Journal of Environment and Ecology
    • /
    • v.35 no.5
    • /
    • pp.490-502
    • /
    • 2021
  • This study was conducted to analyze the current status of fish fauna and characteristics of the fish community according to the habitat of Woraksan National Park, Korea. The spatially balanced sampling selected 20 stations from major streams of Woraksan National Park, and three surveys were conducted in each season. The physical environments of the habitat were mostly mountain streams (Aa), with large stones and gravels scattered over the stream. The average altitude of the habitat was 304.4 m, and the average depth of water was 40.3 cm, being less than 1 m in most cases, and the river water level was distributed from 3rd to 5th streams. The principal component analysis of the physical environmental factors by habitat showed that the substrate properties differed according to the altitude. The survey identified a total of 2,183 individuals in 16 species belonging to 7 families. The dominant species was Zacco koreanus(86.2%), and the subdominant species was Rhynchocypris oxycephalus(3.8%). Pseudopungtungia tenuicorpa, classified as the endangered wildlife II, was the first endangered legally protected species found in this survey. Analysis of the rank abundance curve model in the fish community showed the Zipf model at 9 out of 20 points, the Lognormal model in 3 points, and the Preemption model in 4 points. The remaining 4 habitats showed only one species and were not analyzed. The canonical correspondence analysis of 20 stations and fish species was performed to understand the characteristics of the fish community according to environmental factors. The fish communities were divided according to differences in habitat environment by the altitude.

Unsupervised Image Classification through Multisensor Fusion using Fuzzy Class Vector (퍼지 클래스 벡터를 이용하는 다중센서 융합에 의한 무감독 영상분류)

  • 이상훈
    • Korean Journal of Remote Sensing
    • /
    • v.19 no.4
    • /
    • pp.329-339
    • /
    • 2003
  • In this study, an approach of image fusion in decision level has been proposed for unsupervised image classification using the images acquired from multiple sensors with different characteristics. The proposed method applies separately for each sensor the unsupervised image classification scheme based on spatial region growing segmentation, which makes use of hierarchical clustering, and computes iteratively the maximum likelihood estimates of fuzzy class vectors for the segmented regions by EM(expected maximization) algorithm. The fuzzy class vector is considered as an indicator vector whose elements represent the probabilities that the region belongs to the classes existed. Then, it combines the classification results of each sensor using the fuzzy class vectors. This approach does not require such a high precision in spatial coregistration between the images of different sensors as the image fusion scheme of pixel level does. In this study, the proposed method has been applied to multispectral SPOT and AIRSAR data observed over north-eastern area of Jeollabuk-do, and the experimental results show that it provides more correct information for the classification than the scheme using an augmented vector technique, which is the most conventional approach of image fusion in pixel level.

An Analysis of The Technological Regime by an Integrated Taxonomy of Region-Industry: Focusing on the Manufacturing Sector of the 2016 Korean Innovation Survey (지역-산업 통합분류법에 의한 국내 기술체제 분석: 2016년 한국기업혁신조사 제조업 부문을 중심으로)

  • Jaepil Han
    • Journal of the Economic Geographical Society of Korea
    • /
    • v.26 no.1
    • /
    • pp.1-22
    • /
    • 2023
  • This study proposes an integrated use of region and industry as a way to classify firms' innovation activities by type. Existing studies have used the method of determining innovative activities according to the components of the technological regimes and aggregating them by industry classification, but this method cannot fully reflect the heterogeneity within industries in an increasingly sophisticated innovation environment. Therefore, this study divides firms by region and industry and conducts a cluster analysis on the proportion of innovative activities by the components of the technological regimes to derive a total of four innovation types. Using the 2016 Korean Innovation Survey to classify innovation types in the manufacturing industry, we found that innovation activities are concentrated in Seoul, Busan, Incheon, and Chungnam/ Sejong/ Daejeon area, with different deviations by region and industry. The results of the aggregation of industrial innovation activities, weighted by corporate activity by region, show that the level of innovation activity in some manufacturing industries, such as petrochemicals, manufacturing of medical, precision and optical instruments, watches and clocks, is high, but the level of innovation in other sectors within the manufacturing industry is generally low.

Analysis of Food Resources of 20 Endangered Fishes in Freshwater Ecosystems of South Korea using Non-metric Multidimensional Scaling and Network Analysis (비메트릭 다변량 척도법과 네트워크 분석을 통한 멸종위기 국내 담수어류 20종의 먹이원 분석)

  • Ji, Chang Woo;Lee, Dae-Seong;Lee, Da-Yeong;Park, Young-Seuk;Kwak, Ihn-Sil
    • Korean Journal of Ecology and Environment
    • /
    • v.54 no.2
    • /
    • pp.130-141
    • /
    • 2021
  • By reviewing previous literature, we analyzed the food sources of 20 out of 29 endangered fish species from freshwater ecosystems in South Korea. A total of 19 studies reported that food sources of 20 endangered fish species included 20 phyla, 31 classes, 58 orders, 116 families, and 154 genera. Arthropod, insecta, diptera, and chironomidae were the most fed animal food sources according to different resolution of taxa index on phylum, class, order and family. Similarity, bacillariophyta, bacillariophyceae, naviculales, and cymbellaceae were the most fed abundant plant sources. A larger number of fish species were reliant on animal food sources than plant food sources. 18 of the endangered fish preyed on arthropods, whereas only 6 species consumed bacillariophyta. To characterize the feeding groups of the 20 fish species, a hierarchical clustering analysis and non-metric multidimensional scaling analysis were conducted. The fish species were divided into two groups: 1) insectivores and 2) planktivores. A network analysis, which associated the link between endangered fishes and food sources, also revealed the same two groups. The highest hub score of food sources was for macroinvertebrates, including diptera (0.47), ephemeroptera (0.42), and trichoptera (0.38), based on the network analysis. Niche breadth was used to calculate the diversity of the food sources. Phoxinus phoxinus (0.57) showed thehighest food source diversity among the fish species, whereas Iksookimia pacifica (0.01) showed the lowest. This study will be utilized for the conservation and restoration of the endangered fish species.

Relationships between Community Unit and Environment Factor in Forest Vegetation of Mt. Dutasan, Pyeongchang-gun (평창 두타산 산림식생의 군집유형과 입지환경요인의 상관관계)

  • Lee, Jeong Eun;Shin, Jae Kwon;Kim, Dong Gap;Yun, Chung Weon
    • Journal of Korean Society of Forest Science
    • /
    • v.106 no.3
    • /
    • pp.275-287
    • /
    • 2017
  • The purpose of this study was to analyze forest vegetation type classification and relationships between the type and environment factor in Mt. Dutasan. Data were collected by total of forty six plots using Z-M phytosociological method from June to October, 2016, and analyzed by vegetation classification, canopy layer structure and relationships between vegetation unit and environment factor using coincidence methods. As a result of vegetation type classification, Quercus mongolica community group was classified at a top level of vegetation hierarchy that was classified into Rhododendron schlippenbachii community and Betula costata community. R. schlippenbachii community was divided into Lychnis cognata group and R. schlippenbachii typical group. L. cognata group was subdivided into Veratrum oxysepalum subgroup and L. cognata typical subgroup. B. costata community was divided into Fraxinus mandshurica group and Betula schmidtii group. F. mandshurica group was subdivided into Weigela subsessilis subgroup and Cimicifuga heracleifolia subgroup. Therefore the forest vegetation was composed of six vegetation units with two kinds of bisected species groups and fourteen species groups. As the result of an analysis of canopy layer structure, there were two kinds of structures with monotonous structures V. oxysepalum subgroup (vegetation units 1), L. cognata typical subgroup (vegetation units 2), W. subsessilis subgroup (vegetation units 4) and complicated structures R. schlippenbachii typical group (vegetation units 3), C. heracleifolia subgroup (vegetation units 5), Betula schmidtii group (vegetation units 6). The vertical layer structure of vegetation unit 5 was the most developed and vegetation unit 6 had the lowest coverage of herb layer. According to the correlation between vegetation unit and environmental factor, R. schlippenbachii community (vegetation units 1~3) and B. costata community (vegetation units 4~6) were classified based on 1,100 m of altitude, middle slope, twenty of slope degree, twenty percents of bare rock and thirty centimeters of DBH in tree layer. R. schlippenbachii community (vegetation units 1~3) showed positive correlation with altitude, topography and B. costata community (vegetation units 4~6) showed negative correlation tendency with them.