• Title/Summary/Keyword: location-based clustering

Search Result 169, Processing Time 0.025 seconds

Vegetation Cover Type Mapping Over The Korean Peninsula Using Multitemporal AVHRR Data (시계열(時系列) AVHRR 위성자료(衛星資料)를 이용한 한반도 식생분포(植生分布) 구분(區分))

  • Lee, Kyu-Sung
    • Journal of Korean Society of Forest Science
    • /
    • v.83 no.4
    • /
    • pp.441-449
    • /
    • 1994
  • The two reflective channels(red and near infrared spectrum) of advanced very high resolution radiometer(AVHRR) data were used to classify primary vegetation cover types in the Korean Peninsula. From the NOAA-11 satellite data archive of 1991, 27 daytime scenes of relatively minimum cloud coverage were obtained. After the initial radiometric calibration, normalized difference vegetation index(NDVI) was calculated for each of the 27 data sets. Four or five daily NDVI data were then overlaid for each of the six months starting from February to November and the maximum value of NDVI was retained for every pixel location to make a monthly composite. The six bands of monthly NDVI composite were nearly cloud free and used for the computer classification of vegetation cover. Based on the temporal signatures of different vegetation cover types, which were generated by an unsupervised block clustering algorithm, every pixel was classified into one of the six cover type categories. The classification result was evaluated by both qualitative interpretation and quantitative comparison with existing forest statistics. Considering frequent data acquisition, low data cost and volume, and large area coverage, it is believed that AVHRR data are effective for vegetation cover type mapping at regional scale.

  • PDF

Spatial Pattern Analysis for Distribution of Migratory Insect Pests at Paddy Field in Jeolla-province (전라도 지역 논벼에서 비래해충 개체군 분포의 공간패턴분석)

  • Park, Taechul;Choe, Hojeong;Jeong, Hyoujin;Jang, Hojung;Kim, Kwang Ho;Park, Jung-Joon
    • Korean journal of applied entomology
    • /
    • v.57 no.4
    • /
    • pp.361-372
    • /
    • 2018
  • Migratory insect pest populations migrate from the southern China to Korea through jet streams. In Korea, 5 major migratory insect species are important, i.e. Nilaparvata lugens, Sogatella furcifera, Laodelphax striatellus, Cnaphalocrocis medinalis and Mythimma separate, which are damages to the major crops, rice. This study was conducted from late July 2016 to early September 2016 and from July 2017 to August 2017 in rice paddy of Jeolla-province. C. medinalis and M. separata collected using pheromone traps, while N. lugens, S. furcifera and L. striatellus collected using 3 methods (visual surveys, sweeping surveys, sticky traps). SADIE (Spatial Analysis by Distance IndicEs) among geostatistics was used to analyze migratory insect pests. SADIE was used to analyze spatial distribution and index of aggregation $I_a$, index of clustering $V_i$, $V_j$ were used to investigate the spatial distribution. Also, the clustering indices were mapped as red-blue plot. C. medinalis and M. separata showed different distribution based on SADIE spatial aggregation analysis and red-blue plot analysis. Initial spatial distributions of L. striatellus and other planthoppers were differed for sampling location and time.

Recommender Systems using Structural Hole and Collaborative Filtering (구조적 공백과 협업필터링을 이용한 추천시스템)

  • Kim, Mingun;Kim, Kyoung-Jae
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.4
    • /
    • pp.107-120
    • /
    • 2014
  • This study proposes a novel recommender system using the structural hole analysis to reflect qualitative and emotional information in recommendation process. Although collaborative filtering (CF) is known as the most popular recommendation algorithm, it has some limitations including scalability and sparsity problems. The scalability problem arises when the volume of users and items become quite large. It means that CF cannot scale up due to large computation time for finding neighbors from the user-item matrix as the number of users and items increases in real-world e-commerce sites. Sparsity is a common problem of most recommender systems due to the fact that users generally evaluate only a small portion of the whole items. In addition, the cold-start problem is the special case of the sparsity problem when users or items newly added to the system with no ratings at all. When the user's preference evaluation data is sparse, two users or items are unlikely to have common ratings, and finally, CF will predict ratings using a very limited number of similar users. Moreover, it may produces biased recommendations because similarity weights may be estimated using only a small portion of rating data. In this study, we suggest a novel limitation of the conventional CF. The limitation is that CF does not consider qualitative and emotional information about users in the recommendation process because it only utilizes user's preference scores of the user-item matrix. To address this novel limitation, this study proposes cluster-indexing CF model with the structural hole analysis for recommendations. In general, the structural hole means a location which connects two separate actors without any redundant connections in the network. The actor who occupies the structural hole can easily access to non-redundant, various and fresh information. Therefore, the actor who occupies the structural hole may be a important person in the focal network and he or she may be the representative person in the focal subgroup in the network. Thus, his or her characteristics may represent the general characteristics of the users in the focal subgroup. In this sense, we can distinguish friends and strangers of the focal user utilizing the structural hole analysis. This study uses the structural hole analysis to select structural holes in subgroups as an initial seeds for a cluster analysis. First, we gather data about users' preference ratings for items and their social network information. For gathering research data, we develop a data collection system. Then, we perform structural hole analysis and find structural holes of social network. Next, we use these structural holes as cluster centroids for the clustering algorithm. Finally, this study makes recommendations using CF within user's cluster, and compare the recommendation performances of comparative models. For implementing experiments of the proposed model, we composite the experimental results from two experiments. The first experiment is the structural hole analysis. For the first one, this study employs a software package for the analysis of social network data - UCINET version 6. The second one is for performing modified clustering, and CF using the result of the cluster analysis. We develop an experimental system using VBA (Visual Basic for Application) of Microsoft Excel 2007 for the second one. This study designs to analyzing clustering based on a novel similarity measure - Pearson correlation between user preference rating vectors for the modified clustering experiment. In addition, this study uses 'all-but-one' approach for the CF experiment. In order to validate the effectiveness of our proposed model, we apply three comparative types of CF models to the same dataset. The experimental results show that the proposed model outperforms the other comparative models. In especial, the proposed model significantly performs better than two comparative modes with the cluster analysis from the statistical significance test. However, the difference between the proposed model and the naive model does not have statistical significance.

Enhanced ART1 Algorithm for the Recognition of Student Identification Cards of the Educational Matters Administration System on the Web (웹 환경 학사관리 시스템의 학생증 인식을 위한 개선된 ART1 알고리즘)

  • Park Hyun-Jung;Kim Kwang-Baek
    • Journal of the Korea Society of Computer and Information
    • /
    • v.10 no.5 s.37
    • /
    • pp.333-342
    • /
    • 2005
  • This paper proposes a method, which recognizes student's identification card by using image processing and recognition technology and can manage student information on the web. The presented scheme sets up an average brightness as a threshold, based on the brightest Pixel and the least bright one for the source image of the ID card. It is converting to binary image, applies a horizontal histogram, and extracts student number through its location. And, it removes the noise of the student number region by the mode smoothing with 3$\times$3 mask. After removing noise from the student number region, each number is extracted using vertical histogram and normalized. Using the enhanced ART1 algorithm recognized the extracted student number region. In this study, we propose the enhanced ART1 algorithm different from the conventional ART1 algorithm by the dynamical establishment of the vigilance parameter. which shows a tolerance limit of unbalance between voluntary and stored patterns for clustering. The Experiment results showed that the recognition rate of the proposed ART1 algorithm was improved much more than that of the conventional ART1 algorithm. So, we develop an educational matters administration system by using the proposed recognition method of the student's identification card.

  • PDF

Analysis of the taxi telematics history data based on a state diagram (상태도에 기반한 택시 텔레매틱스 히스토리 데이터 분석)

  • Lee, Jung-Hoon;Kwon, Sang-Cheol
    • Journal of Korea Spatial Information System Society
    • /
    • v.10 no.1
    • /
    • pp.41-49
    • /
    • 2008
  • This paper presents a data analysis method for the taxi telematics system which generates a greate deal of location history data. By the record consist of the basic GPS receiver-generated fields, device-added fields such as taxi operation status, and framework-attached fields such as matched link Identifier and position ratio in a link, each taxi can be represented by a state diagram. The transition and the state definition enable us to efficiently extract such information as pick-up time, pick-up distance, dispatch time, and dispatch distance. The analysis result can help to verify the efficiency of a specific taxi dispatch algorithm, while the analysis framework can invite a new challenging service including future traffic estimation, trajectory clustering, and so on.

  • PDF

Analysis of Pattern Change of Real Transaction Price of Apartment in Seoul (서울시 아파트 실거래가의 변화패턴 분석)

  • Kim, Jung Hee
    • Journal of Korean Society for Geospatial Information Science
    • /
    • v.22 no.1
    • /
    • pp.63-70
    • /
    • 2014
  • This study is to analyze impact of geography and timing on the real transactions prices of apartment complexes in Seoul using data provided by the Ministry of Land, Infrastructure and Transport. The average real transactions and location data of apartment complex was combined into the GIS data. First, the pattern of apartment real transaction price change by period and by area was analyzed by kriging, the one of the spatial interpolation technique. Second, to analyze the pattern of apartment market price change by administrative district(administrative 'Dong' unit), the average of market price per unit area was calculated and converted to Moran I value, which was used to analyze the clustering level of the real transaction price. Through the analysis, spatial-temporal distribution pattern can be found and the type of change can be forecasted. Therefore, this study can be referred as of the base data research for the housing or local policies. Also, the regional unbalanced apartment price can be presented by analyzing the vertical pattern of the change in the time series and the horizontal pattern of the change based on GIS.

Modeling and Simulation of Scheduling Medical Materials Using Graph Model for Complex Rescue

  • Lv, Ming;Zheng, Jingchen;Tong, Qingying;Chen, Jinhong;Liu, Haoting;Gao, Yun
    • Journal of Information Processing Systems
    • /
    • v.13 no.5
    • /
    • pp.1243-1258
    • /
    • 2017
  • A new medical materials scheduling system and its modeling method for the complex rescue are presented. Different from other similar system, first both the BeiDou Satellite Communication System (BSCS) and the Special Fiber-optic Communication Network (SFCN) are used to collect the rescue requirements and the location information of disaster areas. Then all these messages will be displayed in a special medical software terminal. After that the bipartite graph models are utilized to compute the optimal scheduling of medical materials. Finally, all these results will be transmitted back by the BSCS and the SFCN again to implement a fast guidance of medical rescue. The sole drug scheduling issue, the multiple drugs scheduling issue, and the backup-scheme selection issue are all utilized: the Kuhn-Munkres algorithm is used to realize the optimal matching of sole drug scheduling issue, the spectral clustering-based method is employed to calculate the optimal distribution of multiple drugs scheduling issue, and the similarity metric of neighboring matrix is utilized to realize the estimation of backup-scheme selection issue of medical materials. Many simulation analysis experiments and applications have proved the correctness of proposed technique and system.

Cluster exploration of water pipe leak and complaints surveillance using a spatio-temporal statistical analysis (스캔통계량 분석을 통한 상수도 누수 및 수질 민원 발생 클러스터 탐색)

  • Juwon Lee;Eunju Kim;Sookhyun Nam;Tae-Mun Hwang
    • Journal of Korean Society of Water and Wastewater
    • /
    • v.37 no.5
    • /
    • pp.261-269
    • /
    • 2023
  • In light of recent social concerns related to issues such as water supply pipe deterioration leading to problems like leaks and degraded water quality, the significance of maintenance efforts to enhance water source quality and ensure a stable water supply has grown substantially. In this study, scan statistic was applied to analyze water quality complaints and water leakage accidents from 2015 to 2021 to present a reasonable method to identify areas requiring improvement in water management. SaTScan, a spatio-temporal statistical analysis program, and ArcGIS were used for spatial information analysis, and clusters with high relative risk (RR) were determined using the maximum log-likelihood ratio, relative risk, and Monte Carlo hypothesis test for I city, the target area. Specifically, in the case of water quality complaints, the analysis results were compared by distinguishing cases occurring before and after the onset of "red water." The period between 2015 and 2019 revealed that preceding the occurrence of red water, the leak cluster at location L2 posed a significantly higher risk (RR: 2.45) than other regions. As for water quality complaints, cluster C2 exhibited a notably elevated RR (RR: 2.21) and appeared concentrated in areas D and S, respectively. On the other hand, post-red water incidents of water quality complaints were predominantly concentrated in area S. The analysis found that the locations of complaint clusters were similar to those of red water incidents. Of these, cluster C7 exhibited a substantial RR of 4.58, signifying more than a twofold increase compared to pre-incident levels. A kernel density map analysis was performed using GIS to identify priority areas for waterworks management based on the central location of clusters and complaint cluster RR data.

Interpretation of Soil Catena for Agricultural Soils derived from Sedimentary Rocks (퇴적암 유래 농경지 토양에 대한 카테나 해석)

  • SONN, Yeon-Kyu;LEE, Dong-Sung;KIM, Keun-Tae;HYUN, Byung-Keun;JUN, Hye-Weon;JEON, Sang-Ho
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.20 no.4
    • /
    • pp.1-14
    • /
    • 2017
  • In Korea, the soil series derived from sedimentary rocks are classified into seven soil series of coarse loamy soil such as Dain, Danbug, Dongam, Imdong, Jeomgog, Maryeong, and Yonggog; seventeen soil series of fine loamy soil such as Angye, Anmi, Banho, Bigog, Deoggog, Dogye, Dojeon, Gamgog, Gugog, Jincheon, Maji, Mungyeong, Oggye, Samam, Yanggog, Yeongwol, and Yulgog; six soil series of fine silty soil such as Goryeong, Bonggog, Juggog, Gyeongsan, Yuga, and Yugog; and four soil series of clayey soil such as Mitan, Pyeongan, Pyeongjeon, and Uji. All thirty-four soil series have different drainage rates and topography. However, the soil texture depends on the parent rock. The buffer functions in GIS (Geographic Information System) techniques were used to calculate adjacent soil series from a soil series. The length of the adjacent soil series was adjusted because a side of the buffer area was one meter long. The cluster analysis was conducted using the CCC (Cubic Clustering Criterion) method, in which the number of clusters is calculated based on the individual soil series ratio. Soil survey has been carried out since 1964 as "The reconnaissance soil survey", and 1:5,000 detailed soil survey was completed in 1999 with a five-years plan in Korea. Today, all the soil survey information has been computerized. GIS techniques were used to establish a digital soil map; however, there have not been any studies to interpret pedogenesis using the GIS technique. In this study, the area of the adjacent soil series were obtained using the GIS technique. The area of the adjacent soil series can be calculated based on the information area. The similarities of soil originated from sedimentary rocks were estimated using the length. As a result, the distribution of grain size was different based on the types of sedimentary rocks and the location. The clusters were distinguished into limestone, sandstone, and shale. In addition, the soil derived from shale was divided into red shale and gray shale. This means that quantitative interpretation of the catena and this established method can be used to interpret the relationship between soil series.

Calculation of the Peak-hour Ratio at Urban Railway Stations Reflecting Passenger Demand Pattern and Land Use Inventory - A Case of Seoul - (승객 수요 패턴과 역세권의 토지이용 특성을 반영한 도시철도역 첨두시간 집중률 산정 - 서울시를 대상으로 -)

  • Jang, Sunghoon;Kim, Hyo-Seung;Lee, Chungwon;Kim, Dong-Kyu
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.33 no.4
    • /
    • pp.1581-1589
    • /
    • 2013
  • The aim of this study is to suggest a methodology for calculating the peak-hour ratio of passengers at urban railway stations by reflecting the characteristics of passenger demand patterns and the land use inventory of stations. To achieve this, urban railway stations in Seoul are divided into three groups by using factor analysis and cluster analysis. For each station group, we calculate five and four variables related to the passenger demand patterns and the land use inventory of stations, respectively, as well as the peak-hour ratios of passengers. Among these nine variables, average daily passengers and the location quotient (LQ) index for business services are selected as the classification criteria for station groups based on statistical tests. Using the two variables, a group allocation process is suggested to estimate the peak-hour ratio of passengers for a newly-constructed station. Evaluation results based on thirteen stations show that the proposed methodology produces lower errors than the currently-used guideline does. The results of this study contribute to establishing efficiently construction and operation plans for newly-constructed stations.