• Title/Summary/Keyword: Correlation clustering

Search Result 271, Processing Time 0.035 seconds

Characteristics of health lifestyle patterns by the quantification method (수량화 방법을 이용한 건강행태 유형의 특성에 관한 연구)

  • Lee, Soon-Young;Kim, Seon-Woo
    • Journal of Preventive Medicine and Public Health
    • /
    • v.31 no.1 s.60
    • /
    • pp.72-81
    • /
    • 1998
  • The purpose of this study was to investigate the relation between health behavior patterns and demographic, socio-economic characteristics, health status, health information in Korea. The quantification method through canonical correlation analysis was conducted to the data from Korea National Health Survey in 1995, which consisted of 5,805 persons. The health lifestyle patterns were quantified as good diet lifestyle, passive lifestyle to the negative direction and drinker lifestyle, smoker lifestyle, hedonic lifestyle and fitness lifestyle to the positive direction. The covariate were related to health lifestyle patterns in the order of sex, age, marital status, occupation, health information, economic status, level of physical labour and health status. Characteristics of male, age below 50, married, blue colored worker, no health information, low in economic status, heavy level of physical labour, and poor in health status were positively related to drinker lifestyle, smoker lifestyle, hedonic lifestyle, fitness lifestyle sequentially.

  • PDF

Intrusion Detection on IoT Services using Event Network Correlation (이벤트 네트워크 상관분석을 이용한 IoT 서비스에서의 침입탐지)

  • Park, Boseok;Kim, Sangwook
    • Journal of Korea Multimedia Society
    • /
    • v.23 no.1
    • /
    • pp.24-30
    • /
    • 2020
  • As the number of internet-connected appliances and the variety of IoT services are rapidly increasing, it is hard to protect IT assets with traditional network security techniques. Most traditional network log analysis systems use rule based mechanisms to reduce the raw logs. But using predefined rules can't detect new attack patterns. So, there is a need for a mechanism to reduce congested raw logs and detect new attack patterns. This paper suggests enterprise security management for IoT services using graph and network measures. We model an event network based on a graph of interconnected logs between network devices and IoT gateways. And we suggest a network clustering algorithm that estimates the attack probability of log clusters and detects new attack patterns.

Forecasting of Short-term Wind Power Generation Based on SVR Using Characteristics of Wind Direction and Wind Speed (풍향과 풍속의 특징을 이용한 SVR기반 단기풍력발전량 예측)

  • Kim, Yeong-ju;Jeong, Min-a;Son, Nam-rye
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.42 no.5
    • /
    • pp.1085-1092
    • /
    • 2017
  • In this paper, we propose a wind forecasting method that reflects wind characteristics to improve the accuracy of wind power prediction. The proposed method consists of extracting wind characteristics and predicting power generation. The part that extracts the characteristics of the wind uses correlation analysis of power generation amount, wind direction and wind speed. Based on the correlation between the wind direction and the wind speed, the feature vector is extracted by clustering using the K-means method. In the prediction part, machine learning is performed using the SVR that generalizes the SVM so that an arbitrary real value can be predicted. Machine learning was compared with the proposed method which reflects the characteristics of wind and the conventional method which does not reflect wind characteristics. To verify the accuracy and feasibility of the proposed method, we used the data collected from three different locations of Jeju Island wind farm. Experimental results show that the error of the proposed method is better than that of general wind power generation.

Classifier Selection for Efficient Face Recognition (효과적인 얼굴 인식을 위한 인식기 선택)

  • Nam, MIl-Young;Rhee, Phill-Kyu
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • v.9 no.1
    • /
    • pp.453-456
    • /
    • 2005
  • In this paper, we propose method to improve recognition performance using the most effective algorithm selectively after clustering various face data, because recognition performance of each algorithm according to facial attribute is change. The proposed face recognition is divided into two steps. First step is the clustering integrated various data to be optimized in algorithm. Second is that classify input image by a similar cluster, select suitable algorithm and recognize the target. This thesis takes the first step towards the creation of a synthetic classifier fusiontesting environment. The effects of data correlation on three classifier fusion techniques were examined. We proposed fusion method for each recognition algorithm's result. This research explores how the degree of correlation in classification data affects the degree of accuracy in a fusion context.

  • PDF

A Study on Information Expansion of Neighboring Clusters for Creating Enhanced Indoor Movement Paths (향상된 실내 이동 경로 생성을 위한 인접 클러스터의 정보 확장에 관한 연구)

  • Yoon, Chang-Pyo;Hwang, Chi-Gon
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.10a
    • /
    • pp.264-266
    • /
    • 2022
  • In order to apply the RNN model to the radio fingerprint-based indoor path generation technology, the data set must be continuous and sequential. However, Wi-Fi radio fingerprint data is not suitable as RNN data because continuity is not guaranteed as characteristic information about a specific location at the time of collection. Therefore, continuity information of sequential positions should be given. For this purpose, clustering is possible through classification of each region based on signal data. At this time, the continuity information between the clusters does not contain information on whether actual movement is possible due to the limitation of radio signals. Therefore, correlation information on whether movement between adjacent clusters is possible is required. In this paper, a deep learning network, a recurrent neural network (RNN) model, is used to predict the path of a moving object, and it reduces errors that may occur when predicting the path of an object by generating continuous location information for path generation in an indoor environment. We propose a method of giving correlation between clustering for generating an improved moving path that can avoid erroneous path prediction that cannot move on the predicted path.

  • PDF

A Study on the Relationship between Skill and Competition Score Factors of KLPGA Players Using Canonical Correlation Biplot and Cluster Analysis (정준상관 행렬도와 군집분석을 응용한 KLPGA 선수의 기술과 경기성적요인에 대한 연관성 분석)

  • Choi, Tae-Hoon;Choi, Yong-Seok
    • The Korean Journal of Applied Statistics
    • /
    • v.21 no.3
    • /
    • pp.429-439
    • /
    • 2008
  • Canonical correlation biplot is 2-dimensional plot for investigating the relationship between two sets of variables and the relationship between observations and variables in canonical correlation analysis graphically. In general, biplot is useful for giving a graphical description of the data. However, this general biplot and also canonical correlation biplot do not give some concise interpretations between variables and observations when the number of observations are large. Recently, for overcoming this problem, Choi and Kim (2008) suggested a method to interpret the biplot analysis by applying the K-means clustering analysis. Therefore, in this study, we will apply their method for investigating the relationship between skill and competition score factors of KLPGA players using canonical correlation biplot and cluster analysis.

Nonlinear System Modeling Using Genetic Algorithm and FCM-basd Fuzzy System (유전알고리즘과 FCM 기반 퍼지 시스템을 이용한 비선형 시스템 모델링)

  • 곽근창;이대종;유정웅;전명근
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.11 no.6
    • /
    • pp.491-499
    • /
    • 2001
  • In this paper, the scheme of an efficient fuzzy rule generation and fuzzy system construction using GA(genetic algorithm) and FCM(fuzzy c-means) clustering algorithm is proposed for TSK(Takagi-Sugeno-Kang) type fuzzy system. In the structure identification, input data is transformed by PCA(Principal Component Analysis) to reduce the correlation among input data components. And then, a set fuzzy rules are generated for a given criterion by FCM clustering algorithm . In the parameter identification premise parameters are optimally searched by GA. On the other hand, the consequent parameters are estimated by RLSE(Recursive Least Square Estimate) to reduce the search space. From this one can systematically obtain the valid number of fuzzy rules which shows satisfying performance for the given problem. Finally, we applied the proposed method to the Box-Jenkins data and rice taste data modeling problems and obtained a better performance than previous works.

  • PDF

Two-stage Sampling for Estimation of Prevalence of Bovine Tuberculosis (이단계표본추출을 이용한 소결핵병 유병률 추정)

  • Pak, Son-Il
    • Journal of Veterinary Clinics
    • /
    • v.28 no.4
    • /
    • pp.422-426
    • /
    • 2011
  • For a national survey in which wide geographic region or an entire country is targeted, multi-stage sampling approach is widely used to overcome the problem of simple random sampling, to consider both herd- and animallevel factors associated with disease occurrence, and to adjust clustering effect of disease in the population in the calculation of sample size. The aim of this study was to establish sample size for estimating bovine tuberculosis (TB) in Korea using stratified two-stage sampling design. The sample size was determined by taking into account the possible clustering of TB-infected animals on individual herds to increase the reliability of survey results. In this study, the country was stratified into nine provinces (administrative unit) and herd, the primary sampling unit, was considered as a cluster. For all analyses, design effect of 2, between-cluster prevalence of 50% to yield maximum sample size, and mean herd size of 65 were assumed due to lack of information available. Using a two-stage sampling scheme, the number of cattle sampled per herd was 65 cattle, regardless of confidence level, prevalence, and mean herd size examined. Number of clusters to be sampled at a 95% level of confidence was estimated to be 296, 74, 33, 19, 12, and 9 for desired precision of 0.01, 0.02, 0.03, 0.04, 0.05, and 0.06, respectively. Therefore, the total sample size with a 95% confidence level was 172,872, 43,218, 19,224, 10,818, 6,930, and 4,806 for desired precision ranging from 0.01 to 0.06. The sample size was increased with desired precision and design effect. In a situation where the number of cattle sampled per herd is fixed ranging from 5 to 40 with a 5-head interval, total sample size with a 95% confidence level was estimated to be 6,480, 10,080, 13,770, 17,280, 20.925, 24,570, 28,350, and 31,680, respectively. The percent increase in total sample size resulting from the use of intra-cluster correlation coefficient of 0.3 was 22.2, 32.1, 36.3, 39.6, 41.9, 42.9, 42,2, and 44.3%, respectively in comparison to the use of coefficient of 0.2.

Components Clustering for Modular Product Design Using Network Flow Model (네트워크 흐름 모델을 활용한 모듈러 제품 설계를 위한 컴포넌트 군집화)

  • Son, Jiyang;Yoo, Jaewook
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.17 no.7
    • /
    • pp.263-272
    • /
    • 2016
  • Modular product design has contributed to flexible product modification and development, production lead time reduction, and increasing product diversity. Modular product design aims to develop a product architecture that is composed of detachable modules. These modules are constructed by maximizing the similarity of components based on physical and functional interaction analysis among components. Accordingly, a systematic procedure for clustering the components, which is a main activity in modular product design, is proposed in this paper. The first phase in this procedure is to build a component-to-component correlation matrix by analyzing physical and functional interaction relations among the components. In the second phase, network flow modeling is applied to find clusters of components, maximizing their correlations. In the last phase, a network flow model formulated with linear programming is solved to find the clusters and to make them modular. Finally, the proposed procedure in this research and its application are illustrated with an example of modularization for a vacuum cleaner.

Semantic Network Analysis on the MIS Research Keywords: APJIS and MIS Quarterly 2005~2009

  • Lee, Sung-Joon;Choi, Jun-Ho;Kim, Hee-Woong
    • Asia pacific journal of information systems
    • /
    • v.20 no.4
    • /
    • pp.25-51
    • /
    • 2010
  • This study compares and contrasts the intellectual development of the MIS field in Korea from 2005 to 2009 to that of international trends by using a keyword co-occurrence network analysis of the two flagship journals: APJIS and MIS Quarterly. From 316 research articles in these two journals, 132 unique and most frequently co-occurred keywords were put into analysis. The results of structural equivalence show a mild correlation between APJIS and MIS Quarterly. The e-commerce, trust, and technology adoption are the high frequency keywords in both journals. In Korea e-learning, purchasing, and recommendation systems turn out to be important keywords while outsourcing, research method, quantitative method, design research, information theory, and empirical research are in average international journals. This connotes that the Korean scholarship tends to focus more on practically oriented topics, but the clustering and relational mapping of research topics in each journal show a mild level of overlap with distinctive orientations due to intrinsic disparities depending on the concerned journals' geographical scopes, namely domestic or global.