• Title/Summary/Keyword: K-means cluster

Search Result 616, Processing Time 0.025 seconds

Keyword Network Analysis for Technology Forecasting (기술예측을 위한 특허 키워드 네트워크 분석)

  • Choi, Jin-Ho;Kim, Hee-Su;Im, Nam-Gyu
    • Journal of Intelligence and Information Systems
    • /
    • v.17 no.4
    • /
    • pp.227-240
    • /
    • 2011
  • New concepts and ideas often result from extensive recombination of existing concepts or ideas. Both researchers and developers build on existing concepts and ideas in published papers or registered patents to develop new theories and technologies that in turn serve as a basis for further development. As the importance of patent increases, so does that of patent analysis. Patent analysis is largely divided into network-based and keyword-based analyses. The former lacks its ability to analyze information technology in details while the letter is unable to identify the relationship between such technologies. In order to overcome the limitations of network-based and keyword-based analyses, this study, which blends those two methods, suggests the keyword network based analysis methodology. In this study, we collected significant technology information in each patent that is related to Light Emitting Diode (LED) through text mining, built a keyword network, and then executed a community network analysis on the collected data. The results of analysis are as the following. First, the patent keyword network indicated very low density and exceptionally high clustering coefficient. Technically, density is obtained by dividing the number of ties in a network by the number of all possible ties. The value ranges between 0 and 1, with higher values indicating denser networks and lower values indicating sparser networks. In real-world networks, the density varies depending on the size of a network; increasing the size of a network generally leads to a decrease in the density. The clustering coefficient is a network-level measure that illustrates the tendency of nodes to cluster in densely interconnected modules. This measure is to show the small-world property in which a network can be highly clustered even though it has a small average distance between nodes in spite of the large number of nodes. Therefore, high density in patent keyword network means that nodes in the patent keyword network are connected sporadically, and high clustering coefficient shows that nodes in the network are closely connected one another. Second, the cumulative degree distribution of the patent keyword network, as any other knowledge network like citation network or collaboration network, followed a clear power-law distribution. A well-known mechanism of this pattern is the preferential attachment mechanism, whereby a node with more links is likely to attain further new links in the evolution of the corresponding network. Unlike general normal distributions, the power-law distribution does not have a representative scale. This means that one cannot pick a representative or an average because there is always a considerable probability of finding much larger values. Networks with power-law distributions are therefore often referred to as scale-free networks. The presence of heavy-tailed scale-free distribution represents the fundamental signature of an emergent collective behavior of the actors who contribute to forming the network. In our context, the more frequently a patent keyword is used, the more often it is selected by researchers and is associated with other keywords or concepts to constitute and convey new patents or technologies. The evidence of power-law distribution implies that the preferential attachment mechanism suggests the origin of heavy-tailed distributions in a wide range of growing patent keyword network. Third, we found that among keywords that flew into a particular field, the vast majority of keywords with new links join existing keywords in the associated community in forming the concept of a new patent. This finding resulted in the same outcomes for both the short-term period (4-year) and long-term period (10-year) analyses. Furthermore, using the keyword combination information that was derived from the methodology suggested by our study enables one to forecast which concepts combine to form a new patent dimension and refer to those concepts when developing a new patent.

A Study on the Satisfaction of Self-Employed (만족도를 이용한 자영업에 관한 연구)

  • Oh, Yu-Jin
    • The Korean Journal of Applied Statistics
    • /
    • v.22 no.2
    • /
    • pp.281-296
    • /
    • 2009
  • This study examines the job and life satisfactions of the self-employed. It uses the Korean Labour and Income Panel Study(KLIPS, hereafter) data for 1998 and 2004. We examine the phases of satisfaction and what variables influence satisfaction for both years and compare the results in order to see what changed between the two regimes. We make use of k-means clustering to divide self-employed into similar degrees of satisfaction. As a result, we are able to classify the self-employed into three groups(low, medium and high) both for the two regimes. High groups consists of relatively younger, well-educated, low working dates, higher proportion of woman than other groups. As a result of regression analysis, we have some evidence that women are more satisfied than men for job satisfaction and that the existence of income is more important than the amount of income for life satisfaction. The age, education, satisfaction for working place, and health are significant to both satisfactions.

Lifestyles of Korean Older Adults - Focusing on the consumption pattern and its determinants - (한국노인의 생활양식 분석 : 소비패턴과 그 결정요인을 중심으로)

  • Lee, So-Chung
    • Korean Journal of Social Welfare Studies
    • /
    • v.40 no.3
    • /
    • pp.327-348
    • /
    • 2009
  • The purpose of this study is to understand the diverse lifestyles of Korean older adults by analyzing the consumption pattern of older households and its determinants. The 9th wave of the Korea Labor and Income Panel Study(KLIPS) data was used for analysis. The twenty consumption items provided by the dataset was reduced to thirteen according to the consumption purpose inherent in the item. K-means cluster analysis and multinomial logistic regression was employed to categorize the consumption pattern of older households and to analyze the determinants. The results are as follows. The consumption pattern of Korean older adults was clustered into six distinctive groups named Breadwinner, Leisure-time pursuer, Friendly outgoes, Daily-life survivor, Illness sufferer and Shelter seeker. Breadwinner, Leisure-time pursuer and Friendly outgoes were lifestyles that earn and spend more compared to the other three. Nevertheless, they differed according to the family size, indicating that the parenting burden might have direct influence on the lifestyle of Korean older adults. Older adults without parenting burden and with high level of education and economic capacity were likely to show Friendly outgoes lifestyle. On the other hand, Daily-life survivor, Illness sufferer and Shelter seeker showed lower level of spending, indicating that for those lacking in economic capacity, urgent needs such as medical need or housing need dominates the lifestyle. The results call for adequate custom policies that best fit the needs of older adults.

The Family Network Types and Life Satisfaction of the Rural Elderly (농촌노인의 가족관계망 유형과 생활만족도)

  • Lee, Hae-Ja;Park, Kyung-Ae
    • 한국노년학
    • /
    • v.29 no.1
    • /
    • pp.291-307
    • /
    • 2009
  • This study investigated the family network of elderly and its effects on the subjective life satisfaction in Rural Area. In order to classify the family network, the authors used the analysis technique of social network including to a spouse, children and grandchildren. In addition, the authors described basic characteristics of family network on the family type, interaction frequency, and interaction content. And then family network typified four types by K-means cluster analysis method according to characteristics of family network and examined difference on life satisfaction of the elderly persons according to the type of family network. The major results were as following. First, the elderly did contact his/her children often, emotional support revealed that highest support expectation of elderly. Second, The family network of elderly could be typified four types ; 'relation estranged type', 'children-grandchildren centered type', 'family dependent type', 'couples centered type' and statistically significant difference showed in life satisfaction according to each type. The result, in the 'couples centered type', the life satisfaction was highest; on the contrary, 'relation estranged type', it was lowest. Third, Influencing factors on life satisfaction of the old person were economic conditions, physical conditions, education level, sex, more frequent contacts with grandchild, emotional support expectation of spouses. The results of this study suggest that social welfare political and institutional efforts are needed to improve the relationship between older persons and their children, grandchildren and spouses and life satisfaction of the elderly.

The North Korean Female Refugees' Personality and Psychological Adaptation (여성 새터민의 성격유형에 따른 심리적응)

  • Young Mi Sohn;Sook Jung Kang;Cheong Yeul Park
    • Korean Journal of Culture and Social Issue
    • /
    • v.20 no.1
    • /
    • pp.19-44
    • /
    • 2014
  • This study was conducted to investigate the types of personality of North Korean female refugees, which were extracted from the T-scores of SPFQ(scales of the Sixteen Personality Factor Questionnaire) and psychological adaptation. For this, The data of 158 North Korean female refugees located in Seoul Yangchun-Gu and Gayang-Gu was analyzed. The results were as follows. Firstly, the ratio of over 65T in ego-strength, self-control, social-boldness, anxiety scales and under 34T in abstractedness and openness to change scales was higher than in other scales. Secondly, there were statistically significant differences in personality characteristics based on the demographic variables especially age and the term of residence in South Korea. Thirdly, three distinct groups were extracted from the K-means cluster analysis. The first group was characterized with emotional-unstability and negative emotionality. And the North Korean female refugees in the second group hesitated to enter into and maintain proper relationships with south korean, while they were unlikely to accept norms and rules in South Korea. The third group, characterized by higher emotional stability, ego-strength, and agreeableness, was met normal range in all the scale of SPFQ. Finally, each three groups were showed statistically significant differences in psychological adaptation scales(self-identity and resilience). We expected that these results contributed to explore the psychological and the political plans for North Korean female refugees' settlement in South Korea.

  • PDF

Effect of food-related lifestyle, and SNS use and recommended information utilization on dining out (혼밥 및 외식소비 관련 식생활라이프스타일과 SNS 이용 및 추천정보활용의 영향)

  • Jin A Jang
    • Journal of Nutrition and Health
    • /
    • v.56 no.5
    • /
    • pp.573-588
    • /
    • 2023
  • Purpose: This study aimed to examine social networking service (SNS) use and recommended information utilization (SURU) according to the food-related lifestyles (FRLs) of consumers and analyze how the interaction between the FRL and SURU affects the practice of eating alone and visiting restaurants. Methods: Data on 4,624 adults in their 20s to 50s were collected from the 2021 Consumer Behavior Survey for Food. Statistical methods included factor analysis, K-means cluster analysis, the complex samples general linear model, the complex samples Rao-Scott χ2 test, and the general linear model. Results: The following three factors were extracted from the FRL data: Convenience pursuit, rational consumption pursuit, and gastronomy pursuit, and the subjects were classified into three groups, namely the rational consumption, convenient gastronomy, and smart gourmet groups. An examination of the difference in SURU according to the FRL showed that the smart gourmet group had the highest score. The result of analyzing the effects of the FRL and SURU on eating alone revealed that both the main effect and the interaction effect were significant (p < 0.01, p < 0.001). The higher the SURU, the higher the frequency of eating alone in the convenience pursuit, and gastronomy pursuit groups. The main and interaction effects of the FRL and SURU on the frequency of eating out were also significant (p < 0.01, p < 0.001). In all the FRL groups, the higher the SURU level, the higher the frequency of visiting restaurants. Specifically, the two groups with convenience and gastronomic tendencies showed a steeper increase. Conclusion: This study provides important basic data for research on consumer behavior related to food SNS, market segmentation of restaurant consumers, and development of marketing strategies using SNS in the future.

A Study on the Blood-Letting Therapy in Elementary Questions (("황제내경소문(黃帝內經素問)" 중(中) 사혈(瀉血)에 관한 연구(硏究))

  • Lee, Jun-Geun
    • Journal of the Korean Institute of Oriental Medical Informatics
    • /
    • v.14 no.1
    • /
    • pp.19-42
    • /
    • 2008
  • Blood-Letting Therapy is a rational and ecological medical treatment by which we can heal most of the diseases by removing the static blood which precipitates in the blood vessel and blocks the flowing of blood. And the static blood is the generic term for the injurious, bad, dead and precipitated blood which is blocked the capillary vessel. The Yellow Emperor's Canon of Internal Medicine says that "the patient is treated with drugs internally and stone acupuncture externally. "In the old texts, the blood-letting therapy is mentioned as blood-letting, network vessel pricking, bloodletting, pricking, and arousing pulses etc and it is noted down as the method of network vessel pricking in 'On the Application of Needles' of Spiritual Pivot. Nine-pricking therapy, twelve-pricking therapy and five-pricking therapy are recorded in the methods of network vessel pricking and among them, the method of squeezing blood after pricking the affected part is explained as the network vessel pricking. There are four methods of network vessel pricking, pricking, picking, cluster needling and scatter-pricking and they are fluidly applied to the various symptoms of diseases. In 'On Discriminative Treating for Patients of Different Regions' in Elementary Questions, Ki-baek emphasizes "most of the local people, there are black in skin and loose in striate, and their diseases are mostly of carbuncle kind. It is suitable to treat the disease with stone therapy to prick with stone, so the stone therapy is transmitted from the east. "And in 'On the Corresponding Relation Between the Eum and Yang of Man and All Things' in Elementary Questions, when the Emperor asked Ki-Back, he answered "sthenia means the sthenia of evil, and deficiency means the deficiency of healthy energy. When the blood is sthenic, the evil should be discharged by pricking when out letting the blood; deficiency of vital energy is the asthenia of channels and network vessels, so the energy should drain from the channel which is not deficient, to replenish. "And in this case we can use the methods of 'Breaking out the static bloods', 'driving out the static bloods', blood-letting'. With this we can infer that the blood-letting therapy is made use of the important medical treatments from the ancient times. Especially in referring to the principles of treatment in The Yellow Emperor's Canon of Internal Medicine, it mostly alluded to acupuncture therapies and only eleven times to medicinal treatments. This is to verify that the blood-letting therapy formed the foundation of the medical art. In Dong's Therapy of Acupuncture-Moxibustion and Bloodletting, Dong Kyeong-Chang gave emphasis on the points that there must be extravasated bloods without exception in the serious illnesses which is old, unnatural, accompanying acute pains and so we can revive our body‘s sprit by circulating 'gi' and static blood piled up in the network vessel, regulating the weakness and strength, and controling the disharmony of the internal organs. The blood-letting therapy has effect on the orifice in emergency, such as fore draining, freeing network vessels, harmonizing gi and blood, relieving pain, dispersing swelling and concretion, sedation, resolving toxin as well as strengthening the heart, relieving itching. So it has distinguished effect on all kinds of medical treatment to the modern people. But by the change of social customs and the confucianism of confucius - it is widely spread on the period of North and South Dynasties, 'Wi' and 'Jin' in china and the period of the Three States in korea - The blood-letting therapy which was regarded as the most important medicinal treatment withered rapidly. And Confucius accentuated the importance of our body and all its members, loyalty and filial piety and banned any damage of our body under no circumstances. As a result of it, the therapy of blood-letting had a rapid decrease and barely kept itself in existence in both countries. What is worse, at the period of Japanese colonial rule of korea and our nation's founding of early stage, it has been withered by the high-handed policy to change Oriental Medicine into modern medical science. So the therapy of blood-letting barely kept itself in existence in some Buddhist temples. Another case, it has handed down as a old-fashioned quick fix in folk remedies. But all kinds of the contamination of heavy metals and the misuses of antibiotics are widely spread nowadays, which increased diseases of adult people and incurable diseases as modern society unavoidably made its way into a highly industrial society. To make us more miserable, the western medical science - the antibiotics and surgical operation medical science - already reveals itself into a limit. The necessity of a new medical science which can give a security to the patients who are suffering from the diseases of adult people and the incurable diseases is especially come into the force nowadays. In view of the results after bibliographically studying on the blood-letting Therapy in Elementary Questions of the Yellow Emperor's Canon of Internal Medicine, the blood-letting therapy has acted for the important Oriental medicinal science and has been clarified the prominent effects on the diseases of adult people and the incurable diseases. So it is regarded as an appropriate thing that we lay out a determined theory of the blood-letting therapy and of course prevent the unwanted side effects from inappropriate medicinal treatments, and make full use of clinic by elevating the curative value and that we win back our self-respect of medical treatment which is dominated from the western medical science and ultimately contribute to national medical welfare.

  • PDF

Characteristics of Groundwater Quality for Agricultural Irrigation in Plastic Film House Using Multivariate Analysis (다변량분석법을 이용한 시설재배지 지하수 수질 특성)

  • Kim, Jin-Ho;Choi, Chul-Mann;Lee, Jong-Sik;Yun, Sun-Gang;Lee, Jung-Taek;Cho, Kwang-Rae;Lim, Su-Jung;Choi, Seung-Chul;Lee, Gyeong-Ja;Kwon, Yeu-Seok;Kyung, Ki-Chon;Uhm, Mi-Jeong;Kim, Hee-Kwon;Lee, You-Seok;Kim, Chan-Yong;Lee, Seong-Tae;Ryu, Jong-Su
    • Korean Journal of Environmental Agriculture
    • /
    • v.27 no.1
    • /
    • pp.1-9
    • /
    • 2008
  • The main purpose of this study is to accumulate the fundamental data representing groundwater of plastic film houses by means of water quality and its multivariate statistical analysis. Groundwater samples were collected in every two years since 2000 to 2004 from total 211 sites. According to the result of water quality analysis, ground water quality was suitable for irrigation purpose averagely. Correlation analysis showed that EC was highest positively correlated with $Mg^{2+}$ to 0.810(p<0.01), 0.776(p<0.01) in April and July, respectively. $NO_3-N$ was highest positively correlated with T-N to 0.794(p<0.01) in October. This result shows that it can lead to a different result even in similar case sometimes. Four factors were extracted through factor analysis in April and July, but five factors were extracted in October. The proportions of cumulative variance by the factor were 64.9, 60.2, and 70.7 in April, July, and October, respectively. The first factor was highly related to anions and cations such as $Ca^{2+},\;Mg^{2+},\;Cl^-,\;{SO_4}^{2-}$, and EC in contrast to that of stream water. According to the cluster analysis, 211 sites are classified into four groups. Common type of ground water quality was shown in group A. The pH and $PO_4-P$ were highest in Group B. The anions and cations were highest in Group C. $COD_{Cr}$ was highest in Group D.

Term Mapping Methodology between Everyday Words and Legal Terms for Law Information Search System (법령정보 검색을 위한 생활용어와 법률용어 간의 대응관계 탐색 방법론)

  • Kim, Ji Hyun;Lee, Jong-Seo;Lee, Myungjin;Kim, Wooju;Hong, June Seok
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.3
    • /
    • pp.137-152
    • /
    • 2012
  • In the generation of Web 2.0, as many users start to make lots of web contents called user created contents by themselves, the World Wide Web is overflowing by countless information. Therefore, it becomes the key to find out meaningful information among lots of resources. Nowadays, the information retrieval is the most important thing throughout the whole field and several types of search services are developed and widely used in various fields to retrieve information that user really wants. Especially, the legal information search is one of the indispensable services in order to provide people with their convenience through searching the law necessary to their present situation as a channel getting knowledge about it. The Office of Legislation in Korea provides the Korean Law Information portal service to search the law information such as legislation, administrative rule, and judicial precedent from 2009, so people can conveniently find information related to the law. However, this service has limitation because the recent technology for search engine basically returns documents depending on whether the query is included in it or not as a search result. Therefore, it is really difficult to retrieve information related the law for general users who are not familiar with legal terms in the search engine using simple matching of keywords in spite of those kinds of efforts of the Office of Legislation in Korea, because there is a huge divergence between everyday words and legal terms which are especially from Chinese words. Generally, people try to access the law information using everyday words, so they have a difficulty to get the result that they exactly want. In this paper, we propose a term mapping methodology between everyday words and legal terms for general users who don't have sufficient background about legal terms, and we develop a search service that can provide the search results of law information from everyday words. This will be able to search the law information accurately without the knowledge of legal terminology. In other words, our research goal is to make a law information search system that general users are able to retrieval the law information with everyday words. First, this paper takes advantage of tags of internet blogs using the concept for collective intelligence to find out the term mapping relationship between everyday words and legal terms. In order to achieve our goal, we collect tags related to an everyday word from web blog posts. Generally, people add a non-hierarchical keyword or term like a synonym, especially called tag, in order to describe, classify, and manage their posts when they make any post in the internet blog. Second, the collected tags are clustered through the cluster analysis method, K-means. Then, we find a mapping relationship between an everyday word and a legal term using our estimation measure to select the fittest one that can match with an everyday word. Selected legal terms are given the definite relationship, and the relations between everyday words and legal terms are described using SKOS that is an ontology to describe the knowledge related to thesauri, classification schemes, taxonomies, and subject-heading. Thus, based on proposed mapping and searching methodologies, our legal information search system finds out a legal term mapped with user query and retrieves law information using a matched legal term, if users try to retrieve law information using an everyday word. Therefore, from our research, users can get exact results even if they do not have the knowledge related to legal terms. As a result of our research, we expect that general users who don't have professional legal background can conveniently and efficiently retrieve the legal information using everyday words.

Ecological Niche of Quercus acutissima and Quercus variabilis (상수리나무와 굴참나무의 생태적 지위에 관한 연구)

  • Kim, Hae-Ran;Jeong, Heon-Mo;Kim, Hyea-Ju;You, Young-Han
    • Korean Journal of Environmental Biology
    • /
    • v.26 no.4
    • /
    • pp.385-391
    • /
    • 2008
  • In Korea, Quercus acutissima distributed in good condition with high nutrients and moisture content, but Quercus variabilis in dry soil or rock habitate. In order to understand this ecological distribution of Q. acutissima and Q. variabilis, we cultivated the seedlings of two oak species treated with light, soil moisture and nutrient gradients each four level, from May to October in glass house. Then we measured the ecological niche breadth and niche overlap of the two species, and analyzed the relationship of competition using cluster analysis and PCA ordination. Ecological niche breadths of Q. acutissima under moisture and nutrient treatments were slightly wider than those under light one. Among 14 characters measured, 6 characters related with length items were wider in all the environmental treatments, but 8 characters connected with weight terms narrower in light treatment. Ecological niche breadths of Q. variabilis under moisture and nutrient treatment were wider than those of light one. Ecological niche of Q. acutissima was wider than those of Q. variabilis in all the environmental treatments. Ecological overlap between two species was higher with a range of 0.87$\sim$0.92, especially higher in soil moisture factor. These results means that Q. acutissima is more competitive than Q. variabilis, especially in soil moisture condition. Two species were ordinated with distinct group based on 9 characters. From these results, it can be explained that what Q. variabilis distributed in bad soil condition is due to the escape strategy, because of its low competitive ability to Q. acutissima in natural communities.