• Title/Summary/Keyword: 분류나무 분석

Search Result 596, Processing Time 0.024 seconds

A Study of on the Method to Select Manufacturing Activities Sensitive to Regional Characteristics by Analyzing the Locational Hierarchy (입지계층분석을 활용한 산업단지 유치 업종 결정에 관한 연구)

  • So, Jin-Kwang;Lee, Hyeon-Joo;Kim, Sun-Woo
    • Land and Housing Review
    • /
    • v.2 no.4
    • /
    • pp.559-568
    • /
    • 2011
  • This study aims at listing up those manufacturing activities sensitive to regional characteristics by analyzing locational hierarchy designed on the urban rank-size rule. This locational hierarchy by manufacturing activities is expected to provide a ground for the proper supply of an industrial complex. The analysis of the locational hierarchy by manufacturing activities can work as a method of observing the characteristics of the distribution of location for each economic activity by analyzing the trend in the change of manufacturing location. Consequently, it can be used to determine the appropriate manufacturing activities for the industrial complex of a particular region. Here, the locational hierarchy is analyzed depending on the base of the basic local government such as Gun(district level) and Si(city level), and manufacturing activities are categorized by Korea Standard Industry Code. Those activities demonstrating growth pattern are Manufacture of Electronic Equipment(KSIC 26), Manufacture of Medical Precision Optical Instruments Watch(KSIC 27), Manufacture of Motor Vehicles (KSIC 30, 31), etc. With proper infrastructures, these activities can be located everywhere. Those sectors on the decline pattern in the locational hierarchy can be summarized as Manufacture of Tobacco Products(KSIC 12), Manufacture of wearing apparel Fur Articles(KSIC 14), etc. Those sectors scattered widely in the locational hierarchy are Manufacture of Food Products(KSIC 10), Manufacture of Coke Petroleum Products(KSIC 19), Manufacture of Chemical Products(KSIC 20), Manufacture of Electronic Equipment(KSIC 26). These particular manufacturing activities can be operated in those regions in a sufficient supply of unskilled workers regardless of proper infrastructures. Those activities that have a tendency to reconcentrate on larger cities are Manufacture of Textiles(KSIC 13), Manufacture of Wearing Apparel Clothing Fur Articles(KSIC 14), Manufacture of Other Transport Equiptmen(KSIC 31). In most cases, these sectors tend to favor their existing agglomerated areas and concentrate around large cities. Therefore, it is inefficient to promote these sectors in small or medium-sized cities or underdeveloped regions. The establishment of developmental strategies of an industrial complex can gain greater competitiveness by observing such characteristics of the locational hierarchy.

Development of Sentiment Analysis Model for the hot topic detection of online stock forums (온라인 주식 포럼의 핫토픽 탐지를 위한 감성분석 모형의 개발)

  • Hong, Taeho;Lee, Taewon;Li, Jingjing
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.1
    • /
    • pp.187-204
    • /
    • 2016
  • Document classification based on emotional polarity has become a welcomed emerging task owing to the great explosion of data on the Web. In the big data age, there are too many information sources to refer to when making decisions. For example, when considering travel to a city, a person may search reviews from a search engine such as Google or social networking services (SNSs) such as blogs, Twitter, and Facebook. The emotional polarity of positive and negative reviews helps a user decide on whether or not to make a trip. Sentiment analysis of customer reviews has become an important research topic as datamining technology is widely accepted for text mining of the Web. Sentiment analysis has been used to classify documents through machine learning techniques, such as the decision tree, neural networks, and support vector machines (SVMs). is used to determine the attitude, position, and sensibility of people who write articles about various topics that are published on the Web. Regardless of the polarity of customer reviews, emotional reviews are very helpful materials for analyzing the opinions of customers through their reviews. Sentiment analysis helps with understanding what customers really want instantly through the help of automated text mining techniques. Sensitivity analysis utilizes text mining techniques on text on the Web to extract subjective information in the text for text analysis. Sensitivity analysis is utilized to determine the attitudes or positions of the person who wrote the article and presented their opinion about a particular topic. In this study, we developed a model that selects a hot topic from user posts at China's online stock forum by using the k-means algorithm and self-organizing map (SOM). In addition, we developed a detecting model to predict a hot topic by using machine learning techniques such as logit, the decision tree, and SVM. We employed sensitivity analysis to develop our model for the selection and detection of hot topics from China's online stock forum. The sensitivity analysis calculates a sentimental value from a document based on contrast and classification according to the polarity sentimental dictionary (positive or negative). The online stock forum was an attractive site because of its information about stock investment. Users post numerous texts about stock movement by analyzing the market according to government policy announcements, market reports, reports from research institutes on the economy, and even rumors. We divided the online forum's topics into 21 categories to utilize sentiment analysis. One hundred forty-four topics were selected among 21 categories at online forums about stock. The posts were crawled to build a positive and negative text database. We ultimately obtained 21,141 posts on 88 topics by preprocessing the text from March 2013 to February 2015. The interest index was defined to select the hot topics, and the k-means algorithm and SOM presented equivalent results with this data. We developed a decision tree model to detect hot topics with three algorithms: CHAID, CART, and C4.5. The results of CHAID were subpar compared to the others. We also employed SVM to detect the hot topics from negative data. The SVM models were trained with the radial basis function (RBF) kernel function by a grid search to detect the hot topics. The detection of hot topics by using sentiment analysis provides the latest trends and hot topics in the stock forum for investors so that they no longer need to search the vast amounts of information on the Web. Our proposed model is also helpful to rapidly determine customers' signals or attitudes towards government policy and firms' products and services.

Analyzing Dynamics of Korean Housing Market Using Causal Loop Structures (주택시장의 동태성 분석을 위한 시스템 사고의 적용에 관한 연구 - 인과순환지도를 중심으로 -)

  • Shin Hye-Sung;Sohn Jeong-Rak;Kim Jae-Jun
    • Korean Journal of Construction Engineering and Management
    • /
    • v.6 no.3 s.25
    • /
    • pp.144-155
    • /
    • 2005
  • Since 1950s, the Korean housing market has continually experienced the chronicle lack of housing stock because of lower housing investment in comparison with a population explosion, prompt urbanization and rapid restructuring of family. The Korean housing market have thus been driven not by the pricing model by housing demand-supply chain but by the Korean housing policies focusing on the increase of housing supply and the living stability of the middle or low-income bracket. After all, repetitive economic vicious circle of housing price and the increase of unsold apartments aggravated the malfunction of the Korean housing market. Meanwhile, the Korean construction firms have exacerbated their profitability. Such terrible situations are mainly triggered by the Korean construction firms that weighed on the short-term profits and quick response of the government policy alterations rather than the prospect of housing market Therefore, this research focusing on the dynamics of housing market identified and classified the demand and supply elements that consist not only of housing system structures but also of the environmental elements that affect the structures. Based on the system thinking and traditional theory of consumer's choice, the interactions of these elements were constructed as a causal loop diagram that explains the mutual influences among housing subsystems with feedback loops. This paper describes and discusses about the causes of the dynamic changes in the Korean housing market. This study would help housing suppliers, including housing developers, construction firms, etc., to form a more comprehensive understanding on the fundamental issues that constitute the Korean housing market and thereby increasing their long term as well as minimizing the risk involved in the housing supply businesses.

A Study on the Interaction between Online Public Benefit Projects and Users: Alipay's ANT FOREST Focuses on Analysis (온라인 공익 프로젝트와 사용자의 상호작용관계에 관한 연구: 알리페이의 앤트 프레스트를 분석중심으로)

  • Zhao, Xiaolong;Lian, Zexu
    • The Journal of the Korea Contents Association
    • /
    • v.20 no.8
    • /
    • pp.513-521
    • /
    • 2020
  • Launched in August, 2016, the online public benefit project ANT FOREST has planted more than one hundred million trees in desertification areas and is currently continuing on with its activities. It is a fruit of online communications network development, and the public benefit project based on this puts more emphasis on the spirits of public interest rather than the investments of public services, unlike traditional public benefit activities. Hence the purpose of this study is to figure out the interaction between the users supporting the online public benefit and the public benefit progress online. The study was divided into 4 stages in order to find out the interaction, key factors for users to continue to support online public interest. First, preceeding studies on online public benefit will be reviewed to understand the characteristics of online public benefit. Second, determine the public benefit nature of ANT FOREST and investigate the project progress. Third, review the usage rate of ANT FOREST and categorize the properties of users. Fourth, interview was conducted to direct the interaction between the online public benefit project and the user. In conclusion, the online public benefit project completes the public benefit process through the user, the operator, and the supporter, the important factor connecting the energy connecting the process in cyber space and the public benefit activity in reality is the sense of participation, and the user continues the public benefit project through this sense of participation.

A Study on the Construction Specification and Quality Assurance Criteria in Clay Paver (점토바닥벽돌의 품질 및 시공기준 연구)

  • Park, Dae-Gun;Lee, Sang-Yum;Kim, Kyoon-Tai
    • Korean Journal of Construction Engineering and Management
    • /
    • v.11 no.6
    • /
    • pp.111-121
    • /
    • 2010
  • As the customer's interest for sidewalk block in the street or apartment complex is increasing, the materials of block which had been a concrete block exclusively are varied to clay paver, native rock and wood etc. Especially, the sales volume of clay paver which is environment-friendly and ergonomic is dramatically increasing every year with two digits growth rate, however, many problems like "Edge Cracking" "Freezing Breakage" "Bending Breakage" "Joint Gap" are happening frequently within a couple of hours after installation due to the durabilities. Because of the characteristics of Ceramic products, clay pavers are very easy to be broken when they are bumped against each other. In addition, they are relatively fragile by a freezing expansion breakage when exposed to water due to hydrophilic property as well as the intensity and absorptance of the products are varied with small difference from the production process such as production equipment and process control. Therefore, it costs a lot of money to repair the breakdown unless production and installation is carried out according to the strict criteria of the quality control. In this study, the symptoms of breakdown frequently happened in clay paver are classified by each type and finally the solution for this problem in the production of brick, installation and criteria of quality control through compressive strength and absorptance test is suggested.

Distributional Patterns of Understory Vegetation at Mt. Geumdae's Protected Area for Forest Genetic Resources (금대봉 산림유전자원보호림의 하층식생 분포양상)

  • Chun, Seung-Hoon;Lee, Hyung-Sook;Lim, Jong-Hwan
    • Journal of Korean Society of Forest Science
    • /
    • v.98 no.3
    • /
    • pp.339-350
    • /
    • 2009
  • This study was carried out to investigate distributional condition of rare plants and useful plant resources, and to verify distributional patterns of understory vegetation associated with the upper layer's vegetation structure. Total 59 families, 160 genera, 218 kinds of vascular plants were identified at the study site including 6 rare plants designated by Korea Forest Service (Lloydia triflora Bak., Trillium kamtschaticum Pall., Lilium distichum Nakai, Anemone koraiensis Nakai, Iris odaesanensis Y.N. Lee, Viola diamantica Nakai). Twenty three species of useful plant resources were also identified at the site; 8 of them showed clustered distributions and the others were prone to scatter. Actual vegetation of this study area consisted of one natural community dominated by Quercus mongolica Fisch. and three disturbed communities of Larix kaempferi (Lamb.) Carriere, Abies holophylla Max. and/or a herbaceous vegetation resulting from forest removal and strong wind of mountain top. This classification was strongly supported by cluster analysis based on the surveyed plot data. Distributional patterns of understory vegetation within forest stand were somewhat related to overstory vegetation structure, but showed a different tendency according to site condition, species composition, and competitive pressure among understory vegetation. Therefore, in order to protect the important understory components as forest genetic resources, forest treatments such as density control of overstory should be implanted based on understanding of impact on understory's dynamics and growing condition.

Symbolism of the Plants Depicted in the Flower Wall of Jagyeongjeon at Gyeongbokgung (경복궁 자경전 꽃담에 나타난 화훼식물과 상징성)

  • Kwon, Min-Hyeong;Song, In-Jung;Pak, Chun-Ho
    • Journal of agriculture & life science
    • /
    • v.46 no.2
    • /
    • pp.75-82
    • /
    • 2012
  • This is a study on the flower pattern artwork of the west wall of the Jagyeongjeon in Gyeongbokgung to find out the type of plants and flowers represented and their symbolism. The research was conducted from July 2010 to March 2011 and the artwork classified on the basis of its horticultural traits. A number was assigned to each pattern for analysis: No. 1 is Prunus mume, No. 2 is Prunus persica, No. 3 is Paeonia suffruticosa, No. 4 is Punica granatum, No. 5 and 6 is Dendranthema grandiflora, No. 7 is Rhododendron mucronu and No. 8 is Phyllostachys bambusoides. These 8 flower patterns symbolize longevity and fecundity and their presense around the Jagyeongjeon helped to bestow good fortune on the royal family so that they might live long lives and bear many children. 4 artworks symbolize longevity, 2 artworks symbolize integrity and 1 artwork symbolizes wealth and happiness. There is also symbolism of the need to have constancy in a royal household even during secular change. Out of the 8 artworks, the imagery of a bird and a moon is represented only once, but the image of a butterfly is represented five times in the surrounding elements. The bird and butterfly symbolise freedom and happiness from free love. Women in the palace are like a butterfly wanted to be like love as a freedom and have a free and open relationship like a butterfly. But a harmonious relationship between the royal family wanted to have a symbolic meaning that could be seen of the symbolistic. Based on the "Yangwhasorok"only plants with the highest values, from the 1st and 2nd grades, were used in the artwork of the west wall of the Jagyeongjeon.

Convergence Study in Development of Severity Adjustment Method for Death with Acute Myocardial Infarction Patients using Machine Learning (머신러닝을 이용한 급성심근경색증 환자의 퇴원 시 사망 중증도 보정 방법 개발에 대한 융복합 연구)

  • Baek, Seol-Kyung;Park, Hye-Jin;Kang, Sung-Hong;Choi, Joon-Young;Park, Jong-Ho
    • Journal of Digital Convergence
    • /
    • v.17 no.2
    • /
    • pp.217-230
    • /
    • 2019
  • This study was conducted to develop a customized severity-adjustment method and to evaluate their validity for acute myocardial infarction(AMI) patients to complement the limitations of the existing severity-adjustment method for comorbidities. For this purpose, the subjects of KCD-7 code I20.0 ~ I20.9, which is the main diagnosis of acute myocardial infarction were extracted using the Korean National Hospital Discharge In-depth Injury survey data from 2006 to 2015. Three tools were used for severity-adjustment method of comorbidities : CCI (charlson comorbidity index), ECI (Elixhauser comorbidity index) and the newly proposed CCS (Clinical Classification Software). The results showed that CCS was the best tool for the severity correction, and that support vector machine model was the most predictable. Therefore, we propose the use of the customized method of severity correction and machine learning techniques from this study for the future research on severity adjustment such as assessment of results of medical service.

A Study on a God tree of Chosun Distorted in Chosun-Gersu-Nosu-Myungmok-Ji (조선거수노수명목지에 왜곡되어 있는 조선의 신목에 관한 고찰)

  • Park, Chan-Woo;Ahn, Chang-Ho;Kim, Se-Chang
    • Journal of Korean Society of Forest Science
    • /
    • v.108 no.3
    • /
    • pp.372-381
    • /
    • 2019
  • This study was conducted to find proof for the hypothesis that the God tree of Chosun has been misrepresented in Chosun-Gersu-Nosu-Myungmokji (CGNM). The following results were obtained. First, it was established that 64 species and 3170 trees were recorded in CGNM. An old, big tree is classified as a God tree if linked to it there are testimonies and legends about divine elements, and it is classified as a Noble tree if linked to it there are testimonies and legends of historical elements. In total, 2632 trees of eight species were analyzed, from the Zelkova serrata, which has the greatest number of trees, to the eighth most frequent, Abies holophylla. The means of diameter at breast height (DBH), height, and age of the God and the Noble trees were calculated for each of the eight species. In seven out of eight species, the DBH and age of the Noble tree were more than those of the God tree. In addition, the height of the Noble tree was more than that of the God tree in six out of eight species. The fact that the God tree is smaller than the Noble tree, contrary to the common expectation that the Noble tree is a small size tree, was confirmed. This hypothesis was proved by the data gathered. Second, the Japanese Government-General of Korea has pursued a policy to defeat the village ritual based on the God tree being linked with superstition. For such a policy, the God tree should be small and unattractive, and it would have been good for the tree to be superstitious. The CGNM was created as explanatory material or evidence for distorting the sacredness of the God tree of Chosun. Third, CGNM compiled a chronological order of DBH data to make it easy to explain the fabricated facts that the God tree of Chosun is smaller and dwarfed compared to the Noble tree.

The Detection of Online Manipulated Reviews Using Machine Learning and GPT-3 (기계학습과 GPT3를 시용한 조작된 리뷰의 탐지)

  • Chernyaeva, Olga;Hong, Taeho
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.4
    • /
    • pp.347-364
    • /
    • 2022
  • Fraudulent companies or sellers strategically manipulate reviews to influence customers' purchase decisions; therefore, the reliability of reviews has become crucial for customer decision-making. Since customers increasingly rely on online reviews to search for more detailed information about products or services before purchasing, many researchers focus on detecting manipulated reviews. However, the main problem in detecting manipulated reviews is the difficulties with obtaining data with manipulated reviews to utilize machine learning techniques with sufficient data. Also, the number of manipulated reviews is insufficient compared with the number of non-manipulated reviews, so the class imbalance problem occurs. The class with fewer examples is under-represented and can hamper a model's accuracy, so machine learning methods suffer from the class imbalance problem and solving the class imbalance problem is important to build an accurate model for detecting manipulated reviews. Thus, we propose an OpenAI-based reviews generation model to solve the manipulated reviews imbalance problem, thereby enhancing the accuracy of manipulated reviews detection. In this research, we applied the novel autoregressive language model - GPT-3 to generate reviews based on manipulated reviews. Moreover, we found that applying GPT-3 model for oversampling manipulated reviews can recover a satisfactory portion of performance losses and shows better performance in classification (logit, decision tree, neural networks) than traditional oversampling models such as random oversampling and SMOTE.