• Title/Summary/Keyword: analyzing

Search Result 28,628, Processing Time 0.061 seconds

Visualizing the Results of Opinion Mining from Social Media Contents: Case Study of a Noodle Company (소셜미디어 콘텐츠의 오피니언 마이닝결과 시각화: N라면 사례 분석 연구)

  • Kim, Yoosin;Kwon, Do Young;Jeong, Seung Ryul
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.4
    • /
    • pp.89-105
    • /
    • 2014
  • After emergence of Internet, social media with highly interactive Web 2.0 applications has provided very user friendly means for consumers and companies to communicate with each other. Users have routinely published contents involving their opinions and interests in social media such as blogs, forums, chatting rooms, and discussion boards, and the contents are released real-time in the Internet. For that reason, many researchers and marketers regard social media contents as the source of information for business analytics to develop business insights, and many studies have reported results on mining business intelligence from Social media content. In particular, opinion mining and sentiment analysis, as a technique to extract, classify, understand, and assess the opinions implicit in text contents, are frequently applied into social media content analysis because it emphasizes determining sentiment polarity and extracting authors' opinions. A number of frameworks, methods, techniques and tools have been presented by these researchers. However, we have found some weaknesses from their methods which are often technically complicated and are not sufficiently user-friendly for helping business decisions and planning. In this study, we attempted to formulate a more comprehensive and practical approach to conduct opinion mining with visual deliverables. First, we described the entire cycle of practical opinion mining using Social media content from the initial data gathering stage to the final presentation session. Our proposed approach to opinion mining consists of four phases: collecting, qualifying, analyzing, and visualizing. In the first phase, analysts have to choose target social media. Each target media requires different ways for analysts to gain access. There are open-API, searching tools, DB2DB interface, purchasing contents, and so son. Second phase is pre-processing to generate useful materials for meaningful analysis. If we do not remove garbage data, results of social media analysis will not provide meaningful and useful business insights. To clean social media data, natural language processing techniques should be applied. The next step is the opinion mining phase where the cleansed social media content set is to be analyzed. The qualified data set includes not only user-generated contents but also content identification information such as creation date, author name, user id, content id, hit counts, review or reply, favorite, etc. Depending on the purpose of the analysis, researchers or data analysts can select a suitable mining tool. Topic extraction and buzz analysis are usually related to market trends analysis, while sentiment analysis is utilized to conduct reputation analysis. There are also various applications, such as stock prediction, product recommendation, sales forecasting, and so on. The last phase is visualization and presentation of analysis results. The major focus and purpose of this phase are to explain results of analysis and help users to comprehend its meaning. Therefore, to the extent possible, deliverables from this phase should be made simple, clear and easy to understand, rather than complex and flashy. To illustrate our approach, we conducted a case study on a leading Korean instant noodle company. We targeted the leading company, NS Food, with 66.5% of market share; the firm has kept No. 1 position in the Korean "Ramen" business for several decades. We collected a total of 11,869 pieces of contents including blogs, forum contents and news articles. After collecting social media content data, we generated instant noodle business specific language resources for data manipulation and analysis using natural language processing. In addition, we tried to classify contents in more detail categories such as marketing features, environment, reputation, etc. In those phase, we used free ware software programs such as TM, KoNLP, ggplot2 and plyr packages in R project. As the result, we presented several useful visualization outputs like domain specific lexicons, volume and sentiment graphs, topic word cloud, heat maps, valence tree map, and other visualized images to provide vivid, full-colored examples using open library software packages of the R project. Business actors can quickly detect areas by a swift glance that are weak, strong, positive, negative, quiet or loud. Heat map is able to explain movement of sentiment or volume in categories and time matrix which shows density of color on time periods. Valence tree map, one of the most comprehensive and holistic visualization models, should be very helpful for analysts and decision makers to quickly understand the "big picture" business situation with a hierarchical structure since tree-map can present buzz volume and sentiment with a visualized result in a certain period. This case study offers real-world business insights from market sensing which would demonstrate to practical-minded business users how they can use these types of results for timely decision making in response to on-going changes in the market. We believe our approach can provide practical and reliable guide to opinion mining with visualized results that are immediately useful, not just in food industry but in other industries as well.

A Study on Effects of the vocal psychotherapy upon Self-Consciousness (성악심리치료활동을 통한 자기의식 변화에 관한 연구)

  • Lee, Hyun Joo
    • Journal of Music and Human Behavior
    • /
    • v.4 no.2
    • /
    • pp.66-83
    • /
    • 2007
  • The purpose of this study is to learn both effects of the vocal psychotherapy on the self-consciousness and the variety of the self-consciousness on the vocal psychotherapy in return. The research for this study was performed to three subjects who were students of E university, Seoul, ten times for sixty minutes. The subjects were all volunteers for the advertisement on a music-therapy program searching for them on the web site of E university. The vocal psychotherapy program consists of four steps and each of them consists of two to four short terms again. Both before and after the experiment, examinations on self-consciousness were done to recognize the change of the subjects' self-consciousness which would be caused by the vocal psychotherapy activity. After every short term, the subjects were asked to write reports to closely analyze the change of self-consciousness according to the terms and the variety of the subjects. The effect of the vocal psychotherapy activity on the changes of scores in the self-consciousness examination is the first thing to point out on this study. There appeared some personal varieties on the total scores of the examination and scores of some sub-categories. Especially, there were different scores on the private self-consciousness, the public self-consciousness, and the social anxiety between before and after performing the vocal psychotherapy program. Subject A, who had got the best score of all on the scope of the private self-consciousness, showed the steepest decrease on the very scope. On the contrary, the subject showed decrease of scores of the public self-consciousness and the social anxiety in the relatively little rate. Subject B, who had got the highest score of the three on the public self-consciousness, showed the steepest decrease on that of all scopes and showed no difference on the social anxiety scope. In the case of the last one, subject C, who had relatively low scores on the private and public self-consciousness than the others, the private self-consciousness score increased but the public self-consciousness and the social anxiety scores decreased. The changes of the scores of each questions were examined in order to see possible other changes that had not been exposed on the changes of the total and sub-categories scores. As a result of that, of all twenty-eight questions, there were changes about one to two points. Subject A showed the difference with thirteen questions, subject B with sixteen and subject C with nineteen questions. The rate of change of subject C was relatively small but more questions changed and the change of score was wider than the others. Considering all those results, It can be possibly said that the vocal psychotherapy affects the changes of the scores of sub-categories in self-consciousness examination. The next thing to point out on this study is the change of recognition that was exposed on the subjects' report after every short term of the program. As a result of the close analyzing, according to the short terms and variety of self-consciousness, recognizing the way express subjects themselves by voice and recognizing their own voices appeared to be different. How much they cared about others and why they did so were also different. According to the self reports, subject A cared much about her inner thought and emotion and tended to concentrate herself as a social object. There appeared some positive emotional experiments such as emotional abundance and art curiosities on her reports but at the same time some negative emotions such as state-trait anxiety and neuroticism also appeared. Subject B, who showed high scores on the private and public self-consciousness like subject A, had a similar tendency that concentrates on herself as a social object but she showed more social anxiety than subject A. Subject C got relatively lower points in self-consciousness examination, tended to care about herself, and had less negative emotions such as state-trait anxiety than other subjects. Also, with terms going on, she showed changes in the way of caring about her own voice and others. This study has some unique significances in helping people who have problems caused by self-estimation activated with self-consciousness, using voices closely related to one's own self, performing the vocal skills discipline to solve the technical problems. Also, this study has a potentiality that the vocal psychotherapy activity can be effectively used as a way affects the mental health and developing personality.

  • PDF

Evaluation of the Usefulness of Restricted Respiratory Period at the Time of Radiotherapy for Non-Small Cell Lung Cancer Patient (비소세포성 폐암 환자의 방사선 치료 시 제한 호흡 주기의 유용성 평가)

  • Park, So-Yeon;Ahn, Jong-Ho;Suh, Jung-Min;Kim, Yung-Il;Kim, Jin-Man;Choi, Byung-Ki;Pyo, Hong-Ryul;Song, Ki-Won
    • The Journal of Korean Society for Radiation Therapy
    • /
    • v.24 no.2
    • /
    • pp.123-135
    • /
    • 2012
  • Purpose: It is essential to minimize the movement of tumor due to respiratory movement at the time of respiration controlled radiotherapy of non-small cell lung cancer patient. Accordingly, this Study aims to evaluate the usefulness of restricted respiratory period by comparing and analyzing the treatment plans that apply free and restricted respiration period respectively. Materials and Methods: After having conducted training on 9 non-small cell lung cancer patients (tumor n=10) from April to December 2011 by using 'signal monitored-breathing (guided- breathing)' method for the 'free respiratory period' measured on the basis of the regular respiratory period of the patents and 'restricted respiratory period' that was intentionally reduced, total of 10 CT images for each of the respiration phases were acquired by carrying out 4D CT for treatment planning purpose by using RPM and 4-dimensional computed tomography simulator. Visual gross tumor volume (GTV) and internal target volume (ITV) that each of the observer 1 and observer 2 has set were measured and compared on the CT image of each respiratory interval. Moreover, the amplitude of movement of tumor was measured by measuring the center of mass (COM) at the phase of 0% which is the end-inspiration (EI) and at the phase of 50% which is the end-exhalation (EE). In addition, both observers established treatment plan that applied the 2 respiratory periods, and mean dose to normal lung (MDTNL) was compared and analyzed through dose-volume histogram (DVH). Moreover, normal tissue complication probability (NTCP) of the normal lung volume was compared by using dose-volume histogram analysis program (DVH analyzer v.1) and statistical analysis was performed in order to carry out quantitative evaluation of the measured data. Results: As the result of the analysis of the treatment plan that applied the 'restricted respiratory period' of the observer 1 and observer 2, there was reduction rate of 38.75% in the 3-dimensional direction movement of the tumor in comparison to the 'free respiratory period' in the case of the observer 1, while there reduction rate was 41.10% in the case of the observer 2. The results of measurement and comparison of the volumes, GTV and ITV, there was reduction rate of $14.96{\pm}9.44%$ for observer 1 and $19.86{\pm}10.62%$ for observer 2 in the case of GTV, while there was reduction rate of $8.91{\pm}5.91%$ for observer 1 and $15.52{\pm}9.01%$ for observer 2 in the case of ITV. The results of analysis and comparison of MDTNL and NTCP illustrated the reduction rate of MDTNL $3.98{\pm}5.62%$ for observer 1 and $7.62{\pm}10.29%$ for observer 2 in the case of MDTNL, while there was reduction rate of $21.70{\pm}28.27%$ for observer 1 and $37.83{\pm}49.93%$ for observer 2 in the case of NTCP. In addition, the results of analysis of correlation between the resultant values of the 2 observers, while there was significant difference between the observers for the 'free respiratory period', there was no significantly different reduction rates between the observers for 'restricted respiratory period. Conclusion: It was possible to verify the usefulness and appropriateness of 'restricted respiratory period' at the time of respiration controlled radiotherapy on non-small cell lung cancer patient as the treatment plan that applied 'restricted respiratory period' illustrated relative reduction in the evaluation factors in comparison to the 'free respiratory period.

  • PDF

Quality of Life and Its Related Factors of Radiation Therapy Cancer Patients (방사선 치료를 받은 암환자의 삶의 질과 관련요인)

  • Shin, Ryung-Mi;Jung, Won-Seok;Oh, Byeong-Cheon;Jo, Jun-Young;Kim, Gi-Chul;Choi, Tae-Gyu;Lee, Sok-Goo
    • The Journal of Korean Society for Radiation Therapy
    • /
    • v.23 no.1
    • /
    • pp.21-29
    • /
    • 2011
  • Purpose: The purpose of this master's thesis is to utilize basic data in order to improve the quality of life of cancer patients who received radiation therapy after analysing related factors that influence patient's quality of life and obtaining information about physical, mental problems of patients. Materials and Methods: By using a structured questionnaire about various characteristics and forms of support, I carried out a survey targeting 107 patients that experienced radiation therapy at a university hospital in the Daejeon metropolitan area from July 15 to August 15, 2010 and analysed the factors influencing quality of life. Results: In case of pain due to disease, 65.15 and painless 81.87 showed a high grade quality of life. As body weight decreases, the quality of life become lower. When the grade of quality of life according to economic characteristics was compared, all items except treatment period showed a difference (P=0.000). When the score of social support, family support, medical support and self-esteem was low, the mark of quality of life showed respectively 61.71, 68.77, 71.31, and 69.39 on the basis of 128 points. When the score of support form was high, the mark of quality of life showed 90.47, 83.29, 90.40, and 90.36 (P<0.05). When analyzing the correlation between social support, family support, medical support and self-esteem and the degree of quality of life, social support was 0.768, family support 0.596, medical support 0.434, self-esteem 0.516. They indicated the correlation of meaningful quantity statistically (P<0.01). The factors that improved the quality of life were married state, having a job and painless status. As monthly income increases, the quality of life was also much improved (P<0.05). Among the factors related to quality of life, social support and medical support and higher self-esteem scores of the quality of life score increased 0.979 point, 0.508 points and 1.667 point, respectively. Conclusion: In conclusion, the quality of life of cancer patients that received radiation treatment is related to social support, medical support and self esteem. Self-esteem is an important factor that influenced quality of life, so if government offers works that doesn't affect patient's health, they are a useful method that maximize self-esteem and lessen their financial burden at the same time. Along with these policies, the developments of the attention of medical and the program for cancer patient's family are needed for the purpose of improving quality of life of cancer patients. Lastly, medical team, patients and family have to cooperate in harmony to overcome difficulties of cancer patients.

  • PDF

The Impact of Conflict and Influence Strategies Between Local Korean-Products-Selling Retailers and Wholesalers on Performance in Chinese Electronics Distribution Channels: On Moderating Effects of Relational Quality (중국 가전유통경로에서 한국제품 현지 판매업체와 도매업체간 갈등 및 영향전략이 성과에 미치는 영향: 관계 질의 조절효과)

  • Chun, Dal-Young;Kwon, Joo-Hyung;Lee, Guo-Ming
    • Journal of Distribution Research
    • /
    • v.16 no.3
    • /
    • pp.1-32
    • /
    • 2011
  • I. Introduction: In Chinese electronics industry, the local wholesalers are still dominant but power is rapidly swifting from wholesalers to retailers because in recent foreign big retailers and local mass merchandisers are growing fast. During such transient period, conflicts among channel members emerge important issues. For example, when wholesalers who have more power exercise influence strategies to maintain status, conflicts among manufacturer, wholesaler, and retailer will be intensified. Korean electronics companies in China need differentiated channel strategies by dealing with wholesalers and retailers simultaneously to sell more Korean products in competition with foreign firms. For example, Korean electronics firms should utilize 'guanxi' or relational quality to form long-term relationships with whloesalers instead of power and conflict issues. The major purpose of this study is to investigate the impact of conflict, dependency, and influence strategies between local Korean-products-selling retailers and wholesalers on performance in Chinese electronics distribution channels. In particular, this paper proposes effective distribution strategies for Korean electronics companies in China by analyzing moderating effects of 'Guanxi'. II. Literature Review and Hypotheses: The specific purposes of this study are as follows. First, causes of conflicts between local Korean-products-selling retailers and wholesalers are examined from the perspectives of goal incongruence and role ambiguity and then effects of these causes are found out on perceived conflicts of local retailers. Second, the effects of dependency of local retailers upon wholesalers are investigated on local retailers' perceived conflicts. Third, the effects of non-coercive influence strategies such as information exchange and recommendation and coercive strategies such as threats and legalistic pleas exercised by wholesalers are explored on perceived conflicts by local retailers. Fourth, the effects of level of conflicts perceived by local retailers are verified on local retailers' financial performance and satisfaction. Fifth, moderating effects of relational qualities, say, 'quanxi' between wholesalers and retailers are analyzed on the impact of wholesalers' influence strategies on retailers' performances. Finally, moderating effects of relational qualities are examined on the relationship between conflicts and performance. To accomplish above-mentioned research objectives, Figure 1 and the following research hypotheses are proposed and verified. III. Measurement and Data Analysis: To verify the proposed research model and hypotheses, data were collected from 97 retailers who are selling Korean electronic products located around Central and Southern regions in China. Covariance analysis and moderated regression analysis were employed to validate hypotheses. IV. Conclusion: The following results were drawn using structural equation modeling and hierarchical moderated regression. First, goal incongruence perceived by local retailers significantly affected conflict but role ambiguity did not. Second, consistent with conflict spiral theory, the level of conflict decreased when retailers' dependency increased toward wholesalers. Third, noncoercive influence strategies such as information exchange and recommendation implemented by wholesalers had significant effects on retailers' performance such as sales and satisfaction without conflict. On the other hand, coercive influence strategies such as threat and legalistic plea had insignificant effects on performance in spite of increasing the level of conflict. Fourth, 'guanxi', namely, relational quality between local retailers and wholesalers showed unique effects on performance. In case of noncoercive influence strategies, 'guanxi' did not play a role of moderator. Rather, relational quality and noncoercive influence strategies can serve as independent variables to enhance performance. On the other hand, when 'guanxi' was well built due to mutual trust and commitment, relational quality as a moderator can positively function to improve performance even though hostile, coercive influence strategies were implemented. Fifth, 'guanxi' significantly moderated the effects of conflict on performance. Even if conflict arises, local retailers who form solid relational quality can increase performance by dealing with dysfunctional conflict synergistically compared with low 'quanxi' retailers. In conclusion, this study verified the importance of relational quality via 'quanxi' between local retailers and wholesalers in Chinese electronic industry because relational quality could cross out the adverse effects of coercive influence strategies and conflict on performance.

  • PDF

A Basic Study on the Euryale ferox Salisbury for Introduction in Garden Pond - Focusing on the Flora and Vegetation - (정원내 가시연꽃(Euryale ferox Salisbury) 도입을 위한 기초연구 - 식물상과 식생을 중심으로 -)

  • Lee, Suk-Woo;Rho, Jae-Hyun;Oh, Hyun-Kyung
    • Journal of the Korean Institute of Traditional Landscape Architecture
    • /
    • v.34 no.1
    • /
    • pp.83-96
    • /
    • 2016
  • Through the research and analysis on the vegetation environment, flora of habitats through documentary and field studies over 14 habitats of Euryale ferox Salisbury within Jeollabukdo, with the objective of acquiring the basic data for forming an environment based on plantation of reservoirs that are composed with Euryale ferox, the following results were obtained. 1. The entire flora of the 14 habitats appeared to be 79 families, 211 genus, 298 species, two subspecies, 30 varieties and six forma, thus, a total of 336 taxa was confirmed. Among these, emergent water plants appeared to compose 17 taxa, floating-leaved plants to compose seven taxa including Euryale ferox floating plants to compose five taxa and submerged water plants to compose two taxa. As a result of analyzing the similarity only over the water plants. The lowest similarity rate appeared between Gamdong Reservoir and Aedang Reservoir, as the similarity rate between the two regions appeared to be 0% as a result of the analysis. Floating-leaved plants, lotuses and caltrops, appeared to be equally inhabiting in Hanseongji at Jeongeup and Seoknam Reservoir at Gochang, which showed the highest similarity rate, in addition to Euryale ferox. 2. When examining the appearance frequency of aquatic plants per growth type, Actinostemma lobatum and Phragmites communis, in addition to Euryale ferox each appeared 11 times, showing a high frequency of 78.6% and Trapa japonica, which is a floating-leaved water plant, appeared ten times(71.4%) and Zizania latifolia appeared eight times(57.1%). In addition, the appearance rate appeared to be high in the order of Persicaria thunbergii, Leersia sayanuka, Ceratophyllum demersum, Echinochloa crusgalli var. oryzicola, Scirpus maritimus, and Nelumbo nucifera. 3. The rare plants discovered in the Euryale ferox habitats pursuant to the IUCN evaluation standards was confirmed to be composed of five taxa, with three taxa including the least concerned species(LC), Melothria japonica at Yanggok Reservoir, Hydrocharis dubia at Myeongdeokji and Ottelia alismoides at Daewi Reservoir, in addition to vulnerable species(VU), Utricularia vulgaris at Sangpyeong Reservoir, along with Euryale ferox. 4. Most of the group or community types of the natural habitats of Euryale ferox appeared to be the Euryale ferix community' and the Daewi Reservoir of Gunsan was defined as caltrop + Euryale ferox + Nymphoides indica community. The green coverage ratio of Euryale ferox per natural habitats showed a considerably huge deviation from 0.03 to 36.50 and as the average green coverage ratio was appropriated as 9.8, it can be considered that maintaining the green coverage ratio of Euryale ferox in a 10% level would be advisable when forming a reservoir with Euryale ferox as the key composition species. 5. The vegetation community nearby the natural habitats of Euryale ferox per research subject area appeared to be composed of three Leersia japonica communities, two communities each for Zizania latifolia community and Trapa japonica community and one community each for Nelumbo nucifera community, Nymphoides peltata + Typha orientalis community, Trapa japonica + Nelumbo nucifera community, Hydrocharis dubia community, Leersia japnica + Paspalum distichum var. indutum community and Euryale ferox + Trapa japonica community, showing a slight difference depending on the location conditions of each reservoir. Thus, this result may be suggested as a guideline to apply when allocating the vegetation ratio and the types of floating-leaved plants upon planting plants in reservoirs with Euryale ferox as the main companion species.

Strategy for Store Management Using SOM Based on RFM (RFM 기반 SOM을 이용한 매장관리 전략 도출)

  • Jeong, Yoon Jeong;Choi, Il Young;Kim, Jae Kyeong;Choi, Ju Choel
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.2
    • /
    • pp.93-112
    • /
    • 2015
  • Depending on the change in consumer's consumption pattern, existing retail shop has evolved in hypermarket or convenience store offering grocery and daily products mostly. Therefore, it is important to maintain the inventory levels and proper product configuration for effectively utilize the limited space in the retail store and increasing sales. Accordingly, this study proposed proper product configuration and inventory level strategy based on RFM(Recency, Frequency, Monetary) model and SOM(self-organizing map) for manage the retail shop effectively. RFM model is analytic model to analyze customer behaviors based on the past customer's buying activities. And it can differentiates important customers from large data by three variables. R represents recency, which refers to the last purchase of commodities. The latest consuming customer has bigger R. F represents frequency, which refers to the number of transactions in a particular period and M represents monetary, which refers to consumption money amount in a particular period. Thus, RFM method has been known to be a very effective model for customer segmentation. In this study, using a normalized value of the RFM variables, SOM cluster analysis was performed. SOM is regarded as one of the most distinguished artificial neural network models in the unsupervised learning tool space. It is a popular tool for clustering and visualization of high dimensional data in such a way that similar items are grouped spatially close to one another. In particular, it has been successfully applied in various technical fields for finding patterns. In our research, the procedure tries to find sales patterns by analyzing product sales records with Recency, Frequency and Monetary values. And to suggest a business strategy, we conduct the decision tree based on SOM results. To validate the proposed procedure in this study, we adopted the M-mart data collected between 2014.01.01~2014.12.31. Each product get the value of R, F, M, and they are clustered by 9 using SOM. And we also performed three tests using the weekday data, weekend data, whole data in order to analyze the sales pattern change. In order to propose the strategy of each cluster, we examine the criteria of product clustering. The clusters through the SOM can be explained by the characteristics of these clusters of decision trees. As a result, we can suggest the inventory management strategy of each 9 clusters through the suggested procedures of the study. The highest of all three value(R, F, M) cluster's products need to have high level of the inventory as well as to be disposed in a place where it can be increasing customer's path. In contrast, the lowest of all three value(R, F, M) cluster's products need to have low level of inventory as well as to be disposed in a place where visibility is low. The highest R value cluster's products is usually new releases products, and need to be placed on the front of the store. And, manager should decrease inventory levels gradually in the highest F value cluster's products purchased in the past. Because, we assume that cluster has lower R value and the M value than the average value of good. And it can be deduced that product are sold poorly in recent days and total sales also will be lower than the frequency. The procedure presented in this study is expected to contribute to raising the profitability of the retail store. The paper is organized as follows. The second chapter briefly reviews the literature related to this study. The third chapter suggests procedures for research proposals, and the fourth chapter applied suggested procedure using the actual product sales data. Finally, the fifth chapter described the conclusion of the study and further research.

Improved Social Network Analysis Method in SNS (SNS에서의 개선된 소셜 네트워크 분석 방법)

  • Sohn, Jong-Soo;Cho, Soo-Whan;Kwon, Kyung-Lag;Chung, In-Jeong
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.4
    • /
    • pp.117-127
    • /
    • 2012
  • Due to the recent expansion of the Web 2.0 -based services, along with the widespread of smartphones, online social network services are being popularized among users. Online social network services are the online community services which enable users to communicate each other, share information and expand human relationships. In the social network services, each relation between users is represented by a graph consisting of nodes and links. As the users of online social network services are increasing rapidly, the SNS are actively utilized in enterprise marketing, analysis of social phenomenon and so on. Social Network Analysis (SNA) is the systematic way to analyze social relationships among the members of the social network using the network theory. In general social network theory consists of nodes and arcs, and it is often depicted in a social network diagram. In a social network diagram, nodes represent individual actors within the network and arcs represent relationships between the nodes. With SNA, we can measure relationships among the people such as degree of intimacy, intensity of connection and classification of the groups. Ever since Social Networking Services (SNS) have drawn increasing attention from millions of users, numerous researches have made to analyze their user relationships and messages. There are typical representative SNA methods: degree centrality, betweenness centrality and closeness centrality. In the degree of centrality analysis, the shortest path between nodes is not considered. However, it is used as a crucial factor in betweenness centrality, closeness centrality and other SNA methods. In previous researches in SNA, the computation time was not too expensive since the size of social network was small. Unfortunately, most SNA methods require significant time to process relevant data, and it makes difficult to apply the ever increasing SNS data in social network studies. For instance, if the number of nodes in online social network is n, the maximum number of link in social network is n(n-1)/2. It means that it is too expensive to analyze the social network, for example, if the number of nodes is 10,000 the number of links is 49,995,000. Therefore, we propose a heuristic-based method for finding the shortest path among users in the SNS user graph. Through the shortest path finding method, we will show how efficient our proposed approach may be by conducting betweenness centrality analysis and closeness centrality analysis, both of which are widely used in social network studies. Moreover, we devised an enhanced method with addition of best-first-search method and preprocessing step for the reduction of computation time and rapid search of the shortest paths in a huge size of online social network. Best-first-search method finds the shortest path heuristically, which generalizes human experiences. As large number of links is shared by only a few nodes in online social networks, most nods have relatively few connections. As a result, a node with multiple connections functions as a hub node. When searching for a particular node, looking for users with numerous links instead of searching all users indiscriminately has a better chance of finding the desired node more quickly. In this paper, we employ the degree of user node vn as heuristic evaluation function in a graph G = (N, E), where N is a set of vertices, and E is a set of links between two different nodes. As the heuristic evaluation function is used, the worst case could happen when the target node is situated in the bottom of skewed tree. In order to remove such a target node, the preprocessing step is conducted. Next, we find the shortest path between two nodes in social network efficiently and then analyze the social network. For the verification of the proposed method, we crawled 160,000 people from online and then constructed social network. Then we compared with previous methods, which are best-first-search and breath-first-search, in time for searching and analyzing. The suggested method takes 240 seconds to search nodes where breath-first-search based method takes 1,781 seconds (7.4 times faster). Moreover, for social network analysis, the suggested method is 6.8 times and 1.8 times faster than betweenness centrality analysis and closeness centrality analysis, respectively. The proposed method in this paper shows the possibility to analyze a large size of social network with the better performance in time. As a result, our method would improve the efficiency of social network analysis, making it particularly useful in studying social trends or phenomena.

Construction of Event Networks from Large News Data Using Text Mining Techniques (텍스트 마이닝 기법을 적용한 뉴스 데이터에서의 사건 네트워크 구축)

  • Lee, Minchul;Kim, Hea-Jin
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.1
    • /
    • pp.183-203
    • /
    • 2018
  • News articles are the most suitable medium for examining the events occurring at home and abroad. Especially, as the development of information and communication technology has brought various kinds of online news media, the news about the events occurring in society has increased greatly. So automatically summarizing key events from massive amounts of news data will help users to look at many of the events at a glance. In addition, if we build and provide an event network based on the relevance of events, it will be able to greatly help the reader in understanding the current events. In this study, we propose a method for extracting event networks from large news text data. To this end, we first collected Korean political and social articles from March 2016 to March 2017, and integrated the synonyms by leaving only meaningful words through preprocessing using NPMI and Word2Vec. Latent Dirichlet allocation (LDA) topic modeling was used to calculate the subject distribution by date and to find the peak of the subject distribution and to detect the event. A total of 32 topics were extracted from the topic modeling, and the point of occurrence of the event was deduced by looking at the point at which each subject distribution surged. As a result, a total of 85 events were detected, but the final 16 events were filtered and presented using the Gaussian smoothing technique. We also calculated the relevance score between events detected to construct the event network. Using the cosine coefficient between the co-occurred events, we calculated the relevance between the events and connected the events to construct the event network. Finally, we set up the event network by setting each event to each vertex and the relevance score between events to the vertices connecting the vertices. The event network constructed in our methods helped us to sort out major events in the political and social fields in Korea that occurred in the last one year in chronological order and at the same time identify which events are related to certain events. Our approach differs from existing event detection methods in that LDA topic modeling makes it possible to easily analyze large amounts of data and to identify the relevance of events that were difficult to detect in existing event detection. We applied various text mining techniques and Word2vec technique in the text preprocessing to improve the accuracy of the extraction of proper nouns and synthetic nouns, which have been difficult in analyzing existing Korean texts, can be found. In this study, the detection and network configuration techniques of the event have the following advantages in practical application. First, LDA topic modeling, which is unsupervised learning, can easily analyze subject and topic words and distribution from huge amount of data. Also, by using the date information of the collected news articles, it is possible to express the distribution by topic in a time series. Second, we can find out the connection of events in the form of present and summarized form by calculating relevance score and constructing event network by using simultaneous occurrence of topics that are difficult to grasp in existing event detection. It can be seen from the fact that the inter-event relevance-based event network proposed in this study was actually constructed in order of occurrence time. It is also possible to identify what happened as a starting point for a series of events through the event network. The limitation of this study is that the characteristics of LDA topic modeling have different results according to the initial parameters and the number of subjects, and the subject and event name of the analysis result should be given by the subjective judgment of the researcher. Also, since each topic is assumed to be exclusive and independent, it does not take into account the relevance between themes. Subsequent studies need to calculate the relevance between events that are not covered in this study or those that belong to the same subject.

Antioxidant Properties of the Lotus Leaf Powder Content of Cheongpomuk (연잎 분말 첨가량에 따른 청포묵의 항산화 특성)

  • Moon, Jong-Hee;Hong, Ki-Woon;Yoo, Seung Seok
    • Culinary science and hospitality research
    • /
    • v.22 no.7
    • /
    • pp.112-130
    • /
    • 2016
  • In this study the moisture content and chromaticity of fresh made lotus leaf powder added Cheongpomuk to utilize various efficacy of lotus leaf for processed food, as well as chromaticity, moisture content change, texture, total phenolic compound content, DPPH radical scavenging ability and preference of lotus leaf powder added Cheongpomuk with different storage period have been measured and analyzed. From the texture of lotus leaf powder added mung bean as per the storage period, the hardness of fresh Cheongpomuk were $0.38g/cm^2$ from control group, $0.40g/cm^2$ from CCD 1% group, $0.42g/cm^2$ from CCD 3% group, $0.37g/cm^2$ from CCD 5% group, $0.42g/cm^2$ from GGD 1% group, $0.39g/cm^2$ from GGD 3% group, $0.35g/cm^2$ from GGD 5% group, $0.39g/cm^2$ from JLD 1% group, $0.33g/cm^2$ from JLD 3% group, and $0.32g/cm^2$ from JLD 5% group. It has shown that JLD 5% group was the lowest, while CCD 3% group and GGD 1% group were the highest, and there were significant differences among sample groups. For DPPH radical scavenging ability, that of GLD 5% group was 22 times higher than that of control group. In addition, the tendency was increasing by increasing the adding rate of lotus leaf powder though there was some tolerance among sample groups. For total phenolic compound content, that of control group was 6.65 mg CE/100 g, and others were 7.48 mg CE/100 g from CCD 1% group, 15.82 mg CE/100 g from CCD 3% group, 20.15 mg CE/100 g from CCD 5% group, 15.55mg CE/100 g from GGD 1% group, 23.02 mg CE/100 g from GGD 3%, 26.95 mg CE/100 g from GGD 5% group, 3.92 mg CE/100 g from JLD 1% group, 16.72 mg CE/100 g from JLD 3%, and 26.58 mg CE/100 from JLD 5% group. From the analyzing result of responses for color and scent, taste, elasticity, and total preference of lotus leaf powder added Cheongpomuk between two panel groups, there was significant difference for the color, higher from professional cooking instructor group, but there were no significant difference between two groups for all other factors among professional cooking instructors and cooking department students. According to the results, it is expected that various functional foods can be developed by utilizing lotus leaf powder, depending on the growth condition and cultural environment of each region by adding 3% of lotus leaf powder, would be the most suitable recipe for Cheongpomuk.