Search | Korea Science

Development and Validation of the Letter-unit based Korean Sentimental Analysis Model Using Convolution Neural Network (회선 신경망을 활용한 자모 단위 한국형 감성 분석 모델 개발 및 검증)

Sung, Wonkyung;An, Jaeyoung;Lee, Choong C.
- The Journal of Society for e-Business Studies
- /
- v.25 no.1
- /
- pp.13-33
- /
- 2020
This study proposes a Korean sentimental analysis algorithm that utilizes a letter-unit embedding and convolutional neural networks. Sentimental analysis is a natural language processing technique for subjective data analysis, such as a person's attitude, opinion, and propensity, as shown in the text. Recently, Korean sentimental analysis research has been steadily increased. However, it has failed to use a general-purpose sentimental dictionary and has built-up and used its own sentimental dictionary in each field. The problem with this phenomenon is that it does not conform to the characteristics of Korean. In this study, we have developed a model for analyzing emotions by producing syllable vectors based on the onset, peak, and coda, excluding morphology analysis during the emotional analysis procedure. As a result, we were able to minimize the problem of word learning and the problem of unregistered words, and the accuracy of the model was 88%. The model is less influenced by the unstructured nature of the input data and allows for polarized classification according to the context of the text. We hope that through this developed model will be easier for non-experts who wish to perform Korean sentimental analysis.
https://doi.org/10.7838/jsebs.2020.25.1.013 인용 PDF KSCI

Korean Media Partisanship in the Report on THAAD Rumor Network and Frame Analysis (사드 루머(THAAD rumor) 보도에 나타난 한국 언론의 정파성 네트워크 분석과 프레임 분석을 중심으로)

Hong, Juhyun;Son, Young Jun
- Korean journal of communication and information
- /
- v.84
- /
- pp.152-188
- /
- 2017
This study stereotyped the media on the basis of ideological inclinations and media types and explored the news coverage through word analysis, network analysis, and frame analysis. There was no difference between conservative media and progressive media in terms of the amount of news. The conservative mainstream media considered the THAAD rumor as an unnecessary misunderstanding and a rumor based conflict of the south-south. The progressive mainstream media mentioned much about Hwang Gyoan, external influences, and lies and highlighted the government's opinion that there was external influence that spread a vicious rumor. Conservative media mentioned on the bringing about social disturbance and in case of progressive media mentioned social disturbance, and progressive media mentioned the responsibility of government and the attitude of conservative media about the diffusion of the rumor. In conclusion the press framed the THAAD rumor on the basis of their ideological inclinations instead of the role of journalist.
PDF

The Diffusion of Rumor Via Twitter : The Diffusion Trend and the User Interactivity in the Korea-U.S. FTA Case (트위터를 통한 루머의 확산 과정 연구: 한미 FTA 관련 루머의 자극성에 따른 의견 확산 추이와 이용자의 상호작용성을 중심으로)

Hong, Ju-Hyun;Yun, Hae-Jin
- Korean journal of communication and information
- /
- v.66
- /
- pp.59-86
- /
- 2014
This study explored how rumor is diffused via Twitter and how the characteristics of rumor affect the interactivity among users in the Korea-U.S. FTA case. A key word search located three issues as major ones related to the Korea-U.S. FTA: appendectomy myth, collapse of health insurance, and increases in medicine prices. The arousal of rumor has two dimensions: fact and expression. The fact arousal was the highest in the issue of 'appendectomy myth', and the expression arousal the highest in 'increases in medicine prices'. The rumor diffusion took the 'explosive wave' in the issue of appendectomy myth, the 'latent wave' in the issue of increase in medicine prices, and the 'repetitive wave' in the issue of collapse of health insurance. Correlation analyses revealed a high correlation between the arousal intensity of rumor and the user interactivity in the issue of collapse of health insurance. The study showed that Twitter took a role of diffusing negative messages about the Korea-U.S. FTA. Results implies that government officials and journalists pay attention to Twitter for sensing the public opinion when building policies and managing crises.
PDF

A Study of Diaphoretic Therapy[汗法] in "Yumunsachin(儒門事親)" ("유문사친(儒門事親)"의 '한법(汗法)'에 관한 연구)

Kim, Ki-Wook;Park, Hyun-Kuk;Jung, Kyung-Ho
- Journal of Korean Medical classics
- /
- v.21 no.1
- /
- pp.1-11
- /
- 2008
Jangjahwa(張子和) was influenced by "Hwangjenaegyeong(黃帝內經)" and Yuhagan(劉河間)'s theory, and other classics. Also, his clinical experience was helpful to theorize his thought. Thus, he improved medical theory by combining previous medical theory and his own experience. The essence of his thought is the importance of pathogenic Gi[邪氣] as the cause of disease and is Sambeop(三法) of Hantoha(汗吐下) as the methodology for removing pathogenic Gi[邪氣] away. He regarded pathogenic Gi as the cause of disease, and eliminated pathogenic Gi for the remedy. Namely, Sambeop(三法) of Hantoha(汗吐下) was selected as the best efficient method for driving pathogenic Gi away. Sambeop of Jangjahwa(張子和) have different meaning from previous one. Traditionally, Diaphoretic Therapy[汗法] was regarded as therapy for exogenous disease[外感病], and its effect was regarded as Balhanhaepyo(發汗解表). Emetic therapy[吐法] was throwing up Dameumsuksik(痰飮宿食) of stomach and above diaphragm. Purgation therapy[下法] means Tongbyeon(通便), Hajeok(下積), Sasil(瀉實), Chuksu(逐水) were regarded as therapy for Yangmyeongsiljeung(陽明實證) of Sanghan(傷寒). He submitted a new extensive concept of Sambeop adding traditional one, and expanded the application range of Sambeop. All methods, can cause circulation of Gihyeol(氣血) by opening the 'Hyeonbu(玄府)', like Moxibution therapy[灸薰], Steaming[蒸], Washing[洗],Heat therapy[慰], Cauterization[烙], Acupuncture therapy[鍼刺], Stone needling, Physical and breathing exercise[導引], Massage[按摩] were regarded as Diaphoretic Therapy[汗法]. Especially, he thought that Diaphoretic Therapy and venesection[瀉血] have same medical implication. If we examine the process of pushing out pathogenic Gi[邪氣] by means of Sambeop(三法), we can find the intermediation, that is circulation of Gihyeol(氣血). Its meaning is implied in the word of 'opening Hyeonbu(玄府)'. He thought that the circulation of Gihyeol(氣血) is the key to control health. Gihyeol(氣血) was circulated well under the physiological balance, but it was not circulated well under the invasion of pathogenic Gi[邪氣]. In other words, pathogenic Gi is the immediate cause of bad circulation of Gihyeol(氣血) and disease. Naturally, the doctor must remove pathogenic Gi that cause bad circulation for healing by means of Sambeop(三法). In my opinion, because the ultimate goal of Jangjahwa(張子和) was circulation of Gihyeol(氣血) by removing pathogenic Gi[邪氣], the concept of Sarnbeop(三法) could be expanded.
PDF

Social Big Data-based Co-occurrence Analysis of the Main Person's Characteristics and the Issues in the 2016 Rio Olympics Men's Soccer Games (소셜 빅데이터 기반 2016리우올림픽 축구 관련 이슈 및 인물에 대한 연관단어 분석)

Park, SungGeon;Lee, Soowon;Hwang, YoungChan
- 한국체육학회지인문사회과학편
- /
- v.56 no.2
- /
- pp.303-320
- /
- 2017
This paper seeks to better understand the focal issues and persons related to Rio Olympic soccer games through social data science and analytics. This study collected its data from online news articles and comments specific to KOR during the Olympic football games. In order to investigate the public interests for each game and target persons, this study performed the co-occurrence words analysis. Then after, the study applied the NodeXL software to perform its visualization of the results. Through this application and process, the study found several major issues during the Rio Olympic men's football game including the following: the match between KOR and PIJ, KOR player Heungmin Son, commentator Young-Pyo Lee, sportscaster Woo-Jong Jo. The study also showed the general public opinion expressed positive words towards the South Korean national football team during the Rio Olympics, though there existed negative words as well. Furthermore the study revealed positive attitude towards the commentators and casters. In conclusion, the way to increase the public's interest in big sporting events can be achieved by providing the following: contents that include various professional sports analysis, a capable domain expert with thorough preparation, a commentator and/or caster with artistic sense as well as well-spoken, explanatory power and so on. Multidisciplinary research combined with sports science, social science, information technology and media can contribute to a wide range of theoretical studies and practical developments within the sports industry.
https://doi.org/10.23949/kjpe.2017.03.56.2.21 인용

Investigation of the listening environment for lower grade students in elementary school using subjective tests (주관적 평가법을 이용한 초등학교 저학년 교실의 청취환경 조사)

Park, Chan-Jae;Haan, Chan-Hoon
- The Journal of the Acoustical Society of Korea
- /
- v.40 no.3
- /
- pp.201-212
- /
- 2021
The present study was conducted as a pilot investigation to suggest the standards of acoustic performance for classrooms suitable for incomplete hearing people such as children under 9 years of age. Subjective evaluations such as questionnaire and speech intelligibility test were conducted to 264 students at two elementary schools in Cheong-ju in order to analyze the characteristics of the listening environment in the classrooms of the lower grades in elementary school. The survey was undertaken with a total of 264 students at two elementary schools in Cheong-ju, and investigated their satisfaction with the classroom listening environment. As a result, students responded that the most helpful information type for understanding class content is the voice of teacher. In addition, the volume of the current teacher's voice is normal, and the level of clarity is highly satisfactory. As for the acoustic performance of the classroom, the opinion that the noise was normal and the reverberation was very short was found to be dominant in overall satisfaction with the listening environment. Meanwhile, as a result of speech intelligibility test using the word list selected for the lower grade students of elementary school, it could be inferred that the longitudinal axis distance from the sound source in the case of 8-year-olds is a factor that affects speech recognition.
https://doi.org/10.7776/ASK.2021.40.3.201 인용 PDF KSCI

Text Mining Analysis of Media Coverage of Maritime Sports: Perceptions of Yachting, Rowing, and Canoeing (텍스트마이닝을 활용한 해양스포츠에 대한 언론 보도기사 분석: 요트, 조정, 카누를 중심으로)

Ji-Hyeon Kim;Bo-Kyeong Kim
- Journal of the Korean Society of Marine Environment & Safety
- /
- v.29 no.6
- /
- pp.609-619
- /
- 2023
This study aimed to investigate the formation of the social perception of domestic maritime sports using text mining analysis of keywords and topics from domestic media coverage over the past 10 years related to representative maritime sports, including yachting, rowing, and canoeing. The results are as follows: First, term frequency (TF) and word cloud analyses identified the top keywords: "maritime," "competition," "experience," "tourism," "world," "yachting," "canoeing," "leisure," and "participation." Second, semantic network analysis revealed that yachting was correlated with terms like "maritime," "industry," "competition," "leisure," "tourism," "boat," "facilities," and "business"; rowing with terms like "competition" and "Chungju"; and canoeing with terms like "maritime," "competition," "experience," "leisure," and "tourism." Third, topic modeling analysis indicated that yachting, rowing, and canoeing are perceived as elite sports and maritime leisure sports. However, the perception of these sports has been demonstrated to have little impact on society, public opinion, and social transformation. In summary, when considering these results comprehensively, it can be concluded that yachting and canoeing have gradually shifted from being perceived as elite sports to essential elements of the maritime leisure industry. Contrariwise, rowing remains primarily associated with elite sports, and its popularization as a maritime leisure sport appears limited at this time.
https://doi.org/10.7837/kosomes.2023.29.6.609 인용 PDF

Text Mining and Association Rules Analysis to a Self-Introduction Letter of Freshman at Korea National College of Agricultural and Fisheries (2) (한국농수산대학 신입생 자기소개서의 텍스트 마이닝과 연관규칙 분석 (2))

Joo, J.S.;Lee, S.Y.;Kim, J.S.;Shin, Y.K.;Park, N.B.
- Journal of Practical Agriculture & Fisheries Research
- /
- v.22 no.2
- /
- pp.99-114
- /
- 2020
In this study we examined the topic analysis and correlation analysis by text mining from the self introduction letter of freshman at Korea National College of Agriculture and Fisheries(KNCAF) in 2020. The analysis items of the 3rd question were and the 4th question were the motivation for applying to college, the academic plan and the career plan. The text mining to the 3rd question showed that the frequency of 'friends' was overwhelmingly high, followed by keywords such as 'thought', 'time', 'opinion', 'activity', and 'club'. In the 4th question, keyword frequency such as 'thought', 'agriculture', 'KNCAF', 'farm', 'father' was high. The result of association rules analysis for each question showed that the relationship with the highest support level, which means the frequency and importance of the rule, was the {friend} <=> {thought}, {thought} <=> {KNCAF}. The confidence level of a correlation between keywords was the highest in the rules of {teacher}=>{friend}, {agriculture, KNCAF}=>{thought}. Also the lift level that indicates the closeness of two words was the highest in the rules of {friend} <=> {teacher}, {knowledge} <=> {professional}. These keywords are found to play a very important roles in analyzing betweenness centrality and analyzing degree centrality between keywords. The results of frequency analysis and association analysis were visualized with word cloud and correlation graphs to make it easier to understand all the results.
https://doi.org/10.23097/JPAF.2020.22(2):99 인용 PDF

Latent topics-based product reputation mining (잠재 토픽 기반의 제품 평판 마이닝)

Park, Sang-Min;On, Byung-Won
- Journal of Intelligence and Information Systems
- /
- v.23 no.2
- /
- pp.39-70
- /
- 2017
Data-drive analytics techniques have been recently applied to public surveys. Instead of simply gathering survey results or expert opinions to research the preference for a recently launched product, enterprises need a way to collect and analyze various types of online data and then accurately figure out customer preferences. In the main concept of existing data-based survey methods, the sentiment lexicon for a particular domain is first constructed by domain experts who usually judge the positive, neutral, or negative meanings of the frequently used words from the collected text documents. In order to research the preference for a particular product, the existing approach collects (1) review posts, which are related to the product, from several product review web sites; (2) extracts sentences (or phrases) in the collection after the pre-processing step such as stemming and removal of stop words is performed; (3) classifies the polarity (either positive or negative sense) of each sentence (or phrase) based on the sentiment lexicon; and (4) estimates the positive and negative ratios of the product by dividing the total numbers of the positive and negative sentences (or phrases) by the total number of the sentences (or phrases) in the collection. Furthermore, the existing approach automatically finds important sentences (or phrases) including the positive and negative meaning to/against the product. As a motivated example, given a product like Sonata made by Hyundai Motors, customers often want to see the summary note including what positive points are in the 'car design' aspect as well as what negative points are in thesame aspect. They also want to gain more useful information regarding other aspects such as 'car quality', 'car performance', and 'car service.' Such an information will enable customers to make good choice when they attempt to purchase brand-new vehicles. In addition, automobile makers will be able to figure out the preference and positive/negative points for new models on market. In the near future, the weak points of the models will be improved by the sentiment analysis. For this, the existing approach computes the sentiment score of each sentence (or phrase) and then selects top-k sentences (or phrases) with the highest positive and negative scores. However, the existing approach has several shortcomings and is limited to apply to real applications. The main disadvantages of the existing approach is as follows: (1) The main aspects (e.g., car design, quality, performance, and service) to a product (e.g., Hyundai Sonata) are not considered. Through the sentiment analysis without considering aspects, as a result, the summary note including the positive and negative ratios of the product and top-k sentences (or phrases) with the highest sentiment scores in the entire corpus is just reported to customers and car makers. This approach is not enough and main aspects of the target product need to be considered in the sentiment analysis. (2) In general, since the same word has different meanings across different domains, the sentiment lexicon which is proper to each domain needs to be constructed. The efficient way to construct the sentiment lexicon per domain is required because the sentiment lexicon construction is labor intensive and time consuming. To address the above problems, in this article, we propose a novel product reputation mining algorithm that (1) extracts topics hidden in review documents written by customers; (2) mines main aspects based on the extracted topics; (3) measures the positive and negative ratios of the product using the aspects; and (4) presents the digest in which a few important sentences with the positive and negative meanings are listed in each aspect. Unlike the existing approach, using hidden topics makes experts construct the sentimental lexicon easily and quickly. Furthermore, reinforcing topic semantics, we can improve the accuracy of the product reputation mining algorithms more largely than that of the existing approach. In the experiments, we collected large review documents to the domestic vehicles such as K5, SM5, and Avante; measured the positive and negative ratios of the three cars; showed top-k positive and negative summaries per aspect; and conducted statistical analysis. Our experimental results clearly show the effectiveness of the proposed method, compared with the existing method.
https://doi.org/10.13088/jiis.2017.23.2.039 인용 PDF KSCI

Building a Korean Sentiment Lexicon Using Collective Intelligence (집단지성을 이용한 한글 감성어 사전 구축)

An, Jungkook;Kim, Hee-Woong
- Journal of Intelligence and Information Systems
- /
- v.21 no.2
- /
- pp.49-67
- /
- 2015
Recently, emerging the notion of big data and social media has led us to enter data's big bang. Social networking services are widely used by people around the world, and they have become a part of major communication tools for all ages. Over the last decade, as online social networking sites become increasingly popular, companies tend to focus on advanced social media analysis for their marketing strategies. In addition to social media analysis, companies are mainly concerned about propagating of negative opinions on social networking sites such as Facebook and Twitter, as well as e-commerce sites. The effect of online word of mouth (WOM) such as product rating, product review, and product recommendations is very influential, and negative opinions have significant impact on product sales. This trend has increased researchers' attention to a natural language processing, such as a sentiment analysis. A sentiment analysis, also refers to as an opinion mining, is a process of identifying the polarity of subjective information and has been applied to various research and practical fields. However, there are obstacles lies when Korean language (Hangul) is used in a natural language processing because it is an agglutinative language with rich morphology pose problems. Therefore, there is a lack of Korean natural language processing resources such as a sentiment lexicon, and this has resulted in significant limitations for researchers and practitioners who are considering sentiment analysis. Our study builds a Korean sentiment lexicon with collective intelligence, and provides API (Application Programming Interface) service to open and share a sentiment lexicon data with the public (www.openhangul.com). For the pre-processing, we have created a Korean lexicon database with over 517,178 words and classified them into sentiment and non-sentiment words. In order to classify them, we first identified stop words which often quite likely to play a negative role in sentiment analysis and excluded them from our sentiment scoring. In general, sentiment words are nouns, adjectives, verbs, adverbs as they have sentimental expressions such as positive, neutral, and negative. On the other hands, non-sentiment words are interjection, determiner, numeral, postposition, etc. as they generally have no sentimental expressions. To build a reliable sentiment lexicon, we have adopted a concept of collective intelligence as a model for crowdsourcing. In addition, a concept of folksonomy has been implemented in the process of taxonomy to help collective intelligence. In order to make up for an inherent weakness of folksonomy, we have adopted a majority rule by building a voting system. Participants, as voters were offered three voting options to choose from positivity, negativity, and neutrality, and the voting have been conducted on one of the largest social networking sites for college students in Korea. More than 35,000 votes have been made by college students in Korea, and we keep this voting system open by maintaining the project as a perpetual study. Besides, any change in the sentiment score of words can be an important observation because it enables us to keep track of temporal changes in Korean language as a natural language. Lastly, our study offers a RESTful, JSON based API service through a web platform to make easier support for users such as researchers, companies, and developers. Finally, our study makes important contributions to both research and practice. In terms of research, our Korean sentiment lexicon plays an important role as a resource for Korean natural language processing. In terms of practice, practitioners such as managers and marketers can implement sentiment analysis effectively by using Korean sentiment lexicon we built. Moreover, our study sheds new light on the value of folksonomy by combining collective intelligence, and we also expect to give a new direction and a new start to the development of Korean natural language processing.
https://doi.org/10.13088/jiis.2015.21.2.49 인용 PDF KSCI

Search Result 102, Processing Time 0.021 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)