• Title/Summary/Keyword: index development

Search Result 5,237, Processing Time 0.037 seconds

Emoticon by Emotions: The Development of an Emoticon Recommendation System Based on Consumer Emotions (Emoticon by Emotions: 소비자 감성 기반 이모티콘 추천 시스템 개발)

  • Kim, Keon-Woo;Park, Do-Hyung
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.1
    • /
    • pp.227-252
    • /
    • 2018
  • The evolution of instant communication has mirrored the development of the Internet and messenger applications are among the most representative manifestations of instant communication technologies. In messenger applications, senders use emoticons to supplement the emotions conveyed in the text of their messages. The fact that communication via messenger applications is not face-to-face makes it difficult for senders to communicate their emotions to message recipients. Emoticons have long been used as symbols that indicate the moods of speakers. However, at present, emoticon-use is evolving into a means of conveying the psychological states of consumers who want to express individual characteristics and personality quirks while communicating their emotions to others. The fact that companies like KakaoTalk, Line, Apple, etc. have begun conducting emoticon business and sales of related content are expected to gradually increase testifies to the significance of this phenomenon. Nevertheless, despite the development of emoticons themselves and the growth of the emoticon market, no suitable emoticon recommendation system has yet been developed. Even KakaoTalk, a messenger application that commands more than 90% of domestic market share in South Korea, just grouped in to popularity, most recent, or brief category. This means consumers face the inconvenience of constantly scrolling around to locate the emoticons they want. The creation of an emoticon recommendation system would improve consumer convenience and satisfaction and increase the sales revenue of companies the sell emoticons. To recommend appropriate emoticons, it is necessary to quantify the emotions that the consumer sees and emotions. Such quantification will enable us to analyze the characteristics and emotions felt by consumers who used similar emoticons, which, in turn, will facilitate our emoticon recommendations for consumers. One way to quantify emoticons use is metadata-ization. Metadata-ization is a means of structuring or organizing unstructured and semi-structured data to extract meaning. By structuring unstructured emoticon data through metadata-ization, we can easily classify emoticons based on the emotions consumers want to express. To determine emoticons' precise emotions, we had to consider sub-detail expressions-not only the seven common emotional adjectives but also the metaphorical expressions that appear only in South Korean proved by previous studies related to emotion focusing on the emoticon's characteristics. We therefore collected the sub-detail expressions of emotion based on the "Shape", "Color" and "Adumbration". Moreover, to design a highly accurate recommendation system, we considered both emotion-technical indexes and emoticon-emotional indexes. We then identified 14 features of emoticon-technical indexes and selected 36 emotional adjectives. The 36 emotional adjectives consisted of contrasting adjectives, which we reduced to 18, and we measured the 18 emotional adjectives using 40 emoticon sets randomly selected from the top-ranked emoticons in the KakaoTalk shop. We surveyed 277 consumers in their mid-twenties who had experience purchasing emoticons; we recruited them online and asked them to evaluate five different emoticon sets. After data acquisition, we conducted a factor analysis of emoticon-emotional factors. We extracted four factors that we named "Comic", Softness", "Modernity" and "Transparency". We analyzed both the relationship between indexes and consumer attitude and the relationship between emoticon-technical indexes and emoticon-emotional factors. Through this process, we confirmed that the emoticon-technical indexes did not directly affect consumer attitudes but had a mediating effect on consumer attitudes through emoticon-emotional factors. The results of the analysis revealed the mechanism consumers use to evaluate emoticons; the results also showed that consumers' emoticon-technical indexes affected emoticon-emotional factors and that the emoticon-emotional factors affected consumer satisfaction. We therefore designed the emoticon recommendation system using only four emoticon-emotional factors; we created a recommendation method to calculate the Euclidean distance from each factors' emotion. In an attempt to increase the accuracy of the emoticon recommendation system, we compared the emotional patterns of selected emoticons with the recommended emoticons. The emotional patterns corresponded in principle. We verified the emoticon recommendation system by testing prediction accuracy; the predictions were 81.02% accurate in the first result, 76.64% accurate in the second, and 81.63% accurate in the third. This study developed a methodology that can be used in various fields academically and practically. We expect that the novel emoticon recommendation system we designed will increase emoticon sales for companies who conduct business in this domain and make consumer experiences more convenient. In addition, this study served as an important first step in the development of an intelligent emoticon recommendation system. The emotional factors proposed in this study could be collected in an emotional library that could serve as an emotion index for evaluation when new emoticons are released. Moreover, by combining the accumulated emotional library with company sales data, sales information, and consumer data, companies could develop hybrid recommendation systems that would bolster convenience for consumers and serve as intellectual assets that companies could strategically deploy.

Development of a Stock Trading System Using M & W Wave Patterns and Genetic Algorithms (M&W 파동 패턴과 유전자 알고리즘을 이용한 주식 매매 시스템 개발)

  • Yang, Hoonseok;Kim, Sunwoong;Choi, Heung Sik
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.1
    • /
    • pp.63-83
    • /
    • 2019
  • Investors prefer to look for trading points based on the graph shown in the chart rather than complex analysis, such as corporate intrinsic value analysis and technical auxiliary index analysis. However, the pattern analysis technique is difficult and computerized less than the needs of users. In recent years, there have been many cases of studying stock price patterns using various machine learning techniques including neural networks in the field of artificial intelligence(AI). In particular, the development of IT technology has made it easier to analyze a huge number of chart data to find patterns that can predict stock prices. Although short-term forecasting power of prices has increased in terms of performance so far, long-term forecasting power is limited and is used in short-term trading rather than long-term investment. Other studies have focused on mechanically and accurately identifying patterns that were not recognized by past technology, but it can be vulnerable in practical areas because it is a separate matter whether the patterns found are suitable for trading. When they find a meaningful pattern, they find a point that matches the pattern. They then measure their performance after n days, assuming that they have bought at that point in time. Since this approach is to calculate virtual revenues, there can be many disparities with reality. The existing research method tries to find a pattern with stock price prediction power, but this study proposes to define the patterns first and to trade when the pattern with high success probability appears. The M & W wave pattern published by Merrill(1980) is simple because we can distinguish it by five turning points. Despite the report that some patterns have price predictability, there were no performance reports used in the actual market. The simplicity of a pattern consisting of five turning points has the advantage of reducing the cost of increasing pattern recognition accuracy. In this study, 16 patterns of up conversion and 16 patterns of down conversion are reclassified into ten groups so that they can be easily implemented by the system. Only one pattern with high success rate per group is selected for trading. Patterns that had a high probability of success in the past are likely to succeed in the future. So we trade when such a pattern occurs. It is a real situation because it is measured assuming that both the buy and sell have been executed. We tested three ways to calculate the turning point. The first method, the minimum change rate zig-zag method, removes price movements below a certain percentage and calculates the vertex. In the second method, high-low line zig-zag, the high price that meets the n-day high price line is calculated at the peak price, and the low price that meets the n-day low price line is calculated at the valley price. In the third method, the swing wave method, the high price in the center higher than n high prices on the left and right is calculated as the peak price. If the central low price is lower than the n low price on the left and right, it is calculated as valley price. The swing wave method was superior to the other methods in the test results. It is interpreted that the transaction after checking the completion of the pattern is more effective than the transaction in the unfinished state of the pattern. Genetic algorithms(GA) were the most suitable solution, although it was virtually impossible to find patterns with high success rates because the number of cases was too large in this simulation. We also performed the simulation using the Walk-forward Analysis(WFA) method, which tests the test section and the application section separately. So we were able to respond appropriately to market changes. In this study, we optimize the stock portfolio because there is a risk of over-optimized if we implement the variable optimality for each individual stock. Therefore, we selected the number of constituent stocks as 20 to increase the effect of diversified investment while avoiding optimization. We tested the KOSPI market by dividing it into six categories. In the results, the portfolio of small cap stock was the most successful and the high vol stock portfolio was the second best. This shows that patterns need to have some price volatility in order for patterns to be shaped, but volatility is not the best.

Study on Development of Acute Index Through Peripheral Blood Test in Total Body Irradiation Patients (방사선 전신조사 환자의 말초혈액 검사를 통한 급성장애 지표개발에 관한 연구)

  • Park, Young-Hwan
    • The Journal of Korean Society for Radiation Therapy
    • /
    • v.13 no.1
    • /
    • pp.91-103
    • /
    • 2001
  • In those patients who underwent radiation therapy with 10 MV X-ray for bone marrow transfer, compared the effect of high dose irradiation on peripheral blood were investigated in randomly selecting 18 patients. thus, changes in the levels of peripheral blood (WBC, Seg. neutrophil, Lymphocyte, RBC, Hemoglobin, Hematocrit ratio, GOT, GPT) were observed for 25 days after irradiation with the peripheral blood levels before irradiation in these patients and in 32 controls. Among changes in peripheral blood, the level of WBC was $8.63\%$ after radiation therapy with as much decrease as $0.22{\times}103{\pm}0.19{\times}103/mm^3$ compared with $2.51{\times}103{\pm}1.29{\times}103/mm^3$ before radiation therapy, and was only $6.35\%$ of $7.17{\times}103/mm^3{\pm}1.46{\times}103/mm^3$ in the controls, showing statistical significance (P<0.01, r2=0.9151). In the morphologic test of WBC, the level of Seg. neutrophil was decreased as much as to $20.53\%$ with $14.17{\pm}21.60\%$ compared with $69.00{\pm}25.60\%$ before irradiation, and decreased down to $24.39\%$ compared with $58.09{\pm}7.62\%$ in the controls (P<0.05, r2=0.6316). The number of lymphocyte increased as much as 3.94 folds at $79.91{\pm}27.30\%$ compared with $20.29{\pm}21.15\%$ before irradiation and was 2.39 times higher than $33.46{\pm}6.79\%$ in the controls (P<0.05, r2=0.7337). Although the change in the number of RBC was little with $3.18{\times}106{\pm}0.41{\times}106/mm^3$ before irradiation and $4.66{\times}106{\pm}0.43{\times}106/mm^3$ in the controls, no statistical significance was present (P>0.05). In addition, The number of hemoglobin was low compared with before irradiation and with the controls, but none showed statistical significance (P>0.05). Similarly, although the hematocrit ratio was low compared with before irradiation and with the controls, no statistical significance was present (P>0.05). In the values of the histologic examinations GOT and GPT, there is statistical significance between those from with before irradiation and with the controls (P<0.05). Thus, among peripheral blood, the number of WBC decreased drastically for a certain period after total body irradiation with high dose radiation, and the levels of Seg. neutriphil and lymphocyte decreased significantly after irradiation in the lymphocyte morphologic test. The results of the present study suggest that the possibility of developing the first to determine initially those patients who might develop radiation acute impairment using the usual peripheral blood test of patients to be exposed to high dose irradiation.

  • PDF

Development of New 4D Phantom Model in Respiratory Gated Volumetric Modulated Arc Therapy for Lung SBRT (폐암 SBRT에서 호흡동조 VMAT의 정확성 분석을 위한 새로운 4D 팬텀 모델 개발)

  • Yoon, KyoungJun;Kwak, JungWon;Cho, ByungChul;Song, SiYeol;Lee, SangWook;Ahn, SeungDo;Nam, SangHee
    • Progress in Medical Physics
    • /
    • v.25 no.2
    • /
    • pp.100-109
    • /
    • 2014
  • In stereotactic body radiotherapy (SBRT), the accurate location of treatment sites should be guaranteed from the respiratory motions of patients. Lots of studies on this topic have been conducted. In this letter, a new verification method simulating the real respiratory motion of heterogenous treatment regions was proposed to investigate the accuracy of lung SBRT for Volumetric Modulated Arc Therapy. Based on the CT images of lung cancer patients, lung phantoms were fabricated to equip in $QUASAR^{TM}$ respiratory moving phantom using 3D printer. The phantom was bisected in order to measure 2D dose distributions by the insertion of EBT3 film. To ensure the dose calculation accuracy in heterogeneous condition, The homogeneous plastic phantom were also utilized. Two dose algorithms; Analytical Anisotropic Algorithm (AAA) and AcurosXB (AXB) were applied in plan dose calculation processes. In order to evaluate the accuracy of treatments under respiratory motion, we analyzed the gamma index between the plan dose and film dose measured under various moving conditions; static and moving target with or without gating. The CT number of GTV region was 78 HU for real patient and 92 HU for the homemade lung phantom. The gamma pass rates with 3%/3 mm criteria between the plan dose calculated by AAA algorithm and the film doses measured in heterogeneous lung phantom under gated and no gated beam delivery with respiratory motion were 88% and 78%. In static case, 95% of gamma pass rate was presented. In the all cases of homogeneous phantom, the gamma pass rates were more than 99%. Applied AcurosXB algorithm, for heterogeneous phantom, more than 98% and for homogeneous phantom, more than 99% of gamma pass rates were achieved. Since the respiratory amplitude was relatively small and the breath pattern had the longer exhale phase than inhale, the gamma pass rates in 3%/3 mm criteria didn't make any significant difference for various motion conditions. In this study, the new phantom model of 4D dose distribution verification using patient-specific lung phantoms moving in real breathing patterns was successfully implemented. It was also evaluated that the model provides the capability to verify dose distributions delivered in the more realistic condition and also the accuracy of dose calculation.

문헌검색(文獻檢索)에 있어서 Chemical Abstracts와 CA Condensates의 비교(比較)

  • Robert, B.E.
    • Journal of Information Management
    • /
    • v.9 no.1
    • /
    • pp.21-25
    • /
    • 1976
  • 1975년(年) 3월(月), 4년반(年半) 동안의 Chemical Abstracts 색인(索引)과 온-라인이 가능(可能)한 CA Condensates를 비교(比較)하였다. 두가지 데이터 베이스를 함께 이용(利用)하여 검색(檢索)하는 방법(方法)이 가장 효율적(效率的)이지만 실예(實例)에서 보는 바와 같이 CA Condensates를 검색(檢索)하는 것이 보다 실용적(實用的)이다. System Development Corp 사(社) (SDC)에 설치(設置)되어 있는 온-라인 형태(形態)인 CHEMCON과 CHEM7071을 Chemical Abstracts 색인(索引)과 비교(比較)하였다. 대부분(大部分)의 Chemical Abstracts 이용자(理容者)들은 Chemical Abstracts 책자나 우가색인(累加索引)에는 친숙(親熟)하지만 CA Condensates는 아마도 그리 친숙(親熟)하지 못할 것이다. CA Condensates는 서지적 사항을 기계(機械)로 읽을 수 있는 형태(形態)로 되어 있고 Chemical Abstracts에 따라서 색인(索引)되므로 매주 발행되는 Chemical Abstracts 책자의 뒷 부분이 있는 색인(索引)과 같이 우리에게 가장 친숙(親熟)한 형태(形態)로 되어 있다. Chemical Abstracts가 현재(現在) 사용(使用)하고 있는 데이터 데이스이지만 본고(本稿)에서는 Index와 Condensates를 둘 다 데이터 베이스로 정의(定義)한다. Condensates가 미국(美國)의 Chemical Abstracts Service 기관으로부터 상업적(商業的)으로 이용(利用)할 수 있게 되자 여러 정보(情報)센터에서는 이용자(利用者)들의 프로 파일을 뱃취방식(方式)으로 처리(處理)하여 매주 나오는 자기(磁氣)테이프에서 최신정보(最新情報)를 검색(檢索)하여 제공(提供)하는 서어비스 (SDI)를 시작하였다. 어떤 정보(情報)센터들은 지나간 자기(磁氣)테이프들을 모아서 역시 뱃취방식(方式)으로 소급(遡及) 문헌검색(文獻檢索) 서어비스를 한다. 자기(磁氣)테이프를 직접 취급(取扱)하는 사람들을 제외(除外)하고는 대부분(大部分) Condensates를 아직 잘 모르고 있다. 소급(遡及) 문헌검색(文獻檢索)은 비용이 다소 비싸고 두서없이 이것 저것 문헌(文獻)을 검색(檢索)하는 방법(方法)은 실용적(實用的)이 못된다. 매주 나오는 색인(索引)에 대해서 두 개나 그 이상의 개념(槪念)이나 물질(物質)을 조합(組合)하여 검색(檢索)하는 방법(方法)은 어렵고 실용적(實用的)이 못된다. 오히려 주어진 용어(用語) 아래에 있는 모든 인용어(引用語)들을 보고 초록(抄錄)과의 관련성(關連性)을 결정(決定)하는 것이 때때로 더 쉽다. 상호(相互) 작용(作用)하는 온-라인 검색(檢索)을 위한 Condensates의 유용성(有用性)은 많은 변화를 가져 왔다. 필요(必要)한 문헌(文獻)만을 검색(檢索)해 보는 것이 이제 가능(可能)하고 어떤 항목(項目)에 대해서도 완전(完全)히 색인(索引)할 수 있게 되었다. 뱃취 시스팀으로는 검색(檢索)을 시작해서 그 결과(結果)를 받아 볼 때 까지 수시간(數時間)에서 며칠까지 걸리는 번거로운 시간차(時間差)를 이제는 보통 단 몇 분으로 줄일 수 있다. 그리고 뱃취 시스팀과는 달리 부정확하거나 불충분한 검색방법(檢索方法)은 즉시 고칠 수가 있다. 연속적인 뱃취 형태의 검색방법(檢索方法)에 비해서 순서(順序)없이 온-라인으로 검색(檢索)하는 방법(方法)이 분명(分明)하고 정확(正確)한 장점(長點)이 있다. CA Condensates를 자주 이용(移用)하게 되자 그의 진정한 가치(價値)에, 대해 논의(論義)가 있었다. CA Condensates의 색인방법(索引方法)은 CA Abstract 책자나 우가색인(累加索引)의 방법(方法)보다 확실히 덜 체계적(體系的)이고 철저(徹底)하지 못하다. 더우기 두 데이터 베이스는 중복(重複)것이 많으므로, 중복(重複)해서 검색(檢索)할 가치(價値)가 없는지를 결정(決定)해야 한다. 다른 몇 개의 데이터 베이스와 CA Condensates를 비교(比較)한 논문(論文)들이 여러 번 발표(發表)되어 왔는데 일반적(一般的)으로 CA Condensates는 하위(下位)의 데이터 베이스로 나타났다. Buckley는 Chemical Abstracts의 색인(索引)이 CA Condensates 보다 더 좋은 문헌 (데라마이신의 제법에 관해서)을 제공(提供)한 실례(實例)를 인용(引用)하였다. 죠오지대학(大學)의 Search Center는 CA Condensates가 CA Integrated Subject File 보다 기능(機能)이 못하다는 것을 알았다. CA Condensates의 다른 여러 가지 형태(形態)들을 또한 비교(比較)하였다. Michaels은 CA Condensates를 온-라인으로 검색(檢索)한 것과 매주 나오는 Chemical Abstracts 책자의 색인(索引)은 수작업(手作業)으로 검색(檢索)한 것을 비교(比較)한 논문(論文)을 발표(發表)하였다. 그리고 Prewitt는 온-라인으로 축적(蓄積)한 두 개의 상업용(商業用) CA Condensates를 비교(比較)하였다. Amoco Research Center에서도 CA Condensates와 Chemical Abstracts 색인(索引)의 검색결과(檢索結果)를 비교(比較)하고 CA Condensates의 장점(長點)과 색인(索引)의 장점(長點), 그리고 사실상(事實上) 서로 동등(同等)하다는 실례(實例)를 발견(發見)하였다. 1975년(年) 3월(月), 적어도 4년분(年分)의 CA Condensates와 색인(索引)(Vols 72-79, 1970-1973)을 비교(比較)하였다. 저자(著者)와 일반(一般) 주제(主題) 대한 검색(檢索)은 Vol 80 (Jan-June, 1974)을 사용(使用)하여 비교(比較)하였다. CA Condensates는 보통 세분화(細分化)된 복합물(複合物)을 검색(檢索)하는 데 불편(不便)하다. Buckly가 제시(提示)한 실례(實例)가 그 대표적(代表的)인 예(例)이다. 그러나, 다른 형태(形態)의 검색실예(檢索實例)(단체저자(團?著者), 특허수탁저(特許受託著), 개인저자(個人著者), 일반적(一般的)인/세분화(細分化)된 화합물(化合物) 그리고 반응종류(反應種類)로 실제적(實際的)인 검색(檢索)을 위한 CA Condensates의 이점(利點)을 예시(例示)하였다. 다음 실례(實例)에서 CHEMCON과 CHEM7071은 CA Condensates를 온-라인으로 입력(入力)시킨 것이다.

  • PDF

A study on OHIP-14 and EQ-5D of residents in some rural areas (일부 농촌지역 주민들의 OHIP-14와 EQ-5D에 관한 연구)

  • Lee, Eun-Gyeong;Park, Jeong-Hee;Park, Jeong-Ran;Park, Jae-Yong
    • Journal of Korean society of Dental Hygiene
    • /
    • v.11 no.2
    • /
    • pp.197-211
    • /
    • 2011
  • Objectives : OHIP-14 and EQ-5D were used, targeting the residents of farming communities to identify the elements that influence oral cavity's health and quality of life due to health and to identify the importance of oral cavity's health in order to increase health of adults' oral cavity and quality of life via improved health. Methods : This research was conducted from July 17th, 2010 to August 16th, 2010 targeting 600 residents in Goryeong-gun, Gyeongsangbuk-do, aging over 40. The data has been analyzed using Mann-Whitney U test, Kruskal-Wallis test and hierarchical multiple regression through SPSS Win Program 18.0 version. Results : 1. OHIP-14 and EQ-5D based on general characteristics showed lower oral health-related quality of life and health-related quality of life on the following cases: women (p=0.004, p<0.001), older (p<0.001, p<0.001), lower scholastic ability (p<0.001, p<0.001), lower average of average spending money (p<0.001, p<0.001), higher number of chronic disease (p<0.001, p<0.001), less drinking (p=0.012, p=0.008), lower perceived oral health and health status (p<0.001, p<0.001) and non smoking showed only EQ-5D (p<0.001). 2. OHIP-14 and EQ-5D based on oral health behavior showed lower oral health-related quality of life and health-related quality of life on the following cases: no periodic oral check-up (p<0.001, p<0.001), less experience of oral health education (p<0.001, p<0.001), horizontal tooth-brushing method(p<0.001, p<0.001) and lower frequency of tooth-brushing showed only OHIP-14 (p=0.042). OHIP-14 and EQ-5D based on oral health status and subjective oral symptom showed lower oral health-related quality of life and health-related quality of life on following cases: number of existing tooth less than 20 (p<0.001, p<0.001), the number of missing teeth more than 9 (p<0.001, p=0.044), DMFT (Decay, Missing, Filling Teeth) index more than 18 (p<0.001, p<0.001), wears denture (p<0.001, p<0.001), edentulous (p<0.001, p=0.002), have xerostomia (p<0.001, p<0.001) and have chewing discomfort (p<0.001, p<0.001). 3. Factors affecting OHIP-14 were gender, age, perceived oral health status, perceived health status, number of existing teeth, dental status, xerostomia and chewing discomfort, and the of reliability (how well it explains) the final model was 48.7%. EQ-5D showed relevance on gender, age, presence of chronic disease, perceived health status, xerostomia, chewing discomfort and oral health-related quality of life, and the reliability of the final model was 42.9%. Conclusions : In order to improve the quality of life of ruralists, oral health needs to be improved or remained by increasing the rate of possession of the existing teeth and preventing the loss of teeth. In order to do so, improvement of accessibility of dental clinic, change of direction from treatment-centered to prevention-centered health care system, development of oral health education program and various oral health care policies which would vitalize continuous oral health care system are considered to be necessary.

Development of Sentiment Analysis Model for the hot topic detection of online stock forums (온라인 주식 포럼의 핫토픽 탐지를 위한 감성분석 모형의 개발)

  • Hong, Taeho;Lee, Taewon;Li, Jingjing
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.1
    • /
    • pp.187-204
    • /
    • 2016
  • Document classification based on emotional polarity has become a welcomed emerging task owing to the great explosion of data on the Web. In the big data age, there are too many information sources to refer to when making decisions. For example, when considering travel to a city, a person may search reviews from a search engine such as Google or social networking services (SNSs) such as blogs, Twitter, and Facebook. The emotional polarity of positive and negative reviews helps a user decide on whether or not to make a trip. Sentiment analysis of customer reviews has become an important research topic as datamining technology is widely accepted for text mining of the Web. Sentiment analysis has been used to classify documents through machine learning techniques, such as the decision tree, neural networks, and support vector machines (SVMs). is used to determine the attitude, position, and sensibility of people who write articles about various topics that are published on the Web. Regardless of the polarity of customer reviews, emotional reviews are very helpful materials for analyzing the opinions of customers through their reviews. Sentiment analysis helps with understanding what customers really want instantly through the help of automated text mining techniques. Sensitivity analysis utilizes text mining techniques on text on the Web to extract subjective information in the text for text analysis. Sensitivity analysis is utilized to determine the attitudes or positions of the person who wrote the article and presented their opinion about a particular topic. In this study, we developed a model that selects a hot topic from user posts at China's online stock forum by using the k-means algorithm and self-organizing map (SOM). In addition, we developed a detecting model to predict a hot topic by using machine learning techniques such as logit, the decision tree, and SVM. We employed sensitivity analysis to develop our model for the selection and detection of hot topics from China's online stock forum. The sensitivity analysis calculates a sentimental value from a document based on contrast and classification according to the polarity sentimental dictionary (positive or negative). The online stock forum was an attractive site because of its information about stock investment. Users post numerous texts about stock movement by analyzing the market according to government policy announcements, market reports, reports from research institutes on the economy, and even rumors. We divided the online forum's topics into 21 categories to utilize sentiment analysis. One hundred forty-four topics were selected among 21 categories at online forums about stock. The posts were crawled to build a positive and negative text database. We ultimately obtained 21,141 posts on 88 topics by preprocessing the text from March 2013 to February 2015. The interest index was defined to select the hot topics, and the k-means algorithm and SOM presented equivalent results with this data. We developed a decision tree model to detect hot topics with three algorithms: CHAID, CART, and C4.5. The results of CHAID were subpar compared to the others. We also employed SVM to detect the hot topics from negative data. The SVM models were trained with the radial basis function (RBF) kernel function by a grid search to detect the hot topics. The detection of hot topics by using sentiment analysis provides the latest trends and hot topics in the stock forum for investors so that they no longer need to search the vast amounts of information on the Web. Our proposed model is also helpful to rapidly determine customers' signals or attitudes towards government policy and firms' products and services.

Comparison of Natural Flow Estimates for the Han River Basin Using TANK and SWAT Models (TANK 모형과 SWAT 모형을 이용한 한강유역의 자연유출량 산정 비교)

  • Kim, Chul-Gyum;Kim, Nam-Won
    • Journal of Korea Water Resources Association
    • /
    • v.45 no.3
    • /
    • pp.301-316
    • /
    • 2012
  • Two models, TANK and SWAT (Soil and Water Assessment Tool) were compared for simulating natural flows in the Paldang Dam upstream areas of the Han River basin in order to understand the limitations of TANK and to review the applicability and capability of SWAT. For comparison, simulation results from the previous research work were used. In the results for the calibrated watersheds (Chungju Dam and Soyanggang Dam), two models provided promising results for forecasting of daily flows with the Nash-Sutcliffe model efficiency of around 0.8. TANK simulated observations during some peak flood seasons better than SWAT, while it showed poor results during dry seasons, especially its simulations did not fall down under a certain value. It can be explained that TANK was calibrated for relatively larger flows than smaller ones. SWAT results showed a relatively good agreement with observed flows except some flood flows, and simulated inflows at the Paldang Dam considering discharges from upper dams coincided with observations with the model efficiency of around 0.9. This accounts for SWAT applicability with higher accuracy in predicting natural flows without dam operation or artificial water uses, and in assessing flow variations before and after dam development. Also, two model results were compared for other watersheds such as Pyeongchang-A, Dalcheon-B, Seomgang-B, Inbuk-A, Hangang-D, and Hongcheon-A to which calibrated TANK parameters were applied. The results were similar to the case of calibrated watersheds, that TANK simulated poor smaller flows except some flood flows and had same problem of keeping on over a certain value in dry seasons. This indicates that TANK application may have fatal uncertainties in estimating low flows used as an important index in water resources planning and management. Therefore, in order to reflect actually complex and complicated physical characteristics of Korean watersheds, and to manage efficiently water resources according to the land use and water use changes with urbanization or climate change in the future, it is necessary to utilize a physically based watershed model like SWAT rather than an existing conceptual lumped model like TANK.

A Study of the Psychosomatic Self-Reported Symptom of Dental Hygiene Students (일부 치위생과 재학생의 심신 자각증상에 관한 연구)

  • Kwon, Soon-Suk;Moon, Hee-Jung
    • Journal of dental hygiene science
    • /
    • v.12 no.4
    • /
    • pp.413-421
    • /
    • 2012
  • The main purpose of this study is to present practical data for the development of the health enhancing programs for the dental hygiene students. This data is based on the investigation of the psychosomatic self-reported symptoms of the dental hygiene students. Out of the random selection of the current dental hygiene students in Gyeonggi-do and Gangwon-do districts. We conducted a survey and analyzed the collected data from 432 respondents. The results are as follows: 1. The mental appeals (19.37) were higher then the physical appeals (17.53) and of the items in psychosomatic symptoms, the point of mental instability the highest (21.85); whereas, that of mouth and anal scored the lowest (14.59). 2. In terms of the religion, statistical significance was shown among physical appeals (p<.01), mental appeals (p<.05), multiple subjective symptom (p<.01), digestive organs (p<.01), aggressiveness (p<.01), nervousness (p<.01), and eye and skin (p<.05), mental instability (p<.05). 3. Concerning the living conditions, Statistical significance was found on the items such as physical appeals (p<.05), mental appeals (p<.01), depression (p<.001), irregular and life (p<.001), multiple subjective symptom (p<.01), lie scale (p<.01) and mouth and anal (p<.05), digestive organs (p<.05). 4. As for regular health check-ups, statistical significance was shown in the following items such as mental appeals (p<.05), depression (p<.01), multiple subjective symptom (p<.05), mental instability (p<.05).

Selecting the Optimal Research Time for Forest Birds Census in Each Season (산새류의 계절별 적정 조사시간 선정 연구)

  • Kim, Mi-Jeong;Lee, Soo-Dong;Kim, Ji-Suk
    • Korean Journal of Environment and Ecology
    • /
    • v.27 no.2
    • /
    • pp.219-229
    • /
    • 2013
  • This research is performed in order to suggest seasonal effective research time that is applied to wild birds research which lives in a forest. The subject area of this research is Mulgun-ri village forest in Samdong-Myeon, Namhe-gun. To investigate suitability of the project, existing land-use, existing vegetation, vegetational structure, and etc. of the subject and whole area are figured out. To suggest adequate research time, based on seasonal sun rise and set time for 3days, repetitive research is performed at hourly intervals. The subject area is connected with a forest and is possible for forest wild birds to flows in and provides various habitats and feeding areas. And also the subject area is a appropriate area for wild birds research and is like a natural forest in that a layer structure development of the forest itself, a distribution of Zelkova serrata and Aphananthe aspera and so on. 105 species which is observed in subject area are categorized and mountain birds are classified. After time-based peak value is selected in each season, hourly species richness, diversity, and index of similarity are analyzed as compared with the appearing number of species and individual bird. As a result, 7~11 hour is the most effective time in spring, and 8~9 hour is the best time. In summer, 6~9 hour is the most appropriate time when whole appearing species are similar to species structure. In fall, 7~11(30~60 minutes after sun rise) when wild birds movements are vigorous is analyzed easy to observe and 8~9 hour is the most appropriate research time because each analysis shows the best values. In winter, 7~12 hour is the most effective time although 10~11 hour is the best time but it is decided that similar results are drawn because hourly deviation is not so big except 1 hour before sun rise. In every four season, it is decided that 30~60 minutes after sun rise is appropriate to research a group of wild birds in the subject area.