• Title/Summary/Keyword: 수준점

Search Result 5,933, Processing Time 0.038 seconds

Development of a complex failure prediction system using Hierarchical Attention Network (Hierarchical Attention Network를 이용한 복합 장애 발생 예측 시스템 개발)

  • Park, Youngchan;An, Sangjun;Kim, Mintae;Kim, Wooju
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.4
    • /
    • pp.127-148
    • /
    • 2020
  • The data center is a physical environment facility for accommodating computer systems and related components, and is an essential foundation technology for next-generation core industries such as big data, smart factories, wearables, and smart homes. In particular, with the growth of cloud computing, the proportional expansion of the data center infrastructure is inevitable. Monitoring the health of these data center facilities is a way to maintain and manage the system and prevent failure. If a failure occurs in some elements of the facility, it may affect not only the relevant equipment but also other connected equipment, and may cause enormous damage. In particular, IT facilities are irregular due to interdependence and it is difficult to know the cause. In the previous study predicting failure in data center, failure was predicted by looking at a single server as a single state without assuming that the devices were mixed. Therefore, in this study, data center failures were classified into failures occurring inside the server (Outage A) and failures occurring outside the server (Outage B), and focused on analyzing complex failures occurring within the server. Server external failures include power, cooling, user errors, etc. Since such failures can be prevented in the early stages of data center facility construction, various solutions are being developed. On the other hand, the cause of the failure occurring in the server is difficult to determine, and adequate prevention has not yet been achieved. In particular, this is the reason why server failures do not occur singularly, cause other server failures, or receive something that causes failures from other servers. In other words, while the existing studies assumed that it was a single server that did not affect the servers and analyzed the failure, in this study, the failure occurred on the assumption that it had an effect between servers. In order to define the complex failure situation in the data center, failure history data for each equipment existing in the data center was used. There are four major failures considered in this study: Network Node Down, Server Down, Windows Activation Services Down, and Database Management System Service Down. The failures that occur for each device are sorted in chronological order, and when a failure occurs in a specific equipment, if a failure occurs in a specific equipment within 5 minutes from the time of occurrence, it is defined that the failure occurs simultaneously. After configuring the sequence for the devices that have failed at the same time, 5 devices that frequently occur simultaneously within the configured sequence were selected, and the case where the selected devices failed at the same time was confirmed through visualization. Since the server resource information collected for failure analysis is in units of time series and has flow, we used Long Short-term Memory (LSTM), a deep learning algorithm that can predict the next state through the previous state. In addition, unlike a single server, the Hierarchical Attention Network deep learning model structure was used in consideration of the fact that the level of multiple failures for each server is different. This algorithm is a method of increasing the prediction accuracy by giving weight to the server as the impact on the failure increases. The study began with defining the type of failure and selecting the analysis target. In the first experiment, the same collected data was assumed as a single server state and a multiple server state, and compared and analyzed. The second experiment improved the prediction accuracy in the case of a complex server by optimizing each server threshold. In the first experiment, which assumed each of a single server and multiple servers, in the case of a single server, it was predicted that three of the five servers did not have a failure even though the actual failure occurred. However, assuming multiple servers, all five servers were predicted to have failed. As a result of the experiment, the hypothesis that there is an effect between servers is proven. As a result of this study, it was confirmed that the prediction performance was superior when the multiple servers were assumed than when the single server was assumed. In particular, applying the Hierarchical Attention Network algorithm, assuming that the effects of each server will be different, played a role in improving the analysis effect. In addition, by applying a different threshold for each server, the prediction accuracy could be improved. This study showed that failures that are difficult to determine the cause can be predicted through historical data, and a model that can predict failures occurring in servers in data centers is presented. It is expected that the occurrence of disability can be prevented in advance using the results of this study.

Effects of Some Physico-Chemical Conditions of Sioil on Growth and Ionic Balance of the Tobacco Plant (Nicotiana Tabacum L.) I. Effect of Acidity(pH), Moisture(pF) and Anions (Cl-, SO4-) in Soil on Grwth and Ionic Balance of Tobacco (토양(土壤)의 몇가지 이화학적조건(理化學的條件)이 연초(煙草)의 생육(生育) 및 이온평형(平衡)에 미치는 영향(影響) I. 토양(土壤)의 pH, pF와 음(陰)이온(Cl-, SO4-)이 연초(煙草)의 생육(生育) 및 이온평형(平衡)에 미치는 영향(影響))

  • Kim, Jai-Jong;Cho, Seong-Jin
    • Korean Journal of Soil Science and Fertilizer
    • /
    • v.14 no.3
    • /
    • pp.117-129
    • /
    • 1981
  • An experiment with the tobacco plant was conducted in the pots. A sandy humic soil was used with 2 levels of pH, 3.5 and 5.8 with 2 kinds of anions, Cl as $NH_4Cl$ and $SO_4$ as $(NH_4)_2SO_4$, and with 4 levels of pF, 1.5, 2.0, 2.5, and 3.5. The pH-treatment created different N-forms; $NH_4$ at low pH(3.5) and $NO_3$ at high pH (5.8). The results are summarized as follows: 1. At low pH (3.5) with high concentration of $NH_4$ given as $NH_4Cl$, the high content of $NH_4$ and Cl in tobacco resulted in plants suffering from $NH_4$ and Cl toxicity as well as Mn toxicity. As a result of these toxicity, an extremly abnormal growth of tobacco was clearly appeared. In the tobacco grown at low pH with $NH_4$ given as $(NH_4)_2SO_4$, a large amount of the $NH_4$ uptake developed Mg and Ca deficiencies. $NH_4-N$, which had been applied to the soil of high pH (5.8), was almost completely transformed into $NO_3-N$ by nitrification and, on this low acidic soil, the plants were all healthy regardless of Cl or $SO_4$ added together with $NH_4-N$. However, dry matter production was higher and maturity faster when $SO_4$ was used as anion than when Cl was used. 2. High moisture content in soil, to some extent, is necessary for a good development and growth of the tobacco plant. Phosphate uptake seemed to be limited at higher moisture stress. The dry matter yield of tops and roots of tobacco were in the order of pF 1.8 > 2.1 > 2.6 > 3.6, respectively. 3. Data of chemical analysis and dry matter yields of tops and roots showed that the tobacco plant followed the normal (C-A) concept. In the normal growth of plants, the carboxylate content of tops was quite comparable to the estimated (C-A) values. If $NH_4$ content of plants remains in quite high quantities, it must be analysed and taken into consideration for the (C-A) calculation. Al is not transported toward tops in toxic amounts due to its high immobility, it mostly stay in or on the roots, probably due to precipitation as a aolt. When Al is present in high quantities, it has to be considered into the (C-A) calculation.

  • PDF

Public Sentiment Analysis of Korean Top-10 Companies: Big Data Approach Using Multi-categorical Sentiment Lexicon (국내 주요 10대 기업에 대한 국민 감성 분석: 다범주 감성사전을 활용한 빅 데이터 접근법)

  • Kim, Seo In;Kim, Dong Sung;Kim, Jong Woo
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.3
    • /
    • pp.45-69
    • /
    • 2016
  • Recently, sentiment analysis using open Internet data is actively performed for various purposes. As online Internet communication channels become popular, companies try to capture public sentiment of them from online open information sources. This research is conducted for the purpose of analyzing pulbic sentiment of Korean Top-10 companies using a multi-categorical sentiment lexicon. Whereas existing researches related to public sentiment measurement based on big data approach classify sentiment into dimensions, this research classifies public sentiment into multiple categories. Dimensional sentiment structure has been commonly applied in sentiment analysis of various applications, because it is academically proven, and has a clear advantage of capturing degree of sentiment and interrelation of each dimension. However, the dimensional structure is not effective when measuring public sentiment because human sentiment is too complex to be divided into few dimensions. In addition, special training is needed for ordinary people to express their feeling into dimensional structure. People do not divide their sentiment into dimensions, nor do they need psychological training when they feel. People would not express their feeling in the way of dimensional structure like positive/negative or active/passive; rather they express theirs in the way of categorical sentiment like sadness, rage, happiness and so on. That is, categorial approach of sentiment analysis is more natural than dimensional approach. Accordingly, this research suggests multi-categorical sentiment structure as an alternative way to measure social sentiment from the point of the public. Multi-categorical sentiment structure classifies sentiments following the way that ordinary people do although there are possibility to contain some subjectiveness. In this research, nine categories: 'Sadness', 'Anger', 'Happiness', 'Disgust', 'Surprise', 'Fear', 'Interest', 'Boredom' and 'Pain' are used as multi-categorical sentiment structure. To capture public sentiment of Korean Top-10 companies, Internet news data of the companies are collected over the past 25 months from a representative Korean portal site. Based on the sentiment words extracted from previous researches, we have created a sentiment lexicon, and analyzed the frequency of the words coming up within the news data. The frequency of each sentiment category was calculated as a ratio out of the total sentiment words to make ranks of distributions. Sentiment comparison among top-4 companies, which are 'Samsung', 'Hyundai', 'SK', and 'LG', were separately visualized. As a next step, the research tested hypothesis to prove the usefulness of the multi-categorical sentiment lexicon. It tested how effective categorial sentiment can be used as relative comparison index in cross sectional and time series analysis. To test the effectiveness of the sentiment lexicon as cross sectional comparison index, pair-wise t-test and Duncan test were conducted. Two pairs of companies, 'Samsung' and 'Hanjin', 'SK' and 'Hanjin' were chosen to compare whether each categorical sentiment is significantly different in pair-wise t-test. Since category 'Sadness' has the largest vocabularies, it is chosen to figure out whether the subgroups of the companies are significantly different in Duncan test. It is proved that five sentiment categories of Samsung and Hanjin and four sentiment categories of SK and Hanjin are different significantly. In category 'Sadness', it has been figured out that there were six subgroups that are significantly different. To test the effectiveness of the sentiment lexicon as time series comparison index, 'nut rage' incident of Hanjin is selected as an example case. Term frequency of sentiment words of the month when the incident happened and term frequency of the one month before the event are compared. Sentiment categories was redivided into positive/negative sentiment, and it is tried to figure out whether the event actually has some negative impact on public sentiment of the company. The difference in each category was visualized, moreover the variation of word list of sentiment 'Rage' was shown to be more concrete. As a result, there was huge before-and-after difference of sentiment that ordinary people feel to the company. Both hypotheses have turned out to be statistically significant, and therefore sentiment analysis in business area using multi-categorical sentiment lexicons has persuasive power. This research implies that categorical sentiment analysis can be used as an alternative method to supplement dimensional sentiment analysis when figuring out public sentiment in business environment.

An Expert System for the Estimation of the Growth Curve Parameters of New Markets (신규시장 성장모형의 모수 추정을 위한 전문가 시스템)

  • Lee, Dongwon;Jung, Yeojin;Jung, Jaekwon;Park, Dohyung
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.4
    • /
    • pp.17-35
    • /
    • 2015
  • Demand forecasting is the activity of estimating the quantity of a product or service that consumers will purchase for a certain period of time. Developing precise forecasting models are considered important since corporates can make strategic decisions on new markets based on future demand estimated by the models. Many studies have developed market growth curve models, such as Bass, Logistic, Gompertz models, which estimate future demand when a market is in its early stage. Among the models, Bass model, which explains the demand from two types of adopters, innovators and imitators, has been widely used in forecasting. Such models require sufficient demand observations to ensure qualified results. In the beginning of a new market, however, observations are not sufficient for the models to precisely estimate the market's future demand. For this reason, as an alternative, demands guessed from those of most adjacent markets are often used as references in such cases. Reference markets can be those whose products are developed with the same categorical technologies. A market's demand may be expected to have the similar pattern with that of a reference market in case the adoption pattern of a product in the market is determined mainly by the technology related to the product. However, such processes may not always ensure pleasing results because the similarity between markets depends on intuition and/or experience. There are two major drawbacks that human experts cannot effectively handle in this approach. One is the abundance of candidate reference markets to consider, and the other is the difficulty in calculating the similarity between markets. First, there can be too many markets to consider in selecting reference markets. Mostly, markets in the same category in an industrial hierarchy can be reference markets because they are usually based on the similar technologies. However, markets can be classified into different categories even if they are based on the same generic technologies. Therefore, markets in other categories also need to be considered as potential candidates. Next, even domain experts cannot consistently calculate the similarity between markets with their own qualitative standards. The inconsistency implies missing adjacent reference markets, which may lead to the imprecise estimation of future demand. Even though there are no missing reference markets, the new market's parameters can be hardly estimated from the reference markets without quantitative standards. For this reason, this study proposes a case-based expert system that helps experts overcome the drawbacks in discovering referential markets. First, this study proposes the use of Euclidean distance measure to calculate the similarity between markets. Based on their similarities, markets are grouped into clusters. Then, missing markets with the characteristics of the cluster are searched for. Potential candidate reference markets are extracted and recommended to users. After the iteration of these steps, definite reference markets are determined according to the user's selection among those candidates. Then, finally, the new market's parameters are estimated from the reference markets. For this procedure, two techniques are used in the model. One is clustering data mining technique, and the other content-based filtering of recommender systems. The proposed system implemented with those techniques can determine the most adjacent markets based on whether a user accepts candidate markets. Experiments were conducted to validate the usefulness of the system with five ICT experts involved. In the experiments, the experts were given the list of 16 ICT markets whose parameters to be estimated. For each of the markets, the experts estimated its parameters of growth curve models with intuition at first, and then with the system. The comparison of the experiments results show that the estimated parameters are closer when they use the system in comparison with the results when they guessed them without the system.

The Study of Establishing the Multi-pass Eurasian Railroads (유라시아 철도의 다중경로 구축에 관한 연구)

  • Hahm, Beom-Hee;Huh, Nam-Kyun;Hurr, Hee-Young
    • Korean Business Review
    • /
    • v.21 no.2
    • /
    • pp.137-170
    • /
    • 2008
  • This study is presenting the logistics strategy in the international logistics markets which makes competition and corporation among north-east Asian countries to establishing the multi-pass Eurasian railroads. The countries located in north-east area of Eurasia like China, Japan, Russia and Korea are paying higher costs and disutility to the transportations and communications due to repeated conflicts and confrontations causes from the politic problems. They are being used surface transportation for most of all logistics between Europe and Asia except special merchandises because of characteristic of cargo to be air, the Silk Road remains vestige only which was main logistic passage to this area since BC. So far the Trans-Siberian Railway is being used by Russia mostly as north of Eurasian transport because of difficulties of service. The Trans-China Railway built in 1992 is not accomplishing as a international logistic passages. It is expected to take a long lead time because of characteristic of resource development and poor logistic infrastructure to the countries like Uzbekistan, double landlocked country, Mongolia and Azerbaijan, the countries do not be adjacent to the sea, even they have great economic jump-up plans through the development of their own resources. The Shanghai Cooperation Organization(SCO) start to sail officially in 2001 is constructed with China, Russia, Tadzhikistan, Kyrgyzstan, Kazakhstan and Uzbekistan as regular members of 6 countries and Mongolia, India, Pakistan, Afghanistan and Iran as observers 5 countries. It is started as a military alliance to protect terror, but now, it is expended to cooperate with the traffic, transportation, trade and share of energies. The Russia is doing their best to activate TSR as a government target to developnorth area equivalently, and economic develop of far-east Siberia. And also it is agreed provisionally to improve and repair of rail road between Nahjin and Hassan to connect TSR and TKR( Trans-Korea Railroad) by Russia, North Korea and South Korea with Russian's aggressive efforts. The development plan of this area is over lapped with GTI(Greater Tumen Initiative) promoted by UNDP, and is a cooperated project by 5 countries of South Korea, Mongolia, China, Russia and North Korea, subject to review the appropriation of energy, tour, environment, rail road connection between Mongolia and China and establishing a ferry route to north-east Asia. It is Japanese situation to pay attention to Russia and China even they have been supplying large-scope of infrastructure in Mongol area without any charges, target to get East Asia Main Rail Road to connect Mongolia and Zalubino of Russia. In case of the program for the Denuclearization of North Korea is not creeping, it will be accelerated to connect the TKR and TSR, TKR and TCR by somehow attending United States, including developing program promoted by UN ESCAP. As the result, Korean peninsular will continue the central role of competition and cooperation as in the past, now and future of north-east Asia, as of geographical-economics and geographical-politics whether it is requested or not wanted by neighbor countries.

  • PDF

A Study on the Expression of Connexin 43 in the Experimental Tooth Movement of Rat (백서의 실험적 치아이동시 connexin 43의 발현에 관한 연구)

  • Lim, Jeong-Hyeon;Kang, Kyung-Hwa;Lee, Jong-Jin;Kim, Eun-Cheol;Kim, Sang-Cheol
    • The korean journal of orthodontics
    • /
    • v.31 no.5 s.88
    • /
    • pp.525-534
    • /
    • 2001
  • Bone remodeling in response to force requires coordinated actions of osteoblasts, osteoclasts, osteocytes, and periodontal ligament cells. Coordination among these cells may be mediated, in part, by cell-to-cell communication via gap junctions. This study was designed to evaluate the expression of gap junction, connection 43 In periodontal tissue during the experimental movement of rat's incisors, by LSAB(labelled streptavidine biotin) immunohistochemical staining for connexin 43. Twenty seven Sprague-Dawley rats were divided into a control group(3 rats), and 6 experimental groups(24 rats) where 75g of force was applied from helical springs across the maxillary incisors. Rats of experimental groups were sacrificed at 12 hours, 1, 4, 7, 14 and 28 days after force application, respectively. And the tissues of a control group and experimental groups were studied immunohistochemically. The results were as follows : 1. In control group, the expression of connexin 43 was rare in gingiva, dentin, cementum, periodontal ligament, and bone cells. 2. In experimental group, the expression of connexin 43 was increased in pulp, periodontal ligament, osteoblasts, and osteoclasts, comparing to that in control. And it was rare in gingiva, dentin, and odontoblasts regardless of the duration of force application, which was not different from that of control group. 3. The expression of connexin 43 in pulp of experimental group began to increase in 4-day after force application and got to the highest degree at 7-day. And it decreased after 14-day to be similar to that of control group at 28-day. 4. The expression of connexin 43 in periodontal ligament was noted in small capillaries adjacent to alveolar bone, showing higher intensity of immunolabelling after 4-day And it was stronger in the pressure side than in tension side of periodontal ligament. After 7-day, decrease in connexin 43 expression was observed. 5. The expression of connexin 43 in alveolar bone began to increase 1-day, reached to the highest degree at 4-day, and decreased at 7-day. And the expression in osteoclasts was more than that in osteoblasts or osteocyte at 7-day.

  • PDF

A Study on the Legislative Guidelines for Airline Consumer Protection (항공소비자 보호제도의 입법방향)

  • Lee, Chang-Jae
    • The Korean Journal of Air & Space Law and Policy
    • /
    • v.32 no.1
    • /
    • pp.3-51
    • /
    • 2017
  • From a historical point of view, while the Warsaw Convention was passed in 1924 to regulate the unified judicial responsibility in the global air transportation industry, protection of airline consumers was somewhat lacking in protecting air carriers. In principle, the air carrier does not bear any obligation or liability when the aircraft is not operated normally due to natural disasters such as typhoon or heavy snowfall. However, in recent years, in developed countries such as the US and Europe, there has been a movement in which regulates the air carriers' obligation to protect their passengers even if there is no misconduct or negligence. Furthermore, the legislation of such advanced countries imposes an obligation on the airlines to compensate the loss separately from damages in case the abnormal operation of the aircraft is not caused by force majeure but caused by their negligence. Under this historical and international context, Korea is also modifying the system of aviation consumer protection by referring to other foreign legislation. However, when compared with foreign countries, our norm has a few drawbacks. First, the airline's protection or care obligations are mixed with the legal liability for damages in the provision, which seems to be due to the lack of understanding of the airline's passenger protection obligation. The liability for damages, which is governed by the International Convention or the Commercial Act, shall be determined by judging the cause of the airline's liability in respect of the damage of the individual passenger in the course of the air transportation. However, the duty to care and the burden for compensation shall be granted to all passengers who feel uncomfortable with the abnormal operation regardless of the cause of the accident. Also, our compensation system for denied boarding due to oversale is too low compared to the case of foreign countries, and setting the compensation amount range differently based on the time for the re-routing is somewhat unclear. Regarding checked-baggage claim, it will be necessary to refund the fee only from the fact that the baggage is delayed without asking whether there is any damage occurred from the delayed baggage. This is the content of the duty to care, which is different from the current Commercial Act or the international convention, in which responsibility is different depending on whether the airline takes all the necessary measures in order to prevent delaying of the baggage. The content of force majeure, which is a requirement for exemption from the obligation to care passengers on the airplane, shall be reconsidered. Maintenance for safe navigation is not considered to be included in force majeure, and connection to airplanes, airport conditions are disputable. According to the EC Regulation, if the cause of the abnormal operation of the airline is force majeure, the airline's compensation obligation is exempted but the duty to care of airline company is still meaningful. Furthermore, even if the main role of aviation consumer protection is on an airline, it is the responsibility of government agencies to supervise the fulfillment of such protection obligations. Therefore, it is necessary for the Korean government to actively take measures such as enforcing incentives for airlines that faithfully fulfill their obligation to care and imposed penalties on the contrary.

  • PDF

Proposal of Establishing a New International Space Agency for Mining the Natural Resources in the Moon, Mars and Other Celestial Bodies

  • Kim, Doo-Hwan
    • The Korean Journal of Air & Space Law and Policy
    • /
    • v.35 no.2
    • /
    • pp.313-374
    • /
    • 2020
  • The idea of creating a new International Space Agency (ISA) is only my academic and practical opinion. It is necessary for us to establish ISA as an international organization for the efficient and rapid exploitation of natural resources in the moon, Mars and other celestial bodies. The establishment of ISA as a new international regime is based on the Article 11, 5 and Article 18 of the 1979 Moon Agreement. In order to establish as a preliminary procedure, it needs to make a "Draft for the Convention on the Establishment of an International Space Agency" among the space-faring countries. In this paper, I was examined the domestic space legislation in the United States, Luxembourg, European Space Agency, China, Japan, the Republic of Korea as well as space exploration and planning of the moons, Mars, Asteroids, Venus, Jupiter, Saturn, Titan and Other Celestial Bodies. The creation of an ISA would lead to a strengthening of the cooperation needed essentially by the developed countries towards joint and cooperative undertakings in space and would act as a catalyst for the space exploration and exploitation of the moon, Mars and other celestial bodies. It will be managed effectively and centrally the exploitation and exploitation of space the natural resources, technology, manpower and finances as an independent organization in order to get the benefit of the space developed countries by ISA. It is desirable and necessary for us to establish ISA in order to promote cooperation in space policy, law, science technology and industry among the space developed countries in the near future. The establishment of the ISA will be promoted the international cooperation among the space-faring countries in exploration and exploitations of the natural resources in the moon and other celestial bodies. I would propose the "Draft for the Convention for the Establishment of an International Space Agency." in refering the "Convention for the Establishment of a European Space Agency." This "Draft for the Convention Convention for the Establishment of an ISA" must pass the abovementioned "Draft for the Convention" by two-third majority of Diplomatic Conference in the UNCOPUOS. Finally, a very important point is need that a political drive at the highest level and a solemn statement by heads of state of the space devloped countries including the United Nations for the space exploitation of the medium and long term. It should be noted that this political drive will be necessary not only to set up the organization, but also during a subsequent period. It is desirable and necessary for us to establish the ISA in order to develop the space industry, to strengthen friendly relations and to promote research cooperation among the space-faring countries based on the new ideology and creative ideas. If the heads of the superpowers including the United Nations will be agreed to establish ISA at a summit conference, 1 am sure that it is possible to establish an ISA in the near future.

Relationship between Broca Index of Late School-Aged Children and Their Mothers' Eating, Cooking, and Exercise Habit (어머니의 식습관, 요리습관 및 운동습관과 학령기 후기 아동의 Broca 체질량지수와의 상관관계 연구)

  • Lee, Hyerim;Lee, Kyoung-Eun;Ko, Kwang Suk;Hong, Eunah
    • Journal of the Korean Society of Food Science and Nutrition
    • /
    • v.45 no.10
    • /
    • pp.1488-1496
    • /
    • 2016
  • The purposes of this study were to analyze mothers' eating, cooking, and exercise habits based on their demographic characteristics and to examine the relationship between those habits and their late school-aged children's Broca index. A total of 393 questionnaires were mailed to the mothers of late school-aged children who registered at four elementary schools in the Seoul area, of which 159 participants (40.0%) completed questionnaires. Statistical data analyses were performed using SPSS/Win 21.0 for descriptive statistics, t-test ANOVA, and Pearson's regression coefficient. There was a statistically significant difference in mothers' cooking habit (F=3.920, P=0.022) and exercise habit (F=3.211, P=0.043) according to their educational level. Interestingly, 82.4% of mothers had a Broca index of less than 90% of normal body mass level. A significant positive correlation of Broca index between mothers and their late school-aged children (r=0.345, P<0.001) indicated that children whose mothers had a low body mass level also tended to have a low body mass level. In this study, late school-aged children's Broca index was not significantly related with mother's eating (r=-0.072, P=0.367) or exercise habits (r=-0.010, P=0.897) but was significantly related with their mother's cooking habits (r=-0.157, P=0.048). Considering there are few studies examining the impacts of mother's cooking habits on their children's appropriate body mass, the results suggest that developing an effective educational program to cultivate mothers' healthy cooking habits to improve school-aged children's health status is very important. The findings of this study provide important data that could be used when developing health education programs tailored to the multi-dimensional impacts of mothers' life habits on their last school-aged children's developmental health status.

A Study of 'Emotion Trigger' by Text Mining Techniques (텍스트 마이닝을 이용한 감정 유발 요인 'Emotion Trigger'에 관한 연구)

  • An, Juyoung;Bae, Junghwan;Han, Namgi;Song, Min
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.2
    • /
    • pp.69-92
    • /
    • 2015
  • The explosion of social media data has led to apply text-mining techniques to analyze big social media data in a more rigorous manner. Even if social media text analysis algorithms were improved, previous approaches to social media text analysis have some limitations. In the field of sentiment analysis of social media written in Korean, there are two typical approaches. One is the linguistic approach using machine learning, which is the most common approach. Some studies have been conducted by adding grammatical factors to feature sets for training classification model. The other approach adopts the semantic analysis method to sentiment analysis, but this approach is mainly applied to English texts. To overcome these limitations, this study applies the Word2Vec algorithm which is an extension of the neural network algorithms to deal with more extensive semantic features that were underestimated in existing sentiment analysis. The result from adopting the Word2Vec algorithm is compared to the result from co-occurrence analysis to identify the difference between two approaches. The results show that the distribution related word extracted by Word2Vec algorithm in that the words represent some emotion about the keyword used are three times more than extracted by co-occurrence analysis. The reason of the difference between two results comes from Word2Vec's semantic features vectorization. Therefore, it is possible to say that Word2Vec algorithm is able to catch the hidden related words which have not been found in traditional analysis. In addition, Part Of Speech (POS) tagging for Korean is used to detect adjective as "emotional word" in Korean. In addition, the emotion words extracted from the text are converted into word vector by the Word2Vec algorithm to find related words. Among these related words, noun words are selected because each word of them would have causal relationship with "emotional word" in the sentence. The process of extracting these trigger factor of emotional word is named "Emotion Trigger" in this study. As a case study, the datasets used in the study are collected by searching using three keywords: professor, prosecutor, and doctor in that these keywords contain rich public emotion and opinion. Advanced data collecting was conducted to select secondary keywords for data gathering. The secondary keywords for each keyword used to gather the data to be used in actual analysis are followed: Professor (sexual assault, misappropriation of research money, recruitment irregularities, polifessor), Doctor (Shin hae-chul sky hospital, drinking and plastic surgery, rebate) Prosecutor (lewd behavior, sponsor). The size of the text data is about to 100,000(Professor: 25720, Doctor: 35110, Prosecutor: 43225) and the data are gathered from news, blog, and twitter to reflect various level of public emotion into text data analysis. As a visualization method, Gephi (http://gephi.github.io) was used and every program used in text processing and analysis are java coding. The contributions of this study are as follows: First, different approaches for sentiment analysis are integrated to overcome the limitations of existing approaches. Secondly, finding Emotion Trigger can detect the hidden connections to public emotion which existing method cannot detect. Finally, the approach used in this study could be generalized regardless of types of text data. The limitation of this study is that it is hard to say the word extracted by Emotion Trigger processing has significantly causal relationship with emotional word in a sentence. The future study will be conducted to clarify the causal relationship between emotional words and the words extracted by Emotion Trigger by comparing with the relationships manually tagged. Furthermore, the text data used in Emotion Trigger are twitter, so the data have a number of distinct features which we did not deal with in this study. These features will be considered in further study.