• Title/Summary/Keyword: higher order accuracy

Search Result 789, Processing Time 0.028 seconds

KNU Korean Sentiment Lexicon: Bi-LSTM-based Method for Building a Korean Sentiment Lexicon (Bi-LSTM 기반의 한국어 감성사전 구축 방안)

  • Park, Sang-Min;Na, Chul-Won;Choi, Min-Seong;Lee, Da-Hee;On, Byung-Won
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.4
    • /
    • pp.219-240
    • /
    • 2018
  • Sentiment analysis, which is one of the text mining techniques, is a method for extracting subjective content embedded in text documents. Recently, the sentiment analysis methods have been widely used in many fields. As good examples, data-driven surveys are based on analyzing the subjectivity of text data posted by users and market researches are conducted by analyzing users' review posts to quantify users' reputation on a target product. The basic method of sentiment analysis is to use sentiment dictionary (or lexicon), a list of sentiment vocabularies with positive, neutral, or negative semantics. In general, the meaning of many sentiment words is likely to be different across domains. For example, a sentiment word, 'sad' indicates negative meaning in many fields but a movie. In order to perform accurate sentiment analysis, we need to build the sentiment dictionary for a given domain. However, such a method of building the sentiment lexicon is time-consuming and various sentiment vocabularies are not included without the use of general-purpose sentiment lexicon. In order to address this problem, several studies have been carried out to construct the sentiment lexicon suitable for a specific domain based on 'OPEN HANGUL' and 'SentiWordNet', which are general-purpose sentiment lexicons. However, OPEN HANGUL is no longer being serviced and SentiWordNet does not work well because of language difference in the process of converting Korean word into English word. There are restrictions on the use of such general-purpose sentiment lexicons as seed data for building the sentiment lexicon for a specific domain. In this article, we construct 'KNU Korean Sentiment Lexicon (KNU-KSL)', a new general-purpose Korean sentiment dictionary that is more advanced than existing general-purpose lexicons. The proposed dictionary, which is a list of domain-independent sentiment words such as 'thank you', 'worthy', and 'impressed', is built to quickly construct the sentiment dictionary for a target domain. Especially, it constructs sentiment vocabularies by analyzing the glosses contained in Standard Korean Language Dictionary (SKLD) by the following procedures: First, we propose a sentiment classification model based on Bidirectional Long Short-Term Memory (Bi-LSTM). Second, the proposed deep learning model automatically classifies each of glosses to either positive or negative meaning. Third, positive words and phrases are extracted from the glosses classified as positive meaning, while negative words and phrases are extracted from the glosses classified as negative meaning. Our experimental results show that the average accuracy of the proposed sentiment classification model is up to 89.45%. In addition, the sentiment dictionary is more extended using various external sources including SentiWordNet, SenticNet, Emotional Verbs, and Sentiment Lexicon 0603. Furthermore, we add sentiment information about frequently used coined words and emoticons that are used mainly on the Web. The KNU-KSL contains a total of 14,843 sentiment vocabularies, each of which is one of 1-grams, 2-grams, phrases, and sentence patterns. Unlike existing sentiment dictionaries, it is composed of words that are not affected by particular domains. The recent trend on sentiment analysis is to use deep learning technique without sentiment dictionaries. The importance of developing sentiment dictionaries is declined gradually. However, one of recent studies shows that the words in the sentiment dictionary can be used as features of deep learning models, resulting in the sentiment analysis performed with higher accuracy (Teng, Z., 2016). This result indicates that the sentiment dictionary is used not only for sentiment analysis but also as features of deep learning models for improving accuracy. The proposed dictionary can be used as a basic data for constructing the sentiment lexicon of a particular domain and as features of deep learning models. It is also useful to automatically and quickly build large training sets for deep learning models.

Development of New Variables Affecting Movie Success and Prediction of Weekly Box Office Using Them Based on Machine Learning (영화 흥행에 영향을 미치는 새로운 변수 개발과 이를 이용한 머신러닝 기반의 주간 박스오피스 예측)

  • Song, Junga;Choi, Keunho;Kim, Gunwoo
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.4
    • /
    • pp.67-83
    • /
    • 2018
  • The Korean film industry with significant increase every year exceeded the number of cumulative audiences of 200 million people in 2013 finally. However, starting from 2015 the Korean film industry entered a period of low growth and experienced a negative growth after all in 2016. To overcome such difficulty, stakeholders like production company, distribution company, multiplex have attempted to maximize the market returns using strategies of predicting change of market and of responding to such market change immediately. Since a film is classified as one of experiential products, it is not easy to predict a box office record and the initial number of audiences before the film is released. And also, the number of audiences fluctuates with a variety of factors after the film is released. So, the production company and distribution company try to be guaranteed the number of screens at the opining time of a newly released by multiplex chains. However, the multiplex chains tend to open the screening schedule during only a week and then determine the number of screening of the forthcoming week based on the box office record and the evaluation of audiences. Many previous researches have conducted to deal with the prediction of box office records of films. In the early stage, the researches attempted to identify factors affecting the box office record. And nowadays, many studies have tried to apply various analytic techniques to the factors identified previously in order to improve the accuracy of prediction and to explain the effect of each factor instead of identifying new factors affecting the box office record. However, most of previous researches have limitations in that they used the total number of audiences from the opening to the end as a target variable, and this makes it difficult to predict and respond to the demand of market which changes dynamically. Therefore, the purpose of this study is to predict the weekly number of audiences of a newly released film so that the stakeholder can flexibly and elastically respond to the change of the number of audiences in the film. To that end, we considered the factors used in the previous studies affecting box office and developed new factors not used in previous studies such as the order of opening of movies, dynamics of sales. Along with the comprehensive factors, we used the machine learning method such as Random Forest, Multi Layer Perception, Support Vector Machine, and Naive Bays, to predict the number of cumulative visitors from the first week after a film release to the third week. At the point of the first and the second week, we predicted the cumulative number of visitors of the forthcoming week for a released film. And at the point of the third week, we predict the total number of visitors of the film. In addition, we predicted the total number of cumulative visitors also at the point of the both first week and second week using the same factors. As a result, we found the accuracy of predicting the number of visitors at the forthcoming week was higher than that of predicting the total number of them in all of three weeks, and also the accuracy of the Random Forest was the highest among the machine learning methods we used. This study has implications in that this study 1) considered various factors comprehensively which affect the box office record and merely addressed by other previous researches such as the weekly rating of audiences after release, the weekly rank of the film after release, and the weekly sales share after release, and 2) tried to predict and respond to the demand of market which changes dynamically by suggesting models which predicts the weekly number of audiences of newly released films so that the stakeholders can flexibly and elastically respond to the change of the number of audiences in the film.

A Study on Hepatomegaly and Facial Telangiectasia in a Group of the Insured (간종대(肝腫大)와 안면모세혈관확장(顔面毛細血管擴張)의 보험의학적연구(保險醫學的硏究))

  • Im, Young-Hoon
    • The Journal of the Korean life insurance medical association
    • /
    • v.4 no.1
    • /
    • pp.110-132
    • /
    • 1987
  • A study on hepatomegaly detected by abdominal palpation, and facial telangiectasia in a total of 3,418 insured persons medically examined at the Honam Medical Room of Dong Bang Life Insurance Company Ltd. from February, 1984 to August, 1985 was undertaken. The results were as follows: 1) Hepatomegaly was found in 383 cases(27.5%) among the 1,395 insureds of male and in 163 cases(8.1%) among the 2,023 insureds of female. The difference of incidence of hepatomegaly between all males and females showed statistical significance(p<0.001). In each age group, the incidence of hepatomegaly in :nale was higher than that in female. The incidence of hepatomegaly in each age group in male increased cnosiderably with age; it showed 11.6%,16.2%, 42.6% and 52.9% from second to sixth decade in order, thereafter in seventh decade it decreased to 26.7%, While the incidence of hepatomegaly in female increased slightly in each age group. 2) Facial telangiectasia was found in 318 cases(22.8%) among all males and in 157 cases(7.8%) among all females. The difference of incidence of telangiectasia between all males and females showed statistical significance(p<0.001). In each age group, the incidence of telangiectasia in male was higher than that in female, except of second decade. The incidence of facial telangiectasia in each age group in male increased considerably with age; while it increased slightly in female. 3) Facial telangiectasia accompanied by hepatomegaly was found in 235 cases(61.4%) among 383 cases of hepatomegaly in male and in 69 cases(42.3%) among 163 cases of hepatomegaly in female. The difference of incidence of telangiectasia between males and females show ed statistical significance(p<0.001). 4) Facial telangiectasia without spider angiomata accompanied by hepatomegaly was found in 201 cases(52.5%) among 383 cases of hepatomegaly in all males and in 67 casgs(41.4%) among 163 cases of hepatomegaly in all females; facial spider angiomata accompanied by hepatomegaly was found in 34 cases(8.9%) among 383 cases of hepatomegaly in all males and in 2 cases(1.2%) among 163 cases of hepatomegaly in all females. 5) Abnormal SGOT activity was found in 19 cases(7.9%) among 242 cases of hepatomegaly in all males and in one case(1.5%) among 67 cases of hepatomegaly in all females. The difference of incidence of abnormal SGOT activity showed statistical significance(p<0.001). The incidence of abnormal SGOT activity by the size of hepatomegaly, that is, palpated <1 finger's breadth, <2 fingers' breadth and ${\geqq}2$ fingers' breadth, revealed 2.2%, 6.0% and 60.0% respectively in all males, while abnormal SGOT activity was found only one case in fifth decade among 67 cases of hepatomegaly in all females. 6) In ordinary medical examination(the insured amount is low) abnormal SGOT activity was found in 7 cases(4.8%) among 146 cases of hepatomegaly palpated $1\frac{1}{2}$ fingers' breadth and under, while it was not found in 37 cases of the same sized hepatomegaly in all females. Above mentioned 7 cases are thought to be very significant because 7 cases occupy 35% in 20 cases of abnormal SGOT activity with hepatomegaly. 7) Abnormal SGOT activity was found in 12 cases(4.4%) among 273 cases of hepatomegaly of "not firm" consistency, while it was found in 8 cases(22.2%) among 36 cases of hepatomegaly of "firm" consistency. The difference of incidence of abnormal SGOT activity showed statistical significance(p<0.05). 8) Abnormal SGOT activity was found in 5 cases(17.9%) among 28 cases of spider angiomata with hepatomegaly, while it was found in 10 cases(7.3%) among 166 cases of telangiectasia without spider angiomata with hepatomegaly. Owing to a small number of cases, statistical significance was not recognized, but the incidence of abnormal SGOT activity in spider angiomata cases with hepatomegaly is apt to be higher than that in telangiectasia cases without spider angiomata with hepatomegaly. 9) The incidence of abnormal SGOT activity is apt to be higher with age in male group; abnormal SGOT activity was not found among 4 cases of hepatomegaly in second decade and it was 3.8% in third decade, 4.5% in fourth decade, 9.3% in fifth decade, 17.5% in sixth decade and 33.3% in seventh decade, while the incidence of it was only one case among 67 cases in all females. 10) It is believed that the performance of liver function test to the subjects with hepatomegaly even in ordinary medical examination(the insured amount is low) will give considerable contribution for medical selection of hepatomegaly risk. 11) Age of the insured(young or old), presence of facial telangiectasia or spider angiomata especially and their severity, and consistency of enlarged liver(firm or not) should be considered to increase accuracy in evaluating hepatomegaly risk.

  • PDF

The Evaluation of Reliability for the Combined Refractive Power of Overlapping Trial Lenses (중첩된 시험렌즈의 합성굴절력에 대한 신뢰도 평가)

  • Lee, Hyung Kyun;Kim, So Ra;Park, Mijung
    • Journal of Korean Ophthalmic Optics Society
    • /
    • v.20 no.3
    • /
    • pp.263-276
    • /
    • 2015
  • Purpose: The current study aimed to evaluate the reliability for the combined refractive power when a spherical lens and a cylindrical lens were overlapped in a trial frame. Methods: The refractive powers, central thickness and peripheral thickness of spherical trial lenses and cylindrical lenses with negative power were measured. The combined refractive power of the spherical and cylindrical lenses was measured by auto lens meter. Measurement was repeated by changing the insertion order, and their results were further compared with the calculated combined refractive power. Results: There was no correlation between the variation of central and peripheral thickness in trial lenses and that of the lens power. Among 79 trial lenses, 3 trial lenses wasn't met the international standard. The refractive power calculated by Gullstrand's formula that could compensate vertex distance had smaller difference with the estimated power when compared with that calculated by thin lens formula however, it was significantly different from the estimated power. The refractive powers were generally apparent regardless of the insertion order of a spherical lens and a cylindrical lens: thin lens formula > actual measurements > Gullstrand's formula. The error was only found in cylindrical power calculated by Gullstrand's formula when inserted a spherical lens inside and a cylindrical lens outside however, the error was found in both of cylindrical and spherical powers calculated by Gullstrand's formula when inserted as a opposite order. By comparing actual measurements of equivalent spherical power, the accuracy was higher and the possibility of over-correction was lower when inserted a spherical lens inside and a cylindrical lens outside. Conclusions: From the results, those were revealed that the combined refractive power is influenced by the factors other than the vertex distance and the refractive power varies in accordance with the insertion order of a spherical lens and a cylindrical lens. Thus, it can be suggested that the establishment of standard for these is neccesaty.

Study on Labeling Efficiency of $^{99m}Tc$-HMPAO ($^{99m}Tc$-HMPAO 표지효율에 대한 고찰)

  • Hyeon, Jun Ho;Lim, Hyeon Jin;Kim, Ha Kyun;Cho, Seong Uk;Kim, Jin Eui
    • The Korean Journal of Nuclear Medicine Technology
    • /
    • v.16 no.2
    • /
    • pp.131-134
    • /
    • 2012
  • Purpose : The labeling efficiency of radiopharmaceuticals in nuclear medicine is important in terms of accuracy and reliability of the examination. Usually $^{99m}Tc$-HMPAO used for brain SPECT scan is chemically unstable since lots of impurities are existing. Therefore, occurrence of loss of labeling efficiency is easy to appear. In this paper, labeling and use of $^{99m}Tc$-HMPAO should be helpful through experiments on factors affecting the labeling efficiency of $^{99m}Tc$-HMPAO. Materials and Methods : Domestic HMPAO vials (Dong-A) used for brain SPECT scan were tested. Domestic Samyeong Generator 55.5 GBq (1,500 mCi), TLC measurement sets (ITLC-SG, butanone, saline, TLC chamber) and radio-TLC scanner (Advantest, Bioscan) were used. In the first experiment, after eluting generator at 1, 8, 16, 24, 28 hours apart, each eluted $^{99m}Tc$-pertechnetate were labeled with HMPAO and the labeling efficiency was measured. In the second experiment, after eluting $^{99m}Tc$-pertechnetate from a generator, $^{99m}Tc$-pertechnetate was drawn at 0, 1, 3, 6 hours. And each drawn $^{99m}Tc$-pertechnetate were labeled with HMPAO for measuring labeling efficiency. In the third experiment, labeling efficiency was measured at 0, 0.5, 3, 5, 7 hours after labeling $^{99m}Tc$-HMPAO. Results : In the first experiment, measured values were appeared 95.05, 94.64, 94.94, 95.64, 96.76% in passing order of time. In the second experiment, measured values were appeared 94.38, 94.23, 93.26, 91.03% in passing order of time. In the third experiment, measured values were appeared 95.76, 94.17, 88.19, 83.6, 76.86% in passing order of time. Conclusion : In the first experiment of this paper, labeling efficiency of $^{99m}Tc$-HMPAO labeled with $^{99m}Tc$-pertechnetate eluted after 24 hours from first elution. Additional experiments will be needed to discuss for usability. In the second experiment, the labeling efficiency was slightly decreased in chronological order, but it was measured higher than 90%. Also, additional experiments will be needed to discuss for usability. In the third experiment, the labeling efficiency was decreased considerably. Especially, within 3 hours after the labeling is recommended to use $^{99m}Tc$-HMPAO

  • PDF

Implementation of Man-made Tongue Immobilization Devices in Treating Head and Neck Cancer Patients (두 경부 암 환자의 방사선치료 시 자체 제작한 고정 기구 유용성의 고찰)

  • Baek, Jong-Geal;Kim, Joo-Ho;Lee, Sang-Kyu;Lee, Won-Joo;Yoon, Jong-Won;Cho, Jeong-Hee
    • The Journal of Korean Society for Radiation Therapy
    • /
    • v.20 no.1
    • /
    • pp.1-9
    • /
    • 2008
  • Purpose: For head and neck cancer patients treated with radiation therapy, proper immobilization of intra-oral structures is crucial in reproducing treatment positions and optimizing dose distribution. We produced a man-made tongue immobilization device for each patient subjected to this study. Reproducibility of treatment positions and dose distributions at air-and-tissue interface were compared using man-made tongue immobilization devices and conventional tongue-bites. Materials and Methods: Dental alginate and putty were used in producing man-made tongue immobilization devices. In order to evaluate reproducibility of treatment positions, all patients were CT-simulated, and linac-gram was repeated 5 times with each patient in the treatment position. An acrylic phantom was devised in order to evaluate safety of man-made tongue immobilization devices. Air, water, alginate and putty were placed in the phantom and dose distributions at air-and-tissue interface were calculated using Pinnacle (version 7.6c, Phillips, USA) and measured with EBT film. Two different field sizes (3$\times$3 cm and 5$\times$5 cm) were used for comparison. Results: Evaluation of linac grams showed reproducibility of a treatment position was 4 times more accurate with man-made tongue immobilization devices compared with conventional tongue bites. Patients felt more comfortable using customized tongue immobilization devices during radiation treatment. Air-and-tissue interface dose distributions calculated using Pinnacle were 7.78% and 0.56% for 3$\times$3 cm field and 5$\times$5 cm field respectively. Dose distributions measured with EBT (international specialty products, USA) film were 36.5% and 11.8% for 3$\times$3 cm field and 5$\times$5 cm field respectively. Values from EBT film were higher. Conclusion: Using man-made tongue immobilization devices made of dental alginate and putty in treatment of head and neck cancer patients showed higher reproducibility of treatment position compared with using conventional mouth pieces. Man-made immobilization devices can help optimizing air-and-tissue interface dose distributions and compensating limited accuracy of radiotherapy planning systems in calculating air-tissue interface dose distributions.

  • PDF

Isogeometric Shape Sensitivity Analysis in Generalized Curvilinear Coordinate Systems (일반 곡면 좌표계에서 구현된 아이소-지오메트릭 형상 설계민감도 해석)

  • Ha, Youn Doh;Yoon, Minho;Cho, Seonho
    • Journal of the Computational Structural Engineering Institute of Korea
    • /
    • v.25 no.6
    • /
    • pp.497-504
    • /
    • 2012
  • Finite element analysis is to approximate a geometry model developed in computer-aided design(CAD) to a finite element model, thus the conventional shape design sensitivity analysis and optimization using the finite element method have some difficulties in the parameterization of geometry. However, isogeometric analysis is to build a geometry model and directly use the functions describing the geometry in analysis. Therefore, the geometric properties can be embedded in the NURBS basis functions and control points so that it has potential capability to overcome the aforementioned difficulties. In this study, the isogeometric structural analysis and shape design sensitivity analysis in the generalized curvilinear coordinate(GCC) systems are discussed for the curved geometry. Representing the higher order geometric information, such as normal, tangent and curvature, yields the isogeometric approach to be the best way for generating exact GCC systems from a given CAD geometry. The developed GCC isogeometric structural analysis and shape design sensitivity analysis are verified to show better accuracy and faster convergency by comparing with the results obtained from the conventional isogeometric method.

Mixed Mode Analysis using Two-step Extension Based VCCT in an Inclined Center Crack Repaired by Composite Patching (복합재료 팻칭에 의한 중앙경사균열에서 2단계 확장 가상균열닫힘법을 사용한 혼합모우드해석)

  • Ahn, Jae-Seok;Woo, Kwang-Sung
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.32 no.1A
    • /
    • pp.11-18
    • /
    • 2012
  • This paper deals with the numerical determination of the stress intensity factors of cracked aluminum plates under the mixed mode of $K_I$ and $K_{II}$ in glass-epoxy fiber reinforced composites. For the stress intensity factors, two different models are reviewed such as VCCT and two-step extension method. The p-convergent partial layerwise model is adopted to determine the fracture parameters in terms of energy release rates and stress intensity factors. The p-convergent approach is based on the concept of subparametric element. In assumed displacement field, strain-displacement relations and 3-D constitutive equations of a layer are obtained by combination of 2-D and 1-D higher-order shape functions. In the elements, Lobatto shape functions and Gauss-Lobatto technique are employed to interpolate displacement fields and to implement numerical quadrature. Using the models and techniques considered, effects of composite laminate configuration according to inclined angles and adhesive properties on the performance of bonded composite patch are investigated. In addition to these, the out-of-plane bending effect has been investigated across the thickness of patch repaired laminate plates due to the change of neutral axis. The present model provides accuracy and simplicity in terms of stress intensity factors, stress distribution, number of degrees of freedom, and energy release rates as compared with previous works in literatures.

Case Report on Improvement of Reproduction Rate in Hanwoo Farms (한우 농장별 번식기록 분석을 통한 번식률 제고 사례 연구)

  • Kim, Ui Hyung;Chung, Ki Yong;Lee, Seung Hwan;Ryu, Il Sun;Kang, Hee Seol
    • Journal of Embryo Transfer
    • /
    • v.29 no.1
    • /
    • pp.7-12
    • /
    • 2014
  • This work was conducted to study the improvement of reproduction rate from the breeding data collected from four farms from January 2007 to October 2010. The average numbers of service per conception were 1) A farm $1.7{\pm}0.1$ times, 2) B farm $1.5{\pm}0.1$ times, 3) C farm $1.5{\pm}0.1$ times, 4) D farm $1.4{\pm}0.1$ times. The average days from calving to conception was $77.4{\pm}4.8$ days in A farm, $150.8{\pm}11.2$ days in B farm, $90.4{\pm}4.5$ days in C farm, and $71.4{\pm}2.5$ days in D farm. Number of artificial insemination (AI) service per conception was higher at the 30 days before first AI ($2.1{\pm}0.2$ times) than at the 31 days after first AI, and the days from calving to conception were shorter at the 90 days before first AI than at the 91 days after first AI. After timed AI (TAI) treatment, the pregnancy rate was 60.3% for the 58 cows with reproductive disorder. In order to improvement of reproduction rates, the farms has to improve the accuracy of estrus detection, pregnancy diagnosis, check-up for reproductive health, and control of day for first AI periods after calving. The result suggests that farmers need the careful management and reproductive examination of farm animals to improve of reproductive efficiency.

Extraction of Sternocleidomastoid Muscle for Ultrasound Images of Cervical Vertebrae (경추 초음파 영상에서 흉쇄유돌근 추출)

  • Kim, Kwang-Baek
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.15 no.11
    • /
    • pp.2321-2326
    • /
    • 2011
  • Cervical vertebrae are a complex structure and an important part of human body connecting the head and the trunk. In this paper, we propose a method to extract sternocleidomastoid muscle from ultrasonography images of cervical vertabrae automatically. In our method, Region of Interests(ROI) is extracted first from an ultrasonography image after removing unnecessary auxiliary information such as metrics. Then we apply Ends-in search stretching algorithm in order to enhance the contrast of brightness. Average binarization is then applied to those pixels which its brightness is sufficiently large. The noise part is removed by image processing algorithms. After extracting fascia encloses sternocleidomastoid muscle, target muscle object is extracted using the location information of fascia according to the number of objects in the fascia. When only one object is to be extracted, we search downward first to extract the target muscle area and then search from right to left to extract the area and merge them. If there are two target objects, we extract first from the upper-bound of higher object to the lower-bound of lower object and then remove the fascia of the target object area. Smearing technique is used to restore possible loss of the fat area in the process. The thickness of sternocleidomastoid muscle is then calculated as the maximum thickness of those extracted objects. In this experiment with 30 real world ultrasonography images, the proposed method verified its efficacy and accuracy by health professionals.