• Title/Summary/Keyword: Korean text classification

Search Result 413, Processing Time 0.036 seconds

Trends of Studies in Korean Journal of Acupuncture (대한경락경혈학회지 연구동향)

  • Song, Jichung;Hwang, Seongyeon;Ahn, Sunghoon;Eom, Dongmyung
    • Korean Journal of Acupuncture
    • /
    • v.33 no.1
    • /
    • pp.1-11
    • /
    • 2016
  • Objectives : When we understand the characters of certain person or object, we try to follow each one's or its past up. Korean Journal of Acupuncture is one of the most significant journal in acupuncture study fields in Korea. To understand the trends of study in acupuncture study fields, I made a subject with Korean Journal of Acupuncture. Methods : I made an evaluation and classification for all 713 articles' headline from vol. 17(2000 year) to vol.32(2015 year). Results : 1. Experimental Reseach : There were major portion articles for pharmaco-acupuncture study out of studies for acupuncture, moxibustion, chiropratics, devices and so on. 2. Bibliographical Research and Basic Theory Research : There were major portion articles for medians and acu-point study out of studies for article review and text book itself. Also, there were major portion articles for meridian and acupuncture study out of studies for Qigong, pulse, anatomy and so on. 3. Clinical Research : There were major portion articles for acupuncture study out of studies for moxibustion, chiropratics, devices, complex treatments and so on. 4. Others : There were major portion articles for diagnosis and measurement devices out of acupuncture, laser, pulsing device and so on. Also, there were surveys for recognition of patients and medical services and evaluations for measuring diagnosis utility and those effects. Conclusions : With thise results, I hope that several researchers could consider scope and subject when they submit articles for Korean Journal of Acupuncture.

The prediction of the stock price movement after IPO using machine learning and text analysis based on TF-IDF (증권신고서의 TF-IDF 텍스트 분석과 기계학습을 이용한 공모주의 상장 이후 주가 등락 예측)

  • Yang, Suyeon;Lee, Chaerok;Won, Jonggwan;Hong, Taeho
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.2
    • /
    • pp.237-262
    • /
    • 2022
  • There has been a growing interest in IPOs (Initial Public Offerings) due to the profitable returns that IPO stocks can offer to investors. However, IPOs can be speculative investments that may involve substantial risk as well because shares tend to be volatile, and the supply of IPO shares is often highly limited. Therefore, it is crucially important that IPO investors are well informed of the issuing firms and the market before deciding whether to invest or not. Unlike institutional investors, individual investors are at a disadvantage since there are few opportunities for individuals to obtain information on the IPOs. In this regard, the purpose of this study is to provide individual investors with the information they may consider when making an IPO investment decision. This study presents a model that uses machine learning and text analysis to predict whether an IPO stock price would move up or down after the first 5 trading days. Our sample includes 691 Korean IPOs from June 2009 to December 2020. The input variables for the prediction are three tone variables created from IPO prospectuses and quantitative variables that are either firm-specific, issue-specific, or market-specific. The three prospectus tone variables indicate the percentage of positive, neutral, and negative sentences in a prospectus, respectively. We considered only the sentences in the Risk Factors section of a prospectus for the tone analysis in this study. All sentences were classified into 'positive', 'neutral', and 'negative' via text analysis using TF-IDF (Term Frequency - Inverse Document Frequency). Measuring the tone of each sentence was conducted by machine learning instead of a lexicon-based approach due to the lack of sentiment dictionaries suitable for Korean text analysis in the context of finance. For this reason, the training set was created by randomly selecting 10% of the sentences from each prospectus, and the sentence classification task on the training set was performed after reading each sentence in person. Then, based on the training set, a Support Vector Machine model was utilized to predict the tone of sentences in the test set. Finally, the machine learning model calculated the percentages of positive, neutral, and negative sentences in each prospectus. To predict the price movement of an IPO stock, four different machine learning techniques were applied: Logistic Regression, Random Forest, Support Vector Machine, and Artificial Neural Network. According to the results, models that use quantitative variables using technical analysis and prospectus tone variables together show higher accuracy than models that use only quantitative variables. More specifically, the prediction accuracy was improved by 1.45% points in the Random Forest model, 4.34% points in the Artificial Neural Network model, and 5.07% points in the Support Vector Machine model. After testing the performance of these machine learning techniques, the Artificial Neural Network model using both quantitative variables and prospectus tone variables was the model with the highest prediction accuracy rate, which was 61.59%. The results indicate that the tone of a prospectus is a significant factor in predicting the price movement of an IPO stock. In addition, the McNemar test was used to verify the statistically significant difference between the models. The model using only quantitative variables and the model using both the quantitative variables and the prospectus tone variables were compared, and it was confirmed that the predictive performance improved significantly at a 1% significance level.

Study on the Meaning of Yin-Yang and Sasang in the "Huangdineijing" ("황제내경(黃帝內經)"의 '음양(陰陽)'과 '태양(太陽).소양(少陽).소음(少陰).태음(太陰)'의 의미 고찰)

  • Lee, Ok Youn;Jung, Yun Im;Bae, Go Eun;Kwon, Young Kyu
    • Journal of Physiology & Pathology in Korean Medicine
    • /
    • v.28 no.6
    • /
    • pp.577-584
    • /
    • 2014
  • The purpose of this study was to examine how the term Yin-Yang and Sasang categorized in the book "Huangdineijing". In order to investigate how the terms are used, we reviewed the text including the terms expressed in the manner of [(Yin/Yang) within (Yin/Yang)] and [Sasang]. We found three forms of expressions; [(Yin/Yang) within (Yin/Yang)], [Sasang], [(Sasang) within (Yin/Yang)]. Two paragraphs of [(Yin/Yang) within (Yin/Yang)] was found in one chapter, two paragraphs of [(Sasang)] was found in two chapters, and three paragraphs of [(Sasang) within (Yin/Yang)] was found in three chapters. We found five types of relation between [(Yin/Yang) within (Yin/Yang)], [Sasang], and five phases in "Huangdi neijing" as follows; (1) Yang within Yin, lesser Yang, and wood (2) Yang within Yang, greater Yang, and fire (3) ( ), ( ), and extreme Yin (4) Yin within Yang, lesser Yin, and metal, and (5) Yin within Yin, greater Yin, and water. And, as for the [(Yin/Yang) within (Yin/Yang)] and [(Sasang) within (Yin/Yang)], the classification criteria for Yin-Yang were brightness, abdomen/back or lumbar. The order of Sasang with the description form of [Sasang] or [(Sasang) within (Yin/Yang)] in "Siqi Tiaoshen Dalun" and "Liu Jie ZangXiang Theory" was lesser Yang, greater Yang, greater Yin, and lesser Yin, which is based on the meridian system or a plant-shaped change order. We discussed the results and its implication for the analysis of medical classics with the consideration of previous studies on Yin-Yang theories in "Huangdi neijing".

A Study on Information Services of Korean Literature Houses (국내 문학관 웹사이트의 정보 제공 개선 방안 연구)

  • Choi, Seongyeon;Seong, Heehye;Han, Jiyoon;Lee, Hye-Eun
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.32 no.3
    • /
    • pp.265-284
    • /
    • 2021
  • This study is to present improvement plans by examining how Korean literature house websites provide information services. Seventy-nine Korean literature houses out of eighty-eight members of the Korean Literature House Association were studied, except nine that did not construct websites. Three core elements, including website style, literary works information and writer information, together with thirteen sub-elements, were derived from precedent studies. As a result, it was found that 90% of the literature houses were operating websites, but the classification criteria for the literary works and cataloguing rules were not unified, and literature information was not provided sufficiently. Thus, this study suggested improvement plans such as support to build a website, developing cataloging guidelines for literature houses, providing more full-text literature and providing information about literary works and writer.

A study on the indications of Five Viscera Source Point Acupuncture extended from Taegeuk Acupuncture : Focused on Yeoungchu(靈樞) (태극침법(太極鍼法)의 확장형인 오장원혈침법(五臟原穴鍼法)의 적응증 연구 - "황제내경(黃帝內經).영추(靈樞)"를 중심으로 -)

  • Moh, Han Young;Lim, Gyo-Min;Baek, Jin-Ung
    • Journal of Korean Medical classics
    • /
    • v.25 no.4
    • /
    • pp.123-147
    • /
    • 2012
  • Objective : By establishing the Five Viscera Source Point Acupuncture as the targeted acupuncture treatment for stadardization, as the first step, this study was conducted to sort the indications of each acupuncture remedies, which can be referred as one of the most important factors in acupuncture treatment, based on Yeoungchu. Method : This study selected only the contents related to indications of five viscera, by extracting the relevant sentences from Yeoungchu using the search words Liver(Liver Meridian, First Yin), Heart(Pericardium, Heart Meridian, Second Yin), Spleen(Spleen meridian, Third Yin), Lung(Lung Meridian, Third Yin), and Kidney(Kidney Meridian, Second Yin). Result & Conclusion : 1. We selected and extracted text related to liver disease from Chapter 16, heart (pericardium) disease from Chapter 16, spleen disease from Chapter 19, lung disease from Chapter 17, and finally kidney disease from Chapter 17 of Yeoungchu. 2. The basic theory of applying Five Viscera Source Point Acupuncture to five viscera diseases is first assorting the diseases according to its state (i.e. deficiency or excess), then draining the source point of the appropriate viscus in case of excess, or supplementing the source point of the appropriate viscus in case of deficiency. 3. For the correct application of Five Viscera Source Point Acupuncture, the classification of the disease, not only the judgement on its state, must be presented systematically and synthetically in combination with Four Examinations. Therefore the follow-up studies needs to be conducted.

The Major Technology Distribution Analysis of Domestic Defense Companies in Naval Ships based on Patent Information Data (함정 분야 방산업체 주요 기술 분포 분석)

  • Kim, Jang-Eun
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.21 no.7
    • /
    • pp.625-637
    • /
    • 2020
  • In order to decide the naval ship weapon system acquisition for national policy/market economy activities, the decision makers can determine policy based on current technology level/concentration/utilization. For this, the decision makers apply the major common technology field analysis using patents data. As a method for collecting patent data, we can collect patent data of domestic mobile carriers through the Korea Intellectual Property Rights Information System of Korean Intellectual Property Office. As a result, we collected 14,964 patents/352 International Patent Classification(IPC) types. Based on these data, we performed three analysis processes (SNA, PCA, ARIMA, Text Mining) and got each result from extracting 58 IPC types of SNA and 7 IPC types of PCA. Based on the analysis results, we have confirmed that 7 IPC(B63B, H01M, F03D, B01D, H02K, B23K, H01H) types are the Major Common Technology Distribution of domestic Defense Companies.

An Analysis on the Elementary 2nd·3rd Students' Problem Solving Ability in Addition and Subtraction Problems with Natural Numbers (초등학교 2·3학년 학생들의 자연수의 덧셈과 뺄셈에 대한 문제해결 능력 분석)

  • Jeong, So Yun;Lee, Dae Hyun
    • Education of Primary School Mathematics
    • /
    • v.19 no.2
    • /
    • pp.127-142
    • /
    • 2016
  • The purpose of this study was to examine the students' problem solving ability according to numeric expression and the semantic types of addition and subtraction word problems. For this, a research was to analyze the addition and subtraction calculation ability, word problem solving ability of the selected $2^{nd}$ grade(118) and 3rd grade(109) students. We got the conclusion as follows: When the students took the survey to assess their ability to solve the numerical expression and the word problems, the correct answer rates of the result unknown problems was larger than those of the change unknown problems or the start unknown problems. the correct answer rates of the change add-into situation was larger than those of the part-part-whole situation in the result unknown addition word problems: they often presented in text books. And, in the cases of the result unknown subtraction word problems that often presented in text books, the correct answer rates of the change take-away situation was the largest. It seemed probably because the students frequently experienced similar situations in the textbooks. We know that the formal calculation ability of the students was a precondition for successful word problem solving, but that it was not a sufficient condition for that.

Quantification of Schedule Delay Risk of Rain via Text Mining of a Construction Log (공사일지의 텍스트 마이닝을 통한 우천 공기지연 리스크 정량화)

  • Park, Jongho;Cho, Mingeon;Eom, Sae Ho;Park, Sun-Kyu
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.43 no.1
    • /
    • pp.109-117
    • /
    • 2023
  • Schedule delays present a major risk factor, as they can adversely affect construction projects, such as through increasing construction costs, claims from a client, and/or a decrease in construction quality due to trims to stages to catch up on lost time. Risk management has been conducted according to the importance and priority of schedule delay risk, but quantification of risk on the depth of schedule delay tends to be inadequate due to limitations in data collection. Therefore, this research used the BERT (Bidirectional Encoder Representations from Transformers) language model to convert the contents of aconstruction log, which comprised unstructured data, into WBS (Work Breakdown Structure)-based structured data, and to form a model of classification and quantification of risk. A process was applied to eight highway construction sites, and 75 cases of rain schedule delay risk were obtained from 8 out of 39 detailed work kinds. Through a K-S test, a significant probability distribution was derived for fourkinds of work, and the risk impact was compared. The process presented in this study can be used to derive various schedule delay risks in construction projects and to quantify their depth.

A Study on the Direction of Reading and Information Service through Analysis of Digital Reading and Information Literacy Competencies Evaluation Items: Focusing on PIAAC and PISA (디지털 독서 및 정보 리터러시 평가 문항 분석을 통한 독서 및 정보 서비스의 방향 탐색 - PIAAC와 PISA를 중심으로 -)

  • Park, Juhyeon
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.52 no.3
    • /
    • pp.61-89
    • /
    • 2018
  • The purpose of this study is to analyze the items related to digital reading and information literacy which were measured by PIAAC and PISA, to examine the measurement contents and methods of these literacy items, and to derive the implications for providing reading and information services for librarians at public libraries and teacher librarians. In order to solve the questions measuring digital reading literacy and digital information literacy, respondents commonly needed ICT skills as well as cognitive strategies. However, in digital reading literacy measurement items, the ability to comprehend and critically think about texts was emphasized. And in digital information literacy measurement items, the ability to use ICT skills, navigate, and evaluate whether or not to read the retrieved text was emphasized. Librarians and teacher librarians need to encourage readers to read and provide a customized competencies improvement program to reflect the performance results and characteristics of a particular group. And It is also necessary to improve and develop the library environment so that library user can understand and use library search system and the Korean decimal classification.

Clinical Practice Guideline for Taeeumin Disease of Sasang Constitutional Medicine: Esophagus Cold-based Exterior Cold (Wiwansuhan-pyohan) disease (태음인체질병증 임상진료지침: 표병)

  • Choi, Ae-Ryun;Shin, Mi-Ran;Lee, Eui-Ju
    • Journal of Sasang Constitution and Immune Medicine
    • /
    • v.27 no.1
    • /
    • pp.42-56
    • /
    • 2015
  • Objectives This research was proposed to present Clinical Practice Guideline(CPG) for Taeeumin Disease of Sasang Constitutional Medicine(SCM): Esophagus Cold-based Exterior Cold (Wiwansuhan-pyohan) disease. This CPG was developed by the national-wide experts committee consisting of SCM professors. Methods First, collection and organization of literature related to SCM such as Donguisusebowon, Text book of SCM, Clinical Guidebook of SCM and Fundamental research to standardize diagnosis of Sasang Constitutional Medicine was performed. Secondly, journals related to clinical trial or Human complementary medicine of SCM were searched. Finally, 7 articles were selected and included in CPG for Esophagus Cold-based Exterior Cold (Wiwansuhan-pyohan) disease. Results & Conclusions The CPG of Esophagus Cold-based Exterior Cold (Wiwansuhan-pyohan) disease in Taeeumin Disease include classification, definition and standard symptoms of each pattern. Esophagus Cold-based Exterior Cold (Wiwansuhan-pyohan) disease consists of two aspects : Esophagus-Cold (Wiwanhan) and Esophagus-Cold Lung-Dry (Wiwanhan-paejo) symptomatology. Esophagus-Cold (Wiwanhan) symptomatology is classified into mild and moderate pattern by severity. Mild pattern of Esophagus-Cold (Wiwanhan) symptomatology is classified into Supraspinal Exterior (Baechu-pyo) initial and Wheezing-Dyspnea (Hyocheon) pattern. Moderate pattern of Esophagus-Cold (Wiwanhan) symptomatology is classified into Cold-reversal (Hanguel) and Cold-reversal (Hanguel) advanced pattern. And Esophagus-Cold Lung-Dry (Wiwanhan-paejo) symptomatology is classified into severe and critical pattern by severity. Severe pattern of Esophagus-Cold Lung-Dry (Wiwanhan-paejo) is classified into Dry-Cold (Johan) pattern and Dry-Cold (Johan) advanced pattern. Critical pattern of Esophagus-Cold Lung-Dry (Wiwanhan-paejo) symptomatology consists of Dry-Cold (Johan) intense pattern (Eumhyeol-mogal handa pattern).