• 제목/요약/키워드: Text Collection

검색결과 302건 처리시간 0.026초

Analysis of Infertility Keywords in the Largest Domestic Mom Cafe Bulletin Board in Korea Using Text Mining

  • Sangmin Lee
    • 인터넷정보학회논문지
    • /
    • 제24권4호
    • /
    • pp.137-144
    • /
    • 2023
  • The purpose of this study is to examine consumers' perceptions of domestic infertility support policies based on infertility-related keywords and the trends of their changes. To this end, Momsholic, a mom cafe which has the most active infertility-related bulletin boards on Naver, was selected as the analysis target, and 'infertility' was selected as a keyword for data search. The data was collected for three months. In addition, network analysis and visualization were performed using R for data collection and analysis, and cross-validation was attempted using the NetDraw function of 'textom 1.0' and the UCINET6 program. As a result of the analysis, the main keywords were cost, artificial insemination, in vitro fertilization, freezing, harvest, ovulation, and how much. Next, looking at the central value of the degree of connection, it was found that the degree of connection between the words cost, cost, how much, problem, public health center, and artificial insemination was high. According to the results of this study, women who visit mom cafes due to infertility in Korea are more interested in the cost. It is believed to be closely related to infertility treatment as well as in vitro fertilization and egg freezing. Therefore, by examining keywords related toinfertility, it has academic significance in that it is possible to identify major factors that end users are interested in. Furthermore, it is possible to redefine the guidelines for domestic infertility support policies by presenting infertility support policies that reflect the factors of interest of end consumers.

Topic Modeling of Korean Newspaper Articles on Aging via Latent Dirichlet Allocation

  • Lee, So Chung
    • Asian Journal for Public Opinion Research
    • /
    • 제10권1호
    • /
    • pp.4-22
    • /
    • 2022
  • The purpose of this study is to explore the structure of social discourse on aging in Korea by analyzing newspaper articles on aging. The analysis is composed of three steps: first, data collection and preprocessing; second, identifying the latent topics; and third, observing yearly dynamics of topics. In total, 1,472 newspaper articles that included the word "aging" within the title were collected from 10 major newspapers between 2006 and 2019. The underlying topic structure was analyzed using Latent Dirichlet Allocation (LDA), a topic modeling method widely adopted by text mining academics and researchers. Seven latent topics were generated from the LDA model, defined as social issues, death, private insurance, economic growth, national debt, labor market innovation, and income security. The topic loadings demonstrated a clear increase in public interest on topics such as national debt and labor market innovation in recent years. This study concludes that media discourse on aging has shifted towards more productivity and efficiency related issues, requiring older people to be productive citizens. Such subjectivation connotes a decreased role of the government and society by shifting the responsibility to individuals not being able to adapt successfully as productive citizens within the labor market.

풍석(楓石) 서유구(徐有榘)의 『임원경제지(林園經濟志)』 「인제지(仁濟志)」 '탕액운휘(湯液韻彙)'와 처방 제형에 대한 연구 - '방(方)'을 중심으로 - (A Study on the 'Tangaek-Unhoei(湯液韻彙)' Index of Herbal Medicine in the Inje-Ji(仁濟志) of the Imwon-Gyeongje-Ji(林園經濟志), by Seo-Yugu(徐有榘) Focusing on 'Fang(方)')

  • 전종욱
    • 대한한의학원전학회지
    • /
    • 제36권4호
    • /
    • pp.25-40
    • /
    • 2023
  • Objectives : This paper studies the Tangaek-Unhoei(湯液韻彙) index of herbal medicine in the Inje-Ji(仁濟志) of the Imwon-Gyeongje-Ji(林園經濟志), which contains about 4,800 formulas. Created by 19th-century Joseon scholar Seo, Yugu, it not only lists the formulas according to their names, but also provides index by topic, which enabled the collection and effective application of massive medical information. Methods : We quantitatively examined the nearly 4,800 herbal medicines in the Tangaek-Unhoei and their categorization. Any uncommon or particular categorization was examined further by analyzing the original text. Results & Conclusions : The prescriptions contained in the Inje-Ji are categorized under 26 headings. They are listed according to the 106 units of the Chinese character dictionary and organized by double headings. This unique index makes it easy to browse the contents of such a vast book containing massive medicinal knowledge. In addition, the fifty or so remedies called 'Fang(方)' exemplify the author's attitude toward medicinal knowledge, which is both rational and inclusive. This is an attitude that should be recognized beyond tradition.

토픽모델링을 활용한 해운물류 뉴스 분석 (Analysis of Shipping and Logistics News Articles using Topic Modeling)

  • 윤희영;곽일엽
    • 무역학회지
    • /
    • 제46권4호
    • /
    • pp.61-76
    • /
    • 2021
  • This study focuses on three logistics-related news (Logistics Newspaper, Korea Shipping Gadget, and Korea Shipping Newspaper) in order to present changes in logistics issues, centering on Corona 19, which has recently had the greatest impact in the world. For data collection, two-year news articles in 2019 and 2020 (title, article, content, date, article classification, article URL) were collected through web crawling (using Python's BeautifulSoup, requests module) on the homepages of three representative logistics-related media companies. As for the data analysis methods, fundamental statistical analysis, Latent Dirichlet Allocation (LDA) for topic modeling, and Scattertext were performed. The analysis results were as follows. First, among the three news media related to logistics, the Korea Shipping Newspaper was carrying out the most active media activities. Second, through topic modeling with LDA, eight logistics-related topics were identified, and keywords and significant issues of each topic were presented. Third, the keywords were visually expressed through Scattertext. This is the first study to present changes in the logistics field, focusing on articles from representative logistics-related media in 2019 and 2020. In particular, 2019 and 2020 can be divided into before and after the outbreak of Corona 19, which has had a great impact not only on the logistics field but also on our lives as a whole. For future work, a multi-faceted approach is required, such as comparative studies of logistics issues between countries or presenting implications based on long-term time-series articles.

대한민국 정권별 아동복지정책 관련 뉴스 기사 분석: K-평균 군집 분석 (Analysis of News Articles on Child Welfare Policies in South Korea: K-Means Clustering)

  • 김은주;김성광;박빛나
    • 동서간호학연구지
    • /
    • 제29권2호
    • /
    • pp.185-195
    • /
    • 2023
  • Purpose: The purpose of this study is to analyze changes of child welfare policies and provide insights based on the collection and classification of newspaper articles. Methods: Articles related to child welfare policies were collected from 1990, during the Kim, Young-sam administration, to May 9, 2022, under the Moon, Jae-in administration. K-Means clustering and keyword Term Frequency-Inverse Document Frequency analysis were utilized to cluster and analyze newspaper articles with similar themes. Results: The administrations of Kim, Young-sam, Kim, Dae-jung, Roh, Moo-hyun, and Park, Geun-hye were classified into two clusters, and the Lee, Myung-bak and Moon, Jae-in administrations were classified into three clusters. Conclusion: South Korea's child welfare policies have focused on ensuring the safety and healthy development of children through diverse policies initiatives over the years. However, challenges related to child protection and child abuse persist. This requires additional resources and budget allocation. It is important to establish a comprehensive support system for children and families, including comprehensive nursing support.

A Study on User Perception of Tourism Platform Using Big Data

  • Se-won Jeon;Sung-Woo Park;Youn Ju Ahn;Gi-Hwan Ryu
    • International journal of advanced smart convergence
    • /
    • 제13권1호
    • /
    • pp.108-113
    • /
    • 2024
  • The purpose of this study is to analyze user perceptions of tourism platforms through big data. Data were collected from Naver, Daum, and Google as big data analysis channels. Using semantic network analysis with the keyword 'tourism platform,' a total of 29,265 words were collected. The collection period was set for two years, from August 31, 2021, to August 31, 2023. Keywords were analyzed for connected networks using TexTom and Ucinet programs for social network analysis. Keywords perceived by tourism platform users include 'travel,' 'diverse,' 'online,' 'service,' 'tourists,' 'reservation,' 'provision,' and 'region.' CONCOR analysis revealed four groups: 'platform information,' 'tourism information and products,' 'activation strategies for tourism platforms,' and 'tourism destination market.' This study aims to expand and activate services that meet the needs and preferences of users in the tourism field, as well as platforms tailored to the changing market, based on user perception, current status, and trend data on tourism platforms.

"침구갑을경(鍼灸甲乙經)"의 침구문헌적(鍼灸文獻的) 특징(特徵)에 관한 연구(硏究) (A Study of Acupuncture Documentary Characteristics of "Chimgugapelgyeong(鍼灸甲乙經)")

  • 김정호;김기욱;박현국
    • 대한한의학원전학회지
    • /
    • 제22권1호
    • /
    • pp.35-59
    • /
    • 2009
  • The acupuncture documentary characteristics of the "Chimgugapeulgyeong" can be summarized into 7 parts such as the following. 1. After Imeok(林億)'s revised edition of the "Gapeulgyeong(甲乙經)" was printed during the Song dynasty, there were no reprints during the Southern Song, Geum(金) and Won(元) eras, and the first printed edition that remains today is the 'Uihakyukgyeong edition[醫學六經本]' published by Omyeonhak(吳勉學) during the Mallyeok(萬曆) era of the Myeong(明) dynasty. This publication was put into the "Uitongjeongmaek(醫統正脈)" collection in the 29th year of the Manlleok(萬曆) era(1601). Most of the remaining copies have been restored during the Cheong dynasty at bookstores, and we can see that much was restored because of damage and missing characters. Also, the 'Namgyeokcho edition[藍格抄本]' and 'Yukgyeong edition[六經本]' of the Myeong dynasty do not come from the same original document, which allows the correction of the former in many places. However, this edition was not copied well, so the order of contents is different, and there are many mistakes. The 'Sagojeonseo edition[四庫全書本]' and the 'Gajeong edition[嘉靖本]', which Yeounsu(余云岫) quoted from, coincide with each other, making them worth much reference. So, the "Gapeulgyeong" and 'Yukgyeong edition' should be seen as the original, with the 'Myeongcho edition[明抄本]' as the main revision, and the 'Sago edition[四庫本]' as a reference edition. The so-called 'Chojeongtong edition(鈔正統本)' has many problems and marks of forgery, so therefore cannot be used in revising the "Gapeulgyeong" through comparison. 2. The table of contents[序例] in the front of the current edition was in the original edition and was not added by Imeok. The structure of sentences quoted by medical books before the Song dynasty coincide with this 'table of contents'. The "Gapeulgyeong" of the Song dynasty also coincide with the 'table of contents' but the edition remaining differs much from this 'table of contents' so it was edited or erased by people from future generations, especially after the Song dynasty. 3. The remaining edition of "Gapeulgyeong" consists of at least 4 parts. The original edited by Hwangbomil(皇甫謐), annotations added by medicinal practitioners before the Song dynasty, Imeok's revisionary annotations during the Song dynasty, and annotations after the Song dynasty. 4. Expressions such as 'Somun says[素問曰]' 'Gugwon says[九卷曰]' and explanatory annotations like 'Hae says[解曰]' are old writings from the original text and were not added by someone later. 5. Almost all of the 'Double lined small letter annotations[雙行小字注文]' of the 'Yukgyoeng edition' was by people during the Song dynasty. 6. There are many omitted and wrong letters in the remaining edition and there are also many places where future generations edited and supplemented the text. The table of contents differ greatly from the original text. 7. The medical books that quote "Gapeulgyeong" a lot are "Cheongeumyobang(千金要方)", "Oedaebiyobang(外臺秘要方)", "Seongjaechongrok(聖濟總錄)", "Chimgujasaenggyeong(鍼灸資生經)", "Yuyusinseo(幼幼新書)", and "Uihakgangmok(醫學綱目)" and such. However, the method used in using the text differs between the medical books, so the quotation from the same book comes from a quotation used by a doctor from a different era in one("Cheongeumyobang"), or the quotation was taken from each medical book("Chimgujasaenggyeong") or the quotation was all taken from another book("Yuyusinseo"). The reason we need to know about this problem properly is because we must use medical books that quote the original text of the "Gapeulgyeong" when we are looking for text that we can use to revise through comparison.

  • PDF

두사경(杜思敬)의 "제생발수(濟生拔粹)"에 수록된 침구의적(鍼灸醫籍)에 관한 문헌 (A Study on the documentary characteristics of acupuncture and moxibustion recorded in Dusagyeong(杜思敬)'s "Jesaengbalsu(濟生拔粹)")

  • 김정호;김기욱;박현국
    • 대한한의학원전학회지
    • /
    • 제22권2호
    • /
    • pp.71-83
    • /
    • 2009
  • The documentary characteristics of acupuncture and moxibustion recorded in Dusagyeong(杜思敬)'s".Jesaengbalsu(濟生拔粹)" can be summarized into 3 major parts: 1. "Gyeolgo-ungichimbeop(潔古雲岐鍼法)" and "Dutaesachimbeop(竇太師鍼法)" 1) "Gyeolgo-ungichimbeop" was edited by Dusagyeong of the Won dynasty, and was recorded in "Jesaengbalsu". Du was influenced by his teacher Heohyeong(許衡) and followed Janggyeolgo(張潔古) and his son Jangbyeok(張璧), and collected his work "Chimgu-pyeon(鍼灸篇)" for Jang and named it "Gyeolgo-ungichimbeop", and took the content from the medical book of Jang and his student Wang-haejang(王海藏). (2) "Jesaengbalsu"'s original edition exists today. The "Gyeolgo-ungichimbeop" listed in "Jesaengbalsu"'s index contain two collections, the first collection being "Gyeolgo-ungichimbeop" and the second collection being "Dutaesachimbeop(竇太師鍼法)" (3) Gyeolgo(潔古)、Un-gija(雲岐子)'s acupuncture methods can be seen in Un-gija "Bomyeongjipryuyo(保命集類要)" and Wanghaejang "Chasananji(此事難知)". (4) The related acupuncture methods are 'Non-gyeong-rak-yeongsubosabeop(論經絡迎隨補瀉法)', 'Gyeong-rakchwiwonbeop(經絡取原法)', 'Jeopgyeongbeop(接經法)', and 'Sang-hanyeolbyeongjabeop(傷寒熱病刺法)' (5) Du's edition of the entire text of 'Gyeolgojajetongbeop(潔古刺諸痛法)' 'Jasimtongjehyeol(刺心痛諸穴)' and the first half of 'Jeopgyeongbeop(接經法)' is all recorded in "Somunbyeonggigi-uibomyeongjip(素問病機氣宜保命集)". The existing "Somunbyeonggigi-uibomyeongjip" is a combination of the unfinished posthumous work of Yuwanso(劉完素), "Gi-ui(氣宜)" and "Byeonggi(病機)" with works such as Jangwonso(張元素)'s '"Bomyeongseo(保命書)"'. (6) Of the titles "Gyeolgo-ungichimbeop" and "Dutaesachimbeop", the 14$\sim$19th chapters "Dutaesachimbeop" should be concentrated at the end of the chapter, and the 16th chapter that Du added was put after chapter 14 "Yujujiyobu(流注指要賦)", and chapters 20, 21 should be put in "Gyeolgoungichimbeop" after chapter 13. 2. "Chimgyeongjeok-yeongjip(鍼經摘英集)" (1) "Chimgyeongjeok-yeongjip" is a collection of the acupuncture and moxibustion contents of medical books from the Geum and Won dynasties that Dusagyeong collected and organized during the Won dynasty, which is consisted of 5 chapters : "Guchimshik(九鍼式)", "Jeolyangchwisuhyeolbeop(折量取腧穴法)", "Bosabeop(補瀉法)", "Yongchimhoheupbeop(用鍼呼吸法)", "Chibyeongjik-ralgyeol(治病直剌訣)". (2) First, the contents. The nine acupuncture needles[九鍼] listed in "Guchimshik(九鍼式)" is the first existing document recording to systematically illustrate the 'nine classical needles' in drawing and text form which reflects the forms of the needles of the era. Second, "Jeolyangchwisuhyeolbeop(折量取腧穴法)" has the same basic way of measuring points [量穴法] as Wang-yuil's "Dong-insuhyeolchimgudo-gyeong(銅人腧穴鍼灸圖經)" and the same point selection rules as "Jeonyeongbang(全嬰方)". Third, in "Bosabeop(補瀉法)", "Somun(素問)" and Janggyeolgo's "Yeongsubosabeop(迎隨補瀉法)" is put together. Fourth, in "Yongchimhoheupbeop(用鍼呼吸法)", the cold and heat supplementation and draining [寒熱補瀉] method that combines breathing with inner and outer rotation[外 內撚] is recorded. Fifth, "Chi-byeongjik-ralgyeol(治病直剌訣)" is the main part of "Chimgyeongjeok-yeongjip(鍼經摘英集)" listing 69 acupuncture treatments reflecting Du's scholastic ideas on aspects such as syndrome differentiation[辨證], needling method and type of needle[鍼具]. (3) The content of this book was quoted by "Bojebang Chimgumun(普濟方 鍼灸門)" and when Gomu compiled "Chimguchwiyeong", he put the acupuncture treatments for the main indications of the disease patterns[鍼方主治病證] of this book in the related main indications of acupuncture points[腧穴主治證], which influenced books on acupuncture points there after. 3. "Chimgyeongjeolyo(鍼經節要)" (1) Consists of 1 volume. The original title of this book is "Dong-insuhyeolchimgudo-gyeong (銅人腧穴鍼灸圖經)" and the author is Wang-yuil of the Northern Song dynasty, written in the 4th year of the Cheonseong(天聖) era of the Song dynasty(1026). (2) Dusagyeong selected the contents on pathology of the 12 meridians in volume one and two, the introduction and five transport points[五輸穴] in volume 5 of "Dong-indo-gyeong(銅人圖經)" and named it "Chimgyeongjeolyo." During the Won dynasty it was recorded in "Jesaengbalsu".

  • PDF

조선간본(朝鮮刊本) 『유향신서(劉向新序)』의 서지·문헌 연구 (A Bibliographical and Literary Research on the Xinxu(新序) of the Published edition in Joseon)

  • 류승현;민관동
    • 비교문화연구
    • /
    • 제51권
    • /
    • pp.257-257
    • /
    • 2018
  • 조선간본(朝鮮刊本) "유향신서(劉向新序)"는 이극돈(李克墩)이 1492년경 출간하도록 한 판본이다. 현존하는 조선간본(朝鮮刊本) "유향신서(劉向新序)"는 민관동에 의해 계명대학교 소장본 2종(上冊)과 김준식(金俊植) 집안(이하에서는 '후조당(後彫堂)'으로 약칭)의 소장본(下冊) 그리고 분실된 최재석(崔在石) 소장본과 김용기(金用基) 소장본이 발굴되었다. 필자는 이외에 한국학중앙연구원(하책(下冊)) 경기대학교(하책(下冊)) 일본국회도서관(완질) 아단문고(상책(上冊)) 성암고서박물관(하책(下冊)) 소장본의 존재를 확인하였다. 본 논문은 위의 판본들 중 원문을 확인할 수 있는 판본인 계명대 한국학중앙연구원 경기대 후조당(後彫堂) 일본국회도서관 소장본을 대상으로 조선간본(朝鮮刊本)의 특징들을 연구하였다. 상책(上冊)의 경우, 계명대 귀중본은 '초각본(初刻本)'이고 고본은 '초각본(初刻本)'의 제69~70면과 제71~72면을 보각(補刻)한 '보각본(補刻本)'이다, 또한 일본국회도서관 소장본도 고본과 동일한 면이 보각(補刻)된 '보각본(補刻本)'이다. 하책(下冊)의 경우, 한국학중앙연구원과 경기대 소장본은 '초각본(初刻本)'이고, 후조당(後彫堂)과 일본국회도서관 소장본은 제9~10면 제63~64면 제87~88 제107~108면을 상책(上冊)과 마찬가지로 해당 면을 보각(補刻)한 '보각본(補刻本)'이다. 현존 판본들을 비교해보면, 상책(上冊)의 경우 현존하는 판본으로는 2회에 걸쳐 인출되었음만 단정할 수 있고, 하책(下冊)의 경우에는 현존하는 4종의 판본들은 3회에 걸쳐 인쇄가 이루어졌음을 확정할 수 있다. 필자는 이어서 현존 판본들을 바탕으로 실제 문헌에 대한 연구를 진행하여 조선간본(朝鮮刊本)의 특징을 도출하였다. 먼저 권수제(卷首題) 권미제(卷尾題)와 문단의 형식을 논술하였고, 그리고 본문은 원칙적으로 자수(字數)가 '11행(行)18자(字)'로 되어있으나, 실제 판본 상에는 '18자(字)'가 아닌 경우들을 찾아내 해당 면(面)과 행(行)의 자수(字數)를 표로 제시하였으며, 또한 소자주(小字註)가 쌍행(雙行)으로 된 경우를 고찰하였다. 그 다음으로는 조선간본(朝鮮刊本)에는 원문에 빈칸이 나타나는데 해당부분과 해당 글자를 모두 밝혔으며, 마지막으로 조선간본(朝鮮刊本)의 '오탈자(誤脫字)'를 찾아내어 해당부분을 명기하고 오류의 이유를 구체적으로 분석하였다.

지자체 사이버 공간 안전을 위한 금융사기 탐지 텍스트 마이닝 방법 (Financial Fraud Detection using Text Mining Analysis against Municipal Cybercriminality)

  • 최석재;이중원;권오병
    • 지능정보연구
    • /
    • 제23권3호
    • /
    • pp.119-138
    • /
    • 2017
  • 최근 SNS는 개인의 의사소통뿐 아니라 마케팅의 중요한 채널로도 자리매김하고 있다. 그러나 사이버 범죄 역시 정보와 통신 기술의 발달에 따라 진화하여 불법 광고가 SNS에 다량으로 배포되고 있다. 그 결과 개인정보를 빼앗기거나 금전적인 손해가 빈번하게 일어난다. 본 연구에서는 SNS로 전달되는 홍보글인 비정형 데이터를 분석하여 어떤 글이 금융사기(예: 불법 대부업 및 불법 방문판매)와 관련된 글인지를 분석하는 방법론을 제안하였다. 불법 홍보글 학습 데이터를 만드는 과정과, 데이터의 특성을 고려하여 입력 데이터를 구성하는 방안, 그리고 판별 알고리즘의 선택과 추출할 정보 대상의 선정 등이 프레임워크의 주요 구성 요소이다. 본 연구의 방법은 실제로 모 지방자치단체의 금융사기 방지 프로그램의 파일럿 테스트에 활용되었으며, 실제 데이터를 가지고 분석한 결과 금융사기 글을 판정하는 정확도가 사람들에 의하여 판정하는 것이나 키워드 추출법(Term Frequency), MLE 등에 비하여 월등함을 검증하였다.