• Title/Summary/Keyword: Sub-text

Search Result 199, Processing Time 0.031 seconds

Research Trend Analysis in Fashion Design Studies in Korea using Topic Modeling (토픽모델링을 이용한 국내 패션디자인 연구동향 분석)

  • Jang, Namkyung;Kim, Min-Jeong
    • Journal of Digital Convergence
    • /
    • v.15 no.6
    • /
    • pp.415-423
    • /
    • 2017
  • This study explored research trends by investigating articles published in the Journal of Korean Society of Fashion Design from 2001 through 2015. English key words and abstracts were analyzed using text mining and topic modeling techniques. The findings are as followings. By the text mining technique, 183 core terms, appeared more than 30 times, were derived from 7137 words used in total 338 articles' key words and abstracts. 'Fashion' and 'design' showed the highest frequency rate. After that, the well-received topic modeling technique, LDA, was applied to the collected data sets. Several distinct sub-research domains strongly tied with the previous fashion design field, except for topics such as fashion brand marketing and digital technology, were extracted. It was observed that there are the growing and declining trends in the research topics. Based on findings, implication, limitation, and future research questions were presented.

Inferring Disease-related Genes using Title and Body in Biomedical Text (생물학 문헌 데이터의 제목과 본문을 이용한 질병 관련 유전자 추론 방법)

  • Kim, Jeongwoo;Kim, Hyunjin;Yeo, Yunku;Shin, Mincheol;Park, Sanghyun
    • KIISE Transactions on Computing Practices
    • /
    • v.23 no.1
    • /
    • pp.28-36
    • /
    • 2017
  • After the genome projects of the 90s, a vast number of gene studies have been stored in online databases. By using these databases, several biological relationships can be inferred. In this study, we proposed a method to infer disease-gene relationships using title and body in biomedical text. The title was used to extract hub genes from data in the literature; whereas, the body of the literature was used to extract sub genes that are related to hub genes. Through these steps, we were able to construct a local gene-network for each report in the literature. By integrating the local gene-networks, we then constructed a global gene-network. Subsequent analyses of the global gene-network allowed inference of disease-related genes with high rank. We validated the proposed method by comparing with previous methods. The results indicated that the proposed method is a meaningful approach to infer disease-related genes.

A Study on Smishing Block of Android Platform Environment (안드로이드 플랫폼 환경에서의 스미싱 차단에 관한 연구)

  • Lee, Si-Young;Kang, Hee-Soo;Moon, Jong-Sub
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.24 no.5
    • /
    • pp.975-985
    • /
    • 2014
  • As financial transactions with a smartphone has become increasing, a myriad of security threats have emerged against smartphones. Among the many types of security threats, Smishing has evolved to be more sophisticated and diverse in design. Therefore, financial institutions have recommended that users doesn't install applications with setting of "Unknown sources" in the system settings menu and install application which detects Smishing. Unfortunately, these kind of methods come with their own limitations and they have not been very effective in handling Smishing. In this paper, we propose a systematic method to detect Smishing, in which the RIL(Radio Interface Layer) collects a text message received and then, checks if message databases stores text message in order to determine whether Smishing malware has been installed on the system. If found, a system call (also known as a hook) is used to block the outgoing text message generated by the malware. This scheme was found to be effective in preventing Smishing as found in our implementation.

Features of the Rural Revitalization Projects in Jang-su County Using LDA Topic Analysis of News Data - Focused on Keyword of Tourism and Livelihood - (뉴스데이터의 LDA 토픽 분석을 통한 장수군 농촌지역 활성화 사업의 특징 - 관광·생활 키워드를 중심으로 -)

  • Kim, Young-Jin;Son, Yong-hoon
    • Journal of Korean Society of Rural Planning
    • /
    • v.24 no.4
    • /
    • pp.69-80
    • /
    • 2018
  • In this study, we typified the project for revitalizing the rural area through text analysis using news data, and analyzed the main direction and characteristics of the project. In order to examine the factors emphasized among the issues related to the revitalization of rural areas, we used news data related to 'tourism' and 'livelihood', which are the main keyword of the project to promote rural areas. In the analysis, text mining techniques were used. Topic modeling was conducted on LDA techniques for major projects in 'tourism' and 'livelihood' keyword. Based on this, this study typified the projects that are carried out for the activation of rural areas by topic. As a result of the analysis, it was fount that the topics included in the project were distributed in 11 sub-types(Tourism Promotion, Regional Specialization, Local Festival, Development of Regional Scale, Urban and Rural Exchange, Agricultural Support, Community Forest Management, Improve the Settlement Environment, General Welfare Service, Low Class Support, Others). The characteristics of the rural revitalization projects were examined, and it was confirmed that domestic projects were carried out by tourism-oriented projects. To summarize, the government is making projects to revitalize rural areas through related ministries. Within the structure where the project is spreading to the region, a lot of projects are being carried out. It is understood that the tourism and welfare oriented projects are being carried out in the revitalization project of the domestic rural area. Therefore, in order to achieve the goal of rural revitalization, it is believed that it will be effective to carry out a balanced project to improve the settlement environment of the residents.

A Named Entity Recognition Model in Criminal Investigation Domain using Pretrained Language Model (사전학습 언어모델을 활용한 범죄수사 도메인 개체명 인식)

  • Kim, Hee-Dou;Lim, Heuiseok
    • Journal of the Korea Convergence Society
    • /
    • v.13 no.2
    • /
    • pp.13-20
    • /
    • 2022
  • This study is to develop a named entity recognition model specialized in criminal investigation domains using deep learning techniques. Through this study, we propose a system that can contribute to analysis of crime for prevention and investigation using data analysis techniques in the future by automatically extracting and categorizing crime-related information from text-based data such as criminal judgments and investigation documents. For this study, the criminal investigation domain text was collected and the required entity name was newly defined from the perspective of criminal analysis. In addition, the proposed model applying KoELECTRA, a pre-trained language model that has recently shown high performance in natural language processing, shows performance of micro average(referred to as micro avg) F1-score 98% and macro average(referred to as macro avg) F1-score 95% in 9 main categories of crime domain NER experiment data, and micro avg F1-score 98% and macro avg F1-score 62% in 56 sub categories. The proposed model is analyzed from the perspective of future improvement and utilization.

BEHIND CHICKEN RATINGS: An Exploratory Analysis of Yogiyo Reviews Through Text Mining (치킨 리뷰의 이면: 텍스트 마이닝을 통한 리뷰의 탐색적 분석을 중심으로)

  • Kim, Jungyeom;Choi, Eunsol;Yoon, Soohyun;Lee, Youbeen;Kim, Dongwhan
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.11
    • /
    • pp.30-40
    • /
    • 2021
  • Ratings and reviews, despite their growing influence on restaurants' sales and reputation, entail a few limitations due to the burgeoning of reviews and inaccuracies in rating systems. This study explores the texts in reviews and ratings of a delivery application and discovers ways to elevate review credibility and usefulness. Through a text mining method, we concluded that the delivery application 'Yogiyo' has (1) a five-star oriented rating dispersion, (2) a strong positive correlation between rating factors (taste, quantity, and delivery) and (3) distinct part of speech and morpheme proportions depending on review polarity. We created a chicken-specialized negative word dictionary under four main topics and 20 sub-topic classifications after extracting a total of 367 negative words. We provide insights on how the research on delivery app reviews should progress, centered on fried chicken reviews.

A Model of Natural Language Information Retrieval Using Main Keywords and Sub-keywords (주 키워드와 부 키워드를 이용한 자연언어 정보 검색 모델)

  • Kang, Hyun-Kyu;Park, Se-Young
    • The Transactions of the Korea Information Processing Society
    • /
    • v.4 no.12
    • /
    • pp.3052-3062
    • /
    • 1997
  • An Information Retrieval (IR) is to retrieve relevant information that satisfies user's information needs. However a major role of IR systems is not just the generation of sets of relevant documents, but to help determine which documents are most likely to be relevant to the given requirements. Various attempts have been made in the recent past to use syntactic analysis methods for the generation of complex construction that are essential for content identification in various automatic text analysis systems. Unfortunately, it is known that methods based on syntactic understanding alone are not sufficiently powerful to Produce complete analyses of arbitrary text samples. In this paper, we present a document ranking method based on two-level ranking. The first level is used to retrieve the documents, and the second level to reorder the retrieved documents. The main keywords used in the first level can be defined as nouns and/or compound nouns that possess good document discrimination powers. The sub-keywords used in the second level can be also defined as adjectives, adverbs, and/or verbs that are not main keywords, and function words. An empirical study was conducted from a Korean encyclopedia with 23,113 entries and 161 Korean natural language queries collected by end users. 850% of the natural language queries contained sub-keywords. The two-level document ranking methods provides significant improvement in retrieval effectiveness over traditional ranking methods.

  • PDF

A Study on Attire and Accessories as Recorded in the Imwon Sipyukji - Focusing on Boksik Jigu - (『임원십육지(林園十六志)』에 나타난 복식(服飾)에 대한 연구(硏究) - 복식지구(服飾之具)를 중심(中心)으로 -)

  • Chang, Sook-Whan
    • Journal of the Korea Fashion and Costume Design Association
    • /
    • v.12 no.1
    • /
    • pp.35-49
    • /
    • 2010
  • The Imwon Sipyukji of this study was compiled by Seo Yugu (1764~1845), a famous agronomical scholar of the late eighteenth century. The contents of this book are divided into sixteen chapters related to all the important parts of rural home life ranging from daily routines to social life covering the agro-industry and the six skills of manners, music, archery, calligraphy, mathematics and horseback riding. Seomyongji, one of the sixteen chapters, covers all that is necessary for living a rural existence such as house-building, clothing adornments and transportation as well as how to make and use daily household items. The contents of the Boksik Jigu sub-section in the Sumyongji chapter consist of eight large units covering men's and women's clothing, bedding and pillows, sewing tools, belt and shoes accompanying the attire and storage for clothes. These eight are further subdivided into 65 items, each warranting a detailed explanation. My study will translate the original Chinese text of Boksik Jigu into Korean. This sub-section in the Seomyongji chapter will facilitate an investigation into the information contained therein on attire and accessories.

  • PDF

A study on the ideological structure of palace space in Josun period (조선시대 궁궐공간의 관념적 구성에 관한 연구)

  • 김영모;최기수
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.25 no.4
    • /
    • pp.141-157
    • /
    • 1998
  • It has been general view to Josun palace space that the Kyongbok palace, a main palace, is arranged with symmetric geometrical composition principles and, unlikely main palace, sub-palaces such as Changduk, Changkyong and Kyonghee palace are placed in organic structure adapted to natural land form. With that view, there are no common factors to be considered between these palace, main and sub palace, in composting principles of the space. In this study, because of same ideological period, although there is external difference of that palaces, that common ideological principles are projected to these two palaces types through compositing space is assumed. On this hypostasis, this study has been focused on finding the ideological principles projected to these palace space commonly. As result of study, some of them are considered as common principles; Firstly, they are arranged in the text of contents through the way of naming to building, enterence and so on. The second point is ; it is viewed that the Oheung and symmetric arrangement method based on Oheung are used in compositing of palace space. The third is ; through analizing central space of Kyongbok palace, it is analized that oneness composition principles, which are based on the theory of Umyangheong, are projected to different palace space commonly.

  • PDF

The Block Segmentation and Extraction of Layout Information In Document (문서의 영역분리와 레이아웃 정보의 추출)

  • 조용주;남궁재찬
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.17 no.10
    • /
    • pp.1131-1146
    • /
    • 1992
  • In this paper, we suggest a new algorithm applied to the segmentation of published documents to obtain constituent and layout information of document. Firstly, we begin the process of blocking and labeling on a 300dpi scanned document. Secondly, we classify the blocked document by individual sub-regions. Thirdly, we group sub-regions into graphic areas and text areas. Finally, we extract information for layout recognition by using the data. From an experiment on papers of an academic society, we obtain the above 98% of region classification rate and extraction rate of information for the layout recognition.

  • PDF