• Title/Summary/Keyword: 텍스트 연구

Search Result 3,471, Processing Time 0.032 seconds

A Named Entity Recognition Model in Criminal Investigation Domain using Pretrained Language Model (사전학습 언어모델을 활용한 범죄수사 도메인 개체명 인식)

  • Kim, Hee-Dou;Lim, Heuiseok
    • Journal of the Korea Convergence Society
    • /
    • v.13 no.2
    • /
    • pp.13-20
    • /
    • 2022
  • This study is to develop a named entity recognition model specialized in criminal investigation domains using deep learning techniques. Through this study, we propose a system that can contribute to analysis of crime for prevention and investigation using data analysis techniques in the future by automatically extracting and categorizing crime-related information from text-based data such as criminal judgments and investigation documents. For this study, the criminal investigation domain text was collected and the required entity name was newly defined from the perspective of criminal analysis. In addition, the proposed model applying KoELECTRA, a pre-trained language model that has recently shown high performance in natural language processing, shows performance of micro average(referred to as micro avg) F1-score 98% and macro average(referred to as macro avg) F1-score 95% in 9 main categories of crime domain NER experiment data, and micro avg F1-score 98% and macro avg F1-score 62% in 56 sub categories. The proposed model is analyzed from the perspective of future improvement and utilization.

Korean Information Summary System for National R&D Projcet Information Summary (국가R&D과제정보 요약을 위한 한국어 정보요약 시스템)

  • Lee, Jong-Won;Kim, Tae-Hyun;Shin, Dong-Gu;Jo, Woo-Seung
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.10a
    • /
    • pp.72-74
    • /
    • 2022
  • The National Science and Technology Knowledge Information Service (NTIS) provides information on national R&D projects. Project information consists of meta-information such as 'project name', 'project performance institution', 'research manager name', and text explaining projects such as 'research goal', 'research content', and 'expected effect'. There is a problem that it takes a lot of time to find the desired project information by checking all of the "research goals" or "research contents" in the list of results of searching for 1 million project information. To solve this problem, this paper proposes a project information summary system that summarizes the parts consisting of long texts within the national R&D project information. By analyzing the linguistic characteristics of the Korean language, a preprocessor was built and a project information summary model based on natural language processing technology was developed to process preprocessed text information. Through this, project information composed of long sentences is provided in a compressed and summarized form, which will help users to easily and quickly infer the overall content with the summary information alone.

  • PDF

The Design and Evaluation of a Diagonally Splitted Column to Improve Text Readability on a Small Screen (소형 스크린 상에서의 텍스트 가독성 향상을 위한 대각분할 칼럼 디자인과 평가)

  • Kim Yeon-Ji;Lee Woo-Hun
    • Archives of design research
    • /
    • v.19 no.4 s.66
    • /
    • pp.51-60
    • /
    • 2006
  • Nowadays, reading text from screens is prevailing in everyday life. The advent of mobile information devices such as a cellular phone, PDA, and e-book reader facilitates us to enjoy various text-based contents any time and anywhere. Most studies comparing screen and paper readability show that screens are less readable than paper. Furthermore, the decrease of line length and number of lines that can be displayed on the screen of mobile information devices deteriorate text readability. This study investigated parameters affecting text readability on small screens and designed a new text layout to improve readability. We suggested a diagonally splitted layout of rectangular column, which is supposed to facilitate eye movement to trace text flow with ease. The experiment comparing readability between a traditional rectangular column and a diagonally splitted column was conducted. The result of experiment revealed that there is no significant difference between the two text layouts in terms of subjective satisfaction of reading task and a level of comprehension. However, in the screen size of $4000mm^2\;and\;8000mm^2$, reading speed was increased 18.9% and 34.0% respectively from a traditional rectangular column to a diagonally splitted column. We conducted a consecutive experiment to scrutinize the cause that improved the performance in readability task remarkably. The readability of text in a traditional rectangular column was compared with a left triangular column and a right triangular column in the condition of $4000mm^2/3:1$ ratio screen. The performance measurements revealed that participants read 21.1% and 67.6% faster respectively with the left triangular column and right triangular column than with the rectangular column. In consequence, the improvement of readability in the diagonally splitted column was attributed mainly to the increase of reading speed in the right triangular column. This research verified that the diagonally splitted column improve text readability on a small screen and this result is expected to make a contribution to designing an efficient text layout for mobile information devices

  • PDF

A Study on the Textuality of China's Wuyi-Gugok, the Origin of Gugok-Wonlim -Focus on the Tradition Process to Korea - (구곡원림의 원류, 중국 무이구곡(武夷九曲)의 텍스트성 -국내 전승(傳承) 과정을 중심으로 -)

  • Rho, Jae-Hyun
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.36 no.6
    • /
    • pp.66-80
    • /
    • 2009
  • This paper attempts to investigate how the cultural phenomena associated with 'Wuyi-Doga(武夷棹歌)' and 'Wuyi-Gugok (武夷九曲)' was introduced to Joseon. The icon and code of 'Gugok' cultural text which was observed in the process of transmitting the culture through repetition and imitation were examined. With regard to research methodology, an 'analysis and discussion framework' was designed based on the literature review, field survey and the seven textuality criteria proposed by Dressier. Then the textuality of 'Wuyi-Gugok' was analyzed in terms of the dependent relation of text, the relationship between the creator and user, repetition, imitation and the spread process. Since ZhouHee(朱熙)'s 'Wuyi-Doga' and 'Wuyi-Gugok' were introduced to Joseon through literature and paintings, they became a part of the cultural Phenomena with unprecedented popularity. As a result, a great number of imitations can be found. In addition, governors would even take care of political affairs in a scenic mountain valley as described in this literature. Regardless of the writer's intentiot 'Gugok' settled in Joseon as new culture in harmony with Taoism and Sung COnfucianism. In other words, Joseon's Gugok-Wonlim(九曲園林) accepted the nature-appreciation aesthetic consciousness in 'Wuyi-Doga' and 'Wuyi-Gugok' on the basis of Taoism and Sung Confucianism. In terms of the text-based dependent relation only, however, the geographical coherence was somewhat loosened while the Gugok Culture that was dependent on Taoism or elegance in life dominated the internal structure of the textuality. Meantime, the internal factors that dominated the textuality of 'Wdyi-Gugok' were interpreted as 1) 'Aesthetics of Bending, Water Whirls', 2) 'Territoriality Expression Carve letters,' 3) 'Cultural Landscape seeing through the Speculation of Meaning,' 4) 'The Pursuit of Oddness and Presentationism' and 5) 'Transcendental Landscape of Taoism and Topos.'

Analyzing the Study Trends of 'Sense of Place' Using Text Mining Techniques (텍스트마이닝 기법을 활용한 국내외 장소성 관련 연구동향 분석)

  • Lee, Ina;Kim, Hea-Jin
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.30 no.2
    • /
    • pp.189-209
    • /
    • 2019
  • Main Path Analysis (MPA) is one of the text mining techniques that extracts the core literature that contributes knowledge transfer based on citation information in the literature. This study applied various text mining techniques to abstract of the paper related with sense-of-place, which is published at Korea and abroad from 1990 to 2018 so that could discuss in a macro perspective. The main path analysis results showed that from 1990, overseas research on sense-of-place has been carried out in the order of personal identity, public land management, environmental education and urban development-related areas. Also, by using the network analysis, this study found that sense-of-place was discussed at various levels in Korea, including urban development, culture, literature, and history. On the other hand, it has been found that there are few topic changes in international studies, and that discussions on health, identity, landscape and urban development have been going on steadily since the 1990s. This study has implications that it presents a new perspective of grasping the overall flow of relevant research.

Understanding the Evaluation of Quality of Experience for Metaverse Services Utilizing Text Mining: A Case Study on Roblox (텍스트마이닝을 활용한 메타버스 서비스의 경험 품질 평가의 이해: 로블록스 사례 연구)

  • Minjun Kim
    • Journal of Service Research and Studies
    • /
    • v.13 no.4
    • /
    • pp.160-172
    • /
    • 2023
  • The metaverse, derived from the fusion of "meta" and "universe," encompasses a three-dimensional virtual realm where avatars actively participate in a range of political, economic, social, and cultural activities. With the recent development of the metaverse, the traditional way of experiencing services is changing. While existing studies have mainly focused on the technological advancements of metaverse services (e.g., scope of technological enablers, application areas of technologies), recent studies are focusing on evaluating the quality of experience (QoE) of metaverse services from a customer perspective. This is because understanding and analyzing service characteristics that determine QoE from a customer perspective is essential for designing successful metaverse services. However, relatively few studies have explored the customer-oriented approach for QoE evaluation thus far. This study conducted an online review analysis using text mining to overcome this limitation. In particular, this study analyzed 227,332 online reviews of the Roblox service, known as a representative metaverse service, and identified points for improving the Roblox service based on the analysis results. As a result of the study, nine service features that can be used for QoE evaluation of metaverse services were derived, and the importance of each feature was estimated through relationship analysis with service satisfaction. The importance estimation results identified the "co-experience" feature as the most important. These findings provide valuable insights and implications for service companies to identify their strengths and weaknesses, and provide useful insights to gain an advantage in the changing metaverse service environment.

The Effects of Cultural Factors in Tourists' Restaurant Satisfaction: Using Text Mining and Online Reviews (문화적 요인이 관광객의 음식점 만족도에 미치는 영향: 텍스트 마이닝과 온라인 리뷰를 활용하여)

  • Jiajia Meng;Gee-Woo Bock;Han-Min Kim
    • Information Systems Review
    • /
    • v.25 no.1
    • /
    • pp.145-164
    • /
    • 2023
  • The proliferation of online reviews on dining experiences has significantly affected consumers' choices of restaurants, especially overseas. Food quality, service, ambiance, and price have been identified as specific attributes for the choice of a restaurant in prior studies. In addition to these four representative attributes, cultural factors, which may also significantly impact the choice of a restaurant for tourists, in particular, have not received much attention in previous studies. This study employs the text mining technique to analyze over 10,000 online reviews of 76 Korean restaurants posted by Chinese tourists on dianping.com to explore the influence of cultural factors on the consumer's choice of restaurants in the overseas travel context. The findings reveal that "Hallyu (Korean Wave)" influences Chinese tourists' dining experiences in Korea and their satisfaction. Moreover, Korean food-related words, such as cold noodle, bibimbap, rice cake, pig trotters, and kimchi stew, appeared across all the review topics. Our findings contribute to the existing tourism and hospitality literature by identifying the critical role of cultural factors on consumers', especially tourists', satisfaction with the choice of a restaurant using text mining. The findings also provide practical guidance to restaurant owners in Korea to attract more Chinese tourists.

Text Mining-Based Emerging Trend Analysis for the Aviation Industry (항공산업 미래유망분야 선정을 위한 텍스트 마이닝 기반의 트렌드 분석)

  • Kim, Hyun-Jung;Jo, Nam-Ok;Shin, Kyung-Shik
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.1
    • /
    • pp.65-82
    • /
    • 2015
  • Recently, there has been a surge of interest in finding core issues and analyzing emerging trends for the future. This represents efforts to devise national strategies and policies based on the selection of promising areas that can create economic and social added value. The existing studies, including those dedicated to the discovery of future promising fields, have mostly been dependent on qualitative research methods such as literature review and expert judgement. Deriving results from large amounts of information under this approach is both costly and time consuming. Efforts have been made to make up for the weaknesses of the conventional qualitative analysis approach designed to select key promising areas through discovery of future core issues and emerging trend analysis in various areas of academic research. There needs to be a paradigm shift in toward implementing qualitative research methods along with quantitative research methods like text mining in a mutually complementary manner. The change is to ensure objective and practical emerging trend analysis results based on large amounts of data. However, even such studies have had shortcoming related to their dependence on simple keywords for analysis, which makes it difficult to derive meaning from data. Besides, no study has been carried out so far to develop core issues and analyze emerging trends in special domains like the aviation industry. The change used to implement recent studies is being witnessed in various areas such as the steel industry, the information and communications technology industry, the construction industry in architectural engineering and so on. This study focused on retrieving aviation-related core issues and emerging trends from overall research papers pertaining to aviation through text mining, which is one of the big data analysis techniques. In this manner, the promising future areas for the air transport industry are selected based on objective data from aviation-related research papers. In order to compensate for the difficulties in grasping the meaning of single words in emerging trend analysis at keyword levels, this study will adopt topic analysis, which is a technique used to find out general themes latent in text document sets. The analysis will lead to the extraction of topics, which represent keyword sets, thereby discovering core issues and conducting emerging trend analysis. Based on the issues, it identified aviation-related research trends and selected the promising areas for the future. Research on core issue retrieval and emerging trend analysis for the aviation industry based on big data analysis is still in its incipient stages. So, the analysis targets for this study are restricted to data from aviation-related research papers. However, it has significance in that it prepared a quantitative analysis model for continuously monitoring the derived core issues and presenting directions regarding the areas with good prospects for the future. In the future, the scope is slated to expand to cover relevant domestic or international news articles and bidding information as well, thus increasing the reliability of analysis results. On the basis of the topic analysis results, core issues for the aviation industry will be determined. Then, emerging trend analysis for the issues will be implemented by year in order to identify the changes they undergo in time series. Through these procedures, this study aims to prepare a system for developing key promising areas for the future aviation industry as well as for ensuring rapid response. Additionally, the promising areas selected based on the aforementioned results and the analysis of pertinent policy research reports will be compared with the areas in which the actual government investments are made. The results from this comparative analysis are expected to make useful reference materials for future policy development and budget establishment.

A study of the creation mechanism of exclusion against the immigrant (이주민 배제 생성 기제에 대한 연구 -상층부 연구접근-)

  • Kim, Young Sook
    • Korean Journal of Social Welfare Studies
    • /
    • v.44 no.2
    • /
    • pp.5-33
    • /
    • 2013
  • This study is to analyze the creation mechanism of exclusion and discrimination against the immigrant. The author approached studying up and used life history study method. Ten of anti-multiculturists participated this study. Data were collected by in-depth interview. The text of individual life history were analyzed by Mandelbaum(1973). The author analyzed the dimension of life, turning point and adaptation. The result as follows; I presented ① Plan of oneness ground, ② Searching new Sacrifice goat, ③ Transference of a inferiority complex for the mechanism of exclusion and discrimination against immigrant. Finally I proposed 「cross cultural education」, 「native participated integration program」, 「establishment of the strongpoint center for adjustment between native and immigrant and up bringing the professional in community.

Audience Cognitive Reconstruction of the Extended Meaning of Complex Mechanism Text : For Communication Education using Story Media Expressions (복합기제 텍스트의 확장 의미에 대한 수용자의 인지적 재구성 : 서사적 미디어 표현을 활용한 의사소통 교육을 위해)

  • Lim, Ji-Won
    • Journal of Korea Entertainment Industry Association
    • /
    • v.15 no.7
    • /
    • pp.137-143
    • /
    • 2021
  • This discussion can be said to be a qualitative study on the possibility of linking communication education for college students and literacy education for Korean language-linked educators based on the theory of interpretation of cognitive meaning of media text containing complex mechanisms. The implicit meaning of media content expression used as an interactive communication strategy will be accepted as a multilateral interpretation according to the individual learner's cognitive environment. If so, how is the general media content meaning intended by the content creator being accepted? These doubts are the starting point for discussion. To solve the problem, I leaned on the experimental pragmatic methodology of cognitive aesthetics and applied a model of relevance of cognitive linguistics to connect learners' creative cognitive environment and present content to find a contrast. As a result of the discussion, it was possible to establish a basic framework for learners to express their subjectivity and creative thinking that could connect the cognitive environment and present content themselves. In particular, active and positive learners also revealed direct descriptive expressions to build a new cognitive environment, such as suggesting a third alternative to argue the ability to question produced media texts and the validity of the meaning implied in the text. In the future, since media text containing complex mechanisms is an indirect and persuasive communication behavior that occurs easily through various media in modern society, the universal communication principle of reliable conversation between media text creators and audiences should exist.