• 제목/요약/키워드: Keyword Trends

검색결과 446건 처리시간 0.025초

키워드 자동 생성에 대한 새로운 접근법: 역 벡터공간모델을 이용한 키워드 할당 방법 (A New Approach to Automatic Keyword Generation Using Inverse Vector Space Model)

  • 조원진;노상규;윤지영;박진수
    • Asia pacific journal of information systems
    • /
    • 제21권1호
    • /
    • pp.103-122
    • /
    • 2011
  • Recently, numerous documents have been made available electronically. Internet search engines and digital libraries commonly return query results containing hundreds or even thousands of documents. In this situation, it is virtually impossible for users to examine complete documents to determine whether they might be useful for them. For this reason, some on-line documents are accompanied by a list of keywords specified by the authors in an effort to guide the users by facilitating the filtering process. In this way, a set of keywords is often considered a condensed version of the whole document and therefore plays an important role for document retrieval, Web page retrieval, document clustering, summarization, text mining, and so on. Since many academic journals ask the authors to provide a list of five or six keywords on the first page of an article, keywords are most familiar in the context of journal articles. However, many other types of documents could not benefit from the use of keywords, including Web pages, email messages, news reports, magazine articles, and business papers. Although the potential benefit is large, the implementation itself is the obstacle; manually assigning keywords to all documents is a daunting task, or even impractical in that it is extremely tedious and time-consuming requiring a certain level of domain knowledge. Therefore, it is highly desirable to automate the keyword generation process. There are mainly two approaches to achieving this aim: keyword assignment approach and keyword extraction approach. Both approaches use machine learning methods and require, for training purposes, a set of documents with keywords already attached. In the former approach, there is a given set of vocabulary, and the aim is to match them to the texts. In other words, the keywords assignment approach seeks to select the words from a controlled vocabulary that best describes a document. Although this approach is domain dependent and is not easy to transfer and expand, it can generate implicit keywords that do not appear in a document. On the other hand, in the latter approach, the aim is to extract keywords with respect to their relevance in the text without prior vocabulary. In this approach, automatic keyword generation is treated as a classification task, and keywords are commonly extracted based on supervised learning techniques. Thus, keyword extraction algorithms classify candidate keywords in a document into positive or negative examples. Several systems such as Extractor and Kea were developed using keyword extraction approach. Most indicative words in a document are selected as keywords for that document and as a result, keywords extraction is limited to terms that appear in the document. Therefore, keywords extraction cannot generate implicit keywords that are not included in a document. According to the experiment results of Turney, about 64% to 90% of keywords assigned by the authors can be found in the full text of an article. Inversely, it also means that 10% to 36% of the keywords assigned by the authors do not appear in the article, which cannot be generated through keyword extraction algorithms. Our preliminary experiment result also shows that 37% of keywords assigned by the authors are not included in the full text. This is the reason why we have decided to adopt the keyword assignment approach. In this paper, we propose a new approach for automatic keyword assignment namely IVSM(Inverse Vector Space Model). The model is based on a vector space model. which is a conventional information retrieval model that represents documents and queries by vectors in a multidimensional space. IVSM generates an appropriate keyword set for a specific document by measuring the distance between the document and the keyword sets. The keyword assignment process of IVSM is as follows: (1) calculating the vector length of each keyword set based on each keyword weight; (2) preprocessing and parsing a target document that does not have keywords; (3) calculating the vector length of the target document based on the term frequency; (4) measuring the cosine similarity between each keyword set and the target document; and (5) generating keywords that have high similarity scores. Two keyword generation systems were implemented applying IVSM: IVSM system for Web-based community service and stand-alone IVSM system. Firstly, the IVSM system is implemented in a community service for sharing knowledge and opinions on current trends such as fashion, movies, social problems, and health information. The stand-alone IVSM system is dedicated to generating keywords for academic papers, and, indeed, it has been tested through a number of academic papers including those published by the Korean Association of Shipping and Logistics, the Korea Research Academy of Distribution Information, the Korea Logistics Society, the Korea Logistics Research Association, and the Korea Port Economic Association. We measured the performance of IVSM by the number of matches between the IVSM-generated keywords and the author-assigned keywords. According to our experiment, the precisions of IVSM applied to Web-based community service and academic journals were 0.75 and 0.71, respectively. The performance of both systems is much better than that of baseline systems that generate keywords based on simple probability. Also, IVSM shows comparable performance to Extractor that is a representative system of keyword extraction approach developed by Turney. As electronic documents increase, we expect that IVSM proposed in this paper can be applied to many electronic documents in Web-based community and digital library.

국내외 특허 데이터 분석을 통한 금융보안 분야 주요 기술 동향 분석연구 (Research on major technology trends in the field of financial security through Korea and foreign patent data analysis)

  • 채호근;이주연
    • 디지털융복합연구
    • /
    • 제18권6호
    • /
    • pp.53-63
    • /
    • 2020
  • 인터넷과 스마트 디바이스, IoT와 같은 정보통신매체의 급격한 보급으로 전자금융거래 또한 활발히 증가하고 있지만 이에 따른 파생적 부산물로써 각종 개인정보 유출, 해킹과 같은 금융보안의 위협 또한 증가하고 있다. 따라서 이에 대비한 금융보안의 중요성은 높아지고 있지만 국내의 경우 아직 Active-X를 사용하고 있는 등 금융보안 분야의 선진국에 비해서는 상대적으로 금융보안 기술력이 미흡한 실정이다. 이에 본 연구에서는 국내·외 금융보안 관련 특허데이터를 토대로 IPC 분류 빈도분석, 키워드 빈도분석, 키워드 네트워크 분석으로 주요기술 동향을 비교하여 국내의 금융보안 분야의 주요 발전 방향성을 제시하고자 한다. 결론적으로는 최근 국내외 동향은 스마트 디바이스 기반 전자금융서비스 개발에 따른 관련 기술 개발에 초점이 맞춰진 것으로 판단된다. 이에 향후 상용화 단계의 기술로 선행적인 측면의 연구를 반영하는 논문 데이터 분석을 통해 금융보안 연구동향과 기술동향 분석 결과를 매핑함으로써 금융보안의 기술개발을 위한 기반데이터로 활용될 수 있고자 한다.

A Research Analysis of QR code based on big data in Korea

  • Lee, Eun-ji;Kim, Soo Kyun
    • 한국컴퓨터정보학회논문지
    • /
    • 제26권9호
    • /
    • pp.189-200
    • /
    • 2021
  • 최근에 정보기술과 스마트폰 기술이 빠르게 발달되고 있다. 데이터가 증가함에 따라 빅데이터 시대에 도달하였다. 최근 언택트 시대가 도래함에 따라 QR코드는 우리 생활에서 밀접하게 운영되고 있다. 본 연구의 목적은 첫째, "QR Code"에 대한 선행연구를 살펴보고 분야별 키워드에 대한 분석을 실시한다. 둘째, 빅데이터 관점에서 데이터시각화를 위해 "QR Code"의 빈출키워드를 대상으로 워드클라우드 분석과 네트워크 분석을 실시한다. 셋째, "QR Code" 관련하여 향후 연구자들에게 연구방향을 제시하고자 한다. 분석결과 첫째, 연구동향을 살펴본 결과 연구가 증가추세에 있으며, 분야가 다양하게 활용되고 있음을 알 수 있었다. 둘째, 빈출 키워드 분석결과 전반적으로 유사한 결과가 도출되었으며, 분야별, 연도별에 따라 일부 차이가 있는 것으로 분석되었다. 셋째, 빈출 키워드에 따른 시각화 결과 역시 빈출 키워드 분석결과와 동일하게 분석되었다는 것을 알 수 있었다. 이론적 연구결과에 따른 실무적 시사점은 다음과 같다. 첫째, 'QR Code'를 기술적인 측면이 아닌 정보전달의 수단으로 연구될 필요가 있다. 둘째, "QR Code"는 사회 경향이나 이슈들을 반영하여 발전하고 있다는 것을 알 수 있다. 이론적 시사점과 실무적 시사점을 통해 우리는 QR 코드에 대한 방향성을 전략적으로 제공해주고자 한다.

토픽모델링을 활용한 조세순응 연구 동향 분석 (Analysis of Research Trends in Tax Compliance using Topic Modeling)

  • 강민조;백평구
    • 한국콘텐츠학회논문지
    • /
    • 제22권1호
    • /
    • pp.99-115
    • /
    • 2022
  • 본 연구의 목적은 사회과학 전반에 걸쳐서 수행되고 있는 조세 분야의 대표적인 연구주제로서 조세순응, 납세의식, 성실납세(이하 "조세순응")에 관한 연구의 흐름을 정리함으로써 융합학문으로서 세무학의 지평을 확장하는 것이다. 이에 조세순응에 관한 국내 학술지 논문을 학제적 관점에서 종합적으로 분석하기 위하여 텍스트마이닝의 일환으로 토픽모델링 기법을 적용하였다. 데이터 수집-키워드 전처리-토픽모델 분석의 흐름으로 총 347편의 논문에 연구자가 등록한 조세순응 관련 키워드들로부터 잠재적인 연구주제를 제시하고자 하였다. 본 연구의 분석 결과로 첫째, 키워드 분석에서는 세무조사, 조세회피, 성실신고확인제도 등의 키워드가 단순 빈도 기준으로 상위 5개 키워드에 포함되었고, 키워드의 상대적 중요도를 감안한 TF-IDF 값에서도 상위 5개 키워드에 포함되었다. 한편 탈세라는 키워드는 단순빈도에서 부각되지 않은 것에 비해 TF-IDF 값 기준으로 상위 키워드에 포함되었다. 둘째, 토픽모델링을 통해 잠재적인 8개의 연구주제를 도출하였다. 해당 주제는 (1) 조세공정성과 조세범칙행위의 억제, (2) 조세법의 이념과 조세정책의 타당성, (3) 실질과세원칙과 조세채권의 담보 (4) 납세협력비용과 세무행정 서비스, (5) 신고납세제도와 세무전문가, (6) 조세풍토와 전략적 조세행동, (7) 조세행동의 다면성과 차별적 순응의도, (8) 과세정보시스템과 효율적 세원관리와 같다. 본 연구는 학문 간의 경계를 넘어 조세순응이라는 주제어를 바라보는 다양한 관점을 포괄적으로 조망함으로써 학제간 소통의 기회를 마련하고 합리적인 조세제도를 구축하는데 실천적 시사점을 제시하고자 하였다.

국내 노인주거환경계획 분야 연구의 흐름 분석 연구 (An analysis of domestic research trends on elderly environment planing)

  • 이연숙;이소영;김미선;이정화;곽윤정
    • KIEAE Journal
    • /
    • 제7권2호
    • /
    • pp.77-85
    • /
    • 2007
  • Korean society expects to be changing into aged society more rapidly than any other countries due to low birthrate and increase in life expectancy. Increasing number of elderly and social problems of aging society have provoked increase in research on elderly environment. Elderly housing facilities and living conditions are significantly related to the quality of life for older persons. The purpose of this study is to systematically analyze empirical studies on elderly physical environments in Korea, find out research streaming and understand social backgrounds and to suggest future research problems. For this study, contents analysis was conducted. Articles of four academic peer reviewed journals published from 1986 to 2005 were units of analysis. Using a keyword through library database systems, the articles were systematically selected. As results, trends of research according to 4 periods were defined. Among them as major trends, expansion of the quantity, expansion to interior design features for older persons, more facility types for dependent elderly(assisted living facilities, facilities for elderly with dementia, long term care facilities) have appeared. This result showed some directions and implications on elderly facility planning and development.

국내 구술사 연구 동향 분석 - 학술지 논문을 중심으로 - (Research Trends of Oral History in Korea: Focusing on Domestic Academic Journals)

  • 이재영;정연경
    • 한국기록관리학회지
    • /
    • 제18권3호
    • /
    • pp.25-47
    • /
    • 2018
  • 본 연구는 1991년부터 현재까지 이루어진 국내 구술사 연구 동향을 분석함으로써 구술사를 둘러싼 연구 지형을 파악하고 향후 구술사 연구의 방향성에 기초자료를 제공하고자 하였다. 이를 위해 한국교육학술정보원의 학술연구정보서비스에서 구술사로 키워드 검색을 시행하여 1991년부터 2018년까지 이루어진 국내 구술사 연구 논문 총 439건을 연구대상으로 선정하였다. 그리고 이를 시기별로 연구 논문 생산량과 주요 학술지, 연구자 및 연구 재원, 연구분야 및 연구 내용, 국제 협력으로 구분하여 국내 구술사 연구 동향을 다각도로 분석하였다. 마지막으로 도출된 연구 결과를 바탕으로 시사점을 정리하고 후속 연구의 방향을 제시하였다.

Overseas Research Trends Related to 'Research Ethics' Using LDA Topic Modeling

  • YANG, Woo-Ryeong;YANG, Hoe-Chang
    • 연구윤리
    • /
    • 제3권1호
    • /
    • pp.7-11
    • /
    • 2022
  • Purpose: The purpose of this study is to derive clues about the development direction of research ethics and areas of interest which has recently become a social issue in Korea by confirming overseas research trends. Research design, data and methodology: We collected 2,760 articles in scienceON, which including 'research ethics' in their paper. For analysis, frequency analysis, word clouding, keyword association analysis, and LDA topic modeling were used. Results: It was confirmed that many of the papers were published in medical, bio, pharmaceutical, and nursing journals and its interest has been continuously increasing. From word frequency analysis, many words of medical fields such as health, clinical, and patient was confirmed. From topic modeling, 7 topics were extracted such as ethical policy development and human clinical ethics. Conclusions: We founded that overseas research trends on research ethics are related to basic aspects than Korea. This means that a fundamental approach to ethics and the application of strict standards can become the basis for cultivating an overall ethical awareness. Therefore, academic discussions on the application of strict standards for publishing ethics and conducting researches in various fields where community awareness and social consensus are necessary for overall ethical awareness.

Analysis of University Unification Education Research Trends Using Text Network Analysis and Topic Modeling

  • Do-Young LEE
    • 웰빙융합연구
    • /
    • 제6권4호
    • /
    • pp.27-31
    • /
    • 2023
  • Purpose: This study analyzed papers identified by entering the two keywords 'unification education' and 'university' during research from 2013 to 2022 in order to identify trends and key concepts in unification education research at domestic universities. Research design, data, and methodology: The study analyzed 224 papers, excluding those on primary, middle, and high school unification education, as well as unrelated and duplicate papers. The analysis included developing a co-occurrence network of keywords, utilizing topic modeling to categorize research types, and confirming visualizations such as word clouds and sociograms. Results: In the final analysis, the research identified 1,500 keywords, with notable ones like 'Korea,' 'education,' 'unification.' Centrality analysis, measuring influence through connected keywords, revealed that 'Korea,' 'education,' 'north,' and 'unification' held significant positions. Keywords with high centrality compared to their frequency included 'learning,' 'development,' 'training,' 'peace,' and 'language,' in that order. Conclusions: This study investigated trends and structures in university-level unification education by analyzing papers identified with the keywords 'unification education' and 'university.' The use of keyword network analysis aimed to elucidate patterns and structures in university-level unification education. The significance of the study lies in offering foundational data for future research directions in the field of unification education at universities.

A Study on the Change of Tourism Marketing Trends through Big Data

  • Se-won Jeon;Gi-Hwan Ryu
    • International journal of advanced smart convergence
    • /
    • 제13권2호
    • /
    • pp.166-171
    • /
    • 2024
  • Recently, there has been an increasing trend in the role of social media in tourism marketing. We analyze changes in tourism marketing trends using tourism marketing keywords through social media networks. The aim is to understand marketing trends based on the analyzed data and effectively create, maintain, and manage customers, as well as efficiently supply tourism products. Data was collected using web data from platforms such as Naver, Google, and Daum through TexTom. The data collection period was set for one year, from December 1, 2022, to December 1, 2023. The collected data, after undergoing refinement, was analyzed as keyword networks based on frequency analysis results. Network visualization and CONCOR analysis were conducted using the Ucinet program. The top words in frequency were 'tourists,' 'promotion,' 'travel,' and 'research.' Clusters were categorized into four: tourism field, tourism products, marketing, and motivation for visits. Through this, it was confirmed that tourism marketing is being conducted in various tourism sectors such as MICE, medical tourism, and conventions. Utilizing digital marketing via online platforms, tourism products are promoted to tourists, and unique tourism products are developed to increase city branding and tourism demand through integrated tourism content. We identify trends in tourism marketing, providing tourists with a positive image and contributing to the activation of local tourism.

방음벽 및 방음장치 특허 동향 분석 (Patent Analysis for Noise Barrier and Noise Reducing Device)

  • 조준호;고효인;김흥섭
    • 한국철도학회:학술대회논문집
    • /
    • 한국철도학회 2010년도 춘계학술대회 논문집
    • /
    • pp.1975-1981
    • /
    • 2010
  • In this study, the patent trends for noise barrier and noise reducing device have been analyzed, for the development of adaptive noise barrier according to the transmission characteristics of railway noise. Using patent search engine, keyword searching for patents after 1980 in Korea was performed. The first 667 patents details were reviewed for the extraction core(ie, key) patents. From this review, finally 70 patents were built as DB. From this analysis of core patents, system requirements for development of noise reducing device were obtained.

  • PDF