• Title/Summary/Keyword: 주제어 파악

Search Result 158, Processing Time 0.027 seconds

Cloud storage-based intelligent archiving system applying automatic document summarization (문서 자동요약 기술을 적용한 클라우드 스토리지 기반 지능적 아카이빙 시스템)

  • Yoo, Kee-Dong
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.17 no.3
    • /
    • pp.59-68
    • /
    • 2012
  • Zero client-based cloud storage technology is gaining much interest as a tool to centralized management of organizational documents nowadays. Besides the well-known cloud storage's defects such as security and privacy protection, users of the zero client-based cloud storage point out the difficulty in browsing and selecting the storage category because of its diversity and complexity. To resolve this problem, this study proposes a method of intelligent document archiving by applying an algorithm-based automatic topic identification technology. Without user's direct definition of category to store the working document, the proposed methodology and prototype enable the working documents to be automatically archived into the predefined categories according to the extracted topic. Based on the proposed ideas, more effective and efficient centralized management of electronic documents can be achieved.

Investigating Major Topics Through the Analysis of Depression-related Facebook Group Posts (페이스북 그룹 게시물 분석을 통한 우울증 관련 주제에 대한 고찰)

  • Zhu, Yongjun;Kim, Donghun;Lee, Changho;Lee, Yongjeong
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.53 no.4
    • /
    • pp.171-187
    • /
    • 2019
  • The study aims to analyze the posts of depression-related Facebook groups to understand major topics discussed by group users. Specifically, the purpose of the study is to identify the topics and keywords of the posts to understand what users discuss about depression. Depression is a mental disorder that is somewhat sensitive in the online community, which is characterized by accessibility, openness and anonymity. The researchers have implemented a natural language-based data analysis framework that includes components ranging from Facebook data collection to the automated extraction of topics. Using the framework, we collected and analyzed 885 posts created in the past one year from the largest Facebook depression group. To derive more complete and accurate topics, we combined both automated and manual (e.g., stop words removal, topic size determination) methods. Results indicate that users discuss a variety of topics including depression in general, human relations, mood and feeling, depression symptoms, suicide, medical references, family and etc.

Accessibility of Electronic Publishing Technology Trends (전자출판의 접근성 관련 기술 동향 분석 - 특허정보분석을 중심으로-)

  • Jang, Bo-Seong;Nam, Young-Jun
    • Proceedings of the Korean Society for Information Management Conference
    • /
    • 2014.08a
    • /
    • pp.115-118
    • /
    • 2014
  • 본 연구는 특허정보의 제목 및 초록에 나타난 장애인을 위한 전자출판의 접근성 기술 동향을 분석하고자 관련 유효특허를 수집 분석하였다. 분석내용은 특허 출원 빈도, IPC 주제분류, 전자출판 접근성 기술 분류 기준에 의한 세부 기술별 출원 동향, 주제어 출현 빈도 등을 파악하였다. 연구결과 장애인의 전자출판 접근성 분야는 출원 건수가 증가하는 전형적인 성장기 단계이며, IPC 주제분류는 섹션G(물리학), 섹션H(전기)영역에서 주로 특허가 출원되고 있으며 세부 기술별 출원 동향은 문자를 점자로 변환 압축 및 관련 보조공학기기 개발 특허 등이 중심적으로 나타났다. 출현 빈도수를 기준으로 한 상위 주제어는 braille, display, device, tactile, character, control 순으로 나타났다.

  • PDF

A Study on the Service Innovation using SNS (SNS를 이용한 서비스 혁신 방법에 관한 연구)

  • Lee, Jong-Chan;Lee, Won-Young
    • Journal of IKEEE
    • /
    • v.20 no.3
    • /
    • pp.235-240
    • /
    • 2016
  • In this study, we use the data collected from Twitter, as an SNS(Social Networking Service), for service innovation. This data was collected and processed by Flume. The data set in May 2016 was 4,766 and 15,543 from company S and company X, respectively. We were able to figure out the emotional atmosphere of the two companies through the sentiment analysis(SA) and to find out about the vertical relationship through the bibliometric analysis(BA). Furthermore, we were able to grasp the horizontal relationship through the social network analysis(SNA). It was concluded that SNS was worth while to derive an innovative item.

A Study on the Social and Cultural Characteristics of Web Queries (웹 검색질의어 분석을 통한 사회·문화적 특성에 관한 연구)

  • Kim, Seong-Hee
    • Journal of Information Management
    • /
    • v.42 no.4
    • /
    • pp.155-174
    • /
    • 2011
  • This study aims to focus on classifying the search engine queries according to web query topic and the different user intents behind web queries. First, we classified 10,000 web query data set by topic. The results showed that there was significant differences in interesting topics across time. Also, we categorized 500 popular queries in web search engine as informational, navigational, or transactional. As a result, 82 percent of web queries are informational in nature, with about 10.8 percent for navigational and 7.2 percent for transactional. This results will help establish the policy to provide internet contents based on user's intent and also find out the social and cultural characteristics.

A Study on the Research Trends in International Trade using Social Network Analysis (사회연결망 분석을 활용한 무역 분야 연구동향 분석)

  • Lee, Jee-Hoon
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.21 no.11
    • /
    • pp.465-476
    • /
    • 2020
  • This study used social network analysis to analyze trends and the knowledge structure of research in international trade. To this end, 4,840 keywords were extracted and analyzed from 1,797 papers contained in the Journal of Int'l Trade and Industry Studies, the Korea Trade Review, and the Journal of Korea Trade from 2003 to 2019. The results reveal that the distribution of keywords in the trade studies, as with other intellectual networks, followed a power-law distribution. Some differences were observed in the top 20 keywords across journals, with total factor productivity, economic growth, and Korea-US FTA ranking high only in the Journal of Int'l Trade and Industry Studies. Global value chain and trust emerged as a topic that attracted new researchers' attention in the 2011-2019 period. Interest in E-Trade, WTO, and internationalization has declined in recent years. The conventional international trade research trend analyses have predominantly featured qualitative analysis by descriptive method in general, but this study is meaningful in that it employs quantitative analysis using social network analysis techniques.

A Comparative Study on the Structures of Indexing Languages between LC Subject Headings and Thesaurus (LC주제명표목표와 시소러스의 색인어 구조 비교연구)

  • 김주성;김태수
    • Proceedings of the Korean Society for Information Management Conference
    • /
    • 1995.08a
    • /
    • pp.111-114
    • /
    • 1995
  • 전산환경에서 유용한 색인도구로서의 통제어휘집을 구성하는 원칙과 방법을 제시하고자 전조합색인용 통제어휘집인 LC주제명표목표의 표목구조와 후조합색인용 통제어휘집인 시소러스의 용어구조를 비교하였다. 주제명표목표에서 사용되는 도치표목, 전치사로 연결된 표목, 접속사로 연결된 표목, 세목을 가진 표목을 시소러스에서 사용되는 색인구조와 비교분석 하였다. 주제명표목표가 참조구조를 시소러스체제로 변환시켰을 때 나타나는 문제점도 파악하였다.

  • PDF

Research Trend Analysis in Fashion Design Studies in Korea using Topic Modeling (토픽모델링을 이용한 국내 패션디자인 연구동향 분석)

  • Jang, Namkyung;Kim, Min-Jeong
    • Journal of Digital Convergence
    • /
    • v.15 no.6
    • /
    • pp.415-423
    • /
    • 2017
  • This study explored research trends by investigating articles published in the Journal of Korean Society of Fashion Design from 2001 through 2015. English key words and abstracts were analyzed using text mining and topic modeling techniques. The findings are as followings. By the text mining technique, 183 core terms, appeared more than 30 times, were derived from 7137 words used in total 338 articles' key words and abstracts. 'Fashion' and 'design' showed the highest frequency rate. After that, the well-received topic modeling technique, LDA, was applied to the collected data sets. Several distinct sub-research domains strongly tied with the previous fashion design field, except for topics such as fashion brand marketing and digital technology, were extracted. It was observed that there are the growing and declining trends in the research topics. Based on findings, implication, limitation, and future research questions were presented.

Topical Clustering of Documents using Helmholtz Machines with Competitive Units (Competitive Unit을 사용한 Helmholtz Machine에 의한 문서 클러스터링)

  • 장정호;장병탁
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2001.04b
    • /
    • pp.292-294
    • /
    • 2001
  • 문서 클러스터링은 정보검색 시스템에서 검색과정의 효율성을 향상시키기 위해서 많이 사용된다. 기존의 K-means 클러스터링과 같은 거리-기반 접근 방법은 거리에 대한 척도를 정해야 하는 문제가 있고, 또한 전체 자질 공간에서 지역적 특성에 민감하기 때문에 문서 내에 노이즈가 존재할 경우 만족스러운 결과를 내지 못할 수 있다. 그리고 기본적으로 문서 데이터는 희소성(sparseness)을 가기 때문에 정규 분포를 가정한 mixture 모델을 적용하기도 어려움이 있다. 본 논문에서는 Helmoholtz machine에 의한 문서 클러스터링 방법을 제안한다. 제안되는 방법에서는 하나의 문서를 어떤 내재적인 요인(factor)들의 다양한 결합에 의한 결과로 가정하는데, 이 때의 요인은 주제어 집합 또는 적어도 의미적으로 유사한 단어들의 집합이다. 그리고 기본적으로 Helmholtz machine은 이진 데이터를 다루는데, 텍스트 문서에 나타나는 단어들의 빈도를 고려하기 위해 수정된 Helmholtz machine을 제시한다. TREC-8 adhoe 데이터와 20 Newsgroup 문서 집합에 대한 클러스터링 실험 결과, 제안된 방법이 K-means 알고리즘에 비해 우수한 성능을 보였으며 주제어 추출을 통해 문서 집합의 전체 내용 파악을 용이하게 하는 특성이 있었다.

  • PDF

Keyword Analysis of Two SCI Journals on Rock Engineering by using Text Mining (텍스트 마이닝을 이용한 암반공학분야 SCI논문의 주제어 분석)

  • Jung, Yong-Bok;Park, Eui-Seob
    • Tunnel and Underground Space
    • /
    • v.25 no.4
    • /
    • pp.303-319
    • /
    • 2015
  • Text mining is one of the branches of data mining and is used to find any meaningful information from the large amount of text. In this study, we analyzed titles and keywords of two SCI journals on rock engineering by using text mining to find major research area, trend and associations of research fields. Visualization of the results was also included for the intuitive understanding of the results. Two journals showed similar research fields but different patterns in the associations among research fields. IJRMMS showed simple network, that is one big group based on the keyword 'rock' with a few small groups. On the other hand, RMRE showed a complex network among various medium groups. Trend analysis by clustering and linear regression of keyword - year frequency matrix provided that most of the keywords increased in number as time goes by except a few descending keywords.