• 제목/요약/키워드: Text mining analysis

검색결과 1,187건 처리시간 0.021초

온라인 해킹 불법 시장 분석: 데이터 마이닝과 소셜 네트워크 분석 활용 (An Analysis of Online Black Market: Using Data Mining and Social Network Analysis)

  • 김민수;김희웅
    • 한국정보시스템학회지:정보시스템연구
    • /
    • 제29권2호
    • /
    • pp.221-242
    • /
    • 2020
  • Purpose This study collects data of the recently activated online black market and analyzes it to present a specific method for preparing for a hacking attack. This study aims to make safe from the cyber attacks, including hacking, from the perspective of individuals and businesses by closely analyzing hacking methods and tools in a situation where they are easily shared. Design/methodology/approach To prepare for the hacking attack through the online black market, this study uses the routine activity theory to identify the opportunity factors of the hacking attack. Based on this, text mining and social network techniques are applied to reveal the most dangerous areas of security. It finds out suitable targets in routine activity theory through text mining techniques and motivated offenders through social network analysis. Lastly, the absence of guardians and the parts required by guardians are extracted using both analysis techniques simultaneously. Findings As a result of text mining, there was a large supply of hacking gift cards, and the demand to attack sites such as Amazon and Netflix was very high. In addition, interest in accounts and combos was in high demand and supply. As a result of social network analysis, users who actively share hacking information and tools can be identified. When these two analyzes were synthesized, it was found that specialized managers are required in the areas of proxy, maker and many managers are required for the buyer network, and skilled managers are required for the seller network.

텍스트 마이닝을 통한 해외건설공사 입찰정보 분석 - 해외건설공사의 입찰자 질의(Bidder Inquiry) 정보를 대상으로 - (Construction Bid Data Analysis for Overseas Projects Based on Text Mining - Focusing on Overseas Construction Project's Bidder Inquiry)

  • 이지희;이준성;손정욱
    • 한국건설관리학회논문집
    • /
    • 제17권5호
    • /
    • pp.89-96
    • /
    • 2016
  • 건설 프로젝트에서 생산되는 대부분의 데이터는 텍스트 기반의 비정형 데이터이다. 계약서, 시방서, RFi 등 수많은 텍스트 문서들을 효과적으로 분석하기 위해서는 텍스트 마이닝과 같은 비정형 텍스트 데이터 분석 방법이 필요하다. 이에 본 연구에서는 과거에 수행되었던 해외건설공사 프로젝트의 입찰 관련 문서들을 대상으로 텍스트 마이닝을 실시하였으며, 그 결과 빈출단어의 유형, 단어들 간의 연관관계, 문서들의 토픽 유형들에 대한 파악이 가능하였다. 본 연구는 텍스트 마이닝을 활용한 해외건설공사 입찰 정보 분석을 통해 비정형 텍스트 데이터를 효과적으로 분석할 수 있는 방안을 제시하였다는 점에서 의의가 있으며, 향후 관련 분야 연구를 확장시킬 수 있는 기반을 마련할 수 있을 것이라 기대한다.

텍스트 마이닝을 이용한 암반공학분야 SCI논문의 주제어 분석 (Keyword Analysis of Two SCI Journals on Rock Engineering by using Text Mining)

  • 정용복;박의섭
    • 터널과지하공간
    • /
    • 제25권4호
    • /
    • pp.303-319
    • /
    • 2015
  • 텍스트 형태의 자료에서 유용한 정보를 추출하는 텍스트 마이닝 기법은 데이터 마이닝의 한 분야이다. 본 연구에서는 암반공학 분야의 대표적인 국제 학술지인 IJRMMS과 RMRE에 2001년 이후 게재된 논문의 제목과 주요어를 대상으로 텍스트 마이닝 기법을 적용하여 주요 연구 동향과 시계열 트렌드, 연구 분야 상관관계 등을 파악하였으며 이를 이해하기 쉽도록 가시화하였다. 분석 결과 주요 연구 분야는 두 학술지 모두 유사하였으나 연관관계 분석 결과 IJRMMS의 경우 'rock'을 기반으로 1개의 큰 그룹과 소규모 그룹이 형성된 반면 RMRE는 중규모의 그룹이 형성되고 이 그룹 간에 연결이 형성되는 구조가 나타났다. 또한 시계열 자료로 변환하여 군집 분석과 각 주제어의 기울기 자료로 분석한 결과 일부 하강 주제어들이 있었으나 양적인 측면에서 차이가 있을 뿐 대부분 논문 수가 증가하는 것으로 나타났다.

국내 소비자의 일본 패션제품에 대한 정치적 소비 연구 (Korean Consumers' Political Consumption of Japanese Fashion Products)

  • 최영현;이규혜
    • 한국의류학회지
    • /
    • 제44권2호
    • /
    • pp.295-309
    • /
    • 2020
  • In 2019, Japan announced trade regulations against Korean products; consequently, the sales of Japanese products in Korea dropped due to a Korean consumers' boycott. This study measured the Korean consumers' political consumption behavior toward Japanese fashion products. Unstructured text data from online media sources and consumer posted sources such as blog and SNS were collected. Text mining techniques and semantic network analysis were used to process unstructured data. This study used text mining techniques and semantic network analysis to process data. The results identified boycotting Japanese fashion products and buycotting alternative products and Korean brands due to consumers' political consumption. Two brand cases were investigated in detail. Online text data before and after the political action were compared and significant changes in consumption as well as emotional expressions were identified. Product related industry sectors were identified in terms of the political consumption of fashion: liquor, automobile and tourism industry sectors were closely linked to the fashion sector in terms of boycotting. More "boycott" and "buycott" fashion brands (reflected in consumer attitudes and feelings) were detected in consumer driven texts than in media driven sources.

도서 정보 및 본문 텍스트 통합 마이닝 기반 사용자 맞춤형 도서 큐레이션 시스템 (Personalized Book Curation System based on Integrated Mining of Book Details and Body Texts)

  • 안희정;김기원;김승훈
    • Journal of Information Technology Applications and Management
    • /
    • 제24권1호
    • /
    • pp.33-43
    • /
    • 2017
  • The content curation service through big data analysis is receiving great attention in various content fields, such as film, game, music, and book. This service recommends personalized contents to the corresponding user based on user's preferences. The existing book curation systems recommended books to users by using bibliographic citation, user profile or user log data. However, these systems are difficult to recommend books related to character names or spatio-temporal information in text contents. Therefore, in this paper, we suggest a personalized book curation system based on integrated mining of a book. The proposed system consists of mining system, recommendation system, and visualization system. The mining system analyzes book text, user information or profile, and SNS data. The recommendation system recommends personalized books for users based on the analysed data in the mining system. This system can recommend related books using based on book keywords even if there is no user information like new customer. The visualization system visualizes book bibliographic information, mining data such as keyword, characters, character relations, and book recommendation results. In addition, this paper also includes the design and implementation of the proposed mining and recommendation module in the system. The proposed system is expected to broaden users' selection of books and encourage balanced consumption of book contents.

텍스트마이닝을 이용한 한국응급구조학회지 중심단어 분석 (Analysis of key words published with the Korea Society of Emergency Medical Services journal using text mining)

  • 권찬양;양현모
    • 한국응급구조학회지
    • /
    • 제24권1호
    • /
    • pp.85-92
    • /
    • 2020
  • Purpose: The purpose of this study was to analyze the English abstract key words found within the Korea Society of Emergency Medical Services journal using text mining techniques to determine the adherence of these terms with Medical Subject Headings (MeSH) and identify key word trends. Methods: We analyzed 212 papers that were published from 2012 to 2019. R software, web scraping, and frequency analysis of key words were conducted using R's basic and text mining packages. Additionally, the Word Clouds package was used for visualization. Results: The average number of key words used per study was 3.9. Word cloud visualization revealed that CPR was most prominent in the first half and emergency medical technician was most frequently used during the second half. There were a total of 542 (64.9%) words that exactly matched the MeSH listed words. A total of 293 (35%) key words did not match MeSH listed words. Conclusion: Researchers should obey submission rules. Further, journals should update their respective submission rules. MeSH key words that are frequently cited should be suggested for use.

텍스트 마이닝 처리로 품질경영학회지 연구동향 분석 (Analysis of Research Trends in Journal of Korean Society for Quality Management by Text Mining Processing)

  • 이상복
    • 품질경영학회지
    • /
    • 제47권3호
    • /
    • pp.597-613
    • /
    • 2019
  • Purpose: The purpose of this study is to analyze the trend of quality research by analyzing the entire JKSQM(Journal of the Korean Society for Quality Management). Methods: This study is to analyze the frequency of words used in the abstract of the all JKSQM by applying the text mining processing. We use wordcrowd among text mining techniques. Results: 22 words of high frequency were presented in the abstract of the paper published in the JKSQM for 42 years. The frequency of words was shown on a 10 year basis, and the four important words were plotted on a change graph for each Vol. Frequent words of each Vol. are added in the appendix. Conclusion: The main research results are as follows. First, there has been no significant change in research trends over the last 40 years. Second, the early SQC words have been widely used, and since 1990, many words such as service-oriented words have been used, indicating a change in the times. Third, the use of the words of the 4th industrial revolution since 2010 is weak. In the above analysis, the trend of quality research in Korea is within the quality category and can be considered conservative. Now, it is expected that everything will be changed in the period of the 4th Industrial Revolution, and it is time to study the direction of quality in Korea.

텍스트마이닝을 활용한 중고거래 안전결제 실태분석 (Analysis of the Safety Payment in Second-hand Transactions Using Text Mining)

  • 김은지;김범수
    • 문화기술의 융합
    • /
    • 제9권4호
    • /
    • pp.529-536
    • /
    • 2023
  • 한국의 중고거래 시장은 꾸준한 성장을 보이고 있지만, 중고거래 사기의 발생 건수와 피해 금액도 함께 늘어나고 있다. 2021년 기준 중고거래 시장 규모는 24조원이지만 동시에 사기 피해 금액은 3,606억 원에 달한다. 중고거래 플랫폼은 개인 간 거래 사기를 방지하기 위해 안전거래 결제 시스템을 마련했다. 그러나 안전결제 시스템을 악용한 신종 사기 수법이 생겨나고 있어 중고거래 안전결제도 사기로부터 안전하다고 볼 수 없다. 이에 본 연구는 텍스트마이닝을 활용하여 중고거래 안전결제 서비스의 사기 방지를 위해 생긴 안전결제 시스템의 실태를 파악하고, 이를 텍스트마이닝과 네트워크 분석으로 분석하여 안전결제 시스템의 개선 방안을 제안한다.

텍스트마이닝을 활용한 건설현장 재해 유형별 예방 대책 분석 (Analysis of Prevention Methods by Type of Construction Disaster Using Text Mining Techniques)

  • 조규필;이명도;신윤석;김백중
    • 한국재난정보학회 논문집
    • /
    • 제20권1호
    • /
    • pp.13-19
    • /
    • 2024
  • 연구목적: 본 연구는 텍스트 마이닝 기법을 활용하여 유형별 건설재해의 원인을 도출하여 중대재해 사고의 예방대책 마련을 위한 주요 요소를 파악하는 것을 목적으로 한다. 연구방법: 국내 건설분야의 중대재해 사례를 분석한 데이터베이스를 기반으로 예방대책과 원인을 텍스트 마이닝 기법으로 분석하고, 분석 내용을 시각적으로 표현하였다. 연구결과: 이 시각적 데이터는 중요도에 따라 공종별 중대 재해 예방에 필요한 요소의 파악을 용이하게 한다. 결론: 본 연구의 결과는 건설현장 중대재해와 관련하여 예방대책 마련 시 고려되어야 할 요소 및 요소간 명확한 상관관계의 파악에 기여할 것으로 사료된다.

빅데이터 분석 기반의 오피니언 마이닝을 이용한 정보화 사업 평가 분석 (An Analysis of IT Proposal Evaluation Results using Big Data-based Opinion Mining)

  • 김홍삼;김종수
    • 산업경영시스템학회지
    • /
    • 제41권1호
    • /
    • pp.1-10
    • /
    • 2018
  • Current evaluation practices for IT projects suffer from several problems, which include the difficulty of self-explanation for the evaluation results and the improperly scaled scoring system. This study aims to develop a methodology of opinion mining to extract key factors for the causal relationship analysis and to assess the feasibility of quantifying evaluation scores from text comments using opinion mining based on big data analysis. The research has been performed on the domain of publicly procured IT proposal evaluations, which are managed by the National Procurement Service. Around 10,000 sets of comments and evaluation scores have been gathered, most of which are in the form of digital data but some in paper documents. Thus, more refined form of text has been prepared using various tools. From them, keywords for factors and polarity indicators have been extracted, and experts on this domain have selected some of them as the key factors and indicators. Also, those keywords have been grouped into into dimensions. Causal relationship between keyword or dimension factors and evaluation scores were analyzed based on the two research models-a keyword-based model and a dimension-based model, using the correlation analysis and the regression analysis. The results show that keyword factors such as planning, strategy, technology and PM mostly affects the evaluation result and that the keywords are more appropriate forms of factors for causal relationship analysis than the dimensions. Also, it can be asserted from the analysis that evaluation scores can be composed or calculated from the unstructured text comments using opinion mining, when a comprehensive dictionary of polarity for Korean language can be provided. This study may contribute to the area of big data-based evaluation methodology and opinion mining for IT proposal evaluation, leading to a more reliable and effective IT proposal evaluation method.