• Title/Summary/Keyword: keyword frequency analysis

Search Result 316, Processing Time 0.024 seconds

A Study on Establishing a Market Entry Strategy for the Satellite Industry Using Future Signal Detection Techniques (미래신호 탐지 기법을 활용한 위성산업 시장의 진입 전략 수립 연구)

  • Sehyoung Kim;Jaehyeong Park;Hansol Lee;Juyoung Kang
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.3
    • /
    • pp.249-265
    • /
    • 2023
  • Recently, the satellite industry has been paying attention to the private-led 'New Space' paradigm, which is a departure from the traditional government-led industry. The space industry, which is considered to be the next food industry, is still receiving relatively little attention in Korea compared to the global market. Therefore, the purpose of this study is to explore future signals that can help determine the market entry strategies of private companies in the domestic satellite industry. To this end, this study utilizes the theoretical background of future signal theory and the Keyword Portfolio Map method to analyze keyword potential in patent document data based on keyword growth rate and keyword occurrence frequency. In addition, news data was collected to categorize future signals into first symptom and early information, respectively. This is utilized as an interpretive indicator of how the keywords reveal their actual potential outside of patent documents. This study describes the process of data collection and analysis to explore future signals and traces the evolution of each keyword in the collected documents from a weak signal to a strong signal by specifically visualizing how it can be used through the visualization of keyword maps. The process of this research can contribute to the methodological contribution and expansion of the scope of existing research on future signals, and the results can contribute to the establishment of new industry planning and research directions in the satellite industry.

Multi-Dimensional Keyword Search and Analysis of Hotel Review Data Using Multi-Dimensional Text Cubes (다차원 텍스트 큐브를 이용한 호텔 리뷰 데이터의 다차원 키워드 검색 및 분석)

  • Kim, Namsoo;Lee, Suan;Jo, Sunhwa;Kim, Jinho
    • Journal of Information Technology and Architecture
    • /
    • v.11 no.1
    • /
    • pp.63-73
    • /
    • 2014
  • As the advance of WWW, unstructured data including texts are taking users' interests more and more. These unstructured data created by WWW users represent users' subjective opinions thus we can get very useful information such as users' personal tastes or perspectives from them if we analyze appropriately. In this paper, we provide various analysis efficiently for unstructured text documents by taking advantage of OLAP (On-Line Analytical Processing) multidimensional cube technology. OLAP cubes have been widely used for the multidimensional analysis for structured data such as simple alphabetic and numberic data but they didn't have used for unstructured data consisting of long texts. In order to provide multidimensional analysis for unstructured text data, however, Text Cube model has been proposed precently. It incorporates term frequency and inverted index as measurements to search and analyze text databases which play key roles in information retrieval. The primary goal of this paper is to apply this text cube model to a real data set from in an Internet site sharing hotel information and to provide multidimensional analysis for users' reviews on hotels written in texts. To achieve this goal, we first build text cubes for the hotel review data. By using the text cubes, we design and implement the system which provides multidimensional keyword search features to search and to analyze review texts on various dimensions. This system will be able to help users to get valuable guest-subjective summary information easily. Furthermore, this paper evaluats the proposed systems through various experiments and it reveals the effectiveness of the system.

A study on the current status of DIY clothing products related to fabric using text mining (텍스트마이닝을 활용한 패브릭 관련 DIY 의류 상품 현황 연구)

  • Eun-Hye Lee;Ha-Eun Lee;Jeong-Wook Choi
    • Journal of the Korea Fashion and Costume Design Association
    • /
    • v.25 no.2
    • /
    • pp.111-122
    • /
    • 2023
  • This study aims to collect Big Data related to DIY clothing, analyze the results on a year-by-year basis, understand consumers' perceptions, the status, and reality of DIY clothing. The reference period for the evaluation of DIY clothing trends was set from 2012 to 2022. The data in this study was collected and analyzed using Textom, a Big Data solution program certified as a Good Software by the Telecommunications Technology Association (TTA). For the analysis of fabric-related DIY products, the keyword was set to "DIY clothing", and for data cleansing following collection, the "Espresso K" module was employed. Also, via data collection on a year-by-year basis, a total of 11 lists were generated and the collected data was analyzed by period. The following are the findings of this study's data collection on DIY clothing. The total number of keywords collected over a period of ten years on search engines "Naver" and "Google" between January 1, 2012 and December 31, 2022 was 16,315, and data trends by period indicate a continuous upward trend. In addition, a keyword analysis was conducted to analyze TF-IDF (Term Frequency-Inverse Document Frequency), a statistical measure that reflects the importance of a word within data, and the relationship with N-gram, an analysis of the correlation concerning the relationship between words. Using these results, it was possible to evaluate the popularity and growing tendency of DIY clothing products in conjunction with the evolving social environment, as well as the desire to explore DIY trends among consumers. Therefore, this study is valuable in that it provides preliminary data for DIY clothing research by analyzing the status and reality of DIY products, and furthermore, contributes to the development and production of DIY clothing.

A Study on Change in Perception of Community Service and Demand Prediction based on Big Data

  • Chun-Ok, Jang
    • International Journal of Advanced Culture Technology
    • /
    • v.10 no.4
    • /
    • pp.230-237
    • /
    • 2022
  • The Community Social Service Investment project started as a state subsidy project in 2007 and has grown very rapidly in quantitative terms in a short period of time. It is a bottom-up project that discovers the welfare needs of people and plans and provides services suitable for them. The purpose of this study is to analyze using big data to determine the social response to local community service investment projects. For this, data was collected and analyzed by crawling with a specific keyword of community service investment project on Google and Naver sites. As for the analysis contents, monthly search volume, related keywords, monthly search volume, search rate by age, and gender search rate were conducted. As a result, 10 items were found as related keywords in Google, and 3 items were found in Naver. The overall results of Google and Naver sites were slightly different, but they increased and decreased at almost the same time. Therefore, it can be seen that the community service investment project continues to attract users' interest.

Keyword Analysis of Two SCI Journals on Rock Engineering by using Text Mining (텍스트 마이닝을 이용한 암반공학분야 SCI논문의 주제어 분석)

  • Jung, Yong-Bok;Park, Eui-Seob
    • Tunnel and Underground Space
    • /
    • v.25 no.4
    • /
    • pp.303-319
    • /
    • 2015
  • Text mining is one of the branches of data mining and is used to find any meaningful information from the large amount of text. In this study, we analyzed titles and keywords of two SCI journals on rock engineering by using text mining to find major research area, trend and associations of research fields. Visualization of the results was also included for the intuitive understanding of the results. Two journals showed similar research fields but different patterns in the associations among research fields. IJRMMS showed simple network, that is one big group based on the keyword 'rock' with a few small groups. On the other hand, RMRE showed a complex network among various medium groups. Trend analysis by clustering and linear regression of keyword - year frequency matrix provided that most of the keywords increased in number as time goes by except a few descending keywords.

Analysis of dieting practices in 2016 using big data (빅데이터를 통한 2016년의 다이어트 실태 분석)

  • Jung, Eun-Jin;Chang, Un-Jae;Jo, Kyungae
    • Korean Journal of Food Science and Technology
    • /
    • v.51 no.2
    • /
    • pp.176-181
    • /
    • 2019
  • The aim of this study was to analyze dieting practices and tendencies in 2016 using big data. The keywords related to diet were collected from the portal site Naver and analyzed through simple frequency, N-gram, keyword network, and analysis of seasonality. The results showed that exercise had the highest frequency in simple frequency analysis. However, diet menu appeared most frequently in N-gram analysis. In addition, analysis of seasonality showed that the interest of subjects in diet increased steadily from February to July and peaked in October 2016. The monthly frequency of the keyword highfat diet was highest in October, because that showed the 'Low Carbohydrate High Fat' TV program. Although diet showed a certain pattern on a yearly basis, the emergence of new trendy diets in mass media also affects the pattern of diet. Therefore, it is considered that continuous monitoring and analysis of diet is needed rather than periodic monitoring.

A Study on the Research Trend of Elementary Environmental Education through an Analysis of the Network of Author Keywords (저자 키워드 네트워크 분석을 통한 초등 환경교육의 연구 동향 탐색)

  • Kim, Dong-Ryeul
    • Journal of Korean Elementary Science Education
    • /
    • v.36 no.2
    • /
    • pp.113-128
    • /
    • 2017
  • This study aims to investigate the research trend of elementary environmental education. Thus, author keywords were extracted from a total of 197 academic these related to elementary environmental education during two different periods when detailed goals were applied to the 2007 and 2009 revised curriculums respectively, and then this study analyzed the network of author keywords. The results of this study can be summarized as below. Firstly, as a result of analyzing the frequency of author keywords from academic theses related to elementary environmental education, this study discovered 369 author keywords from the period when detailed goals were applied to 2009 revised curriculum. Out of them, it was found that the keyword, 'climate change education', showed the highest frequency, followed by 'environmental literacy' and 'environmental perception', except such central keywords as 'environmental education' and 'elementary school student'. From the period when detailed goals were applied to the 2007 revised curriculum, a total of 394 author keywords were discovered, and the keyword, 'environmental literacy', showed the highest frequency, followed by 'environmental perception' and 'ESD (education for sustainable development)'. Secondly, as a result of analyzing the network of author keywords, this study found out that in the total number of network connections, average connection degree, density and clique, the period when detailed goals were applied to the 2007 revised curriculum was somewhat higher than the period when detailed goals were applied to the 2009 revised curriculum. As a result of analyzing the centrality of author keywords, this study found out that during both the periods, 'environmental perception' and 'environmental literacy' were high in degree centrality and betweenness centrality, except such central keywords as 'environmental education' and 'elementary school student'. As a result of analyzing the components of author keywords as sub-networks, this study discovered 9 components from the period when detailed goals were applied to the 2009 revised curriculum and 6 components from the period when detailed goals were applied to the 2007 revised curriculum. During both the periods, the largest component was composed of keywords high in degree centrality and betweenness centrality.

Web Site Keyword Selection Method by Considering Semantic Similarity Based on Word2Vec (Word2Vec 기반의 의미적 유사도를 고려한 웹사이트 키워드 선택 기법)

  • Lee, Donghun;Kim, Kwanho
    • The Journal of Society for e-Business Studies
    • /
    • v.23 no.2
    • /
    • pp.83-96
    • /
    • 2018
  • Extracting keywords representing documents is very important because it can be used for automated services such as document search, classification, recommendation system as well as quickly transmitting document information. However, when extracting keywords based on the frequency of words appearing in a web site documents and graph algorithms based on the co-occurrence of words, the problem of containing various words that are not related to the topic potentially in the web page structure, There is a difficulty in extracting the semantic keyword due to the limit of the performance of the Korean tokenizer. In this paper, we propose a method to select candidate keywords based on semantic similarity, and solve the problem that semantic keyword can not be extracted and the accuracy of Korean tokenizer analysis is poor. Finally, we use the technique of extracting final semantic keywords through filtering process to remove inconsistent keywords. Experimental results through real web pages of small business show that the performance of the proposed method is improved by 34.52% over the statistical similarity based keyword selection technique. Therefore, it is confirmed that the performance of extracting keywords from documents is improved by considering semantic similarity between words and removing inconsistent keywords.

Exploring Future Signals for Mobile Payment Services - A Case of Chinese Market - (모바일 결제 서비스에 대한 미래신호 예측 - 중국시장을 대상으로 -)

  • Bin Xuan;Seung Ik Baek
    • Journal of Service Research and Studies
    • /
    • v.13 no.1
    • /
    • pp.96-107
    • /
    • 2023
  • The objective of this study is to explore future issues that Chinese users, who have the highest mobile payment service usage rate in the world, will be most interested in. For this purpose, after collecting text data from a Chinese SNS site, it classifies major keywords into 4 types of future signals by using Keyword Emergence Map (KEM) and Keyword Issue Map (KIM). Furthermore, to understand the four types of signals in detail, it performs the qualitative analysis on text related to each signal keyword. As a result, it finds that the strong signal, which is rapidly growing in keyword appearance frequency during this research period, includes the keywords related to the daily life of Chinese people, such as buses, subways, and household account books. Additionally, it find that the signal that appears frequently now, but with a low increase rate, includes various services that can replace cash payment, such as hongbao (cash payment) and bank cards. The weak signal and latent signal, which appear less often than other two signals, includes the keywords related to promotion events or changes in service regulations. Its result shows that the mobile payment services greatly have changed user's daily life beyond providing convenience. Furthermore, it shows that, in the Chinese market, in which card payment is not common, the mobile payment services have the great potential to completely replace cash payment.

An Analysis on the Trends and Issues of Convergence Technology Research (네트워크 분석을 통한 국내 융합기술 연구동향 분석)

  • Lim, Jung-Yeon
    • Journal of Internet of Things and Convergence
    • /
    • v.4 no.1
    • /
    • pp.23-29
    • /
    • 2018
  • The purpose of study was to analyze the trends of 2005 to 2018 revised 'convergence technology research' through text network analysis using NetMiner4.0 program. Data analysis was conducted by using keyword analysis, centrality analysis of 653 authors' keyword from 177 journals. The results of the study are as follows. First, Research on Converging Technology has been studied steadily over the past 13 years in Department of Industry Convergence. Second, the results of the search term frequency analysis show that the 'convergence technology', 'technology convergence', 'convergence', 'design', 'convergence education', 'STEAM', 'convergence research' were used as the main keywords of convergence technology research. Third, Community analysis results show that five communities have been classified five categories according to the characteristics of the search terms 'only IT', 'Cultural industry utilizing Convergence contents', 'Technology innovation and research analysis' And patent development'. Based on these results, we proposed the future directions of convergence technology research.