• Title/Summary/Keyword: keyword-based analysis

Search Result 633, Processing Time 0.026 seconds

Unstructured Data Processing Using Keyword-Based Topic-Oriented Analysis (키워드 기반 주제중심 분석을 이용한 비정형데이터 처리)

  • Ko, Myung-Sook
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.6 no.11
    • /
    • pp.521-526
    • /
    • 2017
  • Data format of Big data is diverse and vast, and its generation speed is very fast, requiring new management and analysis methods, not traditional data processing methods. Textual mining techniques can be used to extract useful information from unstructured text written in human language in online documents on social networks. Identifying trends in the message of politics, economy, and culture left behind in social media is a factor in understanding what topics they are interested in. In this study, text mining was performed on online news related to a given keyword using topic - oriented analysis technique. We use Latent Dirichiet Allocation (LDA) to extract information from web documents and analyze which subjects are interested in a given keyword, and which topics are related to which core values are related.

Research Trends in the Journal of Korean Academic Society of Home Health Care Nursing from 2010 to 2019: Using the Keyword Home Health Care (가정간호학회지 게재 논문 분석: 2010년부터 2019년까지 가정간호분야를 중심으로)

  • Jun, Eun-Young;Noh, Jun Hee
    • Journal of Home Health Care Nursing
    • /
    • v.27 no.2
    • /
    • pp.210-218
    • /
    • 2020
  • Purpose: The purpose of this study was to analyze research trends, using the keyword home health care, in articles published in the Journal of Korean Academic Society of Home Health Care Nursing over the past 10 years. Methods: An analysis was conducted of 50 home health care-based studies chosen from among the 206 studies published in the Journal of Korean Academic Society of Home Health Care Nursing from 2010 to 2019. The analysis focused on research methodology and keyword. Descriptive statistics were used to examine the frequency distribution of research methods and keywords. Results: Study participation was mainly focused on nurses (52.0%). Most of the studies used quantitative methods (96.0%), and 43 studies (86.0%) used self-report structured questionnaires. The most commonly used data analyses methods were descriptive statistics, t-test, analysis of variance, correlation, and regression. Major keywords were home health nursing, elderly care facility, visiting nurse, home care service, home healthcare nurse, home care agencies, long-term care, and home care. Conclusion: The results of this study identified current trends and interests in the Journal of Korean Academic Society of Home Health Care Nursing. This study suggests that future studies include a variety of research methods and maintain appropriate standards of research ethics.

Nowcast of TV Market using Google Trend Data

  • Youn, Seongwook;Cho, Hyun-chong
    • Journal of Electrical Engineering and Technology
    • /
    • v.11 no.1
    • /
    • pp.227-233
    • /
    • 2016
  • Google Trends provides weekly information on keyword search frequency on the Google search engine. Search volume patterns for the search keyword can also be analyzed based on category and by the location of those making the search. Also, Google provides “Hot searches” and “Top charts” including top and rising searches that include the search keyword. All this information is kept up to date, and allows trend comparisons by providing past weekly figures. In this study, we present a predictive model for TV markets using the searched data in Google search engine (Google Trend data). Using a predictive model for the market and analysis of the Google Trend data, we obtained an efficient and meaningful result for the TV market, and also determined highly ranked countries and cities. This method can provide very useful information for TV manufacturers and others.

Web Site Keyword Selection Method by Considering Semantic Similarity Based on Word2Vec (Word2Vec 기반의 의미적 유사도를 고려한 웹사이트 키워드 선택 기법)

  • Lee, Donghun;Kim, Kwanho
    • The Journal of Society for e-Business Studies
    • /
    • v.23 no.2
    • /
    • pp.83-96
    • /
    • 2018
  • Extracting keywords representing documents is very important because it can be used for automated services such as document search, classification, recommendation system as well as quickly transmitting document information. However, when extracting keywords based on the frequency of words appearing in a web site documents and graph algorithms based on the co-occurrence of words, the problem of containing various words that are not related to the topic potentially in the web page structure, There is a difficulty in extracting the semantic keyword due to the limit of the performance of the Korean tokenizer. In this paper, we propose a method to select candidate keywords based on semantic similarity, and solve the problem that semantic keyword can not be extracted and the accuracy of Korean tokenizer analysis is poor. Finally, we use the technique of extracting final semantic keywords through filtering process to remove inconsistent keywords. Experimental results through real web pages of small business show that the performance of the proposed method is improved by 34.52% over the statistical similarity based keyword selection technique. Therefore, it is confirmed that the performance of extracting keywords from documents is improved by considering semantic similarity between words and removing inconsistent keywords.

An Analysis on Key Factors of Mobile Fitness Application by Using Text Mining Techniques : User Experience Perspective (텍스트마이닝 기법을 이용한 모바일 피트니스 애플리케이션 주요 요인 분석 : 사용자 경험 관점)

  • Lee, So-Hyun;Kim, Jinsol;Yoon, Sang-Hyeak;Kim, Hee-Woong
    • Journal of Information Technology Services
    • /
    • v.19 no.3
    • /
    • pp.117-137
    • /
    • 2020
  • The development of information technology leads to changes in various industries. In particular, the health care industry is more influenced so that it is focused on. With the widening of the health care market, the market of smart device based personal health care also draws attention. Since a variety of fitness applications for smartphone based exercise were introduced, more interest has been in the health care industry. But although an amount of use of mobile fitness applications increase, it fails to lead to a sustained use. It is necessary to find and understand what matters for mobile fitness application users. Therefore, this study analyze the reviews of mobile fitness application users, to draw key factors, and thereby to propose detailed strategies for promoting mobile fitness applications. We utilize text mining techniques - LDA topic modeling, term frequency analysis, and keyword extraction - to draw and analyze the issues related to mobile fitness applications. In particular, the key factors drawn by text mining techniques are explained through the concept of user experience. This study is academically meaningful in the point that the key factors of mobile fitness applications are drawn by the user experience based text mining techniques, and practically this study proposes detailed strategies for promoting mobile fitness applications in the health care area.

Changes in the Cultural Trend of Use by Type of Green Infrastructure Before and After COVID-19 Using Blog Text Mining in Seoul

  • Chae, Jinhae;Cho, MinJoon
    • Journal of People, Plants, and Environment
    • /
    • v.24 no.4
    • /
    • pp.415-427
    • /
    • 2021
  • Background and objective: This study examined the changes in the cultural trend of use for green infrastructure in Seoul due to COVID-19 pandemic. Methods: The subjects of this study are 8 sites of green infrastructure selected by type: Forested green infrastructure, Watershed green infrastructure, Park green infrastructure, Walkway green infrastructure. The data used for analysis was blog posts for a total of four years from August 1, 2016 to July 31, 2020. The analysis method was conducted keyword frequency analysis, topic modeling, and related keyword analysis. Results: The results of this study are as follows. First, the number of posts on green infrastructure has increased since COVID-19, especially forested green infrastructure and watershed green infrastructure with abundant naturalness and high openness. Second, the cultural trend keywords before and after COVID-19 changed from large-scale to small-scale, community-based to individual-based activities, and nondaily to daily culture. Third, after COVID-19, topics and keywords related to coronavirus showed that the cultural trends were reflected on appreciation, activities, and dailiness based on natural resources. In sum, the interest in green infrastructure in Seoul has increased after COVID-19. Also, the change of green infrastructure represents the increased demand for experience that reflects the need and expectation for nature. Conclusion: The new trend of green Infrastructure in the pandemic era should be considered in the the individual relaxations & activities.

Forecasting Korean National Innovation System and Science & Technology Policy after the COVID-19

  • Park, Sung-Uk;Kwon, Ki-Seok
    • Asian Journal of Innovation and Policy
    • /
    • v.9 no.2
    • /
    • pp.145-163
    • /
    • 2020
  • The COVID-19 is a pandemic that affects all facets of our life and will change many patterns in science technology and innovation. A qualitative study was conducted using Focus Group Interview involving ten industry-academia-research experts with the objective of identifying changes in Korea's national innovation system and science & technology policy after the COVID-19. Eight questions were designed, based on the major components of the national innovation system, such as companies, universities, and research institutes, to discuss the changes in the national innovation system and science & technology policy. Also, keyword analysis and cluster analysis were performed using the network analysis program VOSviewer. It is predicted that, in the wake of the COVID-19, Korea's national innovation system will shift to a new paradigm that is more decentralized, responsive, and autonomous. Furthermore, several policy agendas that can turn these changes into positive momentum of change in science & technology policy are presented.

Analytical Study on Classification and Service Quality Improvement for Keyword & Blog Advertising Marketing Services (검색 광고 마케팅 서비스 유형 분석과 서비스 품질 개선방안)

  • Choi, Yoon-Ho;Lee, Jae-Won
    • The Journal of the Korea Contents Association
    • /
    • v.15 no.11
    • /
    • pp.456-466
    • /
    • 2015
  • This study is focusing to the keyword and blog advertising marketing services that are implementing a viral marketing utilizing keyword searches of the search portal and advertiser's blogs with convergent way. Through a case study for the company operating the service to pinpoint consumers to the advertisers site by indirect exposure via keyword advertising blog at the top of the search results, we analyzed the primitive service operation model on transactional relationship between the business players. We have a research purpose to generate improvement alternatives for the company's keyword advertising marketing services and operation solution using the survey study on the service quality perception and the perceptional gap between user groups. As results of study, we founded 4 types of the service solution and 4 models of service operating architecture on the transactional relations, and we recommended some improvements on the service and solution operation based on the SERVQUAL questionnaire analysis of the difference between the ads sponsor group and ads agency group.

An Empirical Study on Statistical Optimization Model for the Portfolio Construction of Sponsored Search Advertising(SSA) (키워드검색광고 포트폴리오 구성을 위한 통계적 최적화 모델에 대한 실증분석)

  • Yang, Hognkyu;Hong, Juneseok;Kim, Wooju
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.2
    • /
    • pp.167-194
    • /
    • 2019
  • This research starts from the four basic concepts of incentive incompatibility, limited information, myopia and decision variable which are confronted when making decisions in keyword bidding. In order to make these concept concrete, four framework approaches are designed as follows; Strategic approach for the incentive incompatibility, Statistical approach for the limited information, Alternative optimization for myopia, and New model approach for decision variable. The purpose of this research is to propose the statistical optimization model in constructing the portfolio of Sponsored Search Advertising (SSA) in the Sponsor's perspective through empirical tests which can be used in portfolio decision making. Previous research up to date formulates the CTR estimation model using CPC, Rank, Impression, CVR, etc., individually or collectively as the independent variables. However, many of the variables are not controllable in keyword bidding. Only CPC and Rank can be used as decision variables in the bidding system. Classical SSA model is designed on the basic assumption that the CPC is the decision variable and CTR is the response variable. However, this classical model has so many huddles in the estimation of CTR. The main problem is the uncertainty between CPC and Rank. In keyword bid, CPC is continuously fluctuating even at the same Rank. This uncertainty usually raises questions about the credibility of CTR, along with the practical management problems. Sponsors make decisions in keyword bids under the limited information, and the strategic portfolio approach based on statistical models is necessary. In order to solve the problem in Classical SSA model, the New SSA model frame is designed on the basic assumption that Rank is the decision variable. Rank is proposed as the best decision variable in predicting the CTR in many papers. Further, most of the search engine platforms provide the options and algorithms to make it possible to bid with Rank. Sponsors can participate in the keyword bidding with Rank. Therefore, this paper tries to test the validity of this new SSA model and the applicability to construct the optimal portfolio in keyword bidding. Research process is as follows; In order to perform the optimization analysis in constructing the keyword portfolio under the New SSA model, this study proposes the criteria for categorizing the keywords, selects the representing keywords for each category, shows the non-linearity relationship, screens the scenarios for CTR and CPC estimation, selects the best fit model through Goodness-of-Fit (GOF) test, formulates the optimization models, confirms the Spillover effects, and suggests the modified optimization model reflecting Spillover and some strategic recommendations. Tests of Optimization models using these CTR/CPC estimation models are empirically performed with the objective functions of (1) maximizing CTR (CTR optimization model) and of (2) maximizing expected profit reflecting CVR (namely, CVR optimization model). Both of the CTR and CVR optimization test result show that the suggested SSA model confirms the significant improvements and this model is valid in constructing the keyword portfolio using the CTR/CPC estimation models suggested in this study. However, one critical problem is found in the CVR optimization model. Important keywords are excluded from the keyword portfolio due to the myopia of the immediate low profit at present. In order to solve this problem, Markov Chain analysis is carried out and the concept of Core Transit Keyword (CTK) and Expected Opportunity Profit (EOP) are introduced. The Revised CVR Optimization model is proposed and is tested and shows validity in constructing the portfolio. Strategic guidelines and insights are as follows; Brand keywords are usually dominant in almost every aspects of CTR, CVR, the expected profit, etc. Now, it is found that the Generic keywords are the CTK and have the spillover potentials which might increase consumers awareness and lead them to Brand keyword. That's why the Generic keyword should be focused in the keyword bidding. The contribution of the thesis is to propose the novel SSA model based on Rank as decision variable, to propose to manage the keyword portfolio by categories according to the characteristics of keywords, to propose the statistical modelling and managing based on the Rank in constructing the keyword portfolio, and to perform empirical tests and propose a new strategic guidelines to focus on the CTK and to propose the modified CVR optimization objective function reflecting the spillover effect in stead of the previous expected profit models.

Patent data analysis using clique analysis in a keyword network (키워드 네트워크의 클릭 분석을 이용한 특허 데이터 분석)

  • Kim, Hyon Hee;Kim, Donggeon;Jo, Jinnam
    • Journal of the Korean Data and Information Science Society
    • /
    • v.27 no.5
    • /
    • pp.1273-1284
    • /
    • 2016
  • In this paper, we analyzed the patents on machine learning using keyword network analysis and clique analysis. To construct a keyword network, important keywords were extracted based on the TF-IDF weight and their association, and network structure analysis and clique analysis was performed. Density and clustering coefficient of the patent keyword network are low, which shows that patent keywords on machine learning are weakly connected with each other. It is because the important patents on machine learning are mainly registered in the application system of machine learning rather thant machine learning techniques. Also, our results of clique analysis showed that the keywords found by cliques in 2005 patents are the subjects such as newsmaker verification, product forecasting, virus detection, biomarkers, and workflow management, while those in 2015 patents contain the subjects such as digital imaging, payment card, calling system, mammogram system, price prediction, etc. The clique analysis can be used not only for identifying specialized subjects, but also for search keywords in patent search systems.