• 제목/요약/키워드: big data growth

Search Result 326, Processing Time 0.03 seconds

A Study on the Real-time user purchase pattern analysis User Product Recommendation System in E-Commerce Environment (E-commerce 환경에서 실시간 사용자 구매 패턴 분석을 통한 사용자 상품 추천 시스템 연구)

  • Beom Jung Kim;Ji Hye Huh;Hyeopgeon Lee;Young Woon Kim
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2023.05a
    • /
    • pp.413-414
    • /
    • 2023
  • IT 기술의 발달로 E-Commerce 분야는 실시간으로 발생되는 데이터양이 증가하고 있으며, 발생된 데이터는 개인화 맞춤 서비스에 많이 활용되고 있다. 그러나 신생 E-commerce 기업은 신규 상품 및 기존 상품에 대한 정보와 고객 간의 상호 작용 데이터가 존재하지 않아 콜드 스타트 문제가 발생한다. 이에 본 논문에서는 E-commerce 환경에서 실시간 사용자 구매패턴 분석을 통한 사용자 상품 추천 시스템을 제안한다. 제안하는 시스템은 Kafka와 Spark를 사용해 실시간 스트림을 데이터를 처리한다. 주요 기능은 ALS 알고리즘과, FP-Growth 알고리즘을 적용해 콜트 스타트 문제를 해결하며, 사용자 구매 패턴 분석을 통한 분석 결과에 맞는 상품을 사용자에게 추천한다.

Research on Changes in the Coffee and Tourism Industries After the End of COVID-19 Through Big Data Analysis

  • Hyeon-Seok Kim;Gi-Hwan Ryu
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.16 no.2
    • /
    • pp.43-49
    • /
    • 2024
  • In early 2020, as the COVID-19 pandemic hit the world, widespread changes occurred throughout society. COVID-19 also brought changes in consumers' consumption behaviors and preferences. This study aims to find out how the current status of the tourism industry and the coffee industry has changed since the end of COVID-19 by conducting big data analysis focusing on the search frequency of Naver, Google, and the following, which are representative social networks in Korea. Designating "Coffee Industry + Tourism Industry" as the representative keyword, January 1, 2020 to December 31, 2020, the time of each COVID-19 outbreak, was set before the COVID-19 type, and January 1, 2023 to December 31, 2023 was set after the end of COVID-19. Based on the analyzed search binder big data analysis within the period, we would like to find out how the current status of the tourism industry and the coffee industry has changed since the end of COVID-19. Finaly, the coffee and tourism industries are on the path of recovery and growth. In particular, the rise in coffee consumption, the recovery of the number of tourists, the emphasis on local tourism, and the strengthening of links with global markets are prominent.

Consumption Changes during COVID-19 through the Analysis of Credit Card Usage : Focused on Jeju Province

  • YOON, Dong-Hwa;YANG, Kwon-Min;OH, Hyeon-Gon;KIM, Mincheol;CHANG, Mona
    • The Journal of Economics, Marketing and Management
    • /
    • v.9 no.5
    • /
    • pp.39-50
    • /
    • 2021
  • Purpose: This study is to analyze the changes of consumption patterns to diagnose the economic impacts on consumers' market during COVID-19, and to suggest implications to overcome the new social and economic crisis of Jeju Island. Research design, data, and methodology: We collected a set of credit card transaction records issued by BC Card Company from merchants in Jeju Special Self-Governing Province for past 4 years from 2017 to 2020 from the Jeju Data Hub run by Jeju Special Self-Governing Province. The big data contains details of approved credit card transactions including the approval numbers, amount, locations and types of merchants, time and age of users, etc. The researchers summed up amount in monthly basis, transforming big data to small data to analyze the changes of consumption before and after COVID-19. Results: Sales fell sharply in transportation industries including airlines, and overall consumption by age group decreased while the decrease in consumption among the seniors was relatively small. The sales of Yeon-dong and Yongdam-dong in Jeju City also fell significantly compared to other regions. As a result of the paired t-test of all 73 samples in Jeju City, the p-value of the mean consumption of the credit card in 2019 and 2020 is significant, statistically proven that the total consumption amount in the two years is different. Conclusions: We found there are sensitive spots that can be strategically approached based on the changes in consumption patterns by industry, region, and age although most of companies and small businesses have been hit by COVID-19. It is necessary for local companies and for the government to be focusing their support on upgrading services, in order to prevent declining sales and job instability for their employees, creating strategies to retain jobs and prevent customer churn in the face of the crisis. As Jeju Province is highly dependent on the tertiary industry, including tourism, it is suggested to create various strategies to overcome the crisis of the pandemic by constantly monitoring the sales trends of local companies.

Development of Examination Model of Weather Factors on Garlic Yield Using Big Data Analysis (빅데이터 분석을 활용한 마늘 생산에 미치는 날씨 요인에 관한 영향 조사 모형 개발)

  • Kim, Shinkon
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.19 no.5
    • /
    • pp.480-488
    • /
    • 2018
  • The development of information and communication technology has been carried out actively in the field of agriculture to generate valuable information from large amounts of data and apply big data technology to utilize it. Crops and their varieties are determined by the influence of the natural environment such as temperature, precipitation, and sunshine hours. This paper derives the climatic factors affecting the production of crops using the garlic growth process and daily meteorological variables. A prediction model was also developed for the production of garlic per unit area. A big data analysis technique considering the growth stage of garlic was used. In the exploratory data analysis process, various agricultural production data, such as the production volume, wholesale market load, and growth data were provided from the National Statistical Office, the Rural Development Administration, and Korea Rural Economic Institute. Various meteorological data, such as AWS, ASOS, and special status data, were collected and utilized from the Korea Meteorological Agency. The correlation analysis process was designed by comparing the prediction power of the models and fitness of models derived from the variable selection, candidate model derivation, model diagnosis, and scenario prediction. Numerous weather factor variables were selected as descriptive variables by factor analysis to reduce the dimensions. Using this method, it was possible to effectively control the multicollinearity and low degree of freedom that can occur in regression analysis and improve the fitness and predictive power of regression analysis.

A cache placement algorithm based on comprehensive utility in big data multi-access edge computing

  • Liu, Yanpei;Huang, Wei;Han, Li;Wang, Liping
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.15 no.11
    • /
    • pp.3892-3912
    • /
    • 2021
  • The recent rapid growth of mobile network traffic places multi-access edge computing in an important position to reduce network load and improve network capacity and service quality. Contrasting with traditional mobile cloud computing, multi-access edge computing includes a base station cooperative cache layer and user cooperative cache layer. Selecting the most appropriate cache content according to actual needs and determining the most appropriate location to optimize the cache performance have emerged as serious issues in multi-access edge computing that must be solved urgently. For this reason, a cache placement algorithm based on comprehensive utility in big data multi-access edge computing (CPBCU) is proposed in this work. Firstly, the cache value generated by cache placement is calculated using the cache capacity, data popularity, and node replacement rate. Secondly, the cache placement problem is then modeled according to the cache value, data object acquisition, and replacement cost. The cache placement model is then transformed into a combinatorial optimization problem and the cache objects are placed on the appropriate data nodes using tabu search algorithm. Finally, to verify the feasibility and effectiveness of the algorithm, a multi-access edge computing experimental environment is built. Experimental results show that CPBCU provides a significant improvement in cache service rate, data response time, and replacement number compared with other cache placement algorithms.

BigData Research in Information Systems : Focusing on Journal Articles about Information Systems (정보시스템 분야의 빅데이터 연구 흐름 분석 : Information Systems 관련 저널을 중심으로)

  • Park, Kyungbo;Kim, Juyeong;Kim, Han-Min
    • Asia-pacific Journal of Multimedia Services Convergent with Art, Humanities, and Sociology
    • /
    • v.9 no.6
    • /
    • pp.681-689
    • /
    • 2019
  • The 46th Davos Forum of the World Economic Forum (WEF) predicts the continued growth of the 4th industry in the future. Currently, the 4th industry is attracting attention in various academic and practical fields. As a core technology of the 4th industry, Big Data is regarded as a major resource to lead the 4th industrial revolution along with artificial intelligence. As the growing interest in Big Data, researches on it are actively being done. However, literature studies on existing Big Data are focused on qualitative research, and quantitative research is insufficient. Therefore, this study aims to analyze the big data research flow in MIS field and to make academic thirst for quantification. This study has collected 145 abstracts of big data papers published in major journals in MIS field and confirmed that a majority of papers are published in Decision Support Systems Journal. Text mining and text network analysis were performed only for DSS journals to eliminate bias. As a result of the analysis, it was found out that researches on combining big data in the management field between 2012 and 2014, and researches on system development and analysis method for using big data from 2015 to 2017 were conducted.

De-identification Techniques for Big Data and Issues (빅데이타 비식별화 기술과 이슈)

  • Woo, SungHee
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2017.05a
    • /
    • pp.750-753
    • /
    • 2017
  • Recently, the processing and utilization of big data, which is generated by the spread of smartphone, SNS, and the internet of things, is emerging as a new growth engine of ICT field. However, in order to utilize such big data, De-identification of personal information should be done. De-identification removes identifying information from a data set so that individual data cannot be linked with specific individuals. De-identification can reduce the privacy risk associated with collecting, processing, archiving, distributing or publishing information, thus it attempts to balance the contradictory goals of using and sharing personal information while protecting privacy. De-identified information has also been re-identified and has been controversial for the protection of personal information, but the number of instances where personal information such as big data is de-identified and processed is increasing. In addition, many de-identification guidelines have been introduced and a method for de-identification of personal information has been proposed. Therefore, in this study, we describe the big data de-identification process and follow-up management, and then compare and analyze de-identification methods. Finally we provide personal information protection issues and solutions.

  • PDF

Analysis of the supportive care needs of the parents of preterm children in South Korea using big data text-mining: Topic modeling

  • Park, Ji Hyeon;Lee, Hanna;Cho, Haeryun
    • Child Health Nursing Research
    • /
    • v.27 no.1
    • /
    • pp.34-42
    • /
    • 2021
  • Purpose: The purpose of this study was to identify the supportive care needs of parents of preterm children in South Korea using text data from a portal site. Methods: In total, 628 online newspaper articles and 1,966 social network service posts published between January 1 and December 31, 2019 were analyzed. The procedures in this study were conducted in the following order: keyword selection, data collection, morpheme analysis, keyword analysis, and topic modeling. Results: The term "yirundung-yi", which is a native Korean word referring to premature infants, was confirmed to be a useful term for parents. The following four topics were identified as the supportive care needs of parents of preterm children: 1) a vague fear of caring for a baby upon imminent neonatal intensive care unit discharge, 2) real-world difficulties encountered while caring for preterm children, 3) concerns about growth and development problems, and 4) anxiety about possible complications. Conclusion: Supportive care interventions for parents of preterm children should include general parenting methods for babies. A team composed of multidisciplinary experts must support the individual growth and development of preterm children and manage the complications of prematurity using highly accessible media.

A Study on the Human Resource Recruitment and R&D by the Growth Stage of ICT SMEs (ICT 중소기업의 성장단계별 인적자원 채용 및 연구개발에 관한 연구)

  • Jung, Byoungho;Joo, Hyungkun
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.17 no.4
    • /
    • pp.177-195
    • /
    • 2021
  • The purpose of this study is to examine the trouble of recruitment and research and development of ICT SMEs. Recently, many ICT SMEs have emerged for selling products and services using the technology of the 4th industrial revolution. However, SMEs have relatively deficient resources compared to large companies, the difficulty of maintenance or growth of human resources and intangible resources. This research methodology organized the four stages of the analysis process. The first analysis is the association rules for human resource recruitment. The second analysis is the difficulty of hiring jobs and experienced workers by each stage of company growth. The third analysis is a regression analysis of the trouble of R&D activity. The last analysis is an analysis of association rules on the difficulties of management activities by company growth. As the research result, the first analysis has shown a difference in favored human resources by the ICT industry. The second analysis also showed factor differences in job recruitment difficulties for each stage of corporate growth. In the third analysis, the operation of research institutes in ICT SMEs is influenced by industry type, corporate certification, corporate growth stage, self-technology development, joint technology development, technology transfer, and commercialization. As the last analysis, ICT SMEs showed factor differences in difficulties in management activities by stage of corporate growth. This study contributed empirically emphasizing the troubling phenomenon of human resources and R&D necessary for the growth of ICT SMEs. As a theoretical implication, this research contributed to the research-area expansion of management information using big-data technologies. In particular, this research practically suggests the differentiated direction of recruitment and R&D by ICT SMEs based on industry and each stage of company growth through the association rules of big data.

A Study on Impact of Deep Learning on Korean Economic Growth Factor

  • Dong Hwa Kim;Dae Sung Seo
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.15 no.4
    • /
    • pp.90-99
    • /
    • 2023
  • This paper deals with studying strategy about impact of deep learning (DL) on the factor of Korean economic growth. To study classification of impact factors of Korean economic growth, we suggest dynamic equation of microeconomy and study methods on economic growth impact of deep learning. Next step is to suggest DL model to dynamic equation with Korean economy data with growth related factors to classify what factor is import and dominant factors to build policy and education. DL gives an influence in many areas because it can be implemented with ease as just normal editing works and speak including code development by using huge data. Currently, young generations will take a big impact on their job selection because generative AI can do well as much as humans can do it everywhere. Therefore, policy and education methods should be rearranged as new paradigm. However, government and officers do not understand well how it is serious in policy and education. This paper provides method of policy and education for AI education including generative AI through analysing many papers and reports, and experience.