• Title/Summary/Keyword: Bigdata

Search Result 590, Processing Time 0.031 seconds

A Trend Analysis and Book Recommendation through Bigdata Analysis (빅데이터 분석을 통한 트렌드 파악 및 사용자 맞춤 도서 추천)

  • Kyungseo Yoon;Seungshik Kang
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2023.11a
    • /
    • pp.363-364
    • /
    • 2023
  • 카테고리별 베스트셀러를 통해 트렌드 파악 및 사용자 맞춤형 도서 추천을 위해 카테고리별로 도서 데이터를 수집하고, 대용량 데이터인 위키피디어 데이터를 이용하여 워드임베딩 모델을 구축한다. 도서 데이터에 대한 키워드 분석 및 LDA 주제분석 기법에 의해 카테고리별 핵심 단어 분석을 통해 도서 트렌드를 파악하고, 사용자 맞춤형 도서 정보 제공 및 도서를 추천하는 기능을 구현한다.

Visual Cell : Image Analysis and Visual Retrieval System for Biology Cell Image Bigdata (Visual Cell : 바이오세포 이미지 빅데이터를 위한 이미지 분석 및 시각적 검색 시스템)

  • Park, Beomjun;Jo, Sunhwa;Lee, Suan;Shin, Jiwoon;Yoo, Hyuk Sang;Kim, Jinho
    • The Journal of Bigdata
    • /
    • v.4 no.1
    • /
    • pp.53-61
    • /
    • 2019
  • The extracellular matrix, which provides the structural and biochemical support of surrounding cells, is a cell physiological modulator that controls cell division and differentiation. In the bio sector, the company produces Scapold, a three-dimensional support for tissue engineering, and cultivates stem cells in the produced Scapold to be transplanted into animals to assess tissue regeneration. This depends on components such as collagen in the tissue. Therefore, it is very important to identify the inclusion rate and distribution of components in the tissue, and the data are obtained by analyzing the color of the dyed tissue image. The process from image collection to analysis is costly, and the data collected and analyzed are managed in different formats by different research institutions. Therefore, data integration management and analysis results search are not being performed. In this paper, we establish a database that can manage relevant bigdata in an integrated manner, and propose a bio-image integrated management and retrieval system that can be searched based on color, an important analytical measure in this field of study.

  • PDF

An Experimental Evaluation of Box office Revenue Prediction through Social Bigdata Analysis and Machine Learning (소셜 빅데이터 분석과 기계학습을 이용한 영화흥행예측 기법의 실험적 평가)

  • Chang, Jae-Young
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.17 no.3
    • /
    • pp.167-173
    • /
    • 2017
  • With increased interest in the fourth industrial revolution represented by artificial intelligence, it has been very active to utilize bigdata and machine learning techniques in almost areas of society. Also, such activities have been realized by development of forecasting systems in various applications. Especially in the movie industry, there have been numerous attempts to predict whether they would be success or not. In the past, most of studies considered only the static factors in the process of prediction, but recently, several efforts are tried to utilize realtime social bigdata produced in SNS. In this paper, we propose the prediction technique utilizing various feedback information such as news articles, blogs and reviews as well as static factors of movies. Additionally, we also experimentally evaluate whether the proposed technique could precisely forecast their revenue targeting on the relatively successful movies.

Bigdata Prediction Support Service for Citizen Data Scientists (시민 데이터과학자를 위한 빅데이터 예측 지원 서비스)

  • Chang, Jae-Young
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.19 no.2
    • /
    • pp.151-159
    • /
    • 2019
  • As the era of big data, which is the foundation of the fourth industry, has come, most related industries are developing related solutions focusing on the technologies of data storage, statistical analysis and visualization. However, for the diffusion of bigdata technology, it is necessary to develop the prediction analysis technologies using artificial intelligence. But these advanced technologies are only possible by some experts now called data scientists. For big data-related industries to develop, a non-expert, called a citizen data scientist, should be able to easily access the big data analysis process at low cost because they have insight into their own data. In this paper, we propose a system for analyzing bigdata and building business models with the support of easy-to-use analysis system without knowledge of high-level data science. We also define the necessary components and environment for the prediction analysis system and present the overall service plan.

Predicting win-loss using game data and deriving the importance of subdivided variables (게임데이터를 이용한 승패예측 및 세분화된 변수 중요도 도출 기법)

  • Oh, Min-Ji;Choi, Eun-Seon;Oui, Som Akhamixay;Cho, Wan-Sup
    • The Journal of Bigdata
    • /
    • v.5 no.2
    • /
    • pp.231-240
    • /
    • 2020
  • With the development in the IT industry and the growth in the game industry, user's game data is recorded in seconds according to various plays and options, and a vast amount of game data can be analyzed based on Bigdata. Combined with business, Bigdata is used to discover new values for profit creation in various fields, but it is utilized in the game industry in insufficient ways. In this study, considering the characteristics of the subdivided lines, we constructed a win-loss prediction model for each line using the game data of League of Legends, and derived the importance of variables. This study can contribute to planning of strategies for general game users to get information about team members in advance and increase the win rate by using the record search sites.

Analysis of Social Network Service Data to Estimate Tourist Interests in Green Tour Activities

  • Rah, HyungChul;Park, Sungho;Kim, Miok;Cho, Youngbeen;Yoo, Kwan-Hee
    • International Journal of Contents
    • /
    • v.14 no.3
    • /
    • pp.27-31
    • /
    • 2018
  • Social network service (SNS) data related to green tourism were used to estimate preferred tour sites and users' interests. Keywords related with green tour activities were employed to search the SNS data. SNS data were collected from Korean blogs such as Naver and Daum from June $1^{st}$ to August $31^{st}$ between 2015 and 2017 using text-mining solution. During the study period, seven hundred and five posts were analyzed. Associated words that frequently co-occurred with keywords were classified into different categories depending on the nature of associated words. Associated words included swimming pools and camping sites (location); experience and swimming pools (attribute); and water play and culture (culture/leisure). Our data suggest that SNS users with experience of green tourism in Korea exhibited interest in green tourism with swimming pools, camping sites, experience, water play and/or culture rather than particular popular sites. Based on the findings, it is recommended that preferred facilities such as swimming pools should be provided at green tourism sites to meet the users' needs and to facilitate green tourism.

A Study of Bigdata Platform for Supporting Engineering Services (엔지니어링 서비스 지원을 위한 클라우드 기반 빅데이터 플랫폼 개발 연구)

  • Seo, Dongwoo;Kim, Myungil;Park, Sangjin;Kim, Jaesung;Jeong, Seok Chan
    • The Journal of Bigdata
    • /
    • v.4 no.1
    • /
    • pp.119-127
    • /
    • 2019
  • This study explains how to solve engineering problems easily and efficiently by using cloud based big data platform. To do this, we propose a cloud based big data analysis platform. The application helps users easily create models for data analysis using cloud based big data analysis platform. Analytical models modeled using components are analyzed through an analysis engine. Our platform include pre-processing, analysis, and visualization algorithms required for data analysis. Finally, we show an application of effluent concentration in a sewage treatment process.

  • PDF

Comprehensive Knowledge Archive Network harvester improvement for efficient open-data collection and management

  • Kim, Dasol;Gil, Myeong-Seon;Nguyen, Minh Chau;Won, Heesun;Moon, Yang-Sae
    • ETRI Journal
    • /
    • v.43 no.5
    • /
    • pp.835-855
    • /
    • 2021
  • With the recent increase in data disclosure, the Comprehensive Knowledge Archive Network (CKAN), which is an open-source data distribution platform, is drawing much attention. CKAN is used together with additional extensions, such as Datastore and Datapusher for data management and Harvest and DCAT for data collection. This study derives the problems of CKAN itself and Harvest Extension. First, CKAN causes two problems of data inconsistency and storage space waste for data deletion. Second, Harvest Extension causes three additional problems, namely source deletion that deletes only sources without deleting data themselves, job stop that cannot delete job during data collection, and service interruption that cannot provide service, even if data exist. Based on these observations, we propose herein an improved CKAN that provides a new deletion function solving data inconsistency and storage space waste problems. In addition, we present an improved Harvest Extension solving three problems of the legacy Harvest Extension. We verify the correctness and the usefulness of the improved CKAN and Harvest Extension functions through actual implementation and extensive experiments.

News Article Identification Methods with Fact-Checking Guideline on Artificial Intelligence & Bigdata

  • Kang, Jangmook;Lee, Sangwon
    • International Journal of Advanced Culture Technology
    • /
    • v.9 no.3
    • /
    • pp.352-359
    • /
    • 2021
  • The purpose of this study is to design and build fake news discrimination systems and methods using fact-checking guidelines. In other words, the main content of this study is the system for identifying fake news using Artificial Intelligence -based Fact-checking guidelines. Specifically planned guidelines are needed to determine fake news that is prevalent these days, and the purpose of these guidelines is fact-checking. Identifying fake news immediately after seeing a huge amount of news is inefficient in handling and ineffective in handling. For this reason, we would like to design a fake news identification system using the fact-checking guidelines to create guidelines based on pattern analysis against fake news and real news data. The model will monitor the fact-checking guideline model modeled to determine the Fact-checking target within the news article and news articles shared on social networking service sites. Through this, the model is reflected in the fact-checking guideline model by analyzing news monitoring devices that select suspicious news articles based on their user responses. The core of this research model is a fake news identification device that determines the authenticity of this suspected news article. So, we propose news article identification methods with fact-checking guideline on Artificial Intelligence & Bigdata. This study will help news subscribers determine news that is unclear in its authenticity.

An Effect of O2O Service Users' Motivation on Loyalty through Expectation-Confirmation and Satisfaction (O2O 서비스 이용자의 동기가 기대충족과 만족을 통해 충성도에 미치는 영향)

  • An, Ki-Hoon;Lee, Sin Bok;Lee, Sae Bom;Suh, Yung Ho
    • Journal of Korean Society for Quality Management
    • /
    • v.46 no.4
    • /
    • pp.923-938
    • /
    • 2018
  • Purpose: O2O service are becoming popular in various industries such as food delivery and taxi. This research explores how users' motivation of O2O service influence customer loyalty through expectation-confirmation and satisfaction. this study attempts to explore the motivation factor (i.e. pricing, enjoyment, immediately, social influence) and to empirically examine the relationships between those and users' loyalty to O2O service. Methods: To test the proposed research model, a survey research methodology was used. Paper survey was distributed to O2O service users in Korea. A total of 198 data were used for the analysis. Structural equation modeling was used to test hypotheses. Results: According to our findings, this study found that satisfaction was positively influenced by users' motivation factors. all hypotheses about the effect of motivation on expectation-confirmation were statistically not significant. Conclusion: O2O service providers should consider the results of this study to satisfy users' expectations and satisfaction for building a better O2O market.