• Title/Summary/Keyword: Big Data Analytics Process

Search Result 51, Processing Time 0.027 seconds

Precision nutrition: approach for understanding intra-individual biological variation (정밀영양: 개인 간 대사 다양성을 이해하기 위한 접근)

  • Kim, Yangha
    • Journal of Nutrition and Health
    • /
    • v.55 no.1
    • /
    • pp.1-9
    • /
    • 2022
  • In the past few decades, great progress has been made on understanding the interaction between nutrition and health status. But despite this wealth of knowledge, health problems related to nutrition continue to increase. This leads us to postulate that the continuing trend may result from a lack of consideration for intra-individual biological variation on dietary responses. Precision nutrition utilizes personal information such as age, gender, lifestyle, diet intake, environmental exposure, genetic variants, microbiome, and epigenetics to provide better dietary advices and interventions. Recent technological advances in the artificial intelligence, big data analytics, cloud computing, and machine learning, have made it possible to process data on a scale and in ways that were previously impossible. A big data platform is built by collecting numerous parameters such as meal features, medical metadata, lifestyle variation, genome diversity and microbiome composition. Sophisticated techniques based on machine learning algorithm can be used to integrate and interpret multiple factors and provide dietary guidance at a personalized or stratified level. The development of a suitable machine learning algorithm would make it possible to suggest a personalized diet or functional food based on analysis of intra-individual metabolic variation. This novel precision nutrition might become one of the most exciting and promising approaches of improving health conditions, especially in the context of non-communicable disease prevention.

Analyzing Learners Behavior and Resources Effectiveness in a Distance Learning Course: A Case Study of the Hellenic Open University

  • Alachiotis, Nikolaos S.;Stavropoulos, Elias C.;Verykios, Vassilios S.
    • Journal of Information Science Theory and Practice
    • /
    • v.7 no.3
    • /
    • pp.6-20
    • /
    • 2019
  • Learning analytics, or educational data mining, is an emerging field that applies data mining methods and tools for the exploitation of data coming from educational environments. Learning management systems, like Moodle, offer large amounts of data concerning students' activity, performance, behavior, and interaction with their peers and their tutors. The analysis of these data can be elaborated to make decisions that will assist stakeholders (students, faculty, and administration) to elevate the learning process in higher education. In this work, the power of Excel is exploited to analyze data in Moodle, utilizing an e-learning course developed for enhancing the information computer technology skills of school teachers in primary and secondary education in Greece. Moodle log files are appropriately manipulated in order to trace daily and weekly activity of the learners concerning distribution of access to resources, forum participation, and quizzes and assignments submission. Learners' activity was visualized for every hour of the day and for every day of the week. The visualization of access to every activity or resource during the course is also obtained. In this fashion teachers can schedule online synchronous lectures or discussions more effectively in order to maximize the learners' participation. Results depict the interest of learners for each structural component, their dedication to the course, their participation in the fora, and how it affects the submission of quizzes and assignments. Instructional designers may take advice and redesign the course according to the popularity of the educational material and learners' dedication. Moreover, the final grade of the learners is predicted according to their previous grades using multiple linear regression and sensitivity analysis. These outcomes can be suitably exploited in order for instructors to improve the design of their courses, faculty to alter their educational methodology, and administration to make decisions that will improve the educational services provided.

Customer Classification and Market Basket Analysis Using K-Means Clustering and Association Rules: Evidence from Distribution Big Data of Korean Retailing Company (군집분석과 연관규칙을 활용한 고객 분류 및 장바구니 분석: 소매 유통 빅데이터를 중심으로)

  • Liu, Run-Qing;Lee, Young-Chan;Mu, Hong-Lei
    • Knowledge Management Research
    • /
    • v.19 no.4
    • /
    • pp.59-76
    • /
    • 2018
  • With the arrival of the big data era, customer data and data mining analysis have gradually dominated the process of Customer Relationship Management (CRM). This phenomenon indicates that customer data along with the use of information techniques (IT) have become the basis for building a successful CRM strategy. However, some companies can not discover valuable information through a large amount of customer data, which leads to the failure of making appropriate business strategy. Without suitable strategies, the companies may lose the competitive advantage or probably go bankrupt. The purpose of this study is to propose CRM strategies by segmenting customers into VIPs and Non-VIPs and identifying purchase patterns using the the VIPs' transaction data and data mining techniques (K-means clustering and association rules) of online shopping mall in Korea. The results of this paper indicate that 227 customers were segmented into VIPs among 1866 customers. And according to 51,080 transactions data of VIPs, home product and women wear are frequently associated with food, which means that the purchase of home product or women wears mainly affect the purchase of food. Therefore, marketing managers of shopping mall should consider these shopping patterns when they build CRM strategy.

Stochastics and Artificial Intelligence-based Analytics of Wastewater Plant Operation

  • Sung-Hyun Kwon;Daechul Cho
    • Clean Technology
    • /
    • v.29 no.2
    • /
    • pp.145-150
    • /
    • 2023
  • Tele-metering systems have been useful tools for managing domestic wastewater treatment plants (WWTP) over the last decade. They mostly generate water quality data for discharged water to ensure that it complies with mandatory regulations and they may be able to produce every operation parameter and additional measurements in the near future. A sub-big data group, comprised of about 150,000 data points from four domestic WWTPs, was ready to be classified and also analyzed to optimize the WWTP process. We used the Statistical Product and Service Solutions (SPSS) 25 package in order to statistically treat the data with linear regression and correlation analysis. The major independent variables for analysis were water temperature, sludge recycle rate, electricity used, and water quality of the influent while the dependent variables representing the water quality of the effluent included the total nitrogen, which is the most emphasized index for discharged flow in plants. The water temperature and consumed electricity showed a strong correlation with the total nitrogen but the other indices' mutual correlations with other variables were found to be fuzzy due to the large errors involved. In addition, a multilayer perceptron analysis method was applied to TMS data along with root mean square error (RMSE) analysis. This study showed that the RMSE in the SS, T-N, and TOC predictions were in the range of 10% to 20%.

A multi-channel CNN based online review helpfulness prediction model (Multi-channel CNN 기반 온라인 리뷰 유용성 예측 모델 개발에 관한 연구)

  • Li, Xinzhe;Yun, Hyorim;Li, Qinglong;Kim, Jaekyeong
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.2
    • /
    • pp.171-189
    • /
    • 2022
  • Online reviews play an essential role in the consumer's purchasing decision-making process, and thus, providing helpful and reliable reviews is essential to consumers. Previous online review helpfulness prediction studies mainly predicted review helpfulness based on the consistency of text and rating information of online reviews. However, there is a limitation in that representation capacity or review text and rating interaction. We propose a CNN-RHP model that effectively learns the interaction between review text and rating information to improve the limitations of previous studies. Multi-channel CNNs were applied to extract the semantic representation of the review text. We also converted rating into independent high-dimensional embedding vectors representing the same dimension as the text vector. The consistency between the review text and the rating information is learned based on element-wise operations between the review text and the star rating vector. To evaluate the performance of the proposed CNN-RHP model in this study, we used online reviews collected from Amazom.com. Experimental results show that the CNN-RHP model indicates excellent performance compared to several benchmark models. The results of this study can provide practical implications when providing services related to review helpfulness on online e-commerce platforms.

Design and Utilization of Connected Data Architecture-based AI Service of Mass Distributed Abyss Storage (대용량 분산 Abyss 스토리지의 CDA (Connected Data Architecture) 기반 AI 서비스의 설계 및 활용)

  • Cha, ByungRae;Park, Sun;Seo, JaeHyun;Kim, JongWon;Shin, Byeong-Chun
    • Smart Media Journal
    • /
    • v.10 no.1
    • /
    • pp.99-107
    • /
    • 2021
  • In addition to the 4th Industrial Revolution and Industry 4.0, the recent megatrends in the ICT field are Big-data, IoT, Cloud Computing, and Artificial Intelligence. Therefore, rapid digital transformation according to the convergence of various industrial areas and ICT fields is an ongoing trend that is due to the development of technology of AI services suitable for the era of the 4th industrial revolution and the development of subdivided technologies such as (Business Intelligence), IA (Intelligent Analytics, BI + AI), AIoT (Artificial Intelligence of Things), AIOPS (Artificial Intelligence for IT Operations), and RPA 2.0 (Robotic Process Automation + AI). This study aims to integrate and advance various machine learning services of infrastructure-side GPU, CDA (Connected Data Architecture) framework, and AI based on mass distributed Abyss storage in accordance with these technical situations. Also, we want to utilize AI business revenue model in various industries.

Analysis of Process-focused, Innovative Assessment Cases in Australia, Singapore, the U.S.A. and Korea (과정중심평가를 위한 국가별 학교 평가혁신 사례분석)

  • Kang, Jihye;Lee, Ji-Yeon
    • Journal of Information Technology Services
    • /
    • v.18 no.3
    • /
    • pp.143-154
    • /
    • 2019
  • The purpose of this study is to explore innovative adaptation of IT systems worldwide to support process-focus assessment. To this end, the study presents four cases from Australia, Singapore, the U.S.A., and Korea to inform educational policy and technology researchers and practitioners. Based on comparing the four chosen cases as benchmarks, the study concluded that IT systems and technologies can expedite and improve school interventions to enhance student learning in terms of time and quality. Also, educational big data and learning analytics can be used to systematically monitor and communicate individual student's progress among school stakeholders (i.e., teachers, students, parents, and administrators). Lastly, the study made some suggestions to support process-focused assessment in schools as following : 1) A more evidence-based, systems approach is needed to integrate the curriculum, instruction, and assessment to bridge the gap between educational policy and school practice; 2) It is critical to create ICT-friendly school environments so that meaningful data could be collected, analyzed, and stored from individual students and school units; 3) Teacher professional development is another area that needs special considerations and support to successfully implement process-focused assessment in schools.

Operational Big Data Analytics platform for Smart Factory (스마트팩토리를 위한 운영빅데이터 분석 플랫폼)

  • Bae, Hyerim;Park, Sanghyuck;Choi, Yulim;Joo, Byeongjun;Sutrisnowati, Riska Asriana;Pulshashi, Iq Reviessay;Putra, Ahmad Dzulfikar Adi;Adi, Taufik Nur;Lee, Sanghwa;Won, Seokrae
    • The Journal of Bigdata
    • /
    • v.1 no.2
    • /
    • pp.9-19
    • /
    • 2016
  • Since ICT convergence became a major issue, German government has carried forward a policy 'Industry 4.0' that triggered ICT convergence with manufacturing. Now this trend gets into our stride. From this facts, we can expect great leap up to quality perfection in low cost. Recently Korean government also enforces policy with 'Manufacturing 3.0' for upgrading Korean manufacturing industry with being accelerated by many related technologies. We, in the paper, developed a custom-made operational big data analysis platform for the implementation of operational intelligence to improve industry capability. Our platform is designed based on spring framework and web. In addition, HDFS and spark architectures helps our system analyze massive data on the field with streamed data processed by process mining algorithm. Extracted knowledge from data will support enhancement of manufacturing performance.

  • PDF

A Study on Adaptive Learning Model for Performance Improvement of Stream Analytics (실시간 데이터 분석의 성능개선을 위한 적응형 학습 모델 연구)

  • Ku, Jin-Hee
    • Journal of Convergence for Information Technology
    • /
    • v.8 no.1
    • /
    • pp.201-206
    • /
    • 2018
  • Recently, as technologies for realizing artificial intelligence have become more common, machine learning is widely used. Machine learning provides insight into collecting large amounts of data, batch processing, and taking final action, but the effects of the work are not immediately integrated into the learning process. In this paper proposed an adaptive learning model to improve the performance of real-time stream analysis as a big business issue. Adaptive learning generates the ensemble by adapting to the complexity of the data set, and the algorithm uses the data needed to determine the optimal data point to sample. In an experiment for six standard data sets, the adaptive learning model outperformed the simple machine learning model for classification at the learning time and accuracy. In particular, the support vector machine showed excellent performance at the end of all ensembles. Adaptive learning is expected to be applicable to a wide range of problems that need to be adaptively updated in the inference of changes in various parameters over time.

A Study on Enhancing Personalization Recommendation Service Performance with CNN-based Review Helpfulness Score Prediction (CNN 기반 리뷰 유용성 점수 예측을 통한 개인화 추천 서비스 성능 향상에 관한 연구)

  • Li, Qinglong;Lee, Byunghyun;Li, Xinzhe;Kim, Jae Kyeong
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.3
    • /
    • pp.29-56
    • /
    • 2021
  • Recently, various types of products have been launched with the rapid growth of the e-commerce market. As a result, many users face information overload problems, which is time-consuming in the purchasing decision-making process. Therefore, the importance of a personalized recommendation service that can provide customized products and services to users is emerging. For example, global companies such as Netflix, Amazon, and Google have introduced personalized recommendation services to support users' purchasing decisions. Accordingly, the user's information search cost can reduce which can positively affect the company's sales increase. The existing personalized recommendation service research applied Collaborative Filtering (CF) technique predicts user preference mainly use quantified information. However, the recommendation performance may have decreased if only use quantitative information. To improve the problems of such existing studies, many studies using reviews to enhance recommendation performance. However, reviews contain factors that hinder purchasing decisions, such as advertising content, false comments, meaningless or irrelevant content. When providing recommendation service uses a review that includes these factors can lead to decrease recommendation performance. Therefore, we proposed a novel recommendation methodology through CNN-based review usefulness score prediction to improve these problems. The results show that the proposed methodology has better prediction performance than the recommendation method considering all existing preference ratings. In addition, the results suggest that can enhance the performance of traditional CF when the information on review usefulness reflects in the personalized recommendation service.