• Title/Summary/Keyword: Learning data set

Search Result 1,101, Processing Time 0.026 seconds

A Study on the Classification of Variables Affecting Smartphone Addiction in Decision Tree Environment Using Python Program

  • Kim, Seung-Jae
    • International journal of advanced smart convergence
    • /
    • v.11 no.4
    • /
    • pp.68-80
    • /
    • 2022
  • Since the launch of AI, technology development to implement complete and sophisticated AI functions has continued. In efforts to develop technologies for complete automation, Machine Learning techniques and deep learning techniques are mainly used. These techniques deal with supervised learning, unsupervised learning, and reinforcement learning as internal technical elements, and use the Big-data Analysis method again to set the cornerstone for decision-making. In addition, established decision-making is being improved through subsequent repetition and renewal of decision-making standards. In other words, big data analysis, which enables data classification and recognition/recognition, is important enough to be called a key technical element of AI function. Therefore, big data analysis itself is important and requires sophisticated analysis. In this study, among various tools that can analyze big data, we will use a Python program to find out what variables can affect addiction according to smartphone use in a decision tree environment. We the Python program checks whether data classification by decision tree shows the same performance as other tools, and sees if it can give reliability to decision-making about the addictiveness of smartphone use. Through the results of this study, it can be seen that there is no problem in performing big data analysis using any of the various statistical tools such as Python and R when analyzing big data.

Design of a machine learning based mobile application with GPS, mobile sensors, public GIS: real time prediction on personal daily routes

  • Shin, Hyunkyung
    • International journal of advanced smart convergence
    • /
    • v.7 no.4
    • /
    • pp.27-39
    • /
    • 2018
  • Since the global positioning system (GPS) has been included in mobile devices (e.g., for car navigation, in smartphones, and in smart watches), the impact of personal GPS log data on daily life has been unprecedented. For example, such log data have been used to solve public problems, such as mass transit traffic patterns, finding optimum travelers' routes, and determining prospective business zones. However, a real-time analysis technique for GPS log data has been unattainable due to theoretical limitations. We introduced a machine learning model in order to resolve the limitation. In this paper presents a new, three-stage real-time prediction model for a person's daily route activity. In the first stage, a machine learning-based clustering algorithm is adopted for place detection. The training data set was a personal GPS tracking history. In the second stage, prediction of a new person's transient mode is studied. In the third stage, to represent the person's activity on those daily routes, inference rules are applied.

A Study on Defect Prediction through Real-time Monitoring of Die-Casting Process Equipment (주조공정 설비에 대한 실시간 모니터링을 통한 불량예측에 대한 연구)

  • Chulsoon Park;Heungseob Kim
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.45 no.4
    • /
    • pp.157-166
    • /
    • 2022
  • In the case of a die-casting process, defects that are difficult to confirm by visual inspection, such as shrinkage bubbles, may occur due to an error in maintaining a vacuum state. Since these casting defects are discovered during post-processing operations such as heat treatment or finishing work, they cannot be taken in advance at the casting time, which can cause a large number of defects. In this study, we propose an approach that can predict the occurrence of casting defects by defect type using machine learning technology based on casting parameter data collected from equipment in the die casting process in real time. Die-casting parameter data can basically be collected through the casting equipment controller. In order to perform classification analysis for predicting defects by defect type, labeling of casting parameters must be performed. In this study, first, the defective data set is separated by performing the primary clustering based on the total defect rate obtained during the post-processing. Second, the secondary cluster analysis is performed using the defect rate by type for the separated defect data set, and the labeling task is performed by defect type using the cluster analysis result. Finally, a classification learning model is created by collecting the entire labeled data set, and a real-time monitoring system for defect prediction using LabView and Python was implemented. When a defect is predicted, notification is performed so that the operator can cope with it, such as displaying on the monitoring screen and alarm notification.

Identification of Mesiodens Using Machine Learning Application in Panoramic Images (기계 학습 어플리케이션을 활용한 파노라마 영상에서의 정중 과잉치 식별)

  • Seung, Jaegook;Kim, Jaegon;Yang, Yeonmi;Lim, Hyungbin;Le, Van Nhat Thang;Lee, Daewoo
    • Journal of the korean academy of Pediatric Dentistry
    • /
    • v.48 no.2
    • /
    • pp.221-228
    • /
    • 2021
  • The aim of this study was to evaluate the use of easily accessible machine learning application to identify mesiodens, and to compare the ability to identify mesiodens between trained model and human. A total of 1604 panoramic images (805 images with mesiodens, 799 images without mesiodens) of patients aged 5 - 7 years were used for this study. The model used for machine learning was Google's teachable machine. Data set 1 was used to train model and to verify the model. Data set 2 was used to compare the ability between the learning model and human group. As a result of data set 1, the average accuracy of the model was 0.82. After testing data set 2, the accuracy of the model was 0.78. From the resident group and the student group, the accuracy was 0.82, 0.69. This study developed a model for identifying mesiodens using panoramic radiographs of children in primary and early mixed dentition. The classification accuracy of the model was lower than that of the resident group. However, the classification accuracy (0.78) was higher than that of dental students (0.69), so it could be used to assist the diagnosis of mesiodens for non-expert students or general dentists.

Comparison of Association Rule Learning and Subgroup Discovery for Mining Traffic Accident Data (교통사고 데이터의 마이닝을 위한 연관규칙 학습기법과 서브그룹 발견기법의 비교)

  • Kim, Jeongmin;Ryu, Kwang Ryel
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.4
    • /
    • pp.1-16
    • /
    • 2015
  • Traffic accident is one of the major cause of death worldwide for the last several decades. According to the statistics of world health organization, approximately 1.24 million deaths occurred on the world's roads in 2010. In order to reduce future traffic accident, multipronged approaches have been adopted including traffic regulations, injury-reducing technologies, driving training program and so on. Records on traffic accidents are generated and maintained for this purpose. To make these records meaningful and effective, it is necessary to analyze relationship between traffic accident and related factors including vehicle design, road design, weather, driver behavior etc. Insight derived from these analysis can be used for accident prevention approaches. Traffic accident data mining is an activity to find useful knowledges about such relationship that is not well-known and user may interested in it. Many studies about mining accident data have been reported over the past two decades. Most of studies mainly focused on predict risk of accident using accident related factors. Supervised learning methods like decision tree, logistic regression, k-nearest neighbor, neural network are used for these prediction. However, derived prediction model from these algorithms are too complex to understand for human itself because the main purpose of these algorithms are prediction, not explanation of the data. Some of studies use unsupervised clustering algorithm to dividing the data into several groups, but derived group itself is still not easy to understand for human, so it is necessary to do some additional analytic works. Rule based learning methods are adequate when we want to derive comprehensive form of knowledge about the target domain. It derives a set of if-then rules that represent relationship between the target feature with other features. Rules are fairly easy for human to understand its meaning therefore it can help provide insight and comprehensible results for human. Association rule learning methods and subgroup discovery methods are representing rule based learning methods for descriptive task. These two algorithms have been used in a wide range of area from transaction analysis, accident data analysis, detection of statistically significant patient risk groups, discovering key person in social communities and so on. We use both the association rule learning method and the subgroup discovery method to discover useful patterns from a traffic accident dataset consisting of many features including profile of driver, location of accident, types of accident, information of vehicle, violation of regulation and so on. The association rule learning method, which is one of the unsupervised learning methods, searches for frequent item sets from the data and translates them into rules. In contrast, the subgroup discovery method is a kind of supervised learning method that discovers rules of user specified concepts satisfying certain degree of generality and unusualness. Depending on what aspect of the data we are focusing our attention to, we may combine different multiple relevant features of interest to make a synthetic target feature, and give it to the rule learning algorithms. After a set of rules is derived, some postprocessing steps are taken to make the ruleset more compact and easier to understand by removing some uninteresting or redundant rules. We conducted a set of experiments of mining our traffic accident data in both unsupervised mode and supervised mode for comparison of these rule based learning algorithms. Experiments with the traffic accident data reveals that the association rule learning, in its pure unsupervised mode, can discover some hidden relationship among the features. Under supervised learning setting with combinatorial target feature, however, the subgroup discovery method finds good rules much more easily than the association rule learning method that requires a lot of efforts to tune the parameters.

CNN deep learning based estimation of damage locations of a PSC bridge using static strain data (정적 변형률 데이터를 사용한 CNN 딥러닝 기반 PSC 교량 손상위치 추정)

  • Han, Man-Seok;Shin, Soo-Bong;An, Hyo-Joon
    • Journal of KIBIM
    • /
    • v.10 no.2
    • /
    • pp.21-28
    • /
    • 2020
  • As the number of aging bridges increases, more studies are being conducted on developing effective and reliable methods for the assessment and maintenance of bridges. With the advancement in new sensing systems and data learning techniques through AI technology, there is growing interests in how to evaluate bridges using these advanced techniques. This paper presents a CNN(Convolution Neural Network) deep learning based technique for evaluating the damage existence and for estimating the damage location in PSC bridges using static strain data. Simulation studies were conducted to investigate the proposed method with error analysis. Damage was simulated as the reduction in the stiffness of a finite element. A data learning model was constructed by applying the CNN technique as a type of deep learning. The damage status and its location were estimated using data set built through simulation. It was assumed that the strain gauges were installed in a regular interval under the PSC bridge girders. In order to increase the accuracy in evaluating damage, the squared error between the intact and measured strains are computed and applied for training the data model. Considering the damage occurring near the supports, the results of error analysis were compared according to whether strain data near the supports were included.

Study on the Surface Defect Classification of Al 6061 Extruded Material By Using CNN-Based Algorithms (CNN을 이용한 Al 6061 압출재의 표면 결함 분류 연구)

  • Kim, S.B.;Lee, K.A.
    • Transactions of Materials Processing
    • /
    • v.31 no.4
    • /
    • pp.229-239
    • /
    • 2022
  • Convolution Neural Network(CNN) is a class of deep learning algorithms and can be used for image analysis. In particular, it has excellent performance in finding the pattern of images. Therefore, CNN is commonly applied for recognizing, learning and classifying images. In this study, the surface defect classification performance of Al 6061 extruded material using CNN-based algorithms were compared and evaluated. First, the data collection criteria were suggested and a total of 2,024 datasets were prepared. And they were randomly classified into 1,417 learning data and 607 evaluation data. After that, the size and quality of the training data set were improved using data augmentation techniques to increase the performance of deep learning. The CNN-based algorithms used in this study were VGGNet-16, VGGNet-19, ResNet-50 and DenseNet-121. The evaluation of the defect classification performance was made by comparing the accuracy, loss, and learning speed using verification data. The DenseNet-121 algorithm showed better performance than other algorithms with an accuracy of 99.13% and a loss value of 0.037. This was due to the structural characteristics of the DenseNet model, and the information loss was reduced by acquiring information from all previous layers for image identification in this algorithm. Based on the above results, the possibility of machine vision application of CNN-based model for the surface defect classification of Al extruded materials was also discussed.

Wave Prediction in a Harbour using Deep Learning with Offshore Data (딥러닝을 이용한 외해 해양기상자료로부터의 항내파고 예측)

  • Lee, Geun Se;Jeong, Dong Hyeon;Moon, Yong Ho;Park, Won Kyung;Chae, Jang Won
    • Journal of Korean Society of Coastal and Ocean Engineers
    • /
    • v.33 no.6
    • /
    • pp.367-373
    • /
    • 2021
  • In this study, deep learning model was set up to predict the wave heights inside a harbour. Various machine learning techniques were applied to the model in consideration of the transformation characteristics of offshore waves while propagating into the harbour. Pohang New Port was selected for model application, which had a serious problem of unloading due to swell and has lots of available wave data. Wave height, wave period, and wave direction at offshore sites and wave heights inside the harbour were used for the model input and output, respectively, and then the model was trained using deep learning method. By considering the correlation between the time series wave data of offshore and inside the harbour, the data set was separated into prevailing wave directions as a pre-processing method. As a result, It was confirmed that accuracy and stability of the model prediction are considerably increased.

Mask Wearing Detection Using OpenCV Training Data (OpenCV 학습 데이터를 이용한 마스크 착용 감지)

  • Snowberger, Aaron Daniel;Lee, Choong Ho
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2021.10a
    • /
    • pp.303-304
    • /
    • 2021
  • It is an important issue to detect automatically whether a mask is worn or not for corona prevention. It is known that mask wearing detection can be solved by learning the face data set. However, the search for whether a person is wearing a mask can be detected in a simpler way using OpenCV. In this paper, we describe that it is possible to easily detect whether a single person is wearing a mask or not with a general PC camera using OpenCV learning data results and simple OpenCV functions. Through experiments, the proposed method was shown to be effective.

  • PDF

A study on categories of questions when holding counselling on learning math in regards to grounded theoretical approaches (근거이론적 접근에 따른 수학학습 상담 발문 유형에 대한 연구)

  • Ko, Ho Kyoung;Kim, Dong Won;Lee, Hwan Chul;Choi, Tae Young
    • Journal of the Korean School Mathematics Society
    • /
    • v.17 no.1
    • /
    • pp.73-92
    • /
    • 2014
  • This study was performed in part with the task to find measures to improve the defining characteristics of feelings, value, interest, self-efficacy, and others aspects in regards to learning math among elementary and middle school students. For this study, it was essential to understand the appropriate questions that are needed to be asked during a consultation at a math clinic, for students that are having a hard time learning math. As a method for performing this study, the content of scheduled counseling over 2 years from a math clinic were collected and the questions that were given and taken were analyzed in order to figure out the types of questions needed in order to effectively examine students that are facing difficulty with learning math. The analysis was performed using Grounded theory analysis by Strauss & Corbin(1998) and went through the process of open coding, axial coding, and selective coding. For the paradigm in the categorical analysis stage, 'attitude towards learning math' was set as the casual condition, 'feelings towards learning math' was set as the contextual condition, 'confidence in one's ability to learn math' was set as the phenomenon, 'individual tendencies when learning math' was set as the intervening condition, 'self-management of learning math' was set as the action/interaction strategy, and 'method of learning' was set as the consequence. Through this, the questions that appeared during counseling were linked into categories and subcategories. Through this process, 81 concepts were deducted, which were grouped into 31 categories. I believe that this data can be used as grounded theory for standardization of consultation in clinics.

  • PDF