• Title/Summary/Keyword: train model

Search Result 1,719, Processing Time 0.023 seconds

A Modular Based Approach on the Development of AI Math Curriculum Model (인공지능 수학교육과정의 모듈화 접근방법 연구)

  • Baik, Ran
    • Journal of Engineering Education Research
    • /
    • v.24 no.3
    • /
    • pp.50-57
    • /
    • 2021
  • Although the mathematics education process in AI education is a very important issue, little cases are reported in developing effective methods on AI and mathematics education at the university level. The universities cover all fields of mathematics in their curriculums, but they lack in connecting and applying the math knowledge to AI in an efficient manner. Students are hardly interested in taking many math courses and it gets worse for the students in humanities, social sciences and arts. But university education is very slow in adapting to rapidly changing new technologies in the real world. AI is a technology that is changing the paradigm of the century, so every one should be familiar with this technology but it requires fundamental math knowledge. It is not fair for the students to study all math subjects and ride on the AI train. We recognize that three key elements, SW knowledge, mathematical knowledge, and domain knowledge, are required in applying AI technology to the real world problems. This study proposes a modular approach of studying mathematics knowledge while connecting the math to different domain problems using AI techniques. We also show a modular curriculum that is developed for using math for AI-driven autonomous driving.

Self-Supervised Rigid Registration for Small Images

  • Ma, Ruoxin;Zhao, Shengjie;Cheng, Samuel
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.15 no.1
    • /
    • pp.180-194
    • /
    • 2021
  • For small image registration, feature-based approaches are likely to fail as feature detectors cannot detect enough feature points from low-resolution images. The classic FFT approach's prediction accuracy is high, but the registration time can be relatively long, about several seconds to register one image pair. To achieve real-time and high-precision rigid registration for small images, we apply deep neural networks for supervised rigid transformation prediction, which directly predicts the transformation parameters. We train deep registration models with rigidly transformed CIFAR-10 images and STL-10 images, and evaluate the generalization ability of deep registration models with transformed CIFAR-10 images, STL-10 images, and randomly generated images. Experimental results show that the deep registration models we propose can achieve comparable accuracy to the classic FFT approach for small CIFAR-10 images (32×32) and our LSTM registration model takes less than 1ms to register one pair of images. For moderate size STL-10 images (96×96), FFT significantly outperforms deep registration models in terms of accuracy but is also considerably slower. Our results suggest that deep registration models have competitive advantages over conventional approaches, at least for small images.

A Binary Classifier Using Fully Connected Neural Network for Alzheimer's Disease Classification

  • Prajapati, Rukesh;Kwon, Goo-Rak
    • Journal of Multimedia Information System
    • /
    • v.9 no.1
    • /
    • pp.21-32
    • /
    • 2022
  • Early-stage diagnosis of Alzheimer's Disease (AD) from Cognitively Normal (CN) patients is crucial because treatment at an early stage of AD can prevent further progress in the AD's severity in the future. Recently, computer-aided diagnosis using magnetic resonance image (MRI) has shown better performance in the classification of AD. However, these methods use a traditional machine learning algorithm that requires supervision and uses a combination of many complicated processes. In recent research, the performance of deep neural networks has outperformed the traditional machine learning algorithms. The ability to learn from the data and extract features on its own makes the neural networks less prone to errors. In this paper, a dense neural network is designed for binary classification of Alzheimer's disease. To create a classifier with better results, we studied result of different activation functions in the prediction. We obtained results from 5-folds validations with combinations of different activation functions and compared with each other, and the one with the best validation score is used to classify the test data. In this experiment, features used to train the model are obtained from the ADNI database after processing them using FreeSurfer software. For 5-folds validation, two groups: AD and CN are classified. The proposed DNN obtained better accuracy than the traditional machine learning algorithms and the compared previous studies for AD vs. CN, AD vs. Mild Cognitive Impairment (MCI), and MCI vs. CN classifications, respectively. This neural network is robust and better.

No-Reference Image Quality Assessment based on Quality Awareness Feature and Multi-task Training

  • Lai, Lijing;Chu, Jun;Leng, Lu
    • Journal of Multimedia Information System
    • /
    • v.9 no.2
    • /
    • pp.75-86
    • /
    • 2022
  • The existing image quality assessment (IQA) datasets have a small number of samples. Some methods based on transfer learning or data augmentation cannot make good use of image quality-related features. A No Reference (NR)-IQA method based on multi-task training and quality awareness is proposed. First, single or multiple distortion types and levels are imposed on the original image, and different strategies are used to augment different types of distortion datasets. With the idea of weak supervision, we use the Full Reference (FR)-IQA methods to obtain the pseudo-score label of the generated image. Then, we combine the classification information of the distortion type, level, and the information of the image quality score. The ResNet50 network is trained in the pre-train stage on the augmented dataset to obtain more quality-aware pre-training weights. Finally, the fine-tuning stage training is performed on the target IQA dataset using the quality-aware weights to predicate the final prediction score. Various experiments designed on the synthetic distortions and authentic distortions datasets (LIVE, CSIQ, TID2013, LIVEC, KonIQ-10K) prove that the proposed method can utilize the image quality-related features better than the method using only single-task training. The extracted quality-aware features improve the accuracy of the model.

An Analysis on the Properties of Features against Various Distortions in Deep Neural Networks

  • Kang, Jung Heum;Jeong, Hye Won;Choi, Chang Kyun;Ali, Muhammad Salman;Bae, Sung-Ho;Kim, Hui Yong
    • Journal of Broadcast Engineering
    • /
    • v.26 no.7
    • /
    • pp.868-876
    • /
    • 2021
  • Deploying deep neural network model training performs remarkable performance in the fields of Object detection and Instance segmentation. To train these models, features are first extracted from the input image using a backbone network. The extracted features can be reused by various tasks. Research has been actively conducted to serve various tasks by using these learned features. In this process, standardization discussions about encoding, decoding, and transmission methods are proceeding actively. In this scenario, it is necessary to analyze the response characteristics of features against various distortions that may occur in the data transmission or data compression process. In this paper, experiment was conducted to inject various distortions into the feature in the object recognition task. And analyze the mAP (mean Average Precision) metric between the predicted value output from the neural network and the target value as the intensity of various distortions was increased. Experiments have shown that features are more robust to distortion than images. And this points out that using the feature as transmission means can prevent the loss of information against the various distortions during data transmission and compression process.

Comparative study of text representation and learning for Persian named entity recognition

  • Pour, Mohammad Mahdi Abdollah;Momtazi, Saeedeh
    • ETRI Journal
    • /
    • v.44 no.5
    • /
    • pp.794-804
    • /
    • 2022
  • Transformer models have had a great impact on natural language processing (NLP) in recent years by realizing outstanding and efficient contextualized language models. Recent studies have used transformer-based language models for various NLP tasks, including Persian named entity recognition (NER). However, in complex tasks, for example, NER, it is difficult to determine which contextualized embedding will produce the best representation for the tasks. Considering the lack of comparative studies to investigate the use of different contextualized pretrained models with sequence modeling classifiers, we conducted a comparative study about using different classifiers and embedding models. In this paper, we use different transformer-based language models tuned with different classifiers, and we evaluate these models on the Persian NER task. We perform a comparative analysis to assess the impact of text representation and text classification methods on Persian NER performance. We train and evaluate the models on three different Persian NER datasets, that is, MoNa, Peyma, and Arman. Experimental results demonstrate that XLM-R with a linear layer and conditional random field (CRF) layer exhibited the best performance. This model achieved phrase-based F-measures of 70.04, 86.37, and 79.25 and word-based F scores of 78, 84.02, and 89.73 on the MoNa, Peyma, and Arman datasets, respectively. These results represent state-of-the-art performance on the Persian NER task.

Indirect displacement monitoring of high-speed railway box girders consider bending and torsion coupling effects

  • Wang, Xin;Li, Zhonglong;Zhuo, Yi;Di, Hao;Wei, Jianfeng;Li, Yuchen;Li, Shunlong
    • Smart Structures and Systems
    • /
    • v.28 no.6
    • /
    • pp.827-838
    • /
    • 2021
  • The dynamic displacement is considered to be an important indicator of structural safety, and becomes an indispensable part of Structural Health Monitoring (SHM) system for high-speed railway bridges. This paper proposes an indirect strain based dynamic displacement reconstruction methodology for high-speed railway box girders. For the typical box girders under eccentric train load, the plane section assumption and elementary beam theory is no longer applicable due to the bend-torsion coupling effects. The monitored strain was decoupled into bend and torsion induced strain, pre-trained multi-output support vector regression (M-SVR) model was employed for such decoupling process considering the sensor layout cost and reconstruction accuracy. The decoupled strained based displacement could be reconstructed respectively using box girder plate element analysis and mode superposition principle. For the transformation modal matrix has a significant impact on the reconstructed displacement accuracy, the modal order would be optimized using particle swarm algorithm (PSO), aiming to minimize the ill conditioned degree of transformation modal matrix and the displacement reconstruction error. Numerical simulation and dynamic load testing results show that the reconstructed displacement was in good agreement with the simulated or measured results, which verifies the validity and accuracy of the algorithm proposed in this paper.

Analysis and Prediction of Behavioral Changes in Angelfish Pterophyllum scalare Under Stress Conditions (스트레스 조건에 노출된 Angelfish Pterophyllum scalare의 행동 변화 분석 및 예측)

  • Kim, Yoon-Jae;NO, Hea-Min;Kim, Do-Hyung
    • Korean Journal of Fisheries and Aquatic Sciences
    • /
    • v.54 no.6
    • /
    • pp.965-973
    • /
    • 2021
  • The behavior of angelfish Pterophyllum scalare exposed to low and high temperatures was monitored by video tracking, and information such as the initial speed, changes in speed, and locations of the fish in the tank were analyzed. The water temperature was raised from 26℃ to 36℃ or lowered from 26℃ to 16℃ for 4 h. The control group was maintained at 26℃ for 8 h. The experiment was repeated five times for each group. Machine learning analysis comprising a long short-term memory model was used to train and test the behavioral data (80 s) after pre-processing. Results showed that when the water temperature changed to 36℃ or 16℃, the average speed, changes in speed and fractal dimension value were significantly lower than those in the control group. Machine learning analysis revealed that the accuracy of 80-s video footage data was 87.4%. The machine learning used in this study could distinguish between the optimal temperature group and changing temperature groups with specificity and sensitivity percentages of 86.9% and 87.4%, respectively. Therefore, video tracking technology can be used to effectively analyze fish behavior. In addition, it can be used as an early warning system for fish health in aquariums and fish farms.

Artificial neural fuzzy system and monitoring the process via IoT for optimization synthesis of nano-size polymeric chains

  • Hou, Shihao;Qiao, Luyu;Xing, Lumin
    • Advances in nano research
    • /
    • v.12 no.4
    • /
    • pp.375-386
    • /
    • 2022
  • Synthesis of acrylate-based dispersion resins involves many parameters including temperature, ingredients concentrations, and rate of adding ingredients. Proper controlling of these parameters results in a uniform nano-size chain of polymer on one side and elimination of hazardous residual monomer on the other side. In this study, we aim to screen the process parameters via Internet of Things (IoT) to ensure that, first, the nano-size polymeric chains are in an acceptable range to acquire high adhesion property and second, the remaining hazardous substance concentration is under the minimum value for safety of public and personnel health. In this regard, a set of experiments is conducted to observe the influences of the process parameters on the size and dispersity of polymer chain and residual monomer concentration. The obtained dataset is further used to train an Adaptive Neural network Fuzzy Inference System (ANFIS) to achieve a model that predicts these two output parameters based on the input parameters. Finally, the ANFIS will return values to the automation system for further decisions on parameter adjustment or halting the process to preserve the health of the personnel and final product consumers as well.

A stochastic finite element method for dynamic analysis of bridge structures under moving loads

  • Liu, Xiang;Jiang, Lizhong;Xiang, Ping;Lai, Zhipeng;Zhang, Yuntai;Liu, Lili
    • Structural Engineering and Mechanics
    • /
    • v.82 no.1
    • /
    • pp.31-40
    • /
    • 2022
  • In structural engineering, the material properties of the structures such as elastic modulus, shear modulus, density, and size may not be deterministic and may vary at different locations. The dynamic response analysis of such structures may need to consider these properties as stochastic. This paper introduces a stochastic finite element method (SFEM) approach to analyze moving loads problems. Firstly, Karhunen-Loéve expansion (KLE) is applied for expressing the stochastic field of material properties. Then the mathematical expression of the random field is substituted into the finite element model to formulate the corresponding random matrix. Finally, the statistical moment of the dynamic response is calculated by the point estimation method (PEM). The accuracy and efficiency of the dynamic response obtained from the KLE-PEM are demonstrated by the example of a moving load passing through a simply supported Euler-Bernoulli beam, in which the material properties (including elastic modulus and density) are considered as random fields. The results from the KLE-PEM are compared with those from the Monte Carlo simulation. The results demonstrate that the proposed method of KLE-PEM has high accuracy and efficiency. By using the proposed SFEM, the random vertical deflection of a high-speed railway (HSR) bridge is analyzed by considering the random fields of material properties under the moving load of a train.