• Title/Summary/Keyword: Multi-Model Training

Search Result 352, Processing Time 0.032 seconds

Accuracy of one-step automated orthodontic diagnosis model using a convolutional neural network and lateral cephalogram images with different qualities obtained from nationwide multi-hospitals

  • Yim, Sunjin;Kim, Sungchul;Kim, Inhwan;Park, Jae-Woo;Cho, Jin-Hyoung;Hong, Mihee;Kang, Kyung-Hwa;Kim, Minji;Kim, Su-Jung;Kim, Yoon-Ji;Kim, Young Ho;Lim, Sung-Hoon;Sung, Sang Jin;Kim, Namkug;Baek, Seung-Hak
    • The korean journal of orthodontics
    • /
    • v.52 no.1
    • /
    • pp.3-19
    • /
    • 2022
  • Objective: The purpose of this study was to investigate the accuracy of one-step automated orthodontic diagnosis of skeletodental discrepancies using a convolutional neural network (CNN) and lateral cephalogram images with different qualities from nationwide multi-hospitals. Methods: Among 2,174 lateral cephalograms, 1,993 cephalograms from two hospitals were used for training and internal test sets and 181 cephalograms from eight other hospitals were used for an external test set. They were divided into three classification groups according to anteroposterior skeletal discrepancies (Class I, II, and III), vertical skeletal discrepancies (normodivergent, hypodivergent, and hyperdivergent patterns), and vertical dental discrepancies (normal overbite, deep bite, and open bite) as a gold standard. Pre-trained DenseNet-169 was used as a CNN classifier model. Diagnostic performance was evaluated by receiver operating characteristic (ROC) analysis, t-stochastic neighbor embedding (t-SNE), and gradient-weighted class activation mapping (Grad-CAM). Results: In the ROC analysis, the mean area under the curve and the mean accuracy of all classifications were high with both internal and external test sets (all, > 0.89 and > 0.80). In the t-SNE analysis, our model succeeded in creating good separation between three classification groups. Grad-CAM figures showed differences in the location and size of the focus areas between three classification groups in each diagnosis. Conclusions: Since the accuracy of our model was validated with both internal and external test sets, it shows the possible usefulness of a one-step automated orthodontic diagnosis tool using a CNN model. However, it still needs technical improvement in terms of classifying vertical dental discrepancies.

Virtual Environments for Medical Training: Soft tissue modeling (의료용 훈련을 위한 가상현실에 대한 연구)

  • Kim, Jung
    • Proceedings of the KSME Conference
    • /
    • 2007.05a
    • /
    • pp.372-377
    • /
    • 2007
  • For more than 2,500 years, surgical teaching has been based on the so called "see one, do one, teach one" paradigm, in which the surgical trainee learns by operating on patients under close supervision of peers and superiors. However, higher demands on the quality of patient care and rising malpractice costs have made it increasingly risky to train on patients. Minimally invasive surgery, in particular, has made it more difficult for an instructor to demonstrate the required manual skills. It has been recognized that, similar to flight simulators for pilots, virtual reality (VR) based surgical simulators promise a safer and more comprehensive way to train manual skills of medical personnel in general and surgeons in particular. One of the major challenges in the development of VR-based surgical trainers is the real-time and realistic simulation of interactions between surgical instruments and biological tissues. It involves multi-disciplinary research areas including soft tissue mechanical behavior, tool-tissue contact mechanics, computer haptics, computer graphics and robotics integrated into VR-based training systems. The research described in this paper addresses the problem of characterizing soft tissue properties for medical virtual environments. A system to measure in vivo mechanical properties of soft tissues was designed, and eleven sets of animal experiments were performed to measure in vivo and in vitro biomechanical properties of porcine intra-abdominal organs. Viscoelastic tissue parameters were then extracted by matching finite element model predictions with the empirical data. Finally, the tissue parameters were combined with geometric organ models segmented from the Visible Human Dataset and integrated into a minimally invasive surgical simulation system consisting of haptic interface devices and a graphic display.

  • PDF

Monitoring and Prediction of Appliances Electricity Usage Using Neural Network (신경회로망을 이용한 가전기기 전기 사용량 모니터링 및 예측)

  • Jung, Kyung-Kwon;Choi, Woo-Seung
    • Journal of the Korea Society of Computer and Information
    • /
    • v.16 no.8
    • /
    • pp.137-146
    • /
    • 2011
  • In order to support increased consumer awareness regarding energy consumption, we present new ways of monitoring and predicting with energy in electric appliances. The proposed system is a design of a common electrical power outlet called smart plug that measures the amount of current passing through current sensor at 0.5 second. To acquire data for training and testing the proposed neural network, weather parameters used include average temperature of day, min and max temperature, humidity, and sunshine hour as input data, and power consumption as target data from smart plug. Using the experimental data for training, the neural network model based on Back-Propagation algorithm was developed. Multi layer perception network was used for nonlinear mapping between the input and the output data. It was observed that the proposed neural network model can predict the power consumption quite well with correlation coefficient was 0.9965, and prediction mean square error was 0.02033.

Business Application of Convolutional Neural Networks for Apparel Classification Using Runway Image (합성곱 신경망의 비지니스 응용: 런웨이 이미지를 사용한 의류 분류를 중심으로)

  • Seo, Yian;Shin, Kyung-shik
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.3
    • /
    • pp.1-19
    • /
    • 2018
  • Large amount of data is now available for research and business sectors to extract knowledge from it. This data can be in the form of unstructured data such as audio, text, and image data and can be analyzed by deep learning methodology. Deep learning is now widely used for various estimation, classification, and prediction problems. Especially, fashion business adopts deep learning techniques for apparel recognition, apparel search and retrieval engine, and automatic product recommendation. The core model of these applications is the image classification using Convolutional Neural Networks (CNN). CNN is made up of neurons which learn parameters such as weights while inputs come through and reach outputs. CNN has layer structure which is best suited for image classification as it is comprised of convolutional layer for generating feature maps, pooling layer for reducing the dimensionality of feature maps, and fully-connected layer for classifying the extracted features. However, most of the classification models have been trained using online product image, which is taken under controlled situation such as apparel image itself or professional model wearing apparel. This image may not be an effective way to train the classification model considering the situation when one might want to classify street fashion image or walking image, which is taken in uncontrolled situation and involves people's movement and unexpected pose. Therefore, we propose to train the model with runway apparel image dataset which captures mobility. This will allow the classification model to be trained with far more variable data and enhance the adaptation with diverse query image. To achieve both convergence and generalization of the model, we apply Transfer Learning on our training network. As Transfer Learning in CNN is composed of pre-training and fine-tuning stages, we divide the training step into two. First, we pre-train our architecture with large-scale dataset, ImageNet dataset, which consists of 1.2 million images with 1000 categories including animals, plants, activities, materials, instrumentations, scenes, and foods. We use GoogLeNet for our main architecture as it has achieved great accuracy with efficiency in ImageNet Large Scale Visual Recognition Challenge (ILSVRC). Second, we fine-tune the network with our own runway image dataset. For the runway image dataset, we could not find any previously and publicly made dataset, so we collect the dataset from Google Image Search attaining 2426 images of 32 major fashion brands including Anna Molinari, Balenciaga, Balmain, Brioni, Burberry, Celine, Chanel, Chloe, Christian Dior, Cividini, Dolce and Gabbana, Emilio Pucci, Ermenegildo, Fendi, Giuliana Teso, Gucci, Issey Miyake, Kenzo, Leonard, Louis Vuitton, Marc Jacobs, Marni, Max Mara, Missoni, Moschino, Ralph Lauren, Roberto Cavalli, Sonia Rykiel, Stella McCartney, Valentino, Versace, and Yve Saint Laurent. We perform 10-folded experiments to consider the random generation of training data, and our proposed model has achieved accuracy of 67.2% on final test. Our research suggests several advantages over previous related studies as to our best knowledge, there haven't been any previous studies which trained the network for apparel image classification based on runway image dataset. We suggest the idea of training model with image capturing all the possible postures, which is denoted as mobility, by using our own runway apparel image dataset. Moreover, by applying Transfer Learning and using checkpoint and parameters provided by Tensorflow Slim, we could save time spent on training the classification model as taking 6 minutes per experiment to train the classifier. This model can be used in many business applications where the query image can be runway image, product image, or street fashion image. To be specific, runway query image can be used for mobile application service during fashion week to facilitate brand search, street style query image can be classified during fashion editorial task to classify and label the brand or style, and website query image can be processed by e-commerce multi-complex service providing item information or recommending similar item.

A GA-based Binary Classification Method for Bankruptcy Prediction (도산예측을 위한 유전 알고리듬 기반 이진분류기법의 개발)

  • Min, Jae-H.;Jeong, Chul-Woo
    • Journal of the Korean Operations Research and Management Science Society
    • /
    • v.33 no.2
    • /
    • pp.1-16
    • /
    • 2008
  • The purpose of this paper is to propose a new binary classification method for predicting corporate failure based on genetic algorithm, and to validate its prediction power through empirical analysis. Establishing virtual companies representing bankrupt companies and non-bankrupt ones respectively, the proposed method measures the similarity between the virtual companies and the subject for prediction, and classifies the subject into either bankrupt or non-bankrupt one. The values of the classification variables of the virtual companies and the weights of the variables are determined by the proper model to maximize the hit ratio of training data set using genetic algorithm. In order to test the validity of the proposed method, we compare its prediction accuracy with ones of other existing methods such as multi-discriminant analysis, logistic regression, decision tree, and artificial neural network, and it is shown that the binary classification method we propose in this paper can serve as a premising alternative to the existing methods for bankruptcy prediction.

Real-Time Estimation of TCSC Quantity for Improvement of Transient Stability Energy Margin (과도안정도 에너지 마진 향상을 위한 TCSC 적정치의 실시간 산정)

  • Kim, Soo-Nam;You, Seok-Ku
    • Proceedings of the KIEE Conference
    • /
    • 2000.07a
    • /
    • pp.242-244
    • /
    • 2000
  • This paper presents a method for real-time estimation of TCSC quantity in order to enhance the power system transient stability energy margin using fuzzy neural network in multi-machine system. This paper has two parts, the first part is to estimate the energy margin. To set critical energy, we use the potential energy boundary surface(PEBS) method which one of the transient energy function(TEF) method. And the second is to determine the TCSC quantify and the line to be injected. In order to make training data in this step, we use genetic algorithm. The proposed method is applied to 6-bus, 7-line, 4-machine model system to show its effectiveness.

  • PDF

A Research Framework for the Success Factors of Information

  • Yoo, Sangjin;Soongoo, I-Iong
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.3 no.1
    • /
    • pp.117-139
    • /
    • 1998
  • This study is intended to identify the factors affecting the successful information warehouse (IW) implementation through the technology acceptance model. As the IW has played an important role with the organizations, it has become a strategic management tool. However, because the building of an IWS requires a great amound of financing and a multi-period, managers should consider identifying the variables as a predictor of IWS success. The related research areas, such as TAM , TRA, and innovation diffusion theory, and previous research associated with the EWS success factors are reviewed in this paper. Based on the hypotheses presented , the study will empirically test the relationships between six external variable-user involvement, computer self-efficacy, OLAP characteristics, problem difficulty , user training and top management support-and system utilization via user's perceptions of ease of use, unusefulness. This study semmes to be a first attempt in this research area, and its results will provide general guidclines for IWS project managers to enhancement the like hood of system succes.

The study on the Algorithm for Desing of Fuzzy Logic Controller Using Neural Network (신경회로망을 이용한 퍼지제어기 설계 알고리즘에 관한 연구)

  • 채명기;이상배
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 1996.10a
    • /
    • pp.243-248
    • /
    • 1996
  • In this paper, a general neural-network-based connectionist model, called Fuzzy Neural Network(FNN), is proposed for the realization of a fuzzy logic control system. The proposed FNN is a feedforward multi-layered network which integrates the basic elements and functions of a traditional fuzzy logic controller into a connectionist structure which has distributed learning abilities. Such FNN can be constructed from training examples by learning rule, and the connectionist structure can be trained to develop fuzzy logic rules and find optimal input/output membership functions. Computer simulation examples will be presented to illustrate the performance and applicability of the proposed FNN, and their associated learning algorithms.

  • PDF

Shape Optimization of Sedimentation Tank Using Response Surface Method (반응면기법을 이용한 침전조의 형상최적설계)

  • Kim, Hong-Min;Choi, Seung-Man;Kim, Kwang-Yong
    • The KSFM Journal of Fluid Machinery
    • /
    • v.7 no.6 s.27
    • /
    • pp.55-61
    • /
    • 2004
  • A numerical procedure for optimizing the shape of three-dimensional sedimentation tank is presented to maximize its sedimentation efficiency. The response surface based optimization is used as an optimization technique with Reynolds-averaged Navier-Stokes analysis for multi-phase flow. Standard $k-{\epsilon}$ model is used as a turbulence closure. Three design variables such as, tank height to center feed wall diameter ratio, blockage ratio of center feed wall and angle of distributor are chosen as design variables. Sedimentation efficiency is defined as an objective function. Full-factorial method is used to determine the training points as a means of design of experiment. Sensitivity of each design variable on the objective function has been evaluated. And, optimal values of the design variables have been obtained.

A Study on the Discriminate between Magnetizing Inrush and Internal Faults of Power Transformer by Artificial Neural Network (신경회로망에 의한 변압기의 여자돌입과 내부고장 판별에 관한 연구)

  • Park, Chul-Won;Cho, Phil-Hun;Shin, Myong-Chul;Yoon, Sug-Moo
    • Proceedings of the KIEE Conference
    • /
    • 1995.07b
    • /
    • pp.606-609
    • /
    • 1995
  • This paper presents discriminate between magnetizing inrush and internal faults of power transformer by artificial neural networks trained with preprocessing of fault discriminant. The proposed neural networks contain multi-layer perceptron using back-propagation learning algorithm with logistic sigmoid activation function. For this training and test, we used the relaying signals obtained from the EMTP simulation of model power system. It is shown that the proposed transformer protection system by neural networks never misoperated.

  • PDF