• Title/Summary/Keyword: Learning Structure

Search Result 2,189, Processing Time 0.028 seconds

A Hierarchical deep model for food classification from photographs

  • Yang, Heekyung;Kang, Sungyong;Park, Chanung;Lee, JeongWook;Yu, Kyungmin;Min, Kyungha
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.14 no.4
    • /
    • pp.1704-1720
    • /
    • 2020
  • Recognizing food from photographs presents many applications for machine learning, computer vision and dietetics, etc. Recent progress of deep learning techniques accelerates the recognition of food in a great scale. We build a hierarchical structure composed of deep CNN to recognize and classify food from photographs. We build a dataset for Korean food of 18 classes, which are further categorized in 4 major classes. Our hierarchical recognizer classifies foods into four major classes in the first step. Each food in the major classes is further classified into the exact class in the second step. We employ DenseNet structure for the baseline of our recognizer. The hierarchical structure provides higher accuracy and F1 score than those from the single-structured recognizer.

Stable Wavelet Based Fuzzy Neural Network for the Identification of Nonlinear Systems (비선형 시스템의 동정을 위한 안정한 웨이블릿 기반 퍼지 뉴럴 네트워크)

  • Oh, Joon-Seop;Park, Jin-Bae;Choi, Yoon-Ho
    • Proceedings of the KIEE Conference
    • /
    • 2005.07d
    • /
    • pp.2681-2683
    • /
    • 2005
  • In this paper, we present the structure of fuzzy neural network(FNN) based on wavelet function, and apply this network structure to the identification of nonlinear systems. For adjusting the shape of membership function and the connection weights, the parameter learning method based on the gradient descent scheme is adopted. And an approach that uses adaptive learning rates is driven via a Lyapunov stability analysis to guarantee the fast convergence. Finally, to verify the efficiency of our network structure. we compare the Identification performance of proposed wavelet based fuzzy neural network(WFNN) with those of the FNN, the wavelet fuzzy model(WFM) and the wavelet neural network(WNN) through the computer simulation.

  • PDF

A Study on the Digital Implementation of Multi-layered Neural Networks for Pattern Recognition (패턴인식을 위한 다층 신경망의 디지털 구현에 관한 연구)

  • 박영석
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.2 no.2
    • /
    • pp.111-118
    • /
    • 2001
  • In this paper, in order to implement the multi-layered perceptron neural network using pure digital logic circuit model, we propose the new logic neuron structure, the digital canonical multi-layered logic neural network structure, and the multi-stage multi-layered logic neural network structure for pattern recognition applications. And we show that the proposed approach provides an incremental additive learning algorithm, which is very simple and effective.

  • PDF

Mathematical Thinking through Different Representations and Analogy

  • Cheng, Chun Chor Litwin
    • Research in Mathematical Education
    • /
    • v.15 no.1
    • /
    • pp.45-57
    • /
    • 2011
  • Mathematical thinking is a core element in mathematics education and classroom learning. This paper wish to investigate how primary four (grade 4) students develop their mathematical thinking through working on tasks in multiplication where greatest products of multiplication are required. The tasks include the format of many digit times one digit, 2 digits times 2 digits up to 3 digits times 3 digits. It is found that the process of mathematical thinking of students depends on their own representation in obtaining the product. And the solution is obtained through a pattern/analogy and "pattern plus analogy" process. This specific learning process provides data for understanding structure and mapping in problem solving. The result shows that analogy allows successful extension of solution structure in the tasks.

U-net and Residual-based Cycle-GAN for Improving Object Transfiguration Performance (물체 변형 성능을 향상하기 위한 U-net 및 Residual 기반의 Cycle-GAN)

  • Kim, Sewoon;Park, Kwang-Hyun
    • The Journal of Korea Robotics Society
    • /
    • v.13 no.1
    • /
    • pp.1-7
    • /
    • 2018
  • The image-to-image translation is one of the deep learning applications using image data. In this paper, we aim at improving the performance of object transfiguration which transforms a specific object in an image into another specific object. For object transfiguration, it is required to transform only the target object and maintain background images. In the existing results, however, it is observed that other parts in the image are also transformed. In this paper, we have focused on the structure of artificial neural networks that are frequently used in the existing methods and have improved the performance by adding constraints to the exiting structure. We also propose the advanced structure that combines the existing structures to maintain their advantages and complement their drawbacks. The effectiveness of the proposed methods are shown in experimental results.

Depth Map Extraction from the Single Image Using Pix2Pix Model (Pix2Pix 모델을 활용한 단일 영상의 깊이맵 추출)

  • Gang, Su Myung;Lee, Joon Jae
    • Journal of Korea Multimedia Society
    • /
    • v.22 no.5
    • /
    • pp.547-557
    • /
    • 2019
  • To extract the depth map from a single image, a number of CNN-based deep learning methods have been performed in recent research. In this study, the GAN structure of Pix2Pix is maintained. this model allows to converge well, because it has the structure of the generator and the discriminator. But the convolution in this model takes a long time to compute. So we change the convolution form in the generator to a depthwise convolution to improve the speed while preserving the result. Thus, the seven down-sizing convolutional hidden layers in the generator U-Net are changed to depthwise convolution. This type of convolution decreases the number of parameters, and also speeds up computation time. The proposed model shows similar depth map prediction results as in the case of the existing structure, and the computation time in case of a inference is decreased by 64%.

Design of CNN with MLP Layer (MLP 층을 갖는 CNN의 설계)

  • Park, Jin-Hyun;Hwang, Kwang-Bok;Choi, Young-Kiu
    • Journal of the Korean Society of Mechanical Technology
    • /
    • v.20 no.6
    • /
    • pp.776-782
    • /
    • 2018
  • After CNN basic structure was introduced by LeCun in 1989, there has not been a major structure change except for more deep network until recently. The deep network enhances the expression power due to improve the abstraction ability of the network, and can learn complex problems by increasing non linearity. However, the learning of a deep network means that it has vanishing gradient or longer learning time. In this study, we proposes a CNN structure with MLP layer. The proposed CNNs are superior to the general CNN in their classification performance. It is confirmed that classification accuracy is high due to include MLP layer which improves non linearity by experiment. In order to increase the performance without making a deep network, it is confirmed that the performance is improved by increasing the non linearity of the network.

Lightweight Single Image Super-Resolution by Channel Split Residual Convolution

  • Liu, Buzhong
    • Journal of Information Processing Systems
    • /
    • v.18 no.1
    • /
    • pp.12-25
    • /
    • 2022
  • In recent years, deep convolutional neural networks have made significant progress in the research of single image super-resolution. However, it is difficult to be applied in practical computing terminals or embedded devices due to a large number of parameters and computational effort. To balance these problems, we propose CSRNet, a lightweight neural network based on channel split residual learning structure, to reconstruct highresolution images from low-resolution images. Lightweight refers to designing a neural network with fewer parameters and a simplified structure for lower memory consumption and faster inference speed. At the same time, it is ensured that the performance of recovering high-resolution images is not degraded. In CSRNet, we reduce the parameters and computation by channel split residual learning. Simultaneously, we propose a double-upsampling network structure to improve the performance of the lightweight super-resolution network and make it easy to train. Finally, we propose a new evaluation metric for the lightweight approaches named 100_FPS. Experiments show that our proposed CSRNet not only speeds up the inference of the neural network and reduces memory consumption, but also performs well on single image super-resolution.

Improving Transformer with Dynamic Convolution and Shortcut for Video-Text Retrieval

  • Liu, Zhi;Cai, Jincen;Zhang, Mengmeng
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.16 no.7
    • /
    • pp.2407-2424
    • /
    • 2022
  • Recently, Transformer has made great progress in video retrieval tasks due to its high representation capability. For the structure of a Transformer, the cascaded self-attention modules are capable of capturing long-distance feature dependencies. However, the local feature details are likely to have deteriorated. In addition, increasing the depth of the structure is likely to produce learning bias in the learned features. In this paper, an improved Transformer structure named TransDCS (Transformer with Dynamic Convolution and Shortcut) is proposed. A Multi-head Conv-Self-Attention module is introduced to model the local dependencies and improve the efficiency of local features extraction. Meanwhile, the augmented shortcuts module based on a dual identity matrix is applied to enhance the conduction of input features, and mitigate the learning bias. The proposed model is tested on MSRVTT, LSMDC and Activity-Net benchmarks, and it surpasses all previous solutions for the video-text retrieval task. For example, on the LSMDC benchmark, a gain of about 2.3% MdR and 6.1% MnR is obtained over recently proposed multimodal-based methods.

Smart modified repetitive-control design for nonlinear structure with tuned mass damper

  • ZY Chen;Ruei-Yuan Wang;Yahui Meng;Timothy Chen
    • Steel and Composite Structures
    • /
    • v.46 no.1
    • /
    • pp.107-114
    • /
    • 2023
  • A new intelligent adaptive control scheme was proposed that combines observer disturbance-based adaptive control and fuzzy adaptive control for a composite structure with a mass-adjustable damper. The most important advantage is that the control structures do not need to know the uncertainty limits and the interference effect is eliminated. Three adjustable parameters in LMI are used to control the gain of the 2D fuzzy control. Binary performance indices with weighted matrices are constructed to separately evaluate validation and training performance using the revalidation learning function. Determining the appropriate weight matrix balances control and learning efficiency and prevents large gains in control. It is proved that the stability of the control system can be ensured by a linear matrix theory of equality based on Lyapunov's theory. Simulation results show that the multilevel simulation approach combines accuracy with high computational efficiency. The M-TMD system, by slightly reducing critical joint load amplitudes, can significantly improve the overall response of an uncontrolled structure.