• 제목/요약/키워드: ART2 neural network

검색결과 136건 처리시간 0.022초

2D 얼굴 영상을 이용한 로봇의 감정인식 및 표현시스템 (Emotion Recognition and Expression System of Robot Based on 2D Facial Image)

  • 이동훈;심귀보
    • 제어로봇시스템학회논문지
    • /
    • 제13권4호
    • /
    • pp.371-376
    • /
    • 2007
  • This paper presents an emotion recognition and its expression system of an intelligent robot like a home robot or a service robot. Emotion recognition method in the robot is used by a facial image. We use a motion and a position of many facial features. apply a tracking algorithm to recognize a moving user in the mobile robot and eliminate a skin color of a hand and a background without a facial region by using the facial region detecting algorithm in objecting user image. After normalizer operations are the image enlarge or reduction by distance of the detecting facial region and the image revolution transformation by an angel of a face, the mobile robot can object the facial image of a fixing size. And materialize a multi feature selection algorithm to enable robot to recognize an emotion of user. In this paper, used a multi layer perceptron of Artificial Neural Network(ANN) as a pattern recognition art, and a Back Propagation(BP) algorithm as a learning algorithm. Emotion of user that robot recognized is expressed as a graphic LCD. At this time, change two coordinates as the number of times of emotion expressed in ANN, and change a parameter of facial elements(eyes, eyebrows, mouth) as the change of two coordinates. By materializing the system, expressed the complex emotion of human as the avatar of LCD.

CCTV-Based Multi-Factor Authentication System

  • Kwon, Byoung-Wook;Sharma, Pradip Kumar;Park, Jong-Hyuk
    • Journal of Information Processing Systems
    • /
    • 제15권4호
    • /
    • pp.904-919
    • /
    • 2019
  • Many security systems rely solely on solutions based on Artificial Intelligence, which are weak in nature. These security solutions can be easily manipulated by malicious users who can gain unlawful access. Some security systems suggest using fingerprint-based solutions, but they can be easily deceived by copying fingerprints with clay. Image-based security is undoubtedly easy to manipulate, but it is also a solution that does not require any special training on the part of the user. In this paper, we propose a multi-factor security framework that operates in a three-step process to authenticate the user. The motivation of the research lies in utilizing commonly available and inexpensive devices such as onsite CCTV cameras and smartphone camera and providing fully secure user authentication. We have used technologies such as Argon2 for hashing image features and physically unclonable identification for secure device-server communication. We also discuss the methodological workflow of the proposed multi-factor authentication framework. In addition, we present the service scenario of the proposed model. Finally, we analyze qualitatively the proposed model and compare it with state-of-the-art methods to evaluate the usability of the model in real-world applications.

Proposing a gamma radiation based intelligent system for simultaneous analyzing and detecting type and amount of petroleum by-products

  • Roshani, Mohammadmehdi;Phan, Giang;Faraj, Rezhna Hassan;Phan, Nhut-Huan;Roshani, Gholam Hossein;Nazemi, Behrooz;Corniani, Enrico;Nazemi, Ehsan
    • Nuclear Engineering and Technology
    • /
    • 제53권4호
    • /
    • pp.1277-1283
    • /
    • 2021
  • It is important for operators of poly-pipelines in petroleum industry to continuously monitor characteristics of transferred fluid such as its type and amount. To achieve this aim, in this study a dual energy gamma attenuation technique in combination with artificial neural network (ANN) is proposed to simultaneously determine type and amount of four different petroleum by-products. The detection system is composed of a dual energy gamma source, including americium-241 and barium-133 radioisotopes, and one 2.54 cm × 2.54 cm sodium iodide detector for recording the transmitted photons. Two signals recorded in transmission detector, namely the counts under photo peak of Americium-241 with energy of 59.5 keV and the counts under photo peak of Barium-133 with energy of 356 keV, were applied to the ANN as the two inputs and volume percentages of petroleum by-products were assigned as the outputs.

Hybrid adaptive neuro-fuzzy inference system method for energy absorption of nano-composite reinforced beam with piezoelectric face-sheets

  • Lili Xiao
    • Advances in nano research
    • /
    • 제14권2호
    • /
    • pp.141-154
    • /
    • 2023
  • Effects of viscoelastic foundation on vibration of curved-beam structure with clamped and simply-supported boundary conditions is investigated in this study. In doing so, a micro-scale laminate composite beam with two piezoelectric face layer with a carbon nanotube reinforces composite core is considered. The whole beam structure is laid on a viscoelastic substrate which normally occurred in actual conditions. Due to small scale of the structure non-classical elasticity theory provided more accurate results. Therefore, nonlocal strain gradient theory is employed here to capture both nano-scale effects on carbon nanotubes and microscale effects because of overall scale of the structure. Equivalent homogenous properties of the composite core is obtained using Halpin-Tsai equation. The equations of motion is derived considering energy terms of the beam and variational principle in minimizing total energy. The boundary condition is assumed to be clamped at one end and simply supported at the other end. Due to nonlinear terms in the equations of motion, semi-analytical method of general differential quadrature method is engaged to solve the equations. In addition, due to complexity in developing and solving equations of motion of arches, an artificial neural network is design and implemented to capture effects of different parameters on the inplane vibration of sandwich arches. At the end, effects of several parameters including nonlocal and gradient parameters, geometrical aspect ratios and substrate constants of the structure on the natural frequency and amplitude is derived. It is observed that increasing nonlocal and gradient parameters have contradictory effects of the amplitude and frequency of vibration of the laminate beam.

다해상도 영상과 개선된 RBF 네트워크를 이용한 계층적 영문 명함 인식 (Hierarchical Recognition of English Calling Card by Using Multiresolution Images and Enhanced RBF Network)

  • 김광백;김영주
    • 정보처리학회논문지B
    • /
    • 제10B권4호
    • /
    • pp.443-450
    • /
    • 2003
  • 본 논문은 영문 명함의 다해상도 영상을 이용한 계층적 영살 처리를 통해 문자를 추출하고 개선된 신경망 기법을 이용하여 문자를 인식하는 새로운 계층적 명함 인식 알고리즘을 제안하였다 계층적 인식 알고리즘은 명함 인식 과정을 구성하는 각 처리 단계별로 처리 시간을 단축함과 동시에 성능 향상을 위해 입력된 명함 영상을 해상도가 서로 다른 영상들로 분리하여 적용한다. 우선 1/3배 축소 영상에 가로 스미어링 기법을 적용하여 명함 영상 내에서 문자들을 포함하는 문자열 영역을 추출하고, 문자열 영역으로부터 개별 문자를 추출하기 위하여 1/2배 축소 영상에 새로 스미어링 및 윤곽선 추적 마스킹을 적용한다. 마지막으로 추출된 문자를 인식하기 위해서 문자의 형태학적 특성을 그대로 가지고 있는 원 영상을 사용하며, 다양한 형태를 가진 명함상의 문자를 인식하기 위해 ART1 기반의 개선된 RBF 네트워크를 제안하고 인식 과정에 적용하였다 제안된 인식 알고리즘을 실제 영문 명함 영상에 적용하여 실험한 결과, 기존의 방법과 비교하여 문자 추출 및 인식 성능이 크게 향상됨을 확인하였다.

상황인식에 기반한 유비쿼터스 헬스케어 모델 (Ubiquitous healthcare model based on context recognition)

  • 김정원
    • 한국컴퓨터정보학회논문지
    • /
    • 제15권9호
    • /
    • pp.129-136
    • /
    • 2010
  • 모바일컴퓨팅, 무선 센서네트워크, 센서 기술로 인하여 유비쿼터스 컴퓨팅 서비스가 현실화되고 있고 모든 사람들에게 의료서비스를 보다 편리하게 제공할 수 있게 될 것으로 기대된다. 이 u-Healthcare 서비스는 시간, 장소에 구애 받지 않고 의료서비스를 제공할 수 있으므로 인간의 삶의 질을 향상시킬 수 있다. 따라서 본 논문에서는 이 서비스를 구현하는 시스템으로 심장병 환자를 위한 헬스케어 서비스 프로토타입을 구현하였다. 시스템은 두 부분으로 구성된다. front-end는 체온, 혈압, 혈중산소농도, 심장의 파형 등 다양한 생체신호를 수집하고 back-end는 수집된 신호를 바탕으로 의료서비스를 제공한다. 단순히 생체 신호만을 의료진에게 전달하는 것은 모니터링 수준에 불과하여 유비쿼터스 헬스케어 서비스의 요구조건을 만족할 수 없다. 따라서 본 연구에서는 인공신경망 기술을 이용하여 현재의 상황을 인식하여 불필요한 신호를 제거하고, 개인의 특성에 최적화된 헬스케어 모델을 제시한다. 철저한 실험을 통하여 제안된 모델은 유비쿼터스 서비스의 요구조건을 만족하고 보다 향상된 서비스를 제공할 수 있음을 확인하였다.

Dual-stream Co-enhanced Network for Unsupervised Video Object Segmentation

  • Hongliang Zhu;Hui Yin;Yanting Liu;Ning Chen
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제18권4호
    • /
    • pp.938-958
    • /
    • 2024
  • Unsupervised Video Object Segmentation (UVOS) is a highly challenging problem in computer vision as the annotation of the target object in the testing video is unknown at all. The main difficulty is to effectively handle the complicated and changeable motion state of the target object and the confusion of similar background objects in video sequence. In this paper, we propose a novel deep Dual-stream Co-enhanced Network (DC-Net) for UVOS via bidirectional motion cues refinement and multi-level feature aggregation, which can fully take advantage of motion cues and effectively integrate different level features to produce high-quality segmentation mask. DC-Net is a dual-stream architecture where the two streams are co-enhanced by each other. One is a motion stream with a Motion-cues Refine Module (MRM), which learns from bidirectional optical flow images and produces fine-grained and complete distinctive motion saliency map, and the other is an appearance stream with a Multi-level Feature Aggregation Module (MFAM) and a Context Attention Module (CAM) which are designed to integrate the different level features effectively. Specifically, the motion saliency map obtained by the motion stream is fused with each stage of the decoder in the appearance stream to improve the segmentation, and in turn the segmentation loss in the appearance stream feeds back into the motion stream to enhance the motion refinement. Experimental results on three datasets (Davis2016, VideoSD, SegTrack-v2) demonstrate that DC-Net has achieved comparable results with some state-of-the-art methods.

범용 데이터 셋과 얼굴 데이터 셋에 대한 초해상도 융합 기법 (Super Resolution Fusion Scheme for General- and Face Dataset)

  • 문준원;김재석
    • 한국멀티미디어학회논문지
    • /
    • 제22권11호
    • /
    • pp.1242-1250
    • /
    • 2019
  • Super resolution technique aims to convert a low-resolution image with coarse details to a corresponding high-resolution image with refined details. In the past decades, the performance is greatly improved due to progress of deep learning models. However, universal solution for various objects is a still challenging issue. We observe that learning super resolution with a general dataset has poor performance on faces. In this paper, we propose a super resolution fusion scheme that works well for both general- and face datasets to achieve more universal solution. In addition, object-specific feature extractor is employed for better reconstruction performance. In our experiments, we compare our fusion image and super-resolved images from one- of the state-of-the-art deep learning models trained with DIV2K and FFHQ datasets. Quantitative and qualitative evaluates show that our fusion scheme successfully works well for both datasets. We expect our fusion scheme to be effective on other objects with poor performance and this will lead to universal solutions.

Robust Deep Age Estimation Method Using Artificially Generated Image Set

  • Jang, Jaeyoon;Jeon, Seung-Hyuk;Kim, Jaehong;Yoon, Hosub
    • ETRI Journal
    • /
    • 제39권5호
    • /
    • pp.643-651
    • /
    • 2017
  • Human age estimation is one of the key factors in the field of Human-Robot Interaction/Human-Computer Interaction (HRI/HCI). Owing to the development of deep-learning technologies, age recognition has recently been attempted. In general, however, deep learning techniques require a large-scale database, and for age learning with variations, a conventional database is insufficient. For this reason, we propose an age estimation method using artificially generated data. Image data are artificially generated through 3D information, thus solving the problem of shortage of training data, and helping with the training of the deep-learning technique. Augmentation using 3D has advantages over 2D because it creates new images with more information. We use a deep architecture as a pre-trained model, and improve the estimation capacity using artificially augmented training images. The deep architecture can outperform traditional estimation methods, and the improved method showed increased reliability. We have achieved state-of-the-art performance using the proposed method in the Morph-II dataset and have proven that the proposed method can be used effectively using the Adience dataset.

EER-ASSL: Combining Rollback Learning and Deep Learning for Rapid Adaptive Object Detection

  • Ahmed, Minhaz Uddin;Kim, Yeong Hyeon;Rhee, Phill Kyu
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제14권12호
    • /
    • pp.4776-4794
    • /
    • 2020
  • We propose a rapid adaptive learning framework for streaming object detection, called EER-ASSL. The method combines the expected error reduction (EER) dependent rollback learning and the active semi-supervised learning (ASSL) for a rapid adaptive CNN detector. Most CNN object detectors are built on the assumption of static data distribution. However, images are often noisy and biased, and the data distribution is imbalanced in a real world environment. The proposed method consists of collaborative sampling and EER-ASSL. The EER-ASSL utilizes the active learning (AL) and rollback based semi-supervised learning (SSL). The AL allows us to select more informative and representative samples measuring uncertainty and diversity. The SSL divides the selected streaming image samples into the bins and each bin repeatedly transfers the discriminative knowledge of the EER and CNN models to the next bin until convergence and incorporation with the EER rollback learning algorithm is achieved. The EER models provide a rapid short-term myopic adaptation and the CNN models an incremental long-term performance improvement. EER-ASSL can overcome noisy and biased labels in varying data distribution. Extensive experiments shows that EER-ASSL obtained 70.9 mAP compared to state-of-the-art technology such as Faster RCNN, SSD300, and YOLOv2.