• Title/Summary/Keyword: Deep Features

Search Result 1,096, Processing Time 0.025 seconds

A Survey on Image Emotion Recognition

  • Zhao, Guangzhe;Yang, Hanting;Tu, Bing;Zhang, Lei
    • Journal of Information Processing Systems
    • /
    • v.17 no.6
    • /
    • pp.1138-1156
    • /
    • 2021
  • Emotional semantics are the highest level of semantics that can be extracted from an image. Constructing a system that can automatically recognize the emotional semantics from images will be significant for marketing, smart healthcare, and deep human-computer interaction. To understand the direction of image emotion recognition as well as the general research methods, we summarize the current development trends and shed light on potential future research. The primary contributions of this paper are as follows. We investigate the color, texture, shape and contour features used for emotional semantics extraction. We establish two models that map images into emotional space and introduce in detail the various processes in the image emotional semantic recognition framework. We also discuss important datasets and useful applications in the field such as garment image and image retrieval. We conclude with a brief discussion about future research trends.

Bottleneck-based Siam-CNN Algorithm for Object Tracking (객체 추적을 위한 보틀넥 기반 Siam-CNN 알고리즘)

  • Lim, Su-Chang;Kim, Jong-Chan
    • Journal of Korea Multimedia Society
    • /
    • v.25 no.1
    • /
    • pp.72-81
    • /
    • 2022
  • Visual Object Tracking is known as the most fundamental problem in the field of computer vision. Object tracking localize the region of target object with bounding box in the video. In this paper, a custom CNN is created to extract object feature that has strong and various information. This network was constructed as a Siamese network for use as a feature extractor. The input images are passed convolution block composed of a bottleneck layers, and features are emphasized. The feature map of the target object and the search area, extracted from the Siamese network, was input as a local proposal network. Estimate the object area using the feature map. The performance of the tracking algorithm was evaluated using the OTB2013 dataset. Success Plot and Precision Plot were used as evaluation matrix. As a result of the experiment, 0.611 in Success Plot and 0.831 in Precision Plot were achieved.

A Margin-based Face Liveness Detection with Behavioral Confirmation

  • Tolendiyev, Gabit;Lim, Hyotaek;Lee, Byung-Gook
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.13 no.2
    • /
    • pp.187-194
    • /
    • 2021
  • This paper presents a margin-based face liveness detection method with behavioral confirmation to prevent spoofing attacks using deep learning techniques. The proposed method provides a possibility to prevent biometric person authentication systems from replay and printed spoofing attacks. For this work, a set of real face images and fake face images was collected and a face liveness detection model is trained on the constructed dataset. Traditional face liveness detection methods exploit the face image covering only the face regions of the human head image. However, outside of this region of interest (ROI) might include useful features such as phone edges and fingers. The proposed face liveness detection method was experimentally tested on the author's own dataset. Collected databases are trained and experimental results show that the trained model distinguishes real face images and fake images correctly.

Features Of The Implementation Of Inclusive Education: The Role Of The Teacher

  • Klochko, Oksana;Pohoda, Olena;Rybalko, Petro;Kravchenko, Anatoly;Tytovych, Andrii;Kondratenko, Viktoriia
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.11
    • /
    • pp.109-114
    • /
    • 2022
  • The article theoretically analyzed and specified definitions such as: "professional development of personality, competence, professional competence of a teacher". Structural components of professional competence are defined, namely: theoretical involves deep knowledge in the field of special pedagogy, special psychology; technological involves the use of acquired knowledge in practical activities and personal in which important personal characteristics of a special teacher are noted. Criteria and levels of development of professional competence of future special teachers are determined. The article analyzes the peculiarities of the professional activity of a teacher in the conditions of an inclusive educational space, in particular, the special training of a teacher as an integral component of this process. Emphasis is placed on the cooperation of teachers in an inclusive educational institution for the socialization of a child with special needs and her preparation for independent life.

Aiding the operator during novel fault diagnosis

  • Yoon, Wan-C.;Hammer, John-M.
    • Journal of the Ergonomics Society of Korea
    • /
    • v.6 no.1
    • /
    • pp.9-24
    • /
    • 1987
  • The design and philosophy are presented for an intelligent aid for a hyman operator who must diagnose a novel fault in a physical system. A novel fault is defined as one that the operator has not experienced in either real system operation or training. When the operator must diagnose a novel fault, deep reasoning about the behavior of the system components is required. To aid the human operator in this situation, four aiding approaches which provide useful information are proposed. The aiding information is generated by a qualitative, component-level model of the physical system. Both the aid and the human are able to reason causally about the system in a cooperative search for a diagnosis. The aiding features were designed to help the hyman's use of his/her mental model in predicting the normal system behavior, integrating the observations into the actual system behavior, or finding discrepancies between the two. The aid can also have direct access to the operator's hypotheses and run a hypothetical system model. The different aiding approaches will be evaluated by a series of experiments.

  • PDF

Neural Networks-Based Method for Electrocardiogram Classification

  • Maksym Kovalchuk;Viktoriia Kharchenko;Andrii Yavorskyi;Igor Bieda;Taras Panchenko
    • International Journal of Computer Science & Network Security
    • /
    • v.23 no.9
    • /
    • pp.186-191
    • /
    • 2023
  • Neural Networks are widely used for huge variety of tasks solution. Machine Learning methods are used also for signal and time series analysis, including electrocardiograms. Contemporary wearable devices, both medical and non-medical type like smart watch, allow to gather the data in real time uninterruptedly. This allows us to transfer these data for analysis or make an analysis on the device, and thus provide preliminary diagnosis, or at least fix some serious deviations. Different methods are being used for this kind of analysis, ranging from medical-oriented using distinctive features of the signal to machine learning and deep learning approaches. Here we will demonstrate a neural network-based approach to this task by building an ensemble of 1D CNN classifiers and a final classifier of selection using logistic regression, random forest or support vector machine, and make the conclusions of the comparison with other approaches.

Multi-scale U-SegNet architecture with cascaded dilated convolutions for brain MRI Segmentation

  • Dayananda, Chaitra;Lee, Bumshik
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2020.11a
    • /
    • pp.25-28
    • /
    • 2020
  • Automatic segmentation of brain tissues such as WM, GM, and CSF from brain MRI scans is helpful for the diagnosis of many neurological disorders. Accurate segmentation of these brain structures is a very challenging task due to low tissue contrast, bias filed, and partial volume effects. With the aim to improve brain MRI segmentation accuracy, we propose an end-to-end convolutional based U-SegNet architecture designed with multi-scale kernels, which includes cascaded dilated convolutions for the task of brain MRI segmentation. The multi-scale convolution kernels are designed to extract abundant semantic features and capture context information at different scales. Further, the cascaded dilated convolution scheme helps to alleviate the vanishing gradient problem in the proposed model. Experimental outcomes indicate that the proposed architecture is superior to the traditional deep-learning methods such as Segnet, U-net, and U-Segnet and achieves high performance with an average DSC of 93% and 86% of JI value for brain MRI segmentation.

  • PDF

Endoscopic Management of Gastric Subepithelial Tumor (위상피하종양의 내시경적 진단 및 치료)

  • Hyunchul Lim
    • Journal of Digestive Cancer Research
    • /
    • v.10 no.1
    • /
    • pp.16-21
    • /
    • 2022
  • Diagnosis of gastric subepithelial tumors (SETs) is sometimes difficult with conventional endoscopy or tissue sampling with standard biopsy, so non-invasive imaging modalities such as endoscopic ultrasound (EUS) and computed tomography are used to evaluate the characteristics of SETs features (size, location, originating layer, echogenicity, shape). However imaging modalities alone is not able to distinguish among all types of SETs, so histology is the gold standard for obtaining the final diagnosis. For tissue sampling, mucosal cutting biopsy and mucosal incision-assisted biopsy and EUS-guided fine-needle aspiration or biopsy (EUS-FNA or EUS-FNB) is commonly recommended. Endoscopic mucosal resection (EMR) and endoscopic submucosal dissection (ESD) are used for resection of SETs involving the mucosal and superficial submucosal layers, could not treat adequately and safely the SETs involving the deep mucosa and muscularis propria. Submucosal tunneling endoscopic resection (STER) and endoscopic full-thickness resection (EFTR) is used as a therapeutic option for the treatment of SETs with the development of reliable endoscopic closure techniques and tools.

Motion classification using distributional features of 3D skeleton data

  • Woohyun Kim;Daeun Kim;Kyoung Shin Park;Sungim Lee
    • Communications for Statistical Applications and Methods
    • /
    • v.30 no.6
    • /
    • pp.551-560
    • /
    • 2023
  • Recently, there has been significant research into the recognition of human activities using three-dimensional sequential skeleton data captured by the Kinect depth sensor. Many of these studies employ deep learning models. This study introduces a novel feature selection method for this data and analyzes it using machine learning models. Due to the high-dimensional nature of the original Kinect data, effective feature extraction methods are required to address the classification challenge. In this research, we propose using the first four moments as predictors to represent the distribution of joint sequences and evaluate their effectiveness using two datasets: The exergame dataset, consisting of three activities, and the MSR daily activity dataset, composed of ten activities. The results show that the accuracy of our approach outperforms existing methods on average across different classifiers.

Discrimination model using denoising autoencoder-based majority vote classification for reducing false alarm rate

  • Heonyong Lee;Kyungtak Yu;Shiu Kim
    • Nuclear Engineering and Technology
    • /
    • v.55 no.10
    • /
    • pp.3716-3724
    • /
    • 2023
  • Loose parts monitoring and detecting alarm type in real Nuclear Power Plant have challenges such as background noise, insufficient alarm data, and difficulty of distinction between alarm data that occur during start and stop. Although many signal processing methods and alarm determination algorithms have been developed, it is not easy to determine valid alarm and extract the meaning data from alarm signal including background noise. To address these issues, this paper proposes a denoising autoencoder-based majority vote classification. Training and test data are prepared by acquiring alarm data from real NPP and simulation facility for data augmentation, and noisy data is reproduced by adding Gaussian noise. Using DAEs with 3, 5, 7, and 9 layers, features are extracted for each model and classified into neural networks. Finally, the results obtained from each DAE are classified by majority voting. Also, through comparison with other methods, the accuracy and the false alarm rate are compared, and the excellence of the proposed method is confirmed.