Search | Korea Science

Facial Expression Classification Using Deep Convolutional Neural Network

Choi, In-kyu;Ahn, Ha-eun;Yoo, Jisang
- Journal of Electrical Engineering and Technology
- /
- v.13 no.1
- /
- pp.485-492
- /
- 2018
In this paper, we propose facial expression recognition using CNN (Convolutional Neural Network), one of the deep learning technologies. The proposed structure has general classification performance for any environment or subject. For this purpose, we collect a variety of databases and organize the database into six expression classes such as 'expressionless', 'happy', 'sad', 'angry', 'surprised' and 'disgusted'. Pre-processing and data augmentation techniques are applied to improve training efficiency and classification performance. In the existing CNN structure, the optimal structure that best expresses the features of six facial expressions is found by adjusting the number of feature maps of the convolutional layer and the number of nodes of fully-connected layer. The experimental results show good classification performance compared to the state-of-the-arts in experiments of the cross validation and the cross database. Also, compared to other conventional models, it is confirmed that the proposed structure is superior in classification performance with less execution time.
https://doi.org/10.5370/JEET.2018.13.1.485 인용 PDF KSCI HTML

A CTR Prediction Approach for Text Advertising Based on the SAE-LR Deep Neural Network

Jiang, Zilong;Gao, Shu;Dai, Wei
- Journal of Information Processing Systems
- /
- v.13 no.5
- /
- pp.1052-1070
- /
- 2017
For the autoencoder (AE) implemented as a construction component, this paper uses the method of greedy layer-by-layer pre-training without supervision to construct the stacked autoencoder (SAE) to extract the abstract features of the original input data, which is regarded as the input of the logistic regression (LR) model, after which the click-through rate (CTR) of the user to the advertisement under the contextual environment can be obtained. These experiments show that, compared with the usual logistic regression model and support vector regression model used in the field of predicting the advertising CTR in the industry, the SAE-LR model has a relatively large promotion in the AUC value. Based on the improvement of accuracy of advertising CTR prediction, the enterprises can accurately understand and have cognition for the needs of their customers, which promotes the multi-path development with high efficiency and low cost under the condition of internet finance.
https://doi.org/10.3745/JIPS.02.0069 인용 PDF KSCI

Deep Residual Networks for Single Image De-snowing (이미지의 눈제거를 위한 심층 Resnet)

Wan, Weiguo;Lee, Hyo Jong
- Annual Conference of KIPS
- /
- 2019.05a
- /
- pp.525-528
- /
- 2019
Atmospheric particle removal is a challenging task and attacks wide interests in computer vision filed. In this paper, we proposed a single image snow removal framework based on deep residual networks. According to the fact that there are various snow sizes in a snow image, the inception module which consists of different filter kernels was adopted to extract multiple resolution features of the input snow image. Except the traditional mean square error loss, the perceptual loss and total variation loss were employed to generate more clean images. Experimental results on synthetic and realistic snow images indicated that the proposed method achieves superior performance in respect of visual perception and objective evaluation.
https://doi.org/10.3745/PKIPS.y2019m05a.525 인용 PDF

Forecasting COVID-19 confirmed cases in South Korea using Spatio-Temporal Graph Neural Networks

Ngoc, Kien Mai;Lee, Minho
- International Journal of Contents
- /
- v.17 no.3
- /
- pp.1-14
- /
- 2021
Since the outbreak of the coronavirus disease 2019 (COVID-19) pandemic, a lot of efforts have been made in the field of data science to help combat against this disease. Among them, forecasting the number of cases of infection is a crucial problem to predict the development of the pandemic. Many deep learning-based models can be applied to solve this type of time series problem. In this research, we would like to take a step forward to incorporate spatial data (geography) with time series data to forecast the cases of region-level infection simultaneously. Specifically, we model a single spatio-temporal graph, in which nodes represent the geographic regions, spatial edges represent the distance between each pair of regions, and temporal edges indicate the node features through time. We evaluate this approach in COVID-19 in a Korean dataset, and we show a decrease of approximately 10% in both RMSE and MAE, and a significant boost to the training speed compared to the baseline models. Moreover, the training efficiency allows this approach to be extended for a large-scale spatio-temporal dataset.
https://doi.org/10.5392/IJoC.2021.17.3.001 인용 PDF KSCI HTML

Road Damage Detection and Classification based on Multi-level Feature Pyramids

Yin, Junru;Qu, Jiantao;Huang, Wei;Chen, Qiqiang
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.15 no.2
- /
- pp.786-799
- /
- 2021
Road damage detection is important for road maintenance. With the development of deep learning, more and more road damage detection methods have been proposed, such as Fast R-CNN, Faster R-CNN, Mask R-CNN and RetinaNet. However, because shallow and deep layers cannot be extracted at the same time, the existing methods do not perform well in detecting objects with fewer samples. In addition, these methods cannot obtain a highly accurate detecting bounding box. This paper presents a Multi-level Feature Pyramids method based on M2det. Because the feature layer has multi-scale and multi-level architecture, the feature layer containing more information and obvious features can be extracted. Moreover, an attention mechanism is used to improve the accuracy of local boundary boxes in the dataset. Experimental results show that the proposed method is better than the current state-of-the-art methods.
https://doi.org/10.3837/tiis.2021.02.022 인용 PDF KSCI HTML

Research Trends on Deep Reinforcement Learning (심층 강화학습 기술 동향)

Jang, S.Y.;Yoon, H.J.;Park, N.S.;Yun, J.K.;Son, Y.S.
- Electronics and Telecommunications Trends
- /
- v.34 no.4
- /
- pp.1-14
- /
- 2019
Recent trends in deep reinforcement learning (DRL) have revealed the considerable improvements to DRL algorithms in terms of performance, learning stability, and computational efficiency. DRL also enables the scenarios that it covers (e.g., partial observability; cooperation, competition, coexistence, and communications among multiple agents; multi-task; decentralized intelligence) to be vastly expanded. These features have cultivated multi-agent reinforcement learning research. DRL is also expanding its applications from robotics to natural language processing and computer vision into a wide array of fields such as finance, healthcare, chemistry, and even art. In this report, we briefly summarize various DRL techniques and research directions.
https://doi.org/10.22648/ETRI.2019.J.340401 인용 PDF

ADD-Net: Attention Based 3D Dense Network for Action Recognition

Man, Qiaoyue;Cho, Young Im
- Journal of the Korea Society of Computer and Information
- /
- v.24 no.6
- /
- pp.21-28
- /
- 2019
Recent years with the development of artificial intelligence and the success of the deep model, they have been deployed in all fields of computer vision. Action recognition, as an important branch of human perception and computer vision system research, has attracted more and more attention. Action recognition is a challenging task due to the special complexity of human movement, the same movement may exist between multiple individuals. The human action exists as a continuous image frame in the video, so action recognition requires more computational power than processing static images. And the simple use of the CNN network cannot achieve the desired results. Recently, the attention model has achieved good results in computer vision and natural language processing. In particular, for video action classification, after adding the attention model, it is more effective to focus on motion features and improve performance. It intuitively explains which part the model attends to when making a particular decision, which is very helpful in real applications. In this paper, we proposed a 3D dense convolutional network based on attention mechanism(ADD-Net), recognition of human motion behavior in the video.
https://doi.org/10.9708/jksci.2019.24.06.021 인용 PDF KSCI HTML

A Video Smoke Detection Algorithm Based on Cascade Classification and Deep Learning

Nguyen, Manh Dung;Kim, Dongkeun;Ro, Soonghwan
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.12 no.12
- /
- pp.6018-6033
- /
- 2018
Fires are a common cause of catastrophic personal injuries and devastating property damage. Every year, many fires occur and threaten human lives and property around the world. Providing early important sign for early fire detection, and therefore the detection of smoke is always the first step in fire-alarm systems. In this paper we propose an automatic smoke detection system built on camera surveillance and image processing technologies. The key features used in our algorithm are to detect and track smoke as moving objects and distinguish smoke from non-smoke objects using a convolutional neural network (CNN) model for cascade classification. The results of our experiment, in comparison with those of some earlier studies, show that the proposed algorithm is very effective not only in detecting smoke, but also in reducing false positives.
https://doi.org/10.3837/tiis.2018.12.022 인용 PDF KSCI

No-reference quality assessment of dynamic sports videos based on a spatiotemporal motion model

Kim, Hyoung-Gook;Shin, Seung-Su;Kim, Sang-Wook;Lee, Gi Yong
- ETRI Journal
- /
- v.43 no.3
- /
- pp.538-548
- /
- 2021
This paper proposes an approach to improve the performance of no-reference video quality assessment for sports videos with dynamic motion scenes using an efficient spatiotemporal model. In the proposed method, we divide the video sequences into video blocks and apply a 3D shearlet transform that can efficiently extract primary spatiotemporal features to capture dynamic natural motion scene statistics from the incoming video blocks. The concatenation of a deep residual bidirectional gated recurrent neural network and logistic regression is used to learn the spatiotemporal correlation more robustly and predict the perceptual quality score. In addition, conditional video block-wise constraints are incorporated into the objective function to improve quality estimation performance for the entire video. The experimental results show that the proposed method extracts spatiotemporal motion information more effectively and predicts the video quality with higher accuracy than the conventional no-reference video quality assessment methods.
https://doi.org/10.4218/etrij.2020-0160 인용 PDF KSCI

Evaluations of AI-based malicious PowerShell detection with feature optimizations

Song, Jihyeon;Kim, Jungtae;Choi, Sunoh;Kim, Jonghyun;Kim, Ikkyun
- ETRI Journal
- /
- v.43 no.3
- /
- pp.549-560
- /
- 2021
Cyberattacks are often difficult to identify with traditional signature-based detection, because attackers continually find ways to bypass the detection methods. Therefore, researchers have introduced artificial intelligence (AI) technology for cybersecurity analysis to detect malicious PowerShell scripts. In this paper, we propose a feature optimization technique for AI-based approaches to enhance the accuracy of malicious PowerShell script detection. We statically analyze the PowerShell script and preprocess it with a method based on the tokens and abstract syntax tree (AST) for feature selection. Here, tokens and AST represent the vocabulary and structure of the PowerShell script, respectively. Performance evaluations with optimized features yield detection rates of 98% in both machine learning (ML) and deep learning (DL) experiments. Among them, the ML model with the 3-gram of selected five tokens and the DL model with experiments based on the AST 3-gram deliver the best performance.
https://doi.org/10.4218/etrij.2020-0215 인용 PDF KSCI

Search Result 1,096, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)