Search | Korea Science

Multi-band Approach to Deep Learning-Based Artificial Stereo Extension

Jeon, Kwang Myung;Park, Su Yeon;Chun, Chan Jun;Park, Nam In;Kim, Hong Kook
- ETRI Journal
- /
- v.39 no.3
- /
- pp.398-405
- /
- 2017
In this paper, an artificial stereo extension method that creates stereophonic sound from a mono sound source is proposed. The proposed method first trains deep neural networks (DNNs) that model the nonlinear relationship between the dominant and residual signals of the stereo channel. In the training stage, the band-wise log spectral magnitude and unwrapped phase of both the dominant and residual signals are utilized to model the nonlinearities of each sub-band through deep architecture. From that point, stereo extension is conducted by estimating the residual signal that corresponds to the input mono channel signal with the trained DNN model in a sub-band domain. The performance of the proposed method was evaluated using a log spectral distortion (LSD) measure and multiple stimuli with a hidden reference and anchor (MUSHRA) test. The results showed that the proposed method provided a lower LSD and higher MUSHRA score than conventional methods that use hidden Markov models and DNN with full-band processing.
https://doi.org/10.4218/etrij.17.0116.0773 인용 PDF KSCI

Android malicious code Classification using Deep Belief Network

Shiqi, Luo;Shengwei, Tian;Long, Yu;Jiong, Yu;Hua, Sun
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.12 no.1
- /
- pp.454-475
- /
- 2018
This paper presents a novel Android malware classification model planned to classify and categorize Android malicious code at Drebin dataset. The amount of malicious mobile application targeting Android based smartphones has increased rapidly. In this paper, Restricted Boltzmann Machine and Deep Belief Network are used to classify malware into families of Android application. A texture-fingerprint based approach is proposed to extract or detect the feature of malware content. A malware has a unique "image texture" in feature spatial relations. The method uses information on texture image extracted from malicious or benign code, which are mapped to uncompressed gray-scale according to the texture image-based approach. By studying and extracting the implicit features of the API call from a large number of training samples, we get the original dynamic activity features sets. In order to improve the accuracy of classification algorithm on the features selection, on the basis of which, it combines the implicit features of the texture image and API call in malicious code, to train Restricted Boltzmann Machine and Back Propagation. In an evaluation with different malware and benign samples, the experimental results suggest that the usability of this method---using Deep Belief Network to classify Android malware by their texture images and API calls, it detects more than 94% of the malware with few false alarms. Which is higher than shallow machine learning algorithm clearly.
https://doi.org/10.3837/tiis.2018.01.022 인용 PDF KSCI

FAGON: Fake News Detection Model Using Grammatical Transformation on Deep Neural Network

Seo, Youngkyung;Han, Seong-Soo;Jeon, You-Boo;Jeong, Chang-Sung
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.13 no.10
- /
- pp.4958-4970
- /
- 2019
As technology advances, the amount of fake news is increasing more and more by various reasons such as political issues and advertisement exaggeration. However, there have been very few research works on fake news detection, especially which uses grammatical transformation on deep neural network. In this paper, we shall present a new Fake News Detection Model, called FAGON(Fake news detection model using Grammatical transformation On deep Neural network) which determines efficiently if the proposition is true or not for the given article by learning grammatical transformation on neural network. Especially, our model focuses the Korean language. It consists of two modules: sentence generator and classification. The former generates multiple sentences which have the same meaning as the proposition, but with different grammar by training the grammatical transformation. The latter classifies the proposition as true or false by training with vectors generated from each sentence of the article and the multiple sentences obtained from the former model respectively. We shall show that our model is designed to detect fake news effectively by exploiting various grammatical transformation and proper classification structure.
https://doi.org/10.3837/tiis.2019.10.008 인용 PDF KSCI HTML

Modeling of Suspended Sediment Transport Using Deep Neural Networks (심층 신경망 기법을 통한 부유사 이동 모델링)

Bong, Tae-Ho;Son, Young-Hwan;Kim, Kyu-Sun;Kim, Dong-Geun
- Journal of The Korean Society of Agricultural Engineers
- /
- v.60 no.4
- /
- pp.83-91
- /
- 2018
Land reclamation, coastal construction, coastline extension and port construction, all of which involve dredging, are increasingly required to meet the growing economic and societal demands in the coastal zone. During the land reclamation, a portion of landfills are lost from the desired location due to a variety of causes, and therefore prediction of sediment transport is very important for economical and efficient land reclamation management. In this study, laboratory disposal tests were performed using an open channel, and suspended sediment transport was analyzed according to flow velocity and grain size. The relationships between the average and standard deviation of the deposition distance and the flow velocity were almost linear, and the relationships between the average and standard deviation of deposition distance and the grain size were found to have high non-linearity in the form of power law. The deposition distribution of sediments was demonstrated to have log-normal distributions regardless of the flow velocity. Based on the experimental results, modeling of suspended sediment transport was performed using deep neural network, one of deep learning techniques, and the deposition distribution was reproduced through log-normal distribution.
https://doi.org/10.5389/KSAE.2018.60.4.083 인용 PDF KSCI

Development of Real-Time Objects Segmentation for Dual-Camera Synthesis in iOS (iOS 기반 실시간 객체 분리 및 듀얼 카메라 합성 개발)

Jang, Yoo-jin;Kim, Ji-yeong;Lee, Ju-hyun;Hwang, Jun
- Journal of Internet Computing and Services
- /
- v.22 no.3
- /
- pp.37-43
- /
- 2021
In this paper, we study how objects from front and back cameras can be recognized in real time in a mobile environment to segment regions of object pixels and synthesize them through image processing. To this work, we applied DeepLabV3 machine learning model to dual cameras provided by Apple's iOS. We also propose methods using Core Image and Core Graphics libraries from Apple for image synthesis and postprocessing. Furthermore, we improved CPU usage than previous works and compared the throughput rates and results of Depth and DeepLabV3. Finally, We also developed a camera application using these two methods.
https://doi.org/10.7472/jksii.2021.22.3.37 인용 PDF KSCI HTML

Electroencephalography-based imagined speech recognition using deep long short-term memory network

Agarwal, Prabhakar;Kumar, Sandeep
- ETRI Journal
- /
- v.44 no.4
- /
- pp.672-685
- /
- 2022
This article proposes a subject-independent application of brain-computer interfacing (BCI). A 32-channel Electroencephalography (EEG) device is used to measure imagined speech (SI) of four words (sos, stop, medicine, washroom) and one phrase (come-here) across 13 subjects. A deep long short-term memory (LSTM) network has been adopted to recognize the above signals in seven EEG frequency bands individually in nine major regions of the brain. The results show a maximum accuracy of 73.56% and a network prediction time (NPT) of 0.14 s which are superior to other state-of-the-art techniques in the literature. Our analysis reveals that the alpha band can recognize SI better than other EEG frequencies. To reinforce our findings, the above work has been compared by models based on the gated recurrent unit (GRU), convolutional neural network (CNN), and six conventional classifiers. The results show that the LSTM model has 46.86% more average accuracy in the alpha band and 74.54% less average NPT than CNN. The maximum accuracy of GRU was 8.34% less than the LSTM network. Deep networks performed better than traditional classifiers.
https://doi.org/10.4218/etrij.2021-0118 인용 PDF KSCI

Crack segmentation in high-resolution images using cascaded deep convolutional neural networks and Bayesian data fusion

Tang, Wen;Wu, Rih-Teng;Jahanshahi, Mohammad R.
- Smart Structures and Systems
- /
- v.29 no.1
- /
- pp.221-235
- /
- 2022
Manual inspection of steel box girders on long span bridges is time-consuming and labor-intensive. The quality of inspection relies on the subjective judgements of the inspectors. This study proposes an automated approach to detect and segment cracks in high-resolution images. An end-to-end cascaded framework is proposed to first detect the existence of cracks using a deep convolutional neural network (CNN) and then segment the crack using a modified U-Net encoder-decoder architecture. A Naïve Bayes data fusion scheme is proposed to reduce the false positives and false negatives effectively. To generate the binary crack mask, first, the original images are divided into 448 × 448 overlapping image patches where these image patches are classified as cracks versus non-cracks using a deep CNN. Next, a modified U-Net is trained from scratch using only the crack patches for segmentation. A customized loss function that consists of binary cross entropy loss and the Dice loss is introduced to enhance the segmentation performance. Additionally, a Naïve Bayes fusion strategy is employed to integrate the crack score maps from different overlapping crack patches and to decide whether a pixel is crack or not. Comprehensive experiments have demonstrated that the proposed approach achieves an 81.71% mean intersection over union (mIoU) score across 5 different training/test splits, which is 7.29% higher than the baseline reference implemented with the original U-Net.
https://doi.org/10.12989/sss.2022.29.1.221 인용 KSCI

Multimodal MRI analysis model based on deep neural network for glioma grading classification (신경교종 등급 분류를 위한 심층신경망 기반 멀티모달 MRI 영상 분석 모델)

Kim, Jonghun;Park, Hyunjin
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2022.05a
- /
- pp.425-427
- /
- 2022
The grade of glioma is important information related to survival and thus is important to classify the grade of glioma before treatment to evaluate tumor progression and treatment planning. Glioma grading is mostly divided into high-grade glioma (HGG) and low-grade glioma (LGG). In this study, image preprocessing techniques are applied to analyze magnetic resonance imaging (MRI) using the deep neural network model. Classification performance of the deep neural network model is evaluated. The highest-performance EfficientNet-B6 model shows results of accuracy 0.9046, sensitivity 0.9570, specificity 0.7976, AUC 0.8702, and F1-Score 0.8152 in 5-fold cross-validation.
PDF

Developing Optimal Demand Forecasting Models for a Very Short Shelf-Life Item: A Case of Perishable Products in Online's Retail Business

Wiwat Premrudikul;Songwut Ahmornahnukul;Akkaranan Pongsathornwiwat
- Journal of Information Technology Applications and Management
- /
- v.30 no.3
- /
- pp.1-13
- /
- 2023
Demand forecasting is a crucial task for an online retail where has to manage daily fresh foods effectively. Failing in forecasting results loss of profitability because of incompetent inventory management. This study investigated the optimal performance of different forecasting models for a very short shelf-life product. Demand data of 13 perishable items with aging of 210 days were used for analysis. Our comparison results of four methods: Trivial Identity, Seasonal Naïve, Feed-Forward and Autoregressive Recurrent Neural Networks (DeepAR) reveals that DeepAR outperforms with the lowest MAPE. This study also suggests the managerial implications by employing coefficient of variation (CV) as demand variation indicators. Three classes: Low, Medium and High variation are introduced for classify 13 products into groups. Our analysis found that DeepAR is suitable for medium and high variations, while the low group can use any methods. With this approach, the case can gain benefit of better fill-rate performance.
https://doi.org/10.21219/jitam.2023.30.3.001 인용 PDF

Design of a Recommendation System for Improving Deep Neural Network Performance

Juhyoung Sung;Kiwon Kwon;Byoungchul Song
- Journal of Internet Computing and Services
- /
- v.25 no.1
- /
- pp.49-56
- /
- 2024
There have been emerging many use-cases applying recommendation systems especially in online platform. Although the performance of recommendation systems is affected by a variety of factors, selecting appropriate features is difficult since most of recommendation systems have sparse data. Conventional matrix factorization (MF) method is a basic way to handle with problems in the recommendation systems. However, the MF based scheme cannot reflect non-linearity characteristics well. As deep learning technology has been attracted widely, a deep neural network (DNN) framework based collaborative filtering (CF) was introduced to complement the non-linearity issue. However, there is still a problem related to feature embedding for use as input to the DNN. In this paper, we propose an effective method using singular value decomposition (SVD) based feature embedding for improving the DNN performance of recommendation algorithms. We evaluate the performance of recommendation systems using MovieLens dataset and show the proposed scheme outperforms the existing methods. Moreover, we analyze the performance according to the number of latent features in the proposed algorithm. We expect that the proposed scheme can be applied to the generalized recommendation systems.
https://doi.org/10.7472/jksii.2024.25.1.49 인용 PDF HTML

Search Result 5,178, Processing Time 0.033 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)