• Title/Summary/Keyword: deep Learning

Search Result 5,795, Processing Time 0.03 seconds

Effects of CNN Backbone on Trajectory Prediction Models for Autonomous Vehicle

  • Seoyoung Lee;Hyogyeong Park;Yeonhwi You;Sungjung Yong;Il-Young Moon
    • Journal of information and communication convergence engineering
    • /
    • v.21 no.4
    • /
    • pp.346-350
    • /
    • 2023
  • Trajectory prediction is an essential element for driving autonomous vehicles, and various trajectory prediction models have emerged with the development of deep learning technology. Convolutional neural network (CNN) is the most commonly used neural network architecture for extracting the features of visual images, and the latest models exhibit high performances. This study was conducted to identify an efficient CNN backbone model among the components of deep learning models for trajectory prediction. We changed the existing CNN backbone network of multiple-trajectory prediction models used as feature extractors to various state-of-the-art CNN models. The experiment was conducted using nuScenes, which is a dataset used for the development of autonomous vehicles. The results of each model were compared using frequently used evaluation metrics for trajectory prediction. Analyzing the impact of the backbone can improve the performance of the trajectory prediction task. Investigating the influence of the backbone on multiple deep learning models can be a future challenge.

3D Object Generation and Renderer System based on VAE ResNet-GAN

  • Min-Su Yu;Tae-Won Jung;GyoungHyun Kim;Soonchul Kwon;Kye-Dong Jung
    • International journal of advanced smart convergence
    • /
    • v.12 no.4
    • /
    • pp.142-146
    • /
    • 2023
  • We present a method for generating 3D structures and rendering objects by combining VAE (Variational Autoencoder) and GAN (Generative Adversarial Network). This approach focuses on generating and rendering 3D models with improved quality using residual learning as the learning method for the encoder. We deep stack the encoder layers to accurately reflect the features of the image and apply residual blocks to solve the problems of deep layers to improve the encoder performance. This solves the problems of gradient vanishing and exploding, which are problems when constructing a deep neural network, and creates a 3D model of improved quality. To accurately extract image features, we construct deep layers of the encoder model and apply the residual function to learning to model with more detailed information. The generated model has more detailed voxels for more accurate representation, is rendered by adding materials and lighting, and is finally converted into a mesh model. 3D models have excellent visual quality and accuracy, making them useful in various fields such as virtual reality, game development, and metaverse.

Prediction of Budget Prices in Electronic Bidding using Deep Learning Model (딥러닝 모델을 이용한 전자 입찰에서의 예정가격 예측)

  • Eun-Seo Lee;Gwi-Man Bak;Ji-Eun Lee;Young-Chul Bae
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.18 no.6
    • /
    • pp.1171-1176
    • /
    • 2023
  • In this paper, we predicts the estimated price using the DNBP (Deep learning Network to predict Budget Price) model with bidding data obtained from the bidding websites, ElecNet and OK EMS. We use the DNBP model to predict four lottery preliminary price, calculate their arithmetic mean, and then estimate the expected budget price ratio. We evaluate the model's performance by comparing it with the actual expected budget price ratio. We train the DNBP model by removing some of the 15 input nodes. The prediction results showed the lowest RMSE of 0.75788% when the model had 6 input nodes (a, g, h, i, j, k).

Improving the quality of light-field data extracted from a hologram using deep learning

  • Dae-youl Park;Joongki Park
    • ETRI Journal
    • /
    • v.46 no.2
    • /
    • pp.165-174
    • /
    • 2024
  • We propose a method to suppress the speckle noise and blur effects of the light field extracted from a hologram using a deep-learning technique. The light field can be extracted by bandpass filtering in the hologram's frequency domain. The extracted light field has reduced spatial resolution owing to the limited passband size of the bandpass filter and the blurring that occurs when the object is far from the hologram plane and also contains speckle noise caused by the random phase distribution of the three-dimensional object surface. These limitations degrade the reconstruction quality of the hologram resynthesized using the extracted light field. In the proposed method, a deep-learning model based on a generative adversarial network is designed to suppress speckle noise and blurring, resulting in improved quality of the light field extracted from the hologram. The model is trained using pairs of original two-dimensional images and their corresponding light-field data extracted from the complex field generated by the images. Validation of the proposed method is performed using light-field data extracted from holograms of objects with single and multiple depths and mesh-based computer-generated holograms.

Effect Analysis of a Deep Learning-Based Attention Redirection Compensation Strategy System on the Data Labeling Work Productivity of Individuals with Developmental Disabilities (딥러닝 기반의 주의환기 보상전략 시스템이 발달장애인의 데이터 라벨링 작업 생산성에 미치는 효과분석)

  • Yong-Man Ha;Jong-Wook Jang
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.24 no.1
    • /
    • pp.175-180
    • /
    • 2024
  • This paper investigates the effect of a deep learning-based system on data labeling task productivity by individuals with developmental disabilities. It was found that interventions, particularly those using AI, significantly improved productivity compared to self-serving task. AI interventions were notably more effective than job coach-led approaches. This research underscores the positive role of AI in enhancing task efficiency for those with developmental disabilities. This study is the first to apply AI technology to the data labeling tasks of individuals with developmental disabilities and highlighting deep learning's potential in vocational training and productivity enhancement for this group.

How Long Will Your Videos Remain Popular? Empirical Study with Deep Learning and Survival Analysis

  • Min Gyeong Choi;Jae Hong Park
    • Asia pacific journal of information systems
    • /
    • v.33 no.2
    • /
    • pp.282-297
    • /
    • 2023
  • One of the emerging trends in the marketing field is digital video marketing. Online videos offer rich content typically containing more information than any other type of content (e.g., audible or textual content). Accordingly, previous researchers have examined factors influencing videos' popularity. However, few studies have examined what causes a video to remain popular. Some videos achieve continuous, ongoing popularity, while others fade out quickly. For practitioners, videos at the recommendation slots may serve as strong communication channels, as many potential consumers are exposed to such videos. So,this study will provide practitioners important advice regarding how to choose videos that will survive as long-lasting favorites, allowing them to advertise in a cost-effective manner. Using deep learning techniques, this study extracts text from videos and measured the videos' tones, including factual and emotional tones. Additionally, we measure the aesthetic score by analyzing the thumbnail images in the data. We then empirically show that the cognitive features of a video, such as the tone of a message and the aesthetic assessment of a thumbnail image, play an important role in determining videos' long-term popularity. We believe that this is the first study of its kind to examine new factors that aid in ensuring a video remains popular using both deep learning and econometric methodologies.

Deep Learning-based X-ray Inspection for Battery Defect Detection (배터리 불량 검출을 위한 딥러닝 기반 X-ray 검사)

  • Daejin Jeong;Heon Huh
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.24 no.2
    • /
    • pp.147-153
    • /
    • 2024
  • X-rays are extensively employed for non-destructive inspection, applied to packaged food, human anatomy, and industrial products. Recently, this technology has extended to inspecting batteries in electric vehicles. Given the challenge of manual inspection for a substantial volume of batteries, deep learning is leveraged to detect battery defects. However, the effectiveness of deep learning heavily depends upon data size, and acquiring authentic defective images is a difficult and time-consuming task. In this study, we use data augmentation and investigate the impact of data size on battery inspection performance. The results provide valuable insights for enhancing the capabilities of the inspection process.

A Novel RFID Dynamic Testing Method Based on Optical Measurement

  • Zhenlu Liu;Xiaolei Yu;Lin Li;Weichun Zhang;Xiao Zhuang;Zhimin Zhao
    • Current Optics and Photonics
    • /
    • v.8 no.2
    • /
    • pp.127-137
    • /
    • 2024
  • The distribution of tags is an important factor that affects the performance of radio-frequency identification (RFID). To study RFID performance, it is necessary to obtain RFID tags' coordinates. However, the positioning method of RFID technology has large errors, and is easily affected by the environment. Therefore, a new method using optical measurement is proposed to achieve RFID performance analysis. First, due to the possibility of blurring during image acquisition, the paper derives a new image prior to removing blurring. A nonlocal means-based method for image deconvolution is proposed. Experimental results show that the PSNR and SSIM indicators of our algorithm are better than those of a learning deep convolutional neural network and fast total variation. Second, an RFID dynamic testing system based on photoelectric sensing technology is designed. The reading distance of RFID and the three-dimensional coordinates of the tags are obtained. Finally, deep learning is used to model the RFID reading distance and tag distribution. The error is 3.02%, which is better than other algorithms such as a particle-swarm optimization back-propagation neural network, an extreme learning machine, and a deep neural network. The paper proposes the use of optical methods to measure and collect RFID data, and to analyze and predict RFID performance. This provides a new method for testing RFID performance.

A Study on the Explainability of Inception Network-Derived Image Classification AI Using National Defense Data (국방 데이터를 활용한 인셉션 네트워크 파생 이미지 분류 AI의 설명 가능성 연구)

  • Kangun Cho
    • Journal of the Korea Institute of Military Science and Technology
    • /
    • v.27 no.2
    • /
    • pp.256-264
    • /
    • 2024
  • In the last 10 years, AI has made rapid progress, and image classification, in particular, are showing excellent performance based on deep learning. Nevertheless, due to the nature of deep learning represented by a black box, it is difficult to actually use it in critical decision-making situations such as national defense, autonomous driving, medical care, and finance due to the lack of explainability of judgement results. In order to overcome these limitations, in this study, a model description algorithm capable of local interpretation was applied to the inception network-derived AI to analyze what grounds they made when classifying national defense data. Specifically, we conduct a comparative analysis of explainability based on confidence values by performing LIME analysis from the Inception v2_resnet model and verify the similarity between human interpretations and LIME explanations. Furthermore, by comparing the LIME explanation results through the Top1 output results for Inception v3, Inception v2_resnet, and Xception models, we confirm the feasibility of comparing the efficiency and availability of deep learning networks using XAI.

Network Intrusion Detection Using One-Class Models (단일 클래스 모델을 활용한 네트워크 침입 탐지)

  • Byeongjun Min;Daekyeong Park
    • Convergence Security Journal
    • /
    • v.24 no.3
    • /
    • pp.13-21
    • /
    • 2024
  • Recently, with the rapid expansion of networks driven by the advancements of the Fourth Industrial Revolution, cybersecurity threats are becoming increasingly severe. Traditional signature-based Network Intrusion Detection Systems (NIDS) are effective in detecting known attacks but show limitations when faced with new threats such as Advanced Persistent Threats (APT). Additionally, deep learning models based on supervised learning can lead to biased decision boundaries due to the imbalanced nature of network traffic data, where normal traffic vastly outnumbers malicious traffic. To address these challenges, this paper proposes a network intrusion detection method based on one-class models that learn only from normal data to identify abnormal traffic. The effectiveness of this approach is validated through experiments using the Deep SVDD and MemAE models on the NSL-KDD dataset. Comparative analysis with supervised learning models demonstrates that the proposed method offers superior adaptability and performance in real-world scenarios.