Search | Korea Science

Survey on Deep Learning-based Panoptic Segmentation Methods (딥 러닝 기반의 팬옵틱 분할 기법 분석)

Kwon, Jung Eun;Cho, Sung In
- IEMEK Journal of Embedded Systems and Applications
- /
- v.16 no.5
- /
- pp.209-214
- /
- 2021
Panoptic segmentation, which is now widely used in computer vision such as medical image analysis, and autonomous driving, helps understanding an image with holistic view. It identifies each pixel by assigning a unique class ID, and an instance ID. Specifically, it can classify 'thing' from 'stuff', and provide pixel-wise results of semantic prediction and object detection. As a result, it can solve both semantic segmentation and instance segmentation tasks through a unified single model, producing two different contexts for two segmentation tasks. Semantic segmentation task focuses on how to obtain multi-scale features from large receptive field, without losing low-level features. On the other hand, instance segmentation task focuses on how to separate 'thing' from 'stuff' and how to produce the representation of detected objects. With the advances of both segmentation techniques, several panoptic segmentation models have been proposed. Many researchers try to solve discrepancy problems between results of two segmentation branches that can be caused on the boundary of the object. In this survey paper, we will introduce the concept of panoptic segmentation, categorize the existing method into two representative methods and explain how it is operated on two methods: top-down method and bottom-up method. Then, we will analyze the performance of various methods with experimental results.
https://doi.org/10.14372/IEMEK.2021.16.5.209 인용 PDF KSCI

A Suggestion for Worker Feature Extraction and Multiple-Object Tracking Method in Apartment Construction Sites (아파트 건설 현장 작업자 특징 추출 및 다중 객체 추적 방법 제안)

Kang, Kyung-Su;Cho, Young-Woon;Ryu, Han-Guk
- Proceedings of the Korean Institute of Building Construction Conference
- /
- 2021.05a
- /
- pp.40-41
- /
- 2021
The construction industry has the highest occupational accidents/injuries among all industries. Korean government installed surveillance camera systems at construction sites to reduce occupational accident rates. Construction safety managers are monitoring potential hazards at the sites through surveillance system; however, the human capability of monitoring surveillance system with their own eyes has critical issues. Therefore, this study proposed to build a deep learning-based safety monitoring system that can obtain information on the recognition, location, identification of workers and heavy equipment in the construction sites by applying multiple-object tracking with instance segmentation. To evaluate the system's performance, we utilized the MS COCO and MOT challenge metrics. These results present that it is optimal for efficiently automating monitoring surveillance system task at construction sites.
PDF

Triplet Class-Wise Difficulty-Based Loss for Long Tail Classification

Yaw Darkwah Jnr.;Dae-Ki Kang
- International Journal of Internet, Broadcasting and Communication
- /
- v.15 no.3
- /
- pp.66-72
- /
- 2023
Little attention appears to have been paid to the relevance of learning a good representation function in solving long tail tasks. Therefore, we propose a new loss function to ensure a good representation is learnt while learning to classify. We call this loss function Triplet Class-Wise Difficulty-Based (TriCDB-CE) Loss. It is a combination of the Triplet Loss and Class-wise Difficulty-Based Cross-Entropy (CDB-CE) Loss. We prove its effectiveness empirically by performing experiments on three benchmark datasets. We find improvement in accuracy after comparing with some baseline methods. For instance, in the CIFAR-10-LT, 7 percentage points (pp) increase relative to the CDB-CE Loss was recorded. There is more room for improvement on Places-LT.
https://doi.org/10.7236/IJIBC.2023.15.3.66 인용 PDF

A Study on the Attributes Classification of Agricultural Land Based on Deep Learning Comparison of Accuracy between TIF Image and ECW Image (딥러닝 기반 농경지 속성분류를 위한 TIF 이미지와 ECW 이미지 간 정확도 비교 연구)

Kim, Ji Young;Wee, Seong Seung
- Journal of The Korean Society of Agricultural Engineers
- /
- v.65 no.6
- /
- pp.15-22
- /
- 2023
In this study, We conduct a comparative study of deep learning-based classification of agricultural field attributes using Tagged Image File (TIF) and Enhanced Compression Wavelet (ECW) images. The goal is to interpret and classify the attributes of agricultural fields by analyzing the differences between these two image formats. "FarmMap," initiated by the Ministry of Agriculture, Food and Rural Affairs in 2014, serves as the first digital map of agricultural land in South Korea. It comprises attributes such as paddy, field, orchard, agricultural facility and ginseng cultivation areas. For the purpose of comparing deep learning-based agricultural attribute classification, we consider the location and class information of objects, as well as the attribute information of FarmMap. We utilize the ResNet-50 instance segmentation model, which is suitable for this task, to conduct simulated experiments. The comparison of agricultural attribute classification between the two images is measured in terms of accuracy. The experimental results indicate that the accuracy of TIF images is 90.44%, while that of ECW images is 91.72%. The ECW image model demonstrates approximately 1.28% higher accuracy. However, statistical validation, specifically Wilcoxon rank-sum tests, did not reveal a significant difference in accuracy between the two images.
https://doi.org/10.5389/KSAE.2023.65.6.015 인용 PDF HTML

Design of Character-based Conversational Instruction-Learning System Design for Science Education of Elementary School (초등 과학수업을 위한 캐릭터 기반의 대화형 교수-학습 시스템 설계)

Jeong Sang-Mok;Song Ki-Sang
- Journal of the Korea Society of Computer and Information
- /
- v.10 no.5 s.37
- /
- pp.343-352
- /
- 2005
The existing CAI or web-based science learning system of elementary school has some disadvantages. For instance, it is composed of uniform courses designed by an instructor without considering the learner's characters, and the learner's opinions or questions raised during learning can not be delivered to the system. This structure has diminished the willingness or the motive of the learner and make an adverse effect on the learning efficiency. In this regards, Instruction-Learning System is needed to provide learning environment Pertinent to the learner's individual character and motivate the learner's active attendance and learning. This study is to design a character-based conversational Instruction-Learning System. This may induce the learner's active attendance through the communications between instructor and learner and furnish various learning materials to motivate the learners and attract their consistent interests in learning.
PDF

A Memory-based Learning using Repetitive Fixed Partitioning Averaging (반복적 고정분할 평균기법을 이용한 메모리기반 학습기법)

Yih, Hyeong-Il
- Journal of Korea Multimedia Society
- /
- v.10 no.11
- /
- pp.1516-1522
- /
- 2007
We had proposed the FPA(Fixed Partition Averaging) method in order to improve the storage requirement and classification rate of the Memory Based Reasoning. The algorithm worked not bad in many area, but it lead to some overhead for memory usage and lengthy computation in the multi classes area. We propose an Repetitive FPA algorithm which repetitively partitioning pattern space in the multi classes area. Our proposed methods have been successfully shown to exhibit comparable performance to k-NN with a lot less number of patterns and better result than EACH system which implements the NGE theory.
PDF

A Contrastive Learning Framework for Weakly Supervised Video Anomaly Detection

Hyeon Jeong Park;Je Hyeong Hong
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2022.11a
- /
- pp.171-174
- /
- 2022
Weakly-supervised learning is a widely adopted approach in video anomaly detection whereby only video labels are utilized instead of expensive frame-level annotations. Since the success of multi-instance learning (MIL), almost all recent approaches are based on maximizing the margin between the set of abnormal video snippets and those of normal video snippets. In this work, we present a simple contrastive approach for weakly supervised video anomaly detection (WS-VAD) with aims to enhance the performance of existing models. The method is generic in nature and introduces a loss function to encourage attraction of output features from the same video class and repel those from different video classes. Experimental results demonstrate our method can be applied to existing algorithms to improve detection accuracy in public video anomaly dataset.
PDF

Balancing Fun and Learning through a User Interface: A Case Study of Wii Game

Kim, Si Jung;Lee, Kichol;Park, Yeonjeong
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.13 no.7
- /
- pp.3638-3653
- /
- 2019
Designing a user interface is important because the user interface determines the level of physical and mental engagement of the user resulting in their level of learning. This paper investigated how physical engagement through a different user interfaces is associated with fun and learning and presented a theoretical physical engagement model called, PEM, developed based on an empirical user study. The PEM model describes how a game user interface is associated with the level of fun and learning, particularly in playing a full body engaged game. There are many different types of games but the Wii Tennis, an embodied interactive game, was chosen as an instance of full body engaged game. A user study with 32 participant's age ranged from 21 to 40 years old revealed that there is a positive correlation between both fun and learning and the level of physical engagement through two different user interfaces. The results of the study showed that the extent of fun and learning are associated with the physical engagement of the player through an interface. As an implication from the study, the result recommend that the level of user engagement is realized by an effective user interface, and the level of physical engagement is determined by the level of authenticity bridged by the user interface.
https://doi.org/10.3837/tiis.2019.07.017 인용 PDF KSCI HTML

Android Malware Detection using Machine Learning Techniques KNN-SVM, DBN and GRU

Sk Heena Kauser;V.Maria Anu
- International Journal of Computer Science & Network Security
- /
- v.23 no.7
- /
- pp.202-209
- /
- 2023
Android malware is now on the rise, because of the rising interest in the Android operating system. Machine learning models may be used to classify unknown Android malware utilizing characteristics gathered from the dynamic and static analysis of an Android applications. Anti-virus software simply searches for the signs of the virus instance in a specific programme to detect it while scanning. Anti-virus software that competes with it keeps these in large databases and examines each file for all existing virus and malware signatures. The proposed model aims to provide a machine learning method that depend on the malware detection method for Android inability to detect malware apps and improve phone users' security and privacy. This system tracks numerous permission-based characteristics and events collected from Android apps and analyses them using a classifier model to determine whether the program is good ware or malware. This method used the machine learning techniques KNN-SVM, DBN, and GRU in which help to find the accuracy which gives the different values like KNN gives 87.20 percents accuracy, SVM gives 91.40 accuracy, Naive Bayes gives 85.10 and DBN-GRU Gives 97.90. Furthermore, in this paper, we simply employ standard machine learning techniques; but, in future work, we will attempt to improve those machine learning algorithms in order to develop a better detection algorithm.
https://doi.org/10.22937/IJCSNS.2023.23.7.23 인용 PDF

Data Augmentation for Tomato Detection and Pose Estimation (토마토 위치 및 자세 추정을 위한 데이터 증대기법)

Jang, Minho;Hwang, Youngbae
- Journal of Broadcast Engineering
- /
- v.27 no.1
- /
- pp.44-55
- /
- 2022
In order to automatically provide information on fruits in agricultural related broadcasting contents, instance image segmentation of target fruits is required. In addition, the information on the 3D pose of the corresponding fruit may be meaningfully used. This paper represents research that provides information about tomatoes in video content. A large amount of data is required to learn the instance segmentation, but it is difficult to obtain sufficient training data. Therefore, the training data is generated through a data augmentation technique based on a small amount of real images. Compared to the result using only the real images, it is shown that the detection performance is improved as a result of learning through the synthesized image created by separating the foreground and background. As a result of learning augmented images using images created using conventional image pre-processing techniques, it was shown that higher performance was obtained than synthetic images in which foreground and background were separated. To estimate the pose from the result of object detection, a point cloud was obtained using an RGB-D camera. Then, cylinder fitting based on least square minimization is performed, and the tomato pose is estimated through the axial direction of the cylinder. We show that the results of detection, instance image segmentation, and cylinder fitting of a target object effectively through various experiments.
https://doi.org/10.5909/JBE.2022.27.1.44 인용 PDF KSCI KPUBS

Search Result 129, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)