Search | Korea Science

Candidate Word List and Probability Score Guided for Korean Scene Text Recognition (후보 단어 리스트와 확률 점수에 기반한 한국어 문자 인식 모델)

Lee, Yoonji;Lee, Jong-Min
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2022.05a
- /
- pp.73-75
- /
- 2022
Scene Text Recognition is a technology used in the field of artificial intelligence that requires manless robot, automatic vehicles and human-computer interaction. Though scene text images are distorted by noise interference, such as illumination, low resolution and blurring. Unlike previous studies that recognized only English, this paper shows a strong recognition accuracy including various characters, English, Korean, special character and numbers. Instead of selecting only one class having the highest probability value, a candidate word can be generated by considering the probability value of the second rank as well, thus a method can be corrected an existing language misrecognition problem.
PDF

Current Status of Robotic-assisted Surgery in Gastric Cancer

Eli Kakiashvili
- Journal of Digestive Cancer Research
- /
- v.4 no.2
- /
- pp.99-106
- /
- 2016
Minimally invasive surgery for gastric cancer has increased in popularity during the last two decades mainly in the Asia for patients with early-stage cancer. Nevertheless, the development of laparoscopic surgery for gastric cancers in the Western world has been slow because of the advanced stage at diagnosis for which LG is not yet considered an acceptable alternative to standard open surgery. RAG has been reported as a safe alternative to conventional surgery for treating of early gastric carcinoma. We assess the current status of robotic surgery in the treatment of gastric cancer focusing on the technical details, postoperative outcome, oncological considerations and future perspectives. In gastrectomy the biggest advantage of the robotic approach is the ease and reproducibility of lymphadenectomy. Reports also show that even the intra corporeal digestive restoration is facilitated by use of the robotic approach, particularly following TG. Additionally, the accuracy of robotic dissection is confirmed by decreased blood loss in comparison to conventional laparoscopy. The learning curve and technical reproducibility also appear to be shorter with robotic surgery and, consequently, robotics can help to standardize and diffuse minimally invasive surgery in the treatment of gastric cancer. While published reports have shown no significant differences in surgical morbidity, mortality, or oncological adequacy between robot-assisted and conventional gastrectomy. There are some advantages in terms of postoperative recovery of patients after robotic surgery. More studies are needed to assess the true indications and oncological effectiveness of robotic use in the treatment of gastric carcinoma.
PDF

STAGCN-based Human Action Recognition System for Immersive Large-Scale Signage Content (몰입형 대형 사이니지 콘텐츠를 위한 STAGCN 기반 인간 행동 인식 시스템)

Jeongho Kim;Byungsun Hwang;Jinwook Kim;Joonho Seon;Young Ghyu Sun;Jin Young Kim
- The Journal of the Institute of Internet, Broadcasting and Communication
- /
- v.23 no.6
- /
- pp.89-95
- /
- 2023
In recent decades, human action recognition (HAR) has demonstrated potential applications in sports analysis, human-robot interaction, and large-scale signage content. In this paper, spatial temporal attention graph convolutional network (STAGCN)-based HAR system is proposed. Spatioal-temmporal features of skeleton sequences are assigned different weights by STAGCN, enabling the consideration of key joints and viewpoints. From simulation results, it has been shown that the performance of the proposed model can be improved in terms of classification accuracy in the NTU RGB+D dataset.
https://doi.org/10.7236/JIIBC.2023.23.6.89 인용 PDF HTML

Egocentric Vision for Human Activity Recognition Using Deep Learning

Malika Douache;Badra Nawal Benmoussat
- Journal of Information Processing Systems
- /
- v.19 no.6
- /
- pp.730-744
- /
- 2023
The topic of this paper is the recognition of human activities using egocentric vision, particularly captured by body-worn cameras, which could be helpful for video surveillance, automatic search and video indexing. This being the case, it could also be helpful in assistance to elderly and frail persons for revolutionizing and improving their lives. The process throws up the task of human activities recognition remaining problematic, because of the important variations, where it is realized through the use of an external device, similar to a robot, as a personal assistant. The inferred information is used both online to assist the person, and offline to support the personal assistant. With our proposed method being robust against the various factors of variability problem in action executions, the major purpose of this paper is to perform an efficient and simple recognition method from egocentric camera data only using convolutional neural network and deep learning. In terms of accuracy improvement, simulation results outperform the current state of the art by a significant margin of 61% when using egocentric camera data only, more than 44% when using egocentric camera and several stationary cameras data and more than 12% when using both inertial measurement unit (IMU) and egocentric camera data.
https://doi.org/10.3745/JIPS.02.0207 인용 PDF

Development of Digital Twin and Intelligent Monorail Robot for Road Tunnel Smart Management (도로 터널 스마트관리를 위한 디지털 트윈 및 지능형 레일 로봇 개발)

Youngwoo Sohn;Jaehong Park;Eung-Ug Kim;Young Sik Joung
- Journal of the Korean Society of Industry Convergence
- /
- v.27 no.1
- /
- pp.25-37
- /
- 2024
The objective of this study was to create intelligent rail robots that are optimized for facility management and implement digital twin systems for smart road tunnel management. An autonomous surveillance system is formed by combining the sensing platform consisting of railing robots, fixed cameras and environmental detection sensors with the digital twin data platform technology for tunnel monitoring and early fire suppression. In order to develop mobile rail robots for fire extinguishing, we also designed and manufactured robots for extinguishing & monitoring and fire extinguishing devices, and then we examined the optimization of all parts. Our next step was to build a digital twin for road tunnel management by developing continuous image display system and implementing 3D modeling. After constructing prototypes, we attempted simulations by configuring abnormal symptom scenarios, such as vehicles fires. This study's proposal proposes high-accuracy risk prediction services that will enable intelligent management of risks in the tunnel with early response at each stage, using the data collected from the intelligent rail robots and digital twin systems.
https://doi.org/10.21289/KSIC.2024.27.1.25 인용 PDF HTML

Localization of ripe tomato bunch using deep neural networks and class activation mapping

Seung-Woo Kang;Soo-Hyun Cho;Dae-Hyun Lee;Kyung-Chul Kim
- Korean Journal of Agricultural Science
- /
- v.50 no.3
- /
- pp.357-364
- /
- 2023
In this study, we propose a ripe tomato bunch localization method based on convolutional neural networks, to be applied in robotic harvesting systems. Tomato images were obtained from a smart greenhouse at the Rural Development Administration (RDA). The sample images for training were extracted based on tomato maturity and resized to 128 × 128 pixels for use in the classification model. The model was constructed based on four-layer convolutional neural networks, and the classes were determined based on stage of maturity, using a Softmax classifier. The localization of the ripe tomato bunch region was indicated on a class activation map. The class activation map could show the approximate location of the tomato bunch but tends to present a local part or a large part of the ripe tomato bunch region, which could lead to poor performance. Therefore, we suggest a recursive method to improve the performance of the model. The classification results indicated that the accuracy, precision, recall, and F1-score were 0.98, 0.87, 0.98, and 0.92, respectively. The localization performance was 0.52, estimated by the Intersection over Union (IoU), and through input recursion, the IoU was improved by 13%. Based on the results, the proposed localization of the ripe tomato bunch area can be incorporated in robotic harvesting systems to establish the optimal harvesting paths.
https://doi.org/10.7744/kjoas.500305 인용 PDF

Fault diagnosis of wafer transfer robot based on time domain statistics (시간 영역 통계 기반 웨이퍼 이송 로봇의 고장 진단)

Hyejin Kim;Subin Hong;Youngdae Lee;Arum Park
- The Journal of the Convergence on Culture Technology
- /
- v.10 no.4
- /
- pp.663-668
- /
- 2024
This paper applies statistical analysis methods in the time domain to the fault diagnosis of wafer transfer robots, and proposes a methodology to discern the critical characteristics of vibration and torque signals. Subsequently, principal component analysis (PCA) is applied to diminish the data's dimensionality, followed by the development of a fault diagnosis algorithm utilizing Euclidean distance and Hotelling's T-square statistics. The algorithm establishes decision boundaries to categorize failure states based on the observed data. Our findings indicate that data classification incorporating velocity parameters enhances diagnostic accuracy. This approach serves to enhance the precision and efficacy of fault diagnosis.
https://doi.org/10.17703/JCCT.2024.10.4.663 인용 PDF

Improvement of Disparity Map using Loopy Belief Propagation based on Color and Edge (Disparity 보정을 위한 컬러와 윤곽선 기반 루피 신뢰도 전파 기법)

Kim, Eun Kyeong;Cho, Hyunhak;Lee, Hansoo;Wibowo, Suryo Adhi;Kim, Sungshin
- Journal of the Korean Institute of Intelligent Systems
- /
- v.25 no.5
- /
- pp.502-508
- /
- 2015
Stereo images have an advantage of calculating depth(distance) values which can not analyze from 2-D images. However, depth information obtained by stereo images has due to following reasons: it can be obtained by computation process; mismatching occurs when stereo matching is processing in occlusion which has an effect on accuracy of calculating depth information. Also, if global method is used for stereo matching, it needs a lot of computation. Therefore, this paper proposes the method obtaining disparity map which can reduce computation time and has higher accuracy than established method. Edge extraction which is image segmentation based on feature is used for improving accuracy and reducing computation time. Color K-Means method which is image segmentation based on color estimates correlation of objects in an image. And it extracts region of interest for applying Loopy Belief Propagation(LBP). For this, disparity map can be compensated by considering correlation of objects in the image. And it can reduce computation time because of calculating region of interest not all pixels. As a result, disparity map has more accurate and the proposed method reduces computation time.
https://doi.org/10.5391/JKIIS.2015.25.5.502 인용 PDF KSCI

Scaling Attack Method for Misalignment Error of Camera-LiDAR Calibration Model (카메라-라이다 융합 모델의 오류 유발을 위한 스케일링 공격 방법)

Yi-ji Im;Dae-seon Choi
- Journal of the Korea Institute of Information Security & Cryptology
- /
- v.33 no.6
- /
- pp.1099-1110
- /
- 2023
The recognition system of autonomous driving and robot navigation performs vision work such as object recognition, tracking, and lane detection after multi-sensor fusion to improve performance. Currently, research on a deep learning model based on the fusion of a camera and a lidar sensor is being actively conducted. However, deep learning models are vulnerable to adversarial attacks through modulation of input data. Attacks on the existing multi-sensor-based autonomous driving recognition system are focused on inducing obstacle detection by lowering the confidence score of the object recognition model.However, there is a limitation that an attack is possible only in the target model. In the case of attacks on the sensor fusion stage, errors in vision work after fusion can be cascaded, and this risk needs to be considered. In addition, an attack on LIDAR's point cloud data, which is difficult to judge visually, makes it difficult to determine whether it is an attack. In this study, image scaling-based camera-lidar We propose an attack method that reduces the accuracy of LCCNet, a fusion model (camera-LiDAR calibration model). The proposed method is to perform a scaling attack on the point of the input lidar. As a result of conducting an attack performance experiment by size with a scaling algorithm, an average of more than 77% of fusion errors were caused.
https://doi.org/10.13089/JKIISC.2023.33.6.1099 인용 PDF HTML

Performance Improvement of Stereo Matching by Image Segmentation based on Color and Multi-threshold (컬러와 다중 임계값 기반 영상 분할 기법을 통한 스테레오 매칭의 성능 향상)

Kim, Eun Kyeong;Cho, Hyunhak;Jang, Eunseok;Kim, Sungshin
- Journal of the Korean Institute of Intelligent Systems
- /
- v.26 no.1
- /
- pp.44-49
- /
- 2016
This paper proposed the method to improve performance of a pixel, which has low accuracy, by applying image segmentation methods based on color and multi-threshold of brightness. Stereo matching is the process to find the corresponding point on the right image with the point on the left image. For this process, distance(depth) information in stereo images is calculated. However, in the case of a region which has textureless, stereo matching has low accuracy and bad pixels occur on the disparity map. In the proposed method, the relationship between adjacent pixels is considered for compensating bad pixels. Generally, the object has similar color and brightness. Therefore, by considering the relationship between regions based on segmented regions by means of color and multi-threshold of brightness respectively, the region which is considered as parts of same object is re-segmented. According to relationship information of segmented sets of pixels, bad pixels in the disparity map are compensated efficiently. By applying the proposed method, the results show a decrease of nearly 28% in the number of bad pixels of the image applied the method which is established.
https://doi.org/10.5391/JKIIS.2016.26.1.044 인용 PDF KSCI

Search Result 578, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)