Search | Korea Science

Scene Text Detection with Length of Text (글자 수 정보를 이용한 이미지 내 글자 영역 검출 방법)

Yeong Woo Kim;Wonjun Kim
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2022.11a
- /
- pp.177-179
- /
- 2022
딥러닝의 발전과 함께 합성곱 신경망 기반의 이미지 내 글자 영역 검출(Scene Text Detection) 방법들이 제안됐다. 그러나 이러한 방법들은 대부분 데이터셋이 제공하는 단어의 위치 정보만을 이용할 뿐 글자 영역이 갖는 고유한 정보인 글자 수는 활용하지 않는다. 따라서 본 논문에서는 글자 수 정보를 학습하여 효과적으로 이미지 내의 글자 영역을 검출하는 모듈을 제안한다. 제안하는 방법은 간단한 합성곱 신경망으로 구성된 이미지 내 글자 영역 검출 모델에 글자 수를 예측하는 모듈을 추가하여 학습을 진행하였다. 글자 영역 검출 성능 평가에 널리 사용되는 ICDAR 2015 데이터셋을 통해 기존 방법 대비 성능이 향상됨을 보였고, 글자 수 정보가 글자 영역을 감지하는 데 유효한 정보임을 확인했다.
PDF

Image Deblurring Based on ADMM and Deep CNN Denoiser Image Prior (ADMM과 깊은 합성곱 신경망 잡음 제거기 이미지 Prior에 기반한 이미지 디블러링)

Kwon, Junhyeong;Soh, Jae Woong;Cho, Nam Ik
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2020.07a
- /
- pp.680-683
- /
- 2020
오래 전부터 모델 기반 최적화 방법이 이미지 디블러링을 위해 널리 사용되어 왔고, 최근에는 학습 기반 기술이 영상 디블러링에서 좋은 성과를 보이고 있다. 본 논문은 ADMM과 깊은 합성곱 신경망 잡음 제거기 이미지 prior를 이용하여 모델 기반 최적화 방법의 장점과 학습 기반 방법의 장점을 모두 활용할 수 있는 방법을 제안한다. 본 방법을 이용하여 기존 방법보다 더 좋은 디블러링 성능을 얻을 수 있었다.
PDF

Performance change of defect classification model of rotating machinery according to noise addition and denoising process (노이즈 추가와 디노이징 처리에 따른 회전 기계설비의 결함 분류 모델 성능 변화)

Se-Hoon Lee;Sung-Soo Kim;Bi-gun Cho
- Proceedings of the Korean Society of Computer Information Conference
- /
- 2023.07a
- /
- pp.1-2
- /
- 2023
본 연구는 환경 요인이 통제되어 있는 실험실 데이터에 산업 현장에서 발생하는 유사 잡음을 노이즈로 추가하였을 때, SNR비에 따른 노이즈별 STFT Log Spectrogram, Mel-Spectrogram, CWT Spectrogram 총 3가지의 이미지를 생성하고, 각 이미지를 입력으로 한 CNN 결함 분류 모델의 성능 결과를 확인하였다. 원본 데이터의 영향력이 큰 0db 이상의 SNR비로 합성할 경우 원본 데이터와 분류 결과상 큰 차이가 존재하지 않았으며, 노이즈 데이터의 영향이 큰 0db 이하의 SNR비로 합성할 경우, -20db의 STFT 이미지 기준 약 26%의 성능 저하가 발생하였다. 또한, Wiener Filtering을 통한 디노이징 처리 이후, 노이즈를 효과적으로 제거하여 분류 성능의 결과가 높아지는 점을 확인하였다.
PDF

Image Augmentation of Paralichthys Olivaceus Disease Using SinGAN Deep Learning Model (SinGAN 딥러닝 모델을 이용한 넙치 질병 이미지 증강)

Son, Hyun Seung;Choi, Han Suk
- The Journal of the Korea Contents Association
- /
- v.21 no.12
- /
- pp.322-330
- /
- 2021
In modern aquaculture, mass mortality is a very important issue that determines the success of aquaculture business. If a fish disease is not detected at an early stage in the farm, the disease spreads quickly because the farm is a closed environment. Therefore, early detection of diseases is crucial to prevent mass mortality of fish raised in farms. Recently deep learning-based automatic identification of fish diseases has been widely used, but there are many difficulties in identifying objects due to insufficient images of fish diseases. Therefore, this paper suggests a method to generate a large number of fish disease images by synthesizing normal images and disease images using SinGAN deep learning model in order to to solve the lack of fish disease images. We generate images from the three most frequently occurring Paralichthys Olivaceus diseases such as Scuticociliatida, Vibriosis, and Lymphocytosis and compare them with the original image. In this study, a total of 330 sheets of scutica disease, 110 sheets of vibrioemia, and 110 sheets of limphosis were made by synthesizing 10 disease patterns with 11 normal halibut images, and 1,320 images were produced by quadrupling the images.
https://doi.org/10.5392/JKCA.2021.21.12.322 인용 PDF KSCI HTML

Object Detection Based on Virtual Humans Learning (가상 휴먼 학습 기반 영상 객체 검출 기법)

Lee, JongMin;Jo, Dongsik
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2022.10a
- /
- pp.376-378
- /
- 2022
Artificial intelligence technology is widely used in various fields such as artificial intelligence speakers, artificial intelligence chatbots, and autonomous vehicles. Among these AI application fields, the image processing field shows various uses such as detecting objects or recognizing objects using artificial intelligence. In this paper, data synthesized by a virtual human is used as a method to analyze images taken in a specific space.
PDF

A Study on Game Background and Character Effect Setting (게임 배경과 캐릭터 효과 설정에 관한 연구)

Joo, Heon-Sik
- Proceedings of the Korean Society of Computer Information Conference
- /
- 2016.01a
- /
- pp.227-228
- /
- 2016
본 연구에서는 게임 배경과 게임 캐릭터의 효과에 대해서 나타낸다. 먼저 3개의 캐릭터 이미지를 주 캐릭터 로 만들 이미지에 합성기법을 적용하여 캐릭터들을 합성한다. 합성하여 만든 캐릭터와 다른 캐릭터의 모양과 형태에 맞게 알맞은 배경 색상으로 만들고, 각 캐릭터의 크기, 원근, 모양, 색상에 따라 효과를 적용한다. 따라서 게임 콘텐츠를 제작하는데 정지 영상으로 표현하여 캐릭터 애니메이션과 장면에 맞는 사운드를 삽입하여 게임 콘텐츠로서 시각과 청각과 움직임이 있는 게임 캐릭터를 제작한다. 특히 배경색, 캐릭터의 번개 및 라이팅 효과색상 같은 것에 관심을 갖고 제작하여 앞으로 게임 콘텐츠의 배경과 캐릭터 효과를 제작하는데 모션그래픽을 이용하여 간단하게 제작 할 수 있음을 나타내었다.
PDF

Plant leaf area estimation using synthetic dataset and deep learning model (합성 데이터셋과 딥러닝 모델을 이용한 식물 엽면적 추정)

Suh, Hyun Kwon;Ahn, Juyeon;Park, Hyeonji
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2022.06a
- /
- pp.90-92
- /
- 2022
이 논문에서는 합성된 애기장대 데이터셋을 활용하여 딸기의 엽면적을 추정할 수 있는 딥러닝 모델을 제안한다. 제안된 모델에서는 개별 잎 검출을 위하여 합성 데이터셋으로 학습된 Mask R-CNN 의 객체 검출 모델을 사용하였고, 이어 이미지 후처리 작업에 해당되는 모폴로지 연산의 침식 및 팽창, 픽셀 카운터를 통해 엽면적을 추정하였다. 각기 다른 역할을 수행하는 신경망 계층에 어텐션 메커니즘 적용하여 검출 성능의 향상과 검출 시간을 단축하였다. 제안된 모델은 딸기 데이터셋을 사용하지 않는 합성된 데이터셋만으로도 실제 온실에서 획득한 다양한 이미지에서의 딸기 엽면적을 추정하는 데에 우수한 성능을 보여준다.
PDF

Identification of natural colored diamonds using UV fluorescent and X-ray Lang images (UV 형광과 X-선 Lang 표면이미지를 이용한 천연유색다이아몬드의 감별 연구)

Kim, Jun-Hwan;Ha, Jun-Seok;Kim, Ki-Hoon;Song, Oh-Sung
- Journal of the Korea Academia-Industrial cooperation Society
- /
- v.10 no.12
- /
- pp.3540-3545
- /
- 2009
Due to recent development of high temperature high pressure(HTHP) diamond synthetic and treatment technology, we need to identify the natural diamonds fast, reliable, and economically. We proposed using new method of UV fluorescence and X-ray Lang topography imaging for distinguishing one synthetic diamond from four natural colored diamonds. We observe unique local stress field uneven image in synthetic diamond using UV fluorescence and Lang topography characterization, while uniform images in natural diamonds. Especially, X-ray Lang method offered the better identification power with better high resolution on stress field images.
https://doi.org/10.5762/KAIS.2009.10.12.3540 인용 PDF

Machine Learning Data Extension Way for Confirming Genuine of Trademark Image which is Rotated (회전한 상표 이미지의 진위 결정을 위한 기계 학습 데이터 확장 방법)

Gu, Bongen
- Journal of Platform Technology
- /
- v.8 no.1
- /
- pp.16-23
- /
- 2020
For protecting copyright for trademark, convolutional neural network can be used to confirm genuine of trademark image. For this, repeated training one trademark image degrades the performance of machine learning because of overfitting problem. Therefore, this type of machine learning application generates training data in various way. But if genuine trademark image is rotated, this image is classified as not genuine trademark. In this paper, we propose the way for extending training data to confirm genuine of trademark image which is rotated. Our proposed way generates rotated image from genuine trademark image as training data. To show effectiveness of our proposed way, we use CNN machine learning model, and evaluate the accuracy with test image. From evaluation result, our way can be used to generate training data for machine learning application which confirms genuine of rotated trademark image.
PDF

Text Augmentation Using Hierarchy-based Word Replacement

Kim, Museong;Kim, Namgyu
- Journal of the Korea Society of Computer and Information
- /
- v.26 no.1
- /
- pp.57-67
- /
- 2021
Recently, multi-modal deep learning techniques that combine heterogeneous data for deep learning analysis have been utilized a lot. In particular, studies on the synthesis of Text to Image that automatically generate images from text are being actively conducted. Deep learning for image synthesis requires a vast amount of data consisting of pairs of images and text describing the image. Therefore, various data augmentation techniques have been devised to generate a large amount of data from small data. A number of text augmentation techniques based on synonym replacement have been proposed so far. However, these techniques have a common limitation in that there is a possibility of generating a incorrect text from the content of an image when replacing the synonym for a noun word. In this study, we propose a text augmentation method to replace words using word hierarchy information for noun words. Additionally, we performed experiments using MSCOCO data in order to evaluate the performance of the proposed methodology.
https://doi.org/10.9708/jksci.2021.26.01.057 인용 PDF KSCI

Search Result 554, Processing Time 0.023 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)