Search | Korea Science

Generate Korean image captions using LSTM (LSTM을 이용한 한국어 이미지 캡션 생성)

Park, Seong-Jae;Cha, Jeong-Won
- Annual Conference on Human and Language Technology
- /
- 2017.10a
- /
- pp.82-84
- /
- 2017
본 논문에서는 한국어 이미지 캡션을 학습하기 위한 데이터를 작성하고 딥러닝을 통해 예측하는 모델을 제안한다. 한국어 데이터 생성을 위해 MS COCO 영어 캡션을 번역하여 한국어로 변환하고 수정하였다. 이미지 캡션 생성을 위한 모델은 CNN을 이용하여 이미지를 512차원의 자질로 인코딩한다. 인코딩된 자질을 LSTM의 입력으로 사용하여 캡션을 생성하였다. 생성된 한국어 MS COCO 데이터에 대해 어절 단위, 형태소 단위, 의미형태소 단위 실험을 진행하였고 그 중 가장 높은 성능을 보인 형태소 단위 모델을 영어 모델과 비교하여 영어 모델과 비슷한 성능을 얻음을 증명하였다.
PDF

An Implementation on The Facial Character Drawing System Using Control Points (통제점 조절 방식의 얼굴 캐릭터 제작 시스템에 관한 연구)

정연준;김용진;이현주;조윤석;조창석
- Proceedings of the Korea Multimedia Society Conference
- /
- 2001.11a
- /
- pp.148-152
- /
- 2001
본 논문은 이미지 조합형 캐릭터 생성 방법이 아닌 통제점 조절을 통한 얼굴 캐릭터 생성에 관한 것으로, 캐릭터로 표현 하고자 하는 얼굴 이미지를 바탕으로 얼굴외곽, 눈, 코, 입, 귀, 눈썹 형태에 맞추어 통제점을 조정하고, 통제점에 의해 조절되는 스플라인 곡선으로 얼굴 이미지를 단순화함으로써 다양한 형태의 캐릭터 이미지를 생성할 수 있다 얼굴이외의 헤어스타일과 몸, 기타 악세사리는 샘플 이미지 조합의 방법을 사용하였다.
PDF

Individual 3D facial avatar synthesis using elastic matching of facial mesh and image (얼굴 메쉬와 이미지의 동적 매칭을 이용한 개인 아바타의 3차원 얼굴 합성)

강명진;김창헌
- Proceedings of the Korean Information Science Society Conference
- /
- 1998.10c
- /
- pp.600-602
- /
- 1998
본 논문은 정면과 측면 얼굴 이미지의 특성을 살린 3차원 개인 아바타 합성에 관한 연구이다. 표준 얼굴 메쉬를 얼굴 이미지의 특징점에 맞추려는 힘을 특징점 이외의 점들까지의 거리에 대한 가우스 분포를 따라 부드럽게 전달시켜 매쉬를 탄성있게 변형하는 힘으로 작용시켜 메쉬를 얼굴 이미지의 윤곽선을 중심으로 매칭시키고, 매칭된 메쉬가 매칭 이전의 메쉬의 기하학적 특성을 유지할 수 있도록 메쉬에 동적 피부 모델을 적용한다. 이렇게 생성한 3차원 메쉬에 이미지를 텍스춰 매핑하여 개인 특성을 살린 3차원 개인 아바타를 생성한다.
PDF

Variational Auto Encoder Distributed Restrictions for Image Generation (이미지 생성을 위한 변동 자동 인코더 분산 제약)

Yong-Gil Kim
- The Journal of the Institute of Internet, Broadcasting and Communication
- /
- v.23 no.3
- /
- pp.91-97
- /
- 2023
Recent research shows that latent directions can be used to image process towards certain attributes. However, controlling the generation process of generative model is very difficult. Though the latent directions are used to image process for certain attributes, many restrictions are required to enhance the attributes received the latent vectors according to certain text and prompts and other attributes largely unaffected. This study presents a generative model having certain restriction to the latent vectors for image generation and manipulation. The suggested method requires only few minutes per manipulation, and the simulation results through Tensorflow Variational Auto-encoder show the effectiveness of the suggested approach with extensive results.
https://doi.org/10.7236/JIIBC.2023.23.3.91 인용 PDF HTML

Gray Image Generation Methods Using Genetic Algorithm (유전자 알고리즘을 이용한 흑백 이미지 생성 기법)

Cha, Joo Hyoung;Kang, Dong Sung;Song, Moo Sang;Kweon, Tae Hyeon;Woo, Young Woon
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2019.05a
- /
- pp.265-267
- /
- 2019
In this paper, we propose a method to automatically generate gray images similar to existing images using genetic algorithms. We have proposed two techniques for gene modeling, which is the most important design element to apply genetic algorithm to real field problems. Experiments were performed on two different sizes of gray images using each of the proposed techniques. Experimental results show that there is a large difference in the evolutionary performance of each technique in gene modeling for image generation. Therefore, it can be understood that gene modeling should be carefully decided in order to generate an image similar to the existing image in the future, or to learn quickly and naturally to generate an image synthesized from different images.
PDF

Automatic Composition Algorithm based on Fractal Tree (프랙탈 트리를 이용한 자동 작곡 방법)

Kwak, Sung-Ho;Yoo, Min-Joon;Lee, In-Kwon
- 한국HCI학회:학술대회논문집
- /
- 2008.02a
- /
- pp.618-622
- /
- 2008
In this paper, we suggest new music composition algorithm based on fractal theory. User can define and control fractal shape by setting an initial state and production rules in L-System. We generate an asymmetric fractal tree based on L-System and probability. Then a music is generated by the fractal tree image using sonification techniques. We introduce two composition algorithm using the fractal tree. First, monophonic music can be generated by mapping x and y axis to velocity and pitch, respectively Second, harmonic music also can be generated by mapping x and y axis to time and pitch, respectively Using our composition algorithm, user can easily generate a music which has repeated pattern created by recursive feature of fractal, and a music which has structure similar to fractal tree image.
PDF

3D Object Extraction Mechanism via UML Sequence Models from Natural Language Requirements (자연어 요구사항으로부터 UML 시퀀스 모델을 경유한 3D 객체 추출 메커니즘)

Hyuntae Kim;Janghwan Kim;R. Young Chul Kim
- Proceedings of the Korea Information Processing Society Conference
- /
- 2024.05a
- /
- pp.490-493
- /
- 2024
현재 다양한 분야에서 AI 가 사용되고 있다. 최근에는 소프트웨어공학 관점에서 요구 사항 분석에 Chat GPT 와 같은 LLM 모델을 적용하고 있다. 하지만 1) 대부분의 생성형 AI 는 불투명한 공정을 통해 3D 이미지가 생성하고, 3D 이미지를 생성할 때마다 다른 이미지를 생성한다. 이에 따라 동일한 인물이나 사물을 사용하고 싶은 사용자들은 동일한 객체가 들어간 그림을 일관성 있게 생성할 수 없다. 2) 또한 LLM 과 이미지 생성 AI 와의 결합이 시도 되고 있지만 문장 의미 분석 성능이 부족하다. 이를 해결하기 위해, 자연어 요구사항을 언어학적 기법을 통해 분석하고, 분석 결과를 기반으로 UML 시퀀스 다이어그램 및 3D 객체 생성 메커니즘을 제안한다. 즉 언어학적 분석 기법을 통해, 요구사항의 정확한 의미와 속성을 추출한다. 그런 다음 추출된 정보를 시퀀스 다이어그램과 매핑하여 3D 객체 이미지를 생성한다. 제안하는 방법을 통해 3D 객체 생성의 소프트웨어 개발 공정 사용으로 생산성을 높여 시간과 비용을 단축할 수 있을 것으로 기대한다.
PDF

A Generative Design Algorithm to Generate 3D shapes Using 2D Images (2차원 이미지로 3차원 형태를 생성하는 생성적 디자인 알고리듬)

Kim, Hyeon Ji;Chung, Yun Chan
- Design Convergence Study
- /
- v.15 no.6
- /
- pp.229-241
- /
- 2016
In generative design computer automatically and quickly generates many alternative design solutions, and the computer algorithms which perform the design tasks are important. The main purpose of this study is to propose a computer algorithm which generates three-dimensional shapes from a two-dimensional digital image such as a photograph or a painting. The base geometry of the final shape is a cylinder or sphere. A surface of the cylinder or sphere is deformed depending on the used image. The proposed algorithm was implemented as a computer program, and the program tested with several famous paintings. The algorithm and results presented in this study implicate the possibilities of the generative design which generates three-dimensional shapes from two-dimensional images. It is necessary to find and measure the values of the generated shape, and it will be a future research to find the relations of emotional and cognitive aspects between the input images and the generated shapes. Those studies are expected to expand the possibilities of generative design.

Multi Cycle Consistent Adversarial Networks for Multi Attribute Image to Image Translation

Jo, Seok Hee;Cho, Kyu Cheol
- Journal of the Korea Society of Computer and Information
- /
- v.25 no.9
- /
- pp.63-69
- /
- 2020
Image-image translation is a technology that creates a target image through input images, and has recently shown high performance in creating a more realistic image by utilizing GAN, which is a non-map learning structure. Therefore, there are various studies on image-to-image translation using GAN. At this point, most image-to-image translations basically target one attribute translation. But the data used and obtainable in real life consist of a variety of features that are hard to explain with one feature. Therefore, if you aim to change multiple attributes that can divide the image creation process by attributes to take advantage of the various attributes, you will be able to play a better role in image-to-image translation. In this paper, we propose Multi CycleGAN, a dual attribute transformation structure, by utilizing CycleGAN, which showed high performance among image-image translation structures using GAN. This structure implements a dual transformation structure in which three domains conduct two-way learning to learn about the two properties of an input domain. Experiments have shown that images through the new structure maintain the properties of the input area and show high performance with the target properties applied. Using this structure, it is possible to create more diverse images in the future, so we can expect to utilize image generation in more diverse areas.
https://doi.org/10.9708/jksci.2020.25.09.063 인용 PDF KSCI

Design and Implementation of Deep-Learning-Based Image Tag for Semantic Image Annotation in Mobile Environment (모바일 환경에서 딥러닝을 활용한 의미기반 이미지 어노테이션을 위한 이미지 태그 설계 및 구현)

Shin, YoonMi;Ahn, Jinhyun;Im, Dong-Hyuk
- Proceedings of the Korea Information Processing Society Conference
- /
- 2019.10a
- /
- pp.895-897
- /
- 2019
모바일의 기술 발전과 소셜미디어 사용의 증가로 수없이 많은 멀티미디어 콘텐츠들이 생성되고 있다. 이러한 많은 양의 콘텐츠 중에서 사용자가 원하는 이미지를 효율적으로 찾기 위해 의미 기반 이미지 검색을 이용한다. 이 검색 기법은 이미지에 의미 있는 정보들을 이용하여 사용자가 찾고 자하는 이미지를 정확하게 찾을 수 있다. 본 연구에서는 모바일 환경에서 이미지가 가질 수 있는 의미적 정보를 어노테이션 하고 이와 더불어 모바일에 있는 이미지에 풍성한 어노테이션을 위해 딥러닝 기술을 이용하여 다양한 태그들을 자동 생성하도록 구현하였다. 이렇게 생성된 어노테이션 정보들은 의미적 기반 태그를 통해 RDF 트리플로 확장된다. SPARQL 질의어를 이용하여 의미 기반 이미지 검색을 할 수 있다.
https://doi.org/10.3745/PKIPS.y2019m10a.895 인용 PDF

Search Result 1,469, Processing Time 0.029 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)