• Title/Summary/Keyword: AI 생성 이미지

Search Result 106, Processing Time 0.026 seconds

Development of a Web Service for Cosmetics Recommendation based on an Artificial Intelligence for User Personal Color Generation (사용자 퍼스널 컬러 생성을 위한 인공지능 기반 화장품 추천 웹 서비스 개발)

  • Suk-Hyung Hwang;Min-Taek Lim;Hun-Tae Hwang;Seung-Jun Lee;Soo-Hwan Kim;Se-Woong Hwang
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2023.01a
    • /
    • pp.461-463
    • /
    • 2023
  • MZ세대를 중심으로 자기관리를 열심히 하는 사람들이 증가함에 따라 화장의 기본이 되는 개인 피부톤(퍼스널 컬러)을 찾는 것이 중요시되고 있다. 현재 대다수 사람은 자신에게 어울리는 퍼스널 컬러를 찾기 위해 높은 비용을 지불하여 전문가를 이용하거나 객관적이고 정량화된 기준 없이 오랜 시간을 투자하여 스스로 퍼스널 컬러를 찾는 등 시간과 비용 측면에서의 한계점을 가지고 있다. 본 논문에서는 이를 보완하기 위해 이미지 기반 인공지능 기술(객체 탐지, 객체 분할, BeautyGAN)을 적용하여 데이터 기반의 정량적인 기준을 생성하고, 퍼스널 컬러에 알맞은 화장품 추천 웹 서비스를 제안한다.

  • PDF

CINEMAPIC : Generative AI-based movie concept photo booth system (시네마픽 : 생성형 AI기반 영화 컨셉 포토부스 시스템)

  • Seokhyun Jeong;Seungkyu Leem;Jungjin Lee
    • Journal of the Korea Computer Graphics Society
    • /
    • v.30 no.3
    • /
    • pp.149-158
    • /
    • 2024
  • Photo booths have traditionally provided a fun and easy way to capture and print photos to cherish memories. These booths allow individuals to capture their desired poses and props, sharing memories with friends and family. To enable diverse expressions, generative AI-powered photo booths have emerged. However, existing AI photo booths face challenges such as difficulty in taking group photos, inability to accurately reflect user's poses, and the challenge of applying different concepts to individual subjects. To tackle these issues, we present CINEMAPIC, a photo booth system that allows users to freely choose poses, positions, and concepts for their photos. The system workflow includes three main steps: pre-processing, generation, and post-processing to apply individualized concepts. To produce high-quality group photos, the system generates a transparent image for each character and enhances the backdrop-composited image through a small number of denoising steps. The workflow is accelerated by applying an optimized diffusion model and GPU parallelization. The system was implemented as a prototype, and its effectiveness was validated through a user study and a large-scale pilot operation involving approximately 400 users. The results showed a significant preference for the proposed system over existing methods, confirming its potential for real-world photo booth applications. The proposed CINEMAPIC photo booth is expected to lead the way in a more creative and differentiated market, with potential for widespread application in various fields.

Comparative Evaluation of 18F-FDG Brain PET/CT AI Images Obtained Using Generative Adversarial Network (생성적 적대 신경망(Generative Adversarial Network)을 이용하여 획득한 18F-FDG Brain PET/CT 인공지능 영상의 비교평가)

  • Kim, Jong-Wan;Kim, Jung-Yul;Lim, Han-sang;Kim, Jae-sam
    • The Korean Journal of Nuclear Medicine Technology
    • /
    • v.24 no.1
    • /
    • pp.15-19
    • /
    • 2020
  • Purpose Generative Adversarial Network(GAN) is one of deep learning technologies. This is a way to create a real fake image after learning the real image. In this study, after acquiring artificial intelligence images through GAN, We were compared and evaluated with real scan time images. We want to see if these technologies are potentially useful. Materials and Methods 30 patients who underwent 18F-FDG Brain PET/CT scanning at Severance Hospital, were acquired in 15-minute List mode and reconstructed into 1,2,3,4,5 and 15minute images, respectively. 25 out of 30 patients were used as learning images for learning of GAN and 5 patients used as verification images for confirming the learning model. The program was implemented using the Python and Tensorflow frameworks. After learning using the Pix2Pix model of GAN technology, this learning model generated artificial intelligence images. The artificial intelligence image generated in this way were evaluated as Mean Square Error(MSE), Peak Signal to Noise Ratio(PSNR), and Structural Similarity Index(SSIM) with real scan time image. Results The trained model was evaluated with the verification image. As a result, The 15-minute image created by the 5-minute image rather than 1-minute after the start of the scan showed a smaller MSE, and the PSNR and SSIM increased. Conclusion Through this study, it was confirmed that AI imaging technology is applicable. In the future, if these artificial intelligence imaging technologies are applied to nuclear medicine imaging, it will be possible to acquire images even with a short scan time, which can be expected to reduce artifacts caused by patient movement and increase the efficiency of the scanning room.

A Study of 3D Digital Fashion Design Using Kazmir Malevich's Formative Elements as AI Prompt (카지미르 말레비치의 조형적 요소를 AI 프롬프트로 활용한 3D 디지털 패션디자인 연구)

  • Jooyoung Lee
    • Journal of Fashion Business
    • /
    • v.28 no.3
    • /
    • pp.122-139
    • /
    • 2024
  • Image-generated AI is rapidly emerging as a powerful tool to augment human creativity and transform the art and design process through deep learning capabilities. The purpose of this study was to propose and demonstrate the feasibility of a new design development method that combined traditional design methods and technology by constructing image-generated AI prompts based on artists' formative elements. The study methodology consisted of analyzing Kazmir Malevich's theoretical considerations and applying them to AI prompts for design, print pattern development, and 3D digital design. This study found that the suprematist works of Kazmir Malevich were suitable as design and print pattern prompts due to their clear geometric shapes, colors, and spatial arrangement. The AI-prompted designs and print patterns produced diverse results quickly and enabled an efficient design process compared to traditional methods, although additional refinement was required to perfect the details. The AI-generated designs were successfully produced as 3D garments, thereby demonstrating that AI technology could significantly contribute to fashion design through its integration with artistic principles. This study has academic significance in that it proposes a prompt composition method applicable to fashion design by combining AI and artistic elements. It also has industrial significance in that it contributes to design innovation and the implementation of creative ideas by presenting an AI-based design process that can be practically applied.

Analysis of Generative AI Technology Trends Based on Patent Data (특허 데이터 기반 생성형 AI 기술 동향 분석)

  • Seongmu Ryu;Taewon Song;Minjeong Lee;Yoonju Choi;Soonuk Seol
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.17 no.1
    • /
    • pp.1-9
    • /
    • 2024
  • This paper analyzes the trends in generative AI technology based on patent application documents. To achieve this, we selected 5,433 generative AI-related patents filed in South Korea, the United States, and Europe from 2003 to 2023, and analyzed the data by country, technology category, year, and applicant, presenting it visually to find insights and understand the flow of technology. The analysis shows that patents in the image category account for 36.9%, the largest share, with a continuous increase in filings, while filings in the text/document and music/speech categories have either decreased or remained stable since 2019. Although the company with the highest number of filings is a South Korean company, four out of the top five filers are U.S. companies, and all companies have filed the majority of their patents in the U.S., indicating that generative AI is growing and competing centered around the U.S. market. The findings of this paper are expected to be useful for future research and development in generative AI, as well as for formulating strategies for acquiring intellectual property.

A Study on How to Operate the Curriculum·Comparative Division for Animation Majors in the Era of Image-generating AI: Focusing on the AI Technology Convergence Process (이미지생성AI시대 애니메이션학과의 교과·비교과 운영 안 연구: AI기술융합 과정을 중심으로)

  • Sung Won Park;You Jin Gong
    • Journal of Information Technology Applications and Management
    • /
    • v.31 no.4
    • /
    • pp.99-119
    • /
    • 2024
  • Focusing on the rapid progress of image generation AI, this study examines the changes in talent required according to changes in the production process of the content industry, and proposes an educational management plan for the subject and comparative department of the university's animation major. First, through environmental analysis, the trend of the animation content industry is analyzed in three stages, and the necessity of producing AI-adapted content talent is derived by re-establishing the talent image of the university's animation major and introducing it into rapid education. Next, we present a case designed by applying teaching methods to improve technology convergence capabilities and project-oriented capabilities by presenting subject and non-curricular cases operated in the animation department of the researcher's university. Through this, we propose the necessity of education to cultivate animation content talent who can play technical and administrative roles by utilizing various AI systems in the future. The goal of this study is to establish a cornerstone study by presenting application cases and having the status of a university as a talent supplier that can lead the content industry beyond the era of AI content production that breaks the boundaries of genres between contents. In conclusion, it is intended to propose the application of education to create value through technology convergence capabilities and project-oriented capabilities to cultivate AI-adapted content talents.

AI Art Creation Case Study for AI Film & Video Content (AI 영화영상콘텐츠를 위한 AI 예술창작 사례연구)

  • Jeon, Byoungwon
    • The Journal of the Convergence on Culture Technology
    • /
    • v.7 no.2
    • /
    • pp.85-95
    • /
    • 2021
  • Currently, we stand between computers as creative tools and computers as creators. A new genre of movies, which can be called a post-cinema situation, is emerging. This paper aims to diagnose the possibility of the emergence of AI cinema. To confirm the possibility of AI cinema, it was examined through a case study whether the creation of a story, narrative, image, and sound, which are necessary conditions for film creation, is possible by artificial intelligence. First, we checked the visual creation of AI painting algorithms Obvious, GAN, and CAN. Second, AI music has already entered the distribution stage in the market in cooperation with humans. Third, AI can already complete drama scripts, and automatic scenario creation programs using big data are also gaining popularity. That said, we confirmed that the filmmaking requirements could be met with AI algorithms. From the perspective of Manovich's 'AI Genre Convention', web documentaries and desktop documentaries, typical trends post-cinema, can be said to be representative genres that can be expected as AI cinemas. The conditions for AI, web documentaries and desktop documentaries to exist are the same. This article suggests a new path for the media of the 4th Industrial Revolution era through research on AI as a creator of post-cinema.

3D Object State Representation via State Diagram based on Informal Natural Language Requirement Specifications (비정형 자연어 요구 사항 기반 상태 모델을 통한 3D 객체의 상태 표현 메커니즘)

  • Ye Jin Jin;Chae Yun Seo;R. Young Chul Kim
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2024.05a
    • /
    • pp.494-496
    • /
    • 2024
  • 현재 소프트웨어 산업에서 자연어 요구사항의 정확한 분석 연구는 활발히 진행되고 있다. 그러나, 문법적인 분석만을 통해 해석하는 것이 일반적이다. 본 연구는 요구공학과 언어학 그리고 카툰 공학을 접목을 제안한다. 이를 위해서, 1) 언어학적 관점에는 촘스키의 구문 구조 분석 이론과 필모어의 의미역 이론을 결합하여 문법적, 의미적 분석을 수행한다. 2) 요구공학 관점에서는 요구사항 분석으로 상태 모델 속성 추출 및 접목한다. 3) 카툰 공학에서는 3D 이미지 생성한다. 또한, 해결 못했던 동사와 형용사에 대해 분석하여 범위를 확장한다. 즉 언어학적 분석을 바탕으로 UML 상태 다이어그램을 추출하고, 이를 3D 상태 이미지 생성한다. 본 연구는 AI 기술(Text to Image)에 소프트웨어 공학적 방법에서의 절차적인 공정과 재사용 적용함으로써, AI 내부 작동 원리에 대해 체계적으로 연구하고자 한다.

Deep Learning OCR based document processing platform and its application in financial domain (금융 특화 딥러닝 광학문자인식 기반 문서 처리 플랫폼 구축 및 금융권 내 활용)

  • Dongyoung Kim;Doohyung Kim;Myungsung Kwak;Hyunsoo Son;Dongwon Sohn;Mingi Lim;Yeji Shin;Hyeonjung Lee;Chandong Park;Mihyang Kim;Dongwon Choi
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.1
    • /
    • pp.143-174
    • /
    • 2023
  • With the development of deep learning technologies, Artificial Intelligence powered Optical Character Recognition (AI-OCR) has evolved to read multiple languages from various forms of images accurately. For the financial industry, where a large number of diverse documents are processed through manpower, the potential for using AI-OCR is great. In this study, we present a configuration and a design of an AI-OCR modality for use in the financial industry and discuss the platform construction with application cases. Since the use of financial domain data is prohibited under the Personal Information Protection Act, we developed a deep learning-based data generation approach and used it to train the AI-OCR models. The AI-OCR models are trained for image preprocessing, text recognition, and language processing and are configured as a microservice architected platform to process a broad variety of documents. We have demonstrated the AI-OCR platform by applying it to financial domain tasks of document sorting, document verification, and typing assistance The demonstrations confirm the increasing work efficiency and conveniences.

Enhancing the performance of the facial keypoint detection model by improving the quality of low-resolution facial images (저화질 안면 이미지의 화질 개선를 통한 안면 특징점 검출 모델의 성능 향상)

  • KyoungOok Lee;Yejin Lee;Jonghyuk Park
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.2
    • /
    • pp.171-187
    • /
    • 2023
  • When a person's face is recognized through a recording device such as a low-pixel surveillance camera, it is difficult to capture the face due to low image quality. In situations where it is difficult to recognize a person's face, problems such as not being able to identify a criminal suspect or a missing person may occur. Existing studies on face recognition used refined datasets, so the performance could not be measured in various environments. Therefore, to solve the problem of poor face recognition performance in low-quality images, this paper proposes a method to generate high-quality images by performing image quality improvement on low-quality facial images considering various environments, and then improve the performance of facial feature point detection. To confirm the practical applicability of the proposed architecture, an experiment was conducted by selecting a data set in which people appear relatively small in the entire image. In addition, by choosing a facial image dataset considering the mask-wearing situation, the possibility of expanding to real problems was explored. As a result of measuring the performance of the feature point detection model by improving the image quality of the face image, it was confirmed that the face detection after improvement was enhanced by an average of 3.47 times in the case of images without a mask and 9.92 times in the case of wearing a mask. It was confirmed that the RMSE for facial feature points decreased by an average of 8.49 times when wearing a mask and by an average of 2.02 times when not wearing a mask. Therefore, it was possible to verify the applicability of the proposed method by increasing the recognition rate for facial images captured in low quality through image quality improvement.