• Title/Summary/Keyword: Image Generative AI

Search Result 40, Processing Time 0.017 seconds

Transforming Text into Video: A Proposed Methodology for Video Production Using the VQGAN-CLIP Image Generative AI Model

  • SukChang Lee
    • International Journal of Advanced Culture Technology
    • /
    • v.11 no.3
    • /
    • pp.225-230
    • /
    • 2023
  • With the development of AI technology, there is a growing discussion about Text-to-Image Generative AI. We presented a Generative AI video production method and delineated a methodology for the production of personalized AI-generated videos with the objective of broadening the landscape of the video domain. And we meticulously examined the procedural steps involved in AI-driven video production and directly implemented a video creation approach utilizing the VQGAN-CLIP model. The outcomes produced by the VQGAN-CLIP model exhibited a relatively moderate resolution and frame rate, and predominantly manifested as abstract images. Such characteristics indicated potential applicability in OTT-based video content or the realm of visual arts. It is anticipated that AI-driven video production techniques will see heightened utilization in forthcoming endeavors.

Development of a Shoe Recommendation Model for Matching Outfits Using Generative Artificial Intelligence (생성형 인공지능을 활용한 신발 추천 모델 개발)

  • Jun Woo CHOI
    • Journal of Korea Artificial Intelligence Association
    • /
    • v.1 no.1
    • /
    • pp.7-10
    • /
    • 2023
  • This study proposes an AI-based shoe recommendation model based on user clothing image data to solve the problem of the global fashion industry, which is worsening due to factors such as the economic downturn. Shoes are an important part of modern fashion, and this research aims to improve user satisfaction and contribute to economic growth through a generative AI-based shoe recommendation service. By utilizing generative AI in the personalized consumer market, we show the feasibility, efficiency, and improvements through an accessible web-based implementation. In conclusion, this study provides insights to help fulfill consumer needs in the ever-changing fashion market by implementing a generative AI-based shoe recommendation model.

A Study on User Experience through Analysis of the Creative Process of Using Image Generative AI: Focusing on User Agency in Creativity (이미지 생성형 AI의 창작 과정 분석을 통한 사용자 경험 연구: 사용자의 창작 주체감을 중심으로)

  • Daeun Han;Dahye Choi;Changhoon Oh
    • The Journal of the Convergence on Culture Technology
    • /
    • v.9 no.4
    • /
    • pp.667-679
    • /
    • 2023
  • The advent of image generative AI has made it possible for people who are not experts in art and design to create finished artworks through text input. With the increasing availability of generated images and their impact on the art industry, there is a need for research on how users perceive the process of co-creating with AI. In this study, we conducted an experimental study to investigate the expected and experienced processes of image generative AI creation among general users and to find out which processes affect users' sense of creative agency. The results showed that there was a gap between the expected and experienced creative process, and users tended to perceive a low sense of creative agency. We recommend eight ways that AI can act as an enabler to support users' creative intentions so that they can experience a higher sense of creative agency. This study can contribute to the future development of image-generating AI by considering user-centered creative experiences.

A Research on AI Generated 2D Image to 3D Modeling Technology

  • Ke Ma;Jeanhun Chung
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.16 no.2
    • /
    • pp.81-86
    • /
    • 2024
  • Advancements in generative AI are reshaping graphic and 3D content design landscapes, where AI not only enriches graphic design but extends its reach to 3D content creation. Though 3D texture mapping through AI is advancing, AI-generated 3D modeling technology in this realm remains nascent. This paper presents AI 2D image-driven 3D modeling techniques, assessing their viability in 3D content design by scrutinizing various algorithms. Initially, four OBJ model-exporting AI algorithms are screened, and two are further evaluated. Results indicate that while AI-generated 3D models may not be directly usable, they effectively capture reference object structures, offering substantial time savings and enhanced design efficiency through manual refinements. This endeavor pioneers new avenues for 3D content creators, anticipating a dynamic fusion of AI and 3D design.

Generative AI-based Exterior Building Design Visualization Approach in the Early Design Stage - Leveraging Architects' Style-trained Models - (생성형 AI 기반 초기설계단계 외관디자인 시각화 접근방안 - 건축가 스타일 추가학습 모델 활용을 바탕으로 -)

  • Yoo, Youngjin;Lee, Jin-Kook
    • Journal of KIBIM
    • /
    • v.14 no.2
    • /
    • pp.13-24
    • /
    • 2024
  • This research suggests a novel visualization approach utilizing Generative AI to render photorealistic architectural alternatives images in the early design phase. Photorealistic rendering intuitively describes alternatives and facilitates clear communication between stakeholders. Nevertheless, the conventional rendering process, utilizing 3D modelling and rendering engines, demands sophisticate model and processing time. In this context, the paper suggests a rendering approach employing the text-to-image method aimed at generating a broader range of intuitive and relevant reference images. Additionally, it employs an Text-to-Image method focused on producing a diverse array of alternatives reflecting architects' styles when visualizing the exteriors of residential buildings from the mass model images. To achieve this, fine-tuning for architects' styles was conducted using the Low-Rank Adaptation (LoRA) method. This approach, supported by fine-tuned models, allows not only single style-applied alternatives, but also the fusion of two or more styles to generate new alternatives. Using the proposed approach, we generated more than 15,000 meaningful images, with each image taking only about 5 seconds to produce. This demonstrates that the Generative AI-based visualization approach significantly reduces the labour and time required in conventional visualization processes, holding significant potential for transforming abstract ideas into tangible images, even in the early stages of design.

A Research on Aesthetic Aspects of Checkpoint Models in [Stable Diffusion]

  • Ke Ma;Jeanhun Chung
    • International journal of advanced smart convergence
    • /
    • v.13 no.2
    • /
    • pp.130-135
    • /
    • 2024
  • The Stable diffsuion AI tool is popular among designers because of its flexible and powerful image generation capabilities. However, due to the diversity of its AI models, it needs to spend a lot of time testing different AI models in the face of different design plans, so choosing a suitable general AI model has become a big problem at present. In this paper, by comparing the AI images generated by two different Stable diffsuion models, the advantages and disadvantages of each model are analyzed from the aspects of the matching degree of the AI image and the prompt, the color composition and light composition of the image, and the general AI model that the generated AI image has an aesthetic sense is analyzed, and the designer does not need to take cumbersome steps. A satisfactory AI image can be obtained. The results show that Playground V2.5 model can be used as a general AI model, which has both aesthetic and design sense in various style design requirements. As a result, content designers can focus more on creative content development, and expect more groundbreaking technologies to merge generative AI with content design.

Utilization Strategies of Generative AI Platforms for CG Education (CG 교육을 위한 생성형 인공지능 플랫폼 활용 방안)

  • Donghee Suh
    • Journal of Practical Engineering Education
    • /
    • v.15 no.2
    • /
    • pp.357-364
    • /
    • 2023
  • Due to the rapid advancement of AI technology, generative artificial intelligence platforms are experiencing innovative applications in various fields. In this paper, it examines research cases involving the utilization of AI in education, explore instances where generative AI platforms are applied in the realm of creative endeavors, and discuss the direction of utilizing generative AI in educational contexts. In the field of computer graphics, this study introduced generative AI platforms that are applicable for image creation, editing, and video editing. It also proposed platforms that can be utilized in the video editing production process. These generative AI platforms not only offer advantages in terms of efficiency, by reducing the efforts of creators and saving time in the production process, but they also present positive aspects in enhancing individual capabilities. It is advocated that their swift integration into education is necessary, considering these benefits. This study aims to provide direction for the expansion of creative education utilizing generative AI platforms.

Research on AI Painting Generation Technology Based on the [Stable Diffusion]

  • Chenghao Wang;Jeanhun Chung
    • International journal of advanced smart convergence
    • /
    • v.12 no.2
    • /
    • pp.90-95
    • /
    • 2023
  • With the rapid development of deep learning and artificial intelligence, generative models have achieved remarkable success in the field of image generation. By combining the stable diffusion method with Web UI technology, a novel solution is provided for the application of AI painting generation. The application prospects of this technology are very broad and can be applied to multiple fields, such as digital art, concept design, game development, and more. Furthermore, the platform based on Web UI facilitates user operations, making the technology more easily applicable to practical scenarios. This paper introduces the basic principles of Stable Diffusion Web UI technology. This technique utilizes the stability of diffusion processes to improve the output quality of generative models. By gradually introducing noise during the generation process, the model can generate smoother and more coherent images. Additionally, the analysis of different model types and applications within Stable Diffusion Web UI provides creators with a more comprehensive understanding, offering valuable insights for fields such as artistic creation and design.

Analysis and Forecast of Venture Capital Investment on Generative AI Startups: Focusing on the U.S. and South Korea (생성 AI 스타트업에 대한 벤처투자 분석과 예측: 미국과 한국을 중심으로)

  • Lee, Seungah;Jung, Taehyun
    • Asia-Pacific Journal of Business Venturing and Entrepreneurship
    • /
    • v.18 no.4
    • /
    • pp.21-35
    • /
    • 2023
  • Expectations surrounding generative AI technology and its profound ramifications are sweeping across various industrial domains. Given the anticipated pivotal role of the startup ecosystem in the utilization and advancement of generative AI technology, it is imperative to cultivate a deeper comprehension of the present state and distinctive attributes characterizing venture capital (VC) investments within this domain. The current investigation delves into South Korea's landscape of VC investment deals and prognosticates the projected VC investments by juxtaposing these against the United States, the frontrunner in the generative AI industry and its associated ecosystem. For analytical purposes, a compilation of 286 investment deals originating from 117 U.S. generative AI startups spanning the period from 2008 to 2023, as well as 144 investment deals from 42 South Korean generative AI startups covering the years 2011 to 2023, was amassed to construct new datasets. The outcomes of this endeavor reveal an upward trajectory in the count of VC investment deals within both the U.S. and South Korea during recent years. Predominantly, these deals have been concentrated within the early-stage investment realm. Noteworthy disparities between the two nations have also come to light. Specifically, in the U.S., in contrast to South Korea, the quantum of recent VC deals has escalated, marking an augmentation ranging from 285% to 488% in the corresponding developmental stage. While the interval between disparate investment stages demonstrated a slight elongation in South Korea relative to the U.S., this discrepancy did not achieve statistical significance. Furthermore, the proportion of VC investments channeled into generative AI enterprises, relative to the aggregate number of deals, exhibited a higher quotient in South Korea compared to the U.S. Upon a comprehensive sectoral breakdown of generative AI, it was discerned that within the U.S., 59.2% of total deals were concentrated in the text and model sectors, whereas in South Korea, 61.9% of deals centered around the video, image, and chat sectors. Through forecasting, the anticipated VC investments in South Korea from 2023 to 2029 were derived via four distinct models, culminating in an estimated average requirement of 3.4 trillion Korean won (ranging from at least 2.408 trillion won to a maximum of 5.919 trillion won). This research bears pragmatic significance as it methodically dissects VC investments within the generative AI domain across both the U.S. and South Korea, culminating in the presentation of an estimated VC investment projection for the latter. Furthermore, its academic significance lies in laying the groundwork for prospective scholarly inquiries by dissecting the current landscape of generative AI VC investments, a sphere that has hitherto remained void of rigorous academic investigation supported by empirical data. Additionally, the study introduces two innovative methodologies for the prediction of VC investment sums. Upon broader integration, application, and refinement of these methodologies within diverse academic explorations, they stand poised to enhance the prognosticative capacity pertaining to VC investment costs.

  • PDF