• 제목/요약/키워드: AI image analysis

Search Result 165, Processing Time 0.025 seconds

A Study on the Explainability of Inception Network-Derived Image Classification AI Using National Defense Data (국방 데이터를 활용한 인셉션 네트워크 파생 이미지 분류 AI의 설명 가능성 연구)

  • Kangun Cho
    • Journal of the Korea Institute of Military Science and Technology
    • /
    • v.27 no.2
    • /
    • pp.256-264
    • /
    • 2024
  • In the last 10 years, AI has made rapid progress, and image classification, in particular, are showing excellent performance based on deep learning. Nevertheless, due to the nature of deep learning represented by a black box, it is difficult to actually use it in critical decision-making situations such as national defense, autonomous driving, medical care, and finance due to the lack of explainability of judgement results. In order to overcome these limitations, in this study, a model description algorithm capable of local interpretation was applied to the inception network-derived AI to analyze what grounds they made when classifying national defense data. Specifically, we conduct a comparative analysis of explainability based on confidence values by performing LIME analysis from the Inception v2_resnet model and verify the similarity between human interpretations and LIME explanations. Furthermore, by comparing the LIME explanation results through the Top1 output results for Inception v3, Inception v2_resnet, and Xception models, we confirm the feasibility of comparing the efficiency and availability of deep learning networks using XAI.

Development of a Slope Condition Analysis System using IoT Sensors and AI Camera (IoT 센서와 AI 카메라를 융합한 급경사지 상태 분석 시스템 개발)

  • Seungjoo Lee;Kiyen Jeong;Taehoon Lee;YoungSeok Kim
    • Journal of the Korean Geosynthetics Society
    • /
    • v.23 no.2
    • /
    • pp.43-52
    • /
    • 2024
  • Recent abnormal climate conditions have increased the risk of slope collapses, which frequently result in significant loss of life and property due to the absence of early prediction and warning dissemination. In this paper, we develop a slope condition analysis system using IoT sensors and AI-based camera to assess the condition of slopes. To develop the system, we conducted hardware and firmware design for measurement sensors considering the ground conditions of slopes, designed AI-based image analysis algorithms, and developed prediction and warning solutions and systems. We aimed to minimize errors in sensor data through the integration of IoT sensor data and AI camera image analysis, ultimately enhancing the reliability of the data. Additionally, we evaluated the accuracy (reliability) by applying it to actual slopes. As a result, sensor measurement errors were maintained within 0.1°, and the data transmission rate exceeded 95%. Moreover, the AI-based image analysis system demonstrated nighttime partial recognition rates of over 99%, indicating excellent performance even in low-light conditions. Through this research, it is anticipated that the analysis of slope conditions and smart maintenance management in various fields of Social Overhead Capital (SOC) facilities can be applied.

A Comparative Study on the Features and Applications of AI Tools -Focus on PIKA Labs and RUNWAY

  • Biying Guo;Xinyi Shan;Jeanhun Chung
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.16 no.1
    • /
    • pp.86-91
    • /
    • 2024
  • In the field of artistic creation, the iterative development of AI-generated video software has pushed the boundaries of multimedia content creation and provided powerful creative tools for non-professionals. This paper extensively examines two leading AI-generated video software, PIKA Labs and RUNWAY, discussing their functions, performance differences, and application scopes in the video generation domain. Through detailed operational examples, a comparative analysis of their functionalities, as well as the advantages and limitations of each in generating video content, is presented. By comparison, it can be found that PIKA Labs and RUNWAY have excellent performance in stability and creativity. Therefore, the purpose of this study is to comprehensively elucidate the operating mechanisms of these two AI software, in order to intuitively demonstrate the advantages of each software. Simultaneously, this study provides valuable references for professionals and creators in the video production field, assisting them in selecting the most suitable tools for different scenarios, thereby advancing the application and development of AI-generated video software in multimedia content creation.

A study on the analysis of characteristics of fashion images shown in an AI image generation program (AI 이미지 생성 프로그램에서 나타난 패션 이미지의 특징 분석 연구)

  • Park, Keunsoo
    • The Journal of the Convergence on Culture Technology
    • /
    • v.10 no.3
    • /
    • pp.199-207
    • /
    • 2024
  • Today, AI image creation technology is being expanded and utilized across industries. Accordingly, various AI image creation programs optimized for the fashion industry are being developed and commercialized. In this study, we compared and analyzed the visual characteristics of fashion images created by AI image creation programs such as Playground, Midjourney, and The New Black to identify the characteristics of each program and point out areas where each program can be used and problems. The results are as follows: First, while Playground and Midjourney intuitively applied the contents of the command to create images that were different from actual fashion trends, Dannew Black created images that were relatively similar to fashion trends. Second, while Playground separates or combines images corresponding to the command content, Midjourny tends to create new images by adding and fusing various details. Third, in Playground, colors not included in the command appear randomly, and in The New Black, colors not included in the command appear coordinated, and Midjourney generates the color specified in the command relatively accurately. In conclusion, Midjourney can be used when seeking inspiration for developing unique and creative fashion designs, and The New Black can be helpful in referencing fashion trends or fashion styling. On the other hand, playgrounds can be somewhat confusing when it comes to color creation, so this is something to be careful about. It is expected that AI image creation tools can be used more efficiently in fashion design development.

Research Trend Analysis by using Text-Mining Techniques on the Convergence Studies of AI and Healthcare Technologies (텍스트 마이닝 기법을 활용한 인공지능과 헬스케어 융·복합 분야 연구동향 분석)

  • Yoon, Jee-Eun;Suh, Chang-Jin
    • Journal of Information Technology Services
    • /
    • v.18 no.2
    • /
    • pp.123-141
    • /
    • 2019
  • The goal of this study is to review the major research trend on the convergence studies of AI and healthcare technologies. For the study, 15,260 English articles on AI and healthcare related topics were collected from Scopus for 55 years from 1963, and text mining techniques were conducted. As a result, seven key research topics were defined : "AI for Clinical Decision Support System (CDSS)", "AI for Medical Image", "Internet of Healthcare Things (IoHT)", "Big Data Analytics in Healthcare", "Medical Robotics", "Blockchain in Healthcare", and "Evidence Based Medicine (EBM)". The result of this study can be utilized to set up and develop the appropriate healthcare R&D strategies for the researchers and government. In this study, text mining techniques such as Text Analysis, Frequency Analysis, Topic Modeling on LDA (Latent Dirichlet Allocation), Word Cloud, and Ego Network Analysis were conducted.

A Study on the Comparative Analysis of Overseas Medical Care Video and Domestic Medical Care Video (해외 의료케어 전문 영상과 국내 의료케어 영상 비교분석에 관한 연구)

  • Cho, Hyun Kyung
    • The Journal of the Convergence on Culture Technology
    • /
    • v.7 no.4
    • /
    • pp.415-420
    • /
    • 2021
  • In a situation where the medical care field is developing from various angles, medical promotion video analysis has an important meaning. It is important as a matter of improving competitiveness, and in the era of acceleration of AI systems, medical care is also the leading field. Accordingly, the importance of videos on publicity, advertisements, and explanations is very important, and it is also an important direction to change the image of a company. In this study, the design characteristics and differences in the video were compared, focusing on the comparative analysis of professional videos of AI medical brands, with two foreign major companies (Stryker and Hill-rom) and one domestic leading company (Nine Bell), and detailed part analysis and section analysis were performed accordingly. As a technical partial analysis of image editing, the transition method and infographic graphics were considered. In an in-depth comparison, we found that AI medical imaging Points such as differences in image tone and image color harmony were analyzed and compared. For a detailed analysis in the video image determination part, we compared and studied the differentiated elements appearing in the promotional design and specific scenes of the video intro part and the product description video part of each video.

Feasibility Study of CNN-based Super-Resolution Algorithm Applied to Low-Resolution CT Images

  • Doo Bin KIM;Mi Jo LEE;Joo Wan HONG
    • Korean Journal of Artificial Intelligence
    • /
    • v.12 no.1
    • /
    • pp.1-6
    • /
    • 2024
  • Recently, various techniques are being applied through the development of medical AI, and research has been conducted on the application of super-resolution AI models. In this study, evaluate the results of the application of the super-resolution AI model to brain CT as the basic data for future research. Acquiring CT images of the brain, algorithm for brain and bone windowing setting, and the resolution was downscaled to 5 types resolution image based on the original resolution image, and then upscaled to resolution to create an LR image and used for network input with the original imaging. The SRCNN model was applied to each of these images and analyzed using PSNR, SSIM, Loss. As a result of quantitative index analysis, the results were the best at 256×256, the brain and bone window setting PSNR were the same at 33.72, 35.2, and SSIM at 0.98 respectively, and the loss was 0.0004 and 0.0003, respectively, showing relatively excellent performance in the bone window setting CT image. The possibility of future studies aimed image quality and exposure dose is confirmed, and additional studies that need to be verified are also presented, which can be used as basic data for the above studies.

Application of Artificial Intelligence-based Digital Pathology in Biomedical Research

  • Jin Seok Kang
    • Biomedical Science Letters
    • /
    • v.29 no.2
    • /
    • pp.53-57
    • /
    • 2023
  • The main objective of pathologists is to achieve accurate lesion diagnoses, which has become increasingly challenging due to the growing number of pathological slides that need to be examined. However, using digital technology has made it easier to complete this task compared to older methods. Digital pathology is a specialized field that manages data from digitized specimen slides, utilizing image processing technology to automate and improve analysis. It aims to enhance the precision, reproducibility, and standardization of pathology-based researches, preclinical, and clinical trials through the sophisticated techniques it employs. The advent of whole slide imaging (WSI) technology is revolutionizing the pathology field by replacing glass slides as the primary method of pathology evaluation. Image processing technology that utilizes WSI is being implemented to automate and enhance analysis. Artificial intelligence (AI) algorithms are being developed to assist pathologic diagnosis and detection and segmentation of specific objects. Application of AI-based digital pathology in biomedical researches is classified into four areas: diagnosis and rapid peer review, quantification, prognosis prediction, and education. AI-based digital pathology can result in a higher accuracy rate for lesion diagnosis than using either a pathologist or AI alone. Combining AI with pathologists can enhance and standardize pathology-based investigations, reducing the time and cost required for pathologists to screen tissue slides for abnormalities. And AI-based digital pathology can identify and quantify structures in tissues. Lastly, it can help predict and monitor disease progression and response to therapy, contributing to personalized medicine.

Development of AI Image Analysis Emergency Door Opening and Closing System linked Wired/Wireless Counting (유무선 카운팅 연동형 AI 영상분석 비상문 개폐 시스템 개발)

  • Cheol-soo, Kang;Ji-yun, Hong;Bong-hyun, Kim
    • Journal of Digital Policy
    • /
    • v.1 no.2
    • /
    • pp.1-8
    • /
    • 2022
  • In case of a dangerous situation, the roof, which serves as an emergency exit, must be open in case of fire according to the Fire Act. However, when the roof door is opened, it has become a place of various incidents and accidents such as illegal entry, crime, and suicide. As a result, it is a reality to close the roof door in terms of facility management to prevent crime, various incidents, and accidents. Accordingly, the government is pushing to legislate regulations on housing construction standards, etc. that mandate the installation of electronic automatic opening and closing devices on rooftop doors. Therefore, in this paper, an intelligent emergency door opening/closing device system is proposed. To this end, an intelligent emergency door opening and closing system was developed by linking wired and wireless access counting and AI image analysis. Finally, it is possible to build a wireless communication-based integrated management platform that provides remote control and history management in a centralized method of device status real-time monitoring and event alarm.

Fashion attribute-based mixed reality visualization service (패션 속성기반 혼합현실 시각화 서비스)

  • Yoo, Yongmin;Lee, Kyounguk;Kim, Kyungsun
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.05a
    • /
    • pp.2-5
    • /
    • 2022
  • With the advent of deep learning and the rapid development of ICT (Information and Communication Technology), research using artificial intelligence is being actively conducted in various fields of society such as politics, economy, and culture and so on. Deep learning-based artificial intelligence technology is subdivided into various domains such as natural language processing, image processing, speech processing, and recommendation system. In particular, as the industry is advanced, the need for a recommendation system that analyzes market trends and individual characteristics and recommends them to consumers is increasingly required. In line with these technological developments, this paper extracts and classifies attribute information from structured or unstructured text and image big data through deep learning-based technology development of 'language processing intelligence' and 'image processing intelligence', and We propose an artificial intelligence-based 'customized fashion advisor' service integration system that analyzes trends and new materials, discovers 'market-consumer' insights through consumer taste analysis, and can recommend style, virtual fitting, and design support.

  • PDF