• Title/Summary/Keyword: AI-based image analysis

Search Result 120, Processing Time 0.03 seconds

Robot Journalism Research Trends and Future Prospects (로봇 저널리즘 연구 동향 및 미래 전망)

  • Cui, Jian-Dong;Song, Seung-keun
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.24 no.2
    • /
    • pp.333-336
    • /
    • 2020
  • AI-powered robot news is drawing attention as artificial intelligence technology is fully spread in the news distribution field. Robot news still has many technical and ethical problems, but academic research on this is insufficient. This study analyzes the issue of robot writing in artificial intelligent based robot journalism industry using SWOT analysis. As a result, the advantages of big data processes, accurate information gathering, high efficiency and disadvantages such as lack of independent arguments and lack of evidence and opportunities for technical development, government support, academic development, and industrial applications, and threats such as uncritical acceptance and lack of talent have been found. This study suggests three future-oriented directions, such as human-machine collaboration, intelligent news, and chat-bot, through previous studies on the development direction of robot journalism-based article writing.

Performance Analysis of Human Facial Age Classification Method Based on Vision Transformer (Vision Transformer 기반 얼굴 연령 분류 기법의 성능 분석)

  • Junhwi Park;Namjung Kim;Changjoon Park;Jaehyun Lee;Jeonghwan Gwak
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2024.01a
    • /
    • pp.343-345
    • /
    • 2024
  • 얼굴 연령 분류 기법은 신원 확인 시스템 고도화, 유동 인구 통계 자동화 시스템 구축, 연령 제한 콘텐츠 관리 시스템 고도화 등 다양한 분야에 적용할 수 있는 확장 가능성을 가진다. 넓은 확장 가능성을 가지는 만큼 적용된 시스템의 안정성을 위해서는 얼굴 연령 분류 기법의 높은 정확도는 필수적이다. 따라서, 본 논문에서는 Vision Transformer(ViT) 기반 분류 알고리즘의 얼굴 연령 분류 성능을 비교 분석한다. ViT 기반분류 알고리즘으로는 최근 널리 사용되고 있는 ViT, Swin Transformer(ST), Neighborhood Attention Transformer(NAT) 세 가지로 선정하였으며, ViT의 얼굴 연령 분류 정확도 65.19%의 성능을 확인하였다.

  • PDF

Performance of ChatGPT 3.5 and 4 on U.S. dental examinations: the INBDE, ADAT, and DAT

  • Mahmood Dashti;Shohreh Ghasemi;Niloofar Ghadimi;Delband Hefzi;Azizeh Karimian;Niusha Zare;Amir Fahimipour;Zohaib Khurshid;Maryam Mohammadalizadeh Chafjiri;Sahar Ghaedsharaf
    • Imaging Science in Dentistry
    • /
    • v.54 no.3
    • /
    • pp.271-275
    • /
    • 2024
  • Purpose: Recent advancements in artificial intelligence (AI), particularly tools such as ChatGPT developed by OpenAI, a U.S.-based AI research organization, have transformed the healthcare and education sectors. This study investigated the effectiveness of ChatGPT in answering dentistry exam questions, demonstrating its potential to enhance professional practice and patient care. Materials and Methods: This study assessed the performance of ChatGPT 3.5 and 4 on U.S. dental exams - specifically, the Integrated National Board Dental Examination (INBDE), Dental Admission Test (DAT), and Advanced Dental Admission Test (ADAT) - excluding image-based questions. Using customized prompts, ChatGPT's answers were evaluated against official answer sheets. Results: ChatGPT 3.5 and 4 were tested with 253 questions from the INBDE, ADAT, and DAT exams. For the INBDE, both versions achieved 80% accuracy in knowledge-based questions and 66-69% in case history questions. In ADAT, they scored 66-83% in knowledge-based and 76% in case history questions. ChatGPT 4 excelled on the DAT, with 94% accuracy in knowledge-based questions, 57% in mathematical analysis items, and 100% in comprehension questions, surpassing ChatGPT 3.5's rates of 83%, 31%, and 82%, respectively. The difference was significant for knowledge-based questions(P=0.009). Both versions showed similar patterns in incorrect responses. Conclusion: Both ChatGPT 3.5 and 4 effectively handled knowledge-based, case history, and comprehension questions, with ChatGPT 4 being more reliable and surpassing the performance of 3.5. ChatGPT 4's perfect score in comprehension questions underscores its trainability in specific subjects. However, both versions exhibited weaker performance in mathematical analysis, suggesting this as an area for improvement.

A study on an artificial intelligence model for measuring object speed using road markers that can respond to external forces (외부력에 대응할 수 있는 도로 마커 활용 개체 속도 측정 인공지능 모델 연구)

  • Lim, Dong Hyun;Park, Dae-woo
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.05a
    • /
    • pp.228-231
    • /
    • 2022
  • Most CCTVs operated by public institutions for crime prevention and parking enforcement are located on roads. The angle of these CCTV's view is often changed for various reasons, such as bolt loosening by vibration or shocking by vehicles and workers, etc. In order to effectively provide AI services based on the collected images, the service target area(ROI, Region Of Interest) must be provided without interruption within the image. This is also related to the viewpoint of effective operation of computing power for image analysis. This study explains how to maximize the application of artificial intelligence technology by setting the ROI based on the marker on the road, setting the image analysis to be possible only within the area, and studying the process of finding the ROI.

  • PDF

Computer Vision-based Continuous Large-scale Site Monitoring System through Edge Computing and Small-Object Detection

  • Kim, Yeonjoo;Kim, Siyeon;Hwang, Sungjoo;Hong, Seok Hwan
    • International conference on construction engineering and project management
    • /
    • 2022.06a
    • /
    • pp.1243-1244
    • /
    • 2022
  • In recent years, the growing interest in off-site construction has led to factories scaling up their manufacturing and production processes in the construction sector. Consequently, continuous large-scale site monitoring in low-variability environments, such as prefabricated components production plants (precast concrete production), has gained increasing importance. Although many studies on computer vision-based site monitoring have been conducted, challenges for deploying this technology for large-scale field applications still remain. One of the issues is collecting and transmitting vast amounts of video data. Continuous site monitoring systems are based on real-time video data collection and analysis, which requires excessive computational resources and network traffic. In addition, it is difficult to integrate various object information with different sizes and scales into a single scene. Various sizes and types of objects (e.g., workers, heavy equipment, and materials) exist in a plant production environment, and these objects should be detected simultaneously for effective site monitoring. However, with the existing object detection algorithms, it is difficult to simultaneously detect objects with significant differences in size because collecting and training massive amounts of object image data with various scales is necessary. This study thus developed a large-scale site monitoring system using edge computing and a small-object detection system to solve these problems. Edge computing is a distributed information technology architecture wherein the image or video data is processed near the originating source, not on a centralized server or cloud. By inferring information from the AI computing module equipped with CCTVs and communicating only the processed information with the server, it is possible to reduce excessive network traffic. Small-object detection is an innovative method to detect different-sized objects by cropping the raw image and setting the appropriate number of rows and columns for image splitting based on the target object size. This enables the detection of small objects from cropped and magnified images. The detected small objects can then be expressed in the original image. In the inference process, this study used the YOLO-v5 algorithm, known for its fast processing speed and widely used for real-time object detection. This method could effectively detect large and even small objects that were difficult to detect with the existing object detection algorithms. When the large-scale site monitoring system was tested, it performed well in detecting small objects, such as workers in a large-scale view of construction sites, which were inaccurately detected by the existing algorithms. Our next goal is to incorporate various safety monitoring and risk analysis algorithms into this system, such as collision risk estimation, based on the time-to-collision concept, enabling the optimization of safety routes by accumulating workers' paths and inferring the risky areas based on workers' trajectory patterns. Through such developments, this continuous large-scale site monitoring system can guide a construction plant's safety management system more effectively.

  • PDF

Big Data using Artificial Intelligence CNN on Unstructured Financial Data (비정형 금융 데이터에 관한 인공지능 CNN 활용 빅데이터 연구)

  • Ko, Young-Bong;Park, Dea-Woo
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.05a
    • /
    • pp.232-234
    • /
    • 2022
  • Big data is widely used in customer relationship management, relationship marketing, financial business improvement, credit information and risk management. Moreover, as non-face-to-face financial transactions have become more active recently due to the COVID-19 virus, the use of financial big data is more demanded in terms of relationships with customers. In terms of customer relationship, financial big data has arrived at a time that requires an emotional rather than a technical approach. In relational marketing, it was necessary to emphasize the emotional aspect rather than the cognitive, rational, and rational aspects. Existing traditional financial data was collected and utilized through text-type customer transaction data, corporate financial information, and questionnaires. In this study, the customer's emotional image data, that is, atypical data based on the customer's cultural and leisure activities, is acquired through SNS and the customer's activity image is analyzed with an artificial intelligence CNN algorithm. Activity analysis is again applied to the annotated AI, and the AI big data model is designed to analyze the behavior model shown in the annotation.

  • PDF

Application of Quantitative Assessment of Coronary Atherosclerosis by Coronary Computed Tomographic Angiography

  • Su Nam Lee;Andrew Lin;Damini Dey;Daniel S. Berman;Donghee Han
    • Korean Journal of Radiology
    • /
    • v.25 no.6
    • /
    • pp.518-539
    • /
    • 2024
  • Coronary computed tomography angiography (CCTA) has emerged as a pivotal tool for diagnosing and risk-stratifying patients with suspected coronary artery disease (CAD). Recent advancements in image analysis and artificial intelligence (AI) techniques have enabled the comprehensive quantitative analysis of coronary atherosclerosis. Fully quantitative assessments of coronary stenosis and lumen attenuation have improved the accuracy of assessing stenosis severity and predicting hemodynamically significant lesions. In addition to stenosis evaluation, quantitative plaque analysis plays a crucial role in predicting and monitoring CAD progression. Studies have demonstrated that the quantitative assessment of plaque subtypes based on CT attenuation provides a nuanced understanding of plaque characteristics and their association with cardiovascular events. Quantitative analysis of serial CCTA scans offers a unique perspective on the impact of medical therapies on plaque modification. However, challenges such as time-intensive analyses and variability in software platforms still need to be addressed for broader clinical implementation. The paradigm of CCTA has shifted towards comprehensive quantitative plaque analysis facilitated by technological advancements. As these methods continue to evolve, their integration into routine clinical practice has the potential to enhance risk assessment and guide individualized patient management. This article reviews the evolving landscape of quantitative plaque analysis in CCTA and explores its applications and limitations.

Study of the Application of VQA Deep Learning Technology to the Operation and Management of Urban Parks - Analysis of SNS Images - (도시공원 운영 및 관리를 위한 VQA 딥러닝 기술 활용 연구 - SNS 이미지 분석을 중심으로 -)

  • Lee, Da-Yeon;Park, Seo-Eun;Lee, Jae Ho
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.51 no.5
    • /
    • pp.44-56
    • /
    • 2023
  • This research explores the enhancement of park operation and management by analyzing the changing demands of park users. While traditional methods depended on surveys, there has been a recent shift towards utilizing social media data to understand park usage trends. Notably, most research has focused on text data from social media, overlooking the valuable insights from image data. Addressing this gap, our study introduces a novel method of assessing park usage using social media image data and then applies it to actual city park evaluations. A unique image analysis tool, built on Visual Question Answering (VQA) deep learning technology, was developed. This tool revealed specific city park details such as user demographics, behaviors, and locations. Our findings highlight three main points: (1) The VQA-based image analysis tool's validity was proven by matching its results with traditional text analysis outcomes. (2) VQA deep learning technology offers insights like gender, age, and usage time, which aren't accessible from text analysis alone. (3) Using VQA, we derived operational and management strategies for city parks. In conclusion, our VQA-based method offers significant methodological advancements for future park usage studies.

A Review on Advanced Methodologies to Identify the Breast Cancer Classification using the Deep Learning Techniques

  • Bandaru, Satish Babu;Babu, G. Rama Mohan
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.4
    • /
    • pp.420-426
    • /
    • 2022
  • Breast cancer is among the cancers that may be healed as the disease diagnosed at early times before it is distributed through all the areas of the body. The Automatic Analysis of Diagnostic Tests (AAT) is an automated assistance for physicians that can deliver reliable findings to analyze the critically endangered diseases. Deep learning, a family of machine learning methods, has grown at an astonishing pace in recent years. It is used to search and render diagnoses in fields from banking to medicine to machine learning. We attempt to create a deep learning algorithm that can reliably diagnose the breast cancer in the mammogram. We want the algorithm to identify it as cancer, or this image is not cancer, allowing use of a full testing dataset of either strong clinical annotations in training data or the cancer status only, in which a few images of either cancers or noncancer were annotated. Even with this technique, the photographs would be annotated with the condition; an optional portion of the annotated image will then act as the mark. The final stage of the suggested system doesn't need any based labels to be accessible during model training. Furthermore, the results of the review process suggest that deep learning approaches have surpassed the extent of the level of state-of-of-the-the-the-art in tumor identification, feature extraction, and classification. in these three ways, the paper explains why learning algorithms were applied: train the network from scratch, transplanting certain deep learning concepts and constraints into a network, and (another way) reducing the amount of parameters in the trained nets, are two functions that help expand the scope of the networks. Researchers in economically developing countries have applied deep learning imaging devices to cancer detection; on the other hand, cancer chances have gone through the roof in Africa. Convolutional Neural Network (CNN) is a sort of deep learning that can aid you with a variety of other activities, such as speech recognition, image recognition, and classification. To accomplish this goal in this article, we will use CNN to categorize and identify breast cancer photographs from the available databases from the US Centers for Disease Control and Prevention.

Comparison of Classification Rules Regarding SaMD Between the Regulation EU 2017/745 and the Directive 93/42/EEC

  • Ryu, Gyuha;Lee, Jiyoon
    • Journal of Biomedical Engineering Research
    • /
    • v.42 no.6
    • /
    • pp.277-286
    • /
    • 2021
  • The global market size of AI based SaMD for medical image in 2023 will be anticipated to reach around 620 billion won (518 million dollars). In order for Korean manufacturers to efficiently obtain CE marking for marketing in the EU countries, the paper is to introduce the recommendation and suggestion of how to reclassify SaMD based on classification rules of MDR because, after introducing the Regulation EU 2017/745, classification rules are quite modified and newly added compared to the Directive 93/42/EEC. In addition, the paper is to provide several rules of MDR that may be applicable to decide the classification of SaMD. Lastly, the paper is to examine and demonstrate various secondary data supported by qualitative data because the paper focuses on the suggestion and recommendation with a public trust on the basis of various secondary data conducted by the analysis of field data. In conclusion, the paper found that the previous classification of SaMD followed by the rule of MDD should be reclassified based on the Regulation EU 2017/745. Therefore, the suggestion and recommendation are useful for Korean manufacturers to comprehend the classification of SaMD for marketing in the EU countries.