• Title/Summary/Keyword: 이미지 기반

Search Result 3,951, Processing Time 0.029 seconds

Basic Research on the Possibility of Developing a Landscape Perceptual Response Prediction Model Using Artificial Intelligence - Focusing on Machine Learning Techniques - (인공지능을 활용한 경관 지각반응 예측모델 개발 가능성 기초연구 - 머신러닝 기법을 중심으로 -)

  • Kim, Jin-Pyo;Suh, Joo-Hwan
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.51 no.3
    • /
    • pp.70-82
    • /
    • 2023
  • The recent surge of IT and data acquisition is shifting the paradigm in all aspects of life, and these advances are also affecting academic fields. Research topics and methods are being improved through academic exchange and connections. In particular, data-based research methods are employed in various academic fields, including landscape architecture, where continuous research is needed. Therefore, this study aims to investigate the possibility of developing a landscape preference evaluation and prediction model using machine learning, a branch of Artificial Intelligence, reflecting the current situation. To achieve the goal of this study, machine learning techniques were applied to the landscaping field to build a landscape preference evaluation and prediction model to verify the simulation accuracy of the model. For this, wind power facility landscape images, recently attracting attention as a renewable energy source, were selected as the research objects. For analysis, images of the wind power facility landscapes were collected using web crawling techniques, and an analysis dataset was built. Orange version 3.33, a program from the University of Ljubljana was used for machine learning analysis to derive a prediction model with excellent performance. IA model that integrates the evaluation criteria of machine learning and a separate model structure for the evaluation criteria were used to generate a model using kNN, SVM, Random Forest, Logistic Regression, and Neural Network algorithms suitable for machine learning classification models. The performance evaluation of the generated models was conducted to derive the most suitable prediction model. The prediction model derived in this study separately evaluates three evaluation criteria, including classification by type of landscape, classification by distance between landscape and target, and classification by preference, and then synthesizes and predicts results. As a result of the study, a prediction model with a high accuracy of 0.986 for the evaluation criterion according to the type of landscape, 0.973 for the evaluation criterion according to the distance, and 0.952 for the evaluation criterion according to the preference was developed, and it can be seen that the verification process through the evaluation of data prediction results exceeds the required performance value of the model. As an experimental attempt to investigate the possibility of developing a prediction model using machine learning in landscape-related research, this study was able to confirm the possibility of creating a high-performance prediction model by building a data set through the collection and refinement of image data and subsequently utilizing it in landscape-related research fields. Based on the results, implications, and limitations of this study, it is believed that it is possible to develop various types of landscape prediction models, including wind power facility natural, and cultural landscapes. Machine learning techniques can be more useful and valuable in the field of landscape architecture by exploring and applying research methods appropriate to the topic, reducing the time of data classification through the study of a model that classifies images according to landscape types or analyzing the importance of landscape planning factors through the analysis of landscape prediction factors using machine learning.

Stereoscopic Effect of 3D images according to the Quality of the Depth Map and the Change in the Depth of a Subject (깊이맵의 상세도와 주피사체의 깊이 변화에 따른 3D 이미지의 입체효과)

  • Lee, Won-Jae;Choi, Yoo-Joo;Lee, Ju-Hwan
    • Science of Emotion and Sensibility
    • /
    • v.16 no.1
    • /
    • pp.29-42
    • /
    • 2013
  • In this paper, we analyze the effect of the depth perception, volume perception and visual discomfort according to the change of the quality of the depth image and the depth of the major object. For the analysis, a 2D image was converted to eighteen 3D images using depth images generated based on the different depth position of a major object and background, which were represented in three detail levels. The subjective test was carried out using eighteen 3D images so that the degrees of the depth perception, volume perception and visual discomfort recognized by the subjects were investigated according to the change in the depth position of the major object and the quality of depth map. The absolute depth position of a major object and the relative depth difference between background and the major object were adjusted in three levels, respectively. The details of the depth map was also represented in three levels. Experimental results showed that the quality of the depth image differently affected the depth perception, volume perception and visual discomfort according to the absolute and relative depth position of the major object. In the case of the cardboard depth image, it severely damaged the volume perception regardless of the depth position of the major object. Especially, the depth perception was also more severely deteriorated by the cardboard depth image as the major object was located inside the screen than outside the screen. Furthermore, the subjects did not felt the difference of the depth perception, volume perception and visual comport from the 3D images generated by the detail depth map and by the rough depth map. As a result, it was analyzed that the excessively detail depth map was not necessary for enhancement of the stereoscopic perception in the 2D-to-3D conversion.

  • PDF

PTV Margins for Prostate Treatments with an Endorectal Balloon (전립선 암의 방사선치료 시 직장 내 풍선삽입에 따른 계획표적부피마진)

  • Kim, Hee-Jung;Chung, Jin-Beom;Ha, Sung-Whan;Kim, Jae-Sun;Ye, Sung-Joon
    • Radiation Oncology Journal
    • /
    • v.28 no.3
    • /
    • pp.166-176
    • /
    • 2010
  • Purpose: To determine the appropriate prostate planning target volume (PTV) margins for 3-dimensitional (3D) conformal radiotherapy (CRT) and intensity-modulated radiation therapy (IMRT) patients treated with an endorectal balloon (ERB) under our institutional treatment condition. Materials and Methods: Patients were treated in the supine position. An ERB was inserted into the rectum with 70 cc air prior to planning a CT scan and then each treatment fraction. Electronic portal images (EPIs) and digital reconstructed radiographs (DRR) of planning CT images were used to evaluate inter-fractional patient's setup and ERB errors. To register both image sets, we developed an in-house program written in visual $C^{++}$. A new method to determine prostate PTV margins with an ERB was developed by using the common method. Results: The mean value of patient setup errors was within 1 mm in all directions. The ERB inter-fractional errors in the superior-inferior (SI) and anterior-posterior (AP) directions were larger than in the left-right (LR) direction. The calculated 1D symmetric PTV margins were 3.0 mm, 8.2 mm, and 8.5 mm for 3D CRT and 4.1 mm, 7.9 mm, and 10.3 mm for IMRT in LR, SI, and AP, respectively according to the new method including ERB random errors. Conclusion: The ERB random error contributes to the deformation of the prostate, which affects the original treatment planning. Thus, a new PTV margin method includes dose blurring effects of ERB. The correction of ERB systematic error is a prerequisite since the new method only accounts for ERB random error.

A Comprehensive Review of the Foreign Literature regarding Protest Crowd Counting (집회시위 참가인원 집계방식에 대한 선행연구 고찰 - 국외연구 분석 중심으로 -)

  • Kim, Hak-kyong
    • Korean Security Journal
    • /
    • no.58
    • /
    • pp.9-34
    • /
    • 2019
  • The Korean Police Force is equipped with the dual responsibility to not only protect the constitutional right to protest, but also prevent potential disorder and misconduct might be caused by the abuse of such a right. To this end, the Korean national police employ the crowd counting methodology, termed 'Maximum Figure at Any One Time' with a view to dispatching the proportionate number of police officers to protest scenes for safety management. However, protest organizers rather take advantage of 'Cumulative Figure' methodology, the purpose of which being to publicize the wide recognition of success, noticeably by demonstrating that as many people as possible support for their cause or voice. Hence, different estimates generated by different methods have raised serious political issues in Korean society. Nevertheless, it is found out that there are only three existing academic studies in Korea regarding crowd counting methods, and they are mainly geared towards comparing the two methods, unfortunately without any attempt to analyze the foreign literature in details. Keeping the research gap in mind, the research conducts a comprehensive review of the foreign literature with relation to protest crowd counting methods. Derived from the review and analysis, the counting methods can be broadly categorized into the three models such as: 1) Grid/Density Model, 2) Moving Crowds Model, and 3) Electronic & Non-Image Model. In the end, the research provides brief explanations regarding specific research findings per each model, and further, suggests some policy implications for the development of more accurate crowd counting methodology at protests in Korea.

Basic Study for Selection of Factors Constituents of User Satisfaction for Micro Electric Vehicles (초소형전기차 사용자만족도 구성요인 선정을 위한 기반연구)

  • Jin, Eunju;Seo, Imki;Kim, Jongmin;Park, Jejin
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.41 no.5
    • /
    • pp.581-589
    • /
    • 2021
  • With the recent increase in the introduction of micro-electric vehicles in Korea, interest in micro-electric vehicle user satisfaction is increasing to revitalize related markets. In this paper, a basic study was conducted on the development of public services using micro-electric vehicle based on the constituent factors of user satisfaction. The survey includes: ① 'Analytic Hierarchy Process (AHP) for selecting the priority factors of user satisfaction of micro-electric vehicles', ② 'A survey of micro-electric vehicles image' to collect data in advance for providing users' preferences and transportation services for micro-electric vehicles, ③ In order to investigate the user satisfaction level of users who actually operated micro-electric vehicles, the order of 'user satisfaction survey of micro-electric vehicle drivers' was conducted. In the Analytic Hierarchy Process (AHP) analysis, it was found that users regarded as important in the order of 'user utilization data', 'vehicle movement data', and 'charging service data'. In the micro-electric vehicle image survey, users perceived micro-electric vehicles more positively in terms of "safety", 'durability', 'Ride comfort', 'design', 'MOOE (Maintenance and other operating expense)', and 'environment-friendly' when comparing micro-electric vehicles with electric motorcycles. In the survey on the user satisfaction of micro-electric vehicle drivers, the use of micro-electric vehicle did not directly affect work performance efficiency, and there was an experience of being disadvantaged on the road due to the size of the micro-electric vehicle, and driving in a cluster of micro-electric vehicle for outdoor advertisements. The city's public relations effect was great, but it was concerned about safety. In the future, based on the results of this study, we plan to build a user satisfaction structural equation model, preemptively discover feedback R&D for micro-electric vehicle utilization services in the public field, and actively seek to discover new public mobility support services.

Multiple SL-AVS(Small size & Low power Around View System) Synchronization Maintenance Method (다중 SL-AVS 동기화 유지기법)

  • Park, Hyun-Moon;Park, Soo-Huyn;Seo, Hae-Moon;Park, Woo-Chool
    • Journal of the Korea Society for Simulation
    • /
    • v.18 no.3
    • /
    • pp.73-82
    • /
    • 2009
  • Due to the many advantages including low price, low power consumption, and miniaturization, the CMOS camera has been utilized in many applications, including mobile phones, the automotive industry, medical sciences and sensoring, robotic controls, and research in the security field. In particular, the 360 degree omni-directional camera when utilized in multi-camera applications has displayed issues of software nature, interface communication management, delays, and a complicated image display control. Other issues include energy management problems, and miniaturization of a multi-camera in the hardware field. Traditional CMOS camera systems are comprised of an embedded system that consists of a high-performance MCU enabling a camera to send and receive images and a multi-layer system similar to an individual control system that consists of the camera's high performance Micro Controller Unit. We proposed the SL-AVS (Small Size/Low power Around-View System) to be able to control a camera while collecting image data using a high speed synchronization technique on the foundation of a single layer low performance MCU. It is an initial model of the omni-directional camera that takes images from a 360 view drawing from several CMOS camera utilizing a 110 degree view. We then connected a single MCU with four low-power CMOS cameras and implemented controls that include synchronization, controlling, and transmit/receive functions of individual camera compared with the traditional system. The synchronization of the respective cameras were controlled and then memorized by handling each interrupt through the MCU. We were able to improve the efficiency of data transmission that minimizes re-synchronization amongst a target, the CMOS camera, and the MCU. Further, depending on the choice of users, respective or groups of images divided into 4 domains were then provided with a target. We finally analyzed and compared the performance of the developed camera system including the synchronization and time of data transfer and image data loss, etc.

A Study on the Interactive Narrative - Focusing on the analysis of VR animation <Wolves in the Walls> (인터랙티브 내러티브에 관한 연구 - VR 애니메이션 <Wolves in the Walls>의 분석을 중심으로)

  • Zhuang Sheng
    • Trans-
    • /
    • v.15
    • /
    • pp.25-56
    • /
    • 2023
  • VR is a dynamic image simulation technology with very high information density. Among them, spatial depth, temporality, and realism bring an unprecedented sense of immersion to the experience. However, due to its high information density, the information contained in it is very easy to be manipulated, creating an illusion of objectivity. Users need guidance to help them interpret the high density of dynamic image information. Just like setting up navigation interfaces and interactivity in games, interactivity in virtual reality is a way to interpret virtual content. At present, domestic research on VR content is mainly focused on technology exploration and visual aesthetic experience. However, there is still a lack of research on interactive storytelling design, which is an important part of VR content creation. In order to explore a better interactive storytelling model in virtual reality content, this paper analyzes the interactive storytelling features of the VR animated version of <Wolves in the walls> through the methods of literature review and case study. We find that the following rules can be followed when creating VR content: 1. the VR environment should fully utilize the advantages of free movement for users, and users should not be viewed as mere observers. The user's sense of presence should be fully considered when designing interaction modules. Break down the "fourth wall" to encourage audience interaction in the virtual reality environment, and make the hot media of VR "cool". 2.Provide developer-driven narrative in the early stages of the work so that users are not confused about the ambiguous world situation when they first enter a virtual environment with a high degree of freedom. 1.Unlike some games that guide users through text, you can guide them through a more natural interactive approach that adds natural dialog between the user and story characters (NPC). Also, since gaze guidance is an important part of story progression, you should set up spatial scene user gaze guidance elements within it. For example, you can provide eye-following cues, motion cues, language cues, and more. By analyzing the interactive storytelling features and innovations of the VR animation <Wolves in the walls>, I hope to summarize the main elements of interactive storytelling from its content. Based on this, I hope to explore how to better showcase interactive storytelling in virtual reality content and provide thoughts on future VR content creation.

Deep Learning-based Fracture Mode Determination in Composite Laminates (복합 적층판의 딥러닝 기반 파괴 모드 결정)

  • Muhammad Muzammil Azad;Atta Ur Rehman Shah;M.N. Prabhakar;Heung Soo Kim
    • Journal of the Computational Structural Engineering Institute of Korea
    • /
    • v.37 no.4
    • /
    • pp.225-232
    • /
    • 2024
  • This study focuses on the determination of the fracture mode in composite laminates using deep learning. With the increase in the use of laminated composites in numerous engineering applications, the insurance of their integrity and performance is of paramount importance. However, owing to the complex nature of these materials, the identification of fracture modes is often a tedious and time-consuming task that requires critical domain knowledge. Therefore, to alleviate these issues, this study aims to utilize modern artificial intelligence technology to automate the fractographic analysis of laminated composites. To accomplish this goal, scanning electron microscopy (SEM) images of fractured tensile test specimens are obtained from laminated composites to showcase various fracture modes. These SEM images are then categorized based on numerous fracture modes, including fiber breakage, fiber pull-out, mix-mode fracture, matrix brittle fracture, and matrix ductile fracture. Next, the collective data for all classes are divided into train, test, and validation datasets. Two state-of-the-art, deep learning-based pre-trained models, namely, DenseNet and GoogleNet, are trained to learn the discriminative features for each fracture mode. The DenseNet models shows training and testing accuracies of 94.01% and 75.49%, respectively, whereas those of the GoogleNet model are 84.55% and 54.48%, respectively. The trained deep learning models are then validated on unseen validation datasets. This validation demonstrates that the DenseNet model, owing to its deeper architecture, can extract high-quality features, resulting in 84.44% validation accuracy. This value is 36.84% higher than that of the GoogleNet model. Hence, these results affirm that the DenseNet model is effective in performing fractographic analyses of laminated composites by predicting fracture modes with high precision.

Comparison of Convolutional Neural Network (CNN) Models for Lettuce Leaf Width and Length Prediction (상추잎 너비와 길이 예측을 위한 합성곱 신경망 모델 비교)

  • Ji Su Song;Dong Suk Kim;Hyo Sung Kim;Eun Ji Jung;Hyun Jung Hwang;Jaesung Park
    • Journal of Bio-Environment Control
    • /
    • v.32 no.4
    • /
    • pp.434-441
    • /
    • 2023
  • Determining the size or area of a plant's leaves is an important factor in predicting plant growth and improving the productivity of indoor farms. In this study, we developed a convolutional neural network (CNN)-based model to accurately predict the length and width of lettuce leaves using photographs of the leaves. A callback function was applied to overcome data limitations and overfitting problems, and K-fold cross-validation was used to improve the generalization ability of the model. In addition, ImageDataGenerator function was used to increase the diversity of training data through data augmentation. To compare model performance, we evaluated pre-trained models such as VGG16, Resnet152, and NASNetMobile. As a result, NASNetMobile showed the highest performance, especially in width prediction, with an R_squared value of 0.9436, and RMSE of 0.5659. In length prediction, the R_squared value was 0.9537, and RMSE of 0.8713. The optimized model adopted the NASNetMobile architecture, the RMSprop optimization tool, the MSE loss functions, and the ELU activation functions. The training time of the model averaged 73 minutes per Epoch, and it took the model an average of 0.29 seconds to process a single lettuce leaf photo. In this study, we developed a CNN-based model to predict the leaf length and leaf width of plants in indoor farms, which is expected to enable rapid and accurate assessment of plant growth status by simply taking images. It is also expected to contribute to increasing the productivity and resource efficiency of farms by taking appropriate agricultural measures such as adjusting nutrient solution in real time.

Contents Conversion System for Mobile Devices using Light-Weight Web Document (웹 문서 경량화에 의한 모바일용 콘텐츠 변환 시스템)

  • Kim Jeong-Hee;Kwon Hoon;Kwak Ho-Young
    • Journal of Internet Computing and Services
    • /
    • v.6 no.6
    • /
    • pp.13-22
    • /
    • 2005
  • This paper aims to develop a system for converting web contents to mobile contents that can be used on mobile devices. Since web contents generally consist of pop-up ad windows, a bunch of unnecessary images and useless links, it is difficult to efficiently display them on common mobile devices that have lower bandwidth and memory, as well as much smaller screen, than the online environment. It is also troublesome for mobile device users to directly access contents. Thus, there has been a great demand for a new method for extracting useful and adequate contents from web documents, and optimizing them for use on mobile phones, In the paper, a system based on WAP 2,0 and XHTML Basic, which is a content creation language adopted for WAP 2,0, has been suggested. The system is designed to convert web contents by using the conversion rules of the existing filtering method after making the size of web documents smaller. The adopted conversion rules use the XHTML Basic's module units so that modification and deletion can be carried out with ease. In addition, it has been defined in a XSL document written in XSLT to maintain the extensibility of conversion and the validity of documents, In order to allow it to efficiently work together with WAP l.X's legacy services, the system has been built in a way that can have modules, which analyze information about CC/PP profiles and mobile device headers.

  • PDF