• Title/Summary/Keyword: Input Data

Search Result 8,338, Processing Time 0.037 seconds

Automated Story Generation with Image Captions and Recursiva Calls (이미지 캡션 및 재귀호출을 통한 스토리 생성 방법)

  • Isle Jeon;Dongha Jo;Mikyeong Moon
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.24 no.1
    • /
    • pp.42-50
    • /
    • 2023
  • The development of technology has achieved digital innovation throughout the media industry, including production techniques and editing technologies, and has brought diversity in the form of consumer viewing through the OTT service and streaming era. The convergence of big data and deep learning networks automatically generated text in format such as news articles, novels, and scripts, but there were insufficient studies that reflected the author's intention and generated story with contextually smooth. In this paper, we describe the flow of pictures in the storyboard with image caption generation techniques, and the automatic generation of story-tailored scenarios through language models. Image caption using CNN and Attention Mechanism, we generate sentences describing pictures on the storyboard, and input the generated sentences into the artificial intelligence natural language processing model KoGPT-2 in order to automatically generate scenarios that meet the planning intention. Through this paper, the author's intention and story customized scenarios are created in large quantities to alleviate the pain of content creation, and artificial intelligence participates in the overall process of digital content production to activate media intelligence.

Application of Multiple Linear Regression Analysis and Tree-Based Machine Learning Techniques for Cutter Life Index(CLI) Prediction (커터수명지수 예측을 위한 다중선형회귀분석과 트리 기반 머신러닝 기법 적용)

  • Ju-Pyo Hong;Tae Young Ko
    • Tunnel and Underground Space
    • /
    • v.33 no.6
    • /
    • pp.594-609
    • /
    • 2023
  • TBM (Tunnel Boring Machine) method is gaining popularity in urban and underwater tunneling projects due to its ability to ensure excavation face stability and minimize environmental impact. Among the prominent models for predicting disc cutter life, the NTNU model uses the Cutter Life Index(CLI) as a key parameter, but the complexity of testing procedures and rarity of equipment make measurement challenging. In this study, CLI was predicted using multiple linear regression analysis and tree-based machine learning techniques, utilizing rock properties. Through literature review, a database including rock uniaxial compressive strength, Brazilian tensile strength, equivalent quartz content, and Cerchar abrasivity index was built, and derived variables were added. The multiple linear regression analysis selected input variables based on statistical significance and multicollinearity, while the machine learning prediction model chose variables based on their importance. Dividing the data into 80% for training and 20% for testing, a comparative analysis of the predictive performance was conducted, and XGBoost was identified as the optimal model. The validity of the multiple linear regression and XGBoost models derived in this study was confirmed by comparing their predictive performance with prior research.

A Freight Company Management Model based on Perception Difference between Freight Company and Consigned Vehicle Owners (화물자동차 회사와 위수탁차주의 인식차이에 기반한 화물자동차 운송회사 경영모델)

  • Park, Doo-Jin;Kim, Jung-Yee
    • Journal of Korea Port Economic Association
    • /
    • v.39 no.3
    • /
    • pp.91-105
    • /
    • 2023
  • This paper identified the difference in perception of the consignment system between the freight company and the consigned vehicle owners, derived factors that urgently need to improve the difference in perception, and presented the basis for establishing a management model based on this. A management model based on the difference in perception of consigned borrowers was established, and research hypotheses were established to conduct reliability, validity, and confirmatory factor analysis, path analysis, and mediation effect verification. As a result of the analysis, out of 58 questions, 56 of the total 65 questions showed a difference in perception of the consignment system between trucking companies and consigned vehicle owners, and the questions were input as management model factors. As a result of conducting confirmatory factor analysis, path analysis, and mediating effect verification using only recognition data, 5 hypotheses were adopted, and it was analyzed that the factor had no mediating effect. It can be analyzed that the appropriateness of the compensation system affects the improvement of each other's relationship, and the improvement of each other's relationship leads to the improvement of transportation services. However, since the mediating effect was rejected because it was not significant, this effect should not be exaggerated.

Effective Multi-Modal Feature Fusion for 3D Semantic Segmentation with Multi-View Images (멀티-뷰 영상들을 활용하는 3차원 의미적 분할을 위한 효과적인 멀티-모달 특징 융합)

  • Hye-Lim Bae;Incheol Kim
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.12 no.12
    • /
    • pp.505-518
    • /
    • 2023
  • 3D point cloud semantic segmentation is a computer vision task that involves dividing the point cloud into different objects and regions by predicting the class label of each point. Existing 3D semantic segmentation models have some limitations in performing sufficient fusion of multi-modal features while ensuring both characteristics of 2D visual features extracted from RGB images and 3D geometric features extracted from point cloud. Therefore, in this paper, we propose MMCA-Net, a novel 3D semantic segmentation model using 2D-3D multi-modal features. The proposed model effectively fuses two heterogeneous 2D visual features and 3D geometric features by using an intermediate fusion strategy and a multi-modal cross attention-based fusion operation. Also, the proposed model extracts context-rich 3D geometric features from input point cloud consisting of irregularly distributed points by adopting PTv2 as 3D geometric encoder. In this paper, we conducted both quantitative and qualitative experiments with the benchmark dataset, ScanNetv2 in order to analyze the performance of the proposed model. In terms of the metric mIoU, the proposed model showed a 9.2% performance improvement over the PTv2 model using only 3D geometric features, and a 12.12% performance improvement over the MVPNet model using 2D-3D multi-modal features. As a result, we proved the effectiveness and usefulness of the proposed model.

Comparative Analysis of Self-supervised Deephashing Models for Efficient Image Retrieval System (효율적인 이미지 검색 시스템을 위한 자기 감독 딥해싱 모델의 비교 분석)

  • Kim Soo In;Jeon Young Jin;Lee Sang Bum;Kim Won Gyum
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.12 no.12
    • /
    • pp.519-524
    • /
    • 2023
  • In hashing-based image retrieval, the hash code of a manipulated image is different from the original image, making it difficult to search for the same image. This paper proposes and evaluates a self-supervised deephashing model that generates perceptual hash codes from feature information such as texture, shape, and color of images. The comparison models are autoencoder-based variational inference models, but the encoder is designed with a fully connected layer, convolutional neural network, and transformer modules. The proposed model is a variational inference model that includes a SimAM module of extracting geometric patterns and positional relationships within images. The SimAM module can learn latent vectors highlighting objects or local regions through an energy function using the activation values of neurons and surrounding neurons. The proposed method is a representation learning model that can generate low-dimensional latent vectors from high-dimensional input images, and the latent vectors are binarized into distinguishable hash code. From the experimental results on public datasets such as CIFAR-10, ImageNet, and NUS-WIDE, the proposed model is superior to the comparative model and analyzed to have equivalent performance to the supervised learning-based deephashing model. The proposed model can be used in application systems that require low-dimensional representation of images, such as image search or copyright image determination.

Neural Network-Based Prediction of Dynamic Properties (인공신경망을 활용한 동적 물성치 산정 연구)

  • Min, Dae-Hong;Kim, YoungSeok;Kim, Sewon;Choi, Hyun-Jun;Yoon, Hyung-Koo
    • Journal of the Korean Geotechnical Society
    • /
    • v.39 no.12
    • /
    • pp.37-46
    • /
    • 2023
  • Dynamic soil properties are essential factors for predicting the detailed behavior of the ground. However, there are limitations to gathering soil samples and performing additional experiments. In this study, we used an artificial neural network (ANN) to predict dynamic soil properties based on static soil properties. The selected static soil properties were soil cohesion, internal friction angle, porosity, specific gravity, and uniaxial compressive strength, whereas the compressional and shear wave velocities were determined for the dynamic soil properties. The Levenberg-Marquardt and Bayesian regularization methods were used to enhance the reliability of the ANN results, and the reliability associated with each optimization method was compared. The accuracy of the ANN model was represented by the coefficient of determination, which was greater than 0.9 in the training and testing phases, indicating that the proposed ANN model exhibits high reliability. Further, the reliability of the output values was verified with new input data, and the results showed high accuracy.

RNA helicase DEAD-box-5 is involved in R-loop dynamics of preimplantation embryos

  • Hyeonji Lee;Dong Wook Han;Seonho Yoo;Ohbeom Kwon;Hyeonwoo La;Chanhyeok Park;Heeji Lee;Kiye Kang;Sang Jun Uhm;Hyuk Song;Jeong Tae Do;Youngsok Choi;Kwonho Hong
    • Animal Bioscience
    • /
    • v.37 no.6
    • /
    • pp.1021-1030
    • /
    • 2024
  • Objective: R-loops are DNA:RNA triplex hybrids, and their metabolism is tightly regulated by transcriptional regulation, DNA damage response, and chromatin structure dynamics. R-loop homeostasis is dynamically regulated and closely associated with gene transcription in mouse zygotes. However, the factors responsible for regulating these dynamic changes in the R-loops of fertilized mouse eggs have not yet been investigated. This study examined the functions of candidate factors that interact with R-loops during zygotic gene activation. Methods: In this study, we used publicly available next-generation sequencing datasets, including low-input ribosome profiling analysis and polymerase II chromatin immunoprecipitation-sequencing (ChIP-seq), to identify potential regulators of R-loop dynamics in zygotes. These datasets were downloaded, reanalyzed, and compared with mass spectrometry data to identify candidate factors involved in regulating R-loop dynamics. To validate the functions of these candidate factors, we treated mouse zygotes with chemical inhibitors using in vitro fertilization. Immunofluorescence with an anti-R-loop antibody was then performed to quantify changes in R-loop metabolism. Results: We identified DEAD-box-5 (DDX5) and histone deacetylase-2 (HDAC2) as candidates that potentially regulate R-loop metabolism in oocytes, zygotes and two-cell embryos based on change of their gene translation. Our analysis revealed that the DDX5 inhibition of activity led to decreased R-loop accumulation in pronuclei, indicating its involvement in regulating R-loop dynamics. However, the inhibition of histone deacetylase-2 activity did not significantly affect R-loop levels in pronuclei. Conclusion: These findings suggest that dynamic changes in R-loops during mouse zygote development are likely regulated by RNA helicases, particularly DDX5, in conjunction with transcriptional processes. Our study provides compelling evidence for the involvement of these factors in regulating R-loop dynamics during early embryonic development.

Analysis of Infrared Characteristics According to Common Depth Using RP Images Converted into Numerical Data (수치 데이터로 변환된 RP 이미지를 활용하여 공동 깊이에 따른 적외선 특성 분석)

  • Jang, Byeong-Su;Kim, YoungSeok;Kim, Sewon;Choi, Hyun-Jun;Yoon, Hyung-Koo
    • Journal of the Korean Geotechnical Society
    • /
    • v.40 no.3
    • /
    • pp.77-84
    • /
    • 2024
  • Aging and damaged underground utilities cause cavity and ground subsidence under roads, which can cause economic losses and risk user safety. This study used infrared cameras to assess the thermal characteristics of such cavities and evaluate their reliability using a CNN algorithm. PVC pipes were embedded at various depths in a test site measuring 400 cm × 50 cm × 40 cm. Concrete blocks were used to simulate road surfaces, and measurements were taken from 4 PM to noon the following day. The initial temperatures measured by the infrared camera were 43.7℃, 43.8℃, and 41.9℃, reflecting atmospheric temperature changes during the measurement period. The RP algorithm generates images in four resolutions, i.e., 10,000 × 10,000, 2,000 × 2,000, 1,000 × 1,000, and 100 × 100 pixels. The accuracy of the CNN model using RP images as input was 99%, 97%, 98%, and 96%, respectively. These results represent a considerable improvement over the 73% accuracy obtained using time-series images, with an improvement greater than 20% when using the RP algorithm-based inputs.

Changes in Mesozooplankton Community Around the Rainy Season in Asan Bay, Korea (아산만 해역에서 장마기 전후 중형동물플랑크톤 군집의 변화)

  • Lee, Doo-Byoul;Park, Chul;Yang, Sung-Ryull;Shin, Yong-Sik
    • The Sea:JOURNAL OF THE KOREAN SOCIETY OF OCEANOGRAPHY
    • /
    • v.12 no.4
    • /
    • pp.337-348
    • /
    • 2007
  • Characteristics in distributions of T, S, nutrients, chlorophyll ${\alpha}$ concentrations and meso-zooplankton abundances and the relations among these parameters were investigated with the data collected in Asan Bay around the rainy season from May 24 till August 25, 2006 at about 10 days interval. Freshwater input during the rainy season clearly affected the distributions of zooplankton and phytoplankton (chlorophyll ${\alpha}$). Freshwater discharge resulted in high nutrients decreased zooplankton abundances. On the contrary, chlorophyll ${\alpha}$ concentrations increased at the end of the rainy season. It seemed that the increase of chlorophyll ${\alpha}$ concentrations was the result of the decreased zooplankton and enriched nutrients caused by freshwater discharges. Seawater temperatures were certainly the reason for the zooplankton succession. However, overall abundance of zooplankton and abundances of some zooplankton such as Noctiluca scintillans, Acartia pacifica, and Sagitta crassa seemed to be influenced by lowered salinity caused by heavy rain rather than seawater temperatures.

Establishment of Risk Database and Development of Risk Classification System for NATM Tunnel (NATM 터널 공정리스크 데이터베이스 구축 및 리스크 분류체계 개발)

  • Kim, Hyunbee;Karunarathne, Batagalle Vinuri;Kim, ByungSoo
    • Korean Journal of Construction Engineering and Management
    • /
    • v.25 no.1
    • /
    • pp.32-41
    • /
    • 2024
  • In the construction industry, not only safety accidents, but also various complex risks such as construction delays, cost increases, and environmental pollution occur, and management technologies are needed to solve them. Among them, process risk management, which directly affects the project, lacks related information compared to its importance. This study tried to develop a MATM tunnel process risk classification system to solve the difficulty of risk information retrieval due to the use of different classification systems for each project. Risk collection used existing literature review and experience mining techniques, and DB construction utilized the concept of natural language processing. For the structure of the classification system, the existing WBS structure was adopted in consideration of compatibility of data, and an RBS linked to the work species of the WBS was established. As a result of the research, a risk classification system was completed that easily identifies risks by work type and intuitively reveals risk characteristics and risk factors linked to risks. As a result of verifying the usability of the established classification system, it was found that the classification system was effective as risks and risk factors for each work type were easily identified by user input of keywords. Through this study, it is expected to contribute to preventing an increase in cost and construction period by identifying risks according to work types in advance when planning and designing NATM tunnels and establishing countermeasures suitable for those factors.