• Title/Summary/Keyword: artificial intelligence-based models

Search Result 575, Processing Time 0.033 seconds

VS3-NET: Neural variational inference model for machine-reading comprehension

  • Park, Cheoneum;Lee, Changki;Song, Heejun
    • ETRI Journal
    • /
    • v.41 no.6
    • /
    • pp.771-781
    • /
    • 2019
  • We propose the VS3-NET model to solve the task of question answering questions with machine-reading comprehension that searches for an appropriate answer in a given context. VS3-NET is a model that trains latent variables for each question using variational inferences based on a model of a simple recurrent unit-based sentences and self-matching networks. The types of questions vary, and the answers depend on the type of question. To perform efficient inference and learning, we introduce neural question-type models to approximate the prior and posterior distributions of the latent variables, and we use these approximated distributions to optimize a reparameterized variational lower bound. The context given in machine-reading comprehension usually comprises several sentences, leading to performance degradation caused by context length. Therefore, we model a hierarchical structure using sentence encoding, in which as the context becomes longer, the performance degrades. Experimental results show that the proposed VS3-NET model has an exact-match score of 76.8% and an F1 score of 84.5% on the SQuAD test set.

Robot Vision to Audio Description Based on Deep Learning for Effective Human-Robot Interaction (효과적인 인간-로봇 상호작용을 위한 딥러닝 기반 로봇 비전 자연어 설명문 생성 및 발화 기술)

  • Park, Dongkeon;Kang, Kyeong-Min;Bae, Jin-Woo;Han, Ji-Hyeong
    • The Journal of Korea Robotics Society
    • /
    • v.14 no.1
    • /
    • pp.22-30
    • /
    • 2019
  • For effective human-robot interaction, robots need to understand the current situation context well, but also the robots need to transfer its understanding to the human participant in efficient way. The most convenient way to deliver robot's understanding to the human participant is that the robot expresses its understanding using voice and natural language. Recently, the artificial intelligence for video understanding and natural language process has been developed very rapidly especially based on deep learning. Thus, this paper proposes robot vision to audio description method using deep learning. The applied deep learning model is a pipeline of two deep learning models for generating natural language sentence from robot vision and generating voice from the generated natural language sentence. Also, we conduct the real robot experiment to show the effectiveness of our method in human-robot interaction.

Coordinated Millimeter Wave Beam Selection Using Fingerprint for Cellular-Connected Unmanned Aerial Vehicle

  • Moon, Sangmi;Kim, Hyeonsung;You, Young-Hwan;Kim, Cheol Hong;Hwang, Intae
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.15 no.5
    • /
    • pp.1929-1943
    • /
    • 2021
  • Millimeter wave (mmWave) communication based on the wide bandwidth of >28 GHz is one of the key technologies for cellular-connected unmanned aerial vehicles (UAVs). The selection of mmWave beams in such cellular-connected UAVs is challenging and critical, especially when downlink transmissions toward aerial user equipment (UE) suffer from poor signal-to-interference-plus-noise ratio (SINR) more often than their terrestrial counterparts. This study proposed a coordinated mmWave beam selection scheme using fingerprint for cellular-connected UAV. The scheme comprises fingerprint database configuration and coordinated beam selection. In the fingerprint database configuration, the best beam index from the serving cell and interference beam indexes from neighboring cells are stored. In the coordinated beam selection, the best and interference beams are determined using the fingerprint database information instead of performing an exhaustive search, and the coordinated beam transmission improves the SINR for aerial UEs. System-level simulations assess the UAV effect based on the third-generation partnership project-new radio mmWave and UAV channel models. Simulation results show that the proposed scheme can reduce the overhead of exhaustive search and improve the SINR and spectral efficiency.

Towards a Value-Creation Framework for Proptech Business (프롭테크 비즈니스 가치창출 프레임워크)

  • Kim, Jae-Young;Park, Seung-Bong
    • Knowledge Management Research
    • /
    • v.22 no.1
    • /
    • pp.105-120
    • /
    • 2021
  • Recently, there has been a dramatic change in real estate markets with the development of information technology. The word, Proptech, is defined as the real estate transaction innovation motivated by various types of information technology including artificial intelligence, sensing technology and big data. The objective of this study is to provide a value-creation framework for Proptech business based on the understanding of how and what types of values are created and shared, which gives organization to develop strategies and business models. And a new classification scheme of Proptech business is also suggested based on the recognition of created values along the development of Proptech business. Then, the proposed matrix is applied to derive the business value such as intangibility value, relational value and enhancement value with the case analysis on the each components of Proptech business.

A Study on Improvement of Level of Highway Maintenance Service Using Self-Organizing Map Neural Network (자기조직화 신경망을 이용한 고속도로 유지관리 서비스 등급 개선에 대한 연구)

  • Shin, Duksoon;Park, Sungbum
    • Journal of Information Technology Services
    • /
    • v.20 no.1
    • /
    • pp.81-92
    • /
    • 2021
  • As the degree of economic development of society increases, the maintenance issues on the existing social overhead capital becomes essential. Accordingly, the adaptation of the concept of Level of service in highway maintenance is indispensable. It is also crucial to manage and perform the service level such as road assets to provide universal services to users. In this regards, the purpose of this study is to improve the maintenance service rating model and to focus on the assessment items and weights among the improvements. Particularly, in determining weights, an Analytic Hierarchy Process (AHP) is performed based on the survey response results. After then, this study conducts unsupervised neural network models such as Self-Organizing Map (SOM) and Davies-Bouldin (DB) Index to divide proper sub-groups and determine priorities. This paper identifies similar cases by grouping the results of the responses based on the similarity of the survey responses. This can effectively support decision making in general situations where many evaluation factors need to be considered at once, resulting in reasonable policy decisions. It is the process of using advanced technology to find optimized management methods for maintenance.

A many-objective evolutionary algorithm based on integrated strategy for skin cancer detection

  • Lan, Yang;Xie, Lijie;Cai, Xingjuan;Wang, Lifang
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.16 no.1
    • /
    • pp.80-96
    • /
    • 2022
  • Nowadays, artificial intelligence promotes the rapid development of skin cancer detection technology, and the federated skin cancer detection model (FSDM) and dual generative adversarial network model (DGANM) solves the fragmentation and privacy of data to a certain extent. To overcome the problem that the many-objective evolutionary algorithm (MaOEA) cannot guarantee the convergence and diversity of the population when solving the above models, a many-objective evolutionary algorithm based on integrated strategy (MaOEA-IS) is proposed. First, the idea of federated learning is introduced into population mutation, the new parents are generated through sub-populations employs different mating selection operators. Then, the distance between each solution to the ideal point (SID) and the Achievement Scalarizing Function (ASF) value of each solution are considered comprehensively for environment selection, meanwhile, the elimination mechanism is used to carry out the select offspring operation. Eventually, the FSDM and DGANM are solved through MaOEA-IS. The experimental results show that the MaOEA-IS has better convergence and diversity, and it has superior performance in solving the FSDM and DGANM. The proposed MaOEA-IS provides more reasonable solutions scheme for many scholars of skin cancer detection and promotes the progress of intelligent medicine.

Fundamental Function Design of Real-Time Unmanned Monitoring System Applying YOLOv5s on NVIDIA TX2TM AI Edge Computing Platform

  • LEE, SI HYUN
    • International journal of advanced smart convergence
    • /
    • v.11 no.2
    • /
    • pp.22-29
    • /
    • 2022
  • In this paper, for the purpose of designing an real-time unmanned monitoring system, the YOLOv5s (small) object detection model was applied on the NVIDIA TX2TM AI (Artificial Intelligence) edge computing platform in order to design the fundamental function of an unmanned monitoring system that can detect objects in real time. YOLOv5s was applied to the our real-time unmanned monitoring system based on the performance evaluation of object detection algorithms (for example, R-CNN, SSD, RetinaNet, and YOLOv5). In addition, the performance of the four YOLOv5 models (small, medium, large, and xlarge) was compared and evaluated. Furthermore, based on these results, the YOLOv5s model suitable for the design purpose of this paper was ported to the NVIDIA TX2TM AI edge computing system and it was confirmed that it operates normally. The real-time unmanned monitoring system designed as a result of the research can be applied to various application fields such as an security or monitoring system. Future research is to apply NMS (Non-Maximum Suppression) modification, model reconstruction, and parallel processing programming techniques using CUDA (Compute Unified Device Architecture) for the improvement of object detection speed and performance.

Ensemble Deep Learning Model using Random Forest for Patient Shock Detection

  • Minsu Jeong;Namhwa Lee;Byuk Sung Ko;Inwhee Joe
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.17 no.4
    • /
    • pp.1080-1099
    • /
    • 2023
  • Digital healthcare combined with telemedicine services in the form of convergence with digital technology and AI is developing rapidly. Digital healthcare research is being conducted on many conditions including shock. However, the causes of shock are diverse, and the treatment is very complicated, requiring a high level of medical knowledge. In this paper, we propose a shock detection method based on the correlation between shock and data extracted from hemodynamic monitoring equipment. From the various parameters expressed by this equipment, four parameters closely related to patient shock were used as the input data for a machine learning model in order to detect the shock. Using the four parameters as input data, that is, feature values, a random forest-based ensemble machine learning model was constructed. The value of the mean arterial pressure was used as the correct answer value, the so called label value, to detect the patient's shock state. The performance was then compared with the decision tree and logistic regression model using a confusion matrix. The average accuracy of the random forest model was 92.80%, which shows superior performance compared to other models. We look forward to our work playing a role in helping medical staff by making recommendations for the diagnosis and treatment of complex and difficult cases of shock.

ICLAL: In-Context Learning-Based Audio-Language Multi-Modal Deep Learning Models (ICLAL: 인 컨텍스트 러닝 기반 오디오-언어 멀티 모달 딥러닝 모델)

  • Jun Yeong Park;Jinyoung Yeo;Go-Eun Lee;Chang Hwan Choi;Sang-Il Choi
    • Annual Conference of KIPS
    • /
    • 2023.11a
    • /
    • pp.514-517
    • /
    • 2023
  • 본 연구는 인 컨택스트 러닝 (In-Context Learning)을 오디오-언어 작업에 적용하기 위한 멀티모달 (Multi-Modal) 딥러닝 모델을 다룬다. 해당 모델을 통해 학습 단계에서 오디오와 텍스트의 소통 가능한 형태의 표현 (Representation)을 학습하고 여러가지 오디오-텍스트 작업을 수행할 수 있는 멀티모달 딥러닝 모델을 개발하는 것이 본 연구의 목적이다. 모델은 오디오 인코더와 언어 인코더가 연결된 구조를 가지고 있으며, 언어 모델은 6.7B, 30B 의 파라미터 수를 가진 자동회귀 (Autoregressive) 대형 언어 모델 (Large Language Model)을 사용한다 오디오 인코더는 자기지도학습 (Self-Supervised Learning)을 기반으로 사전학습 된 오디오 특징 추출 모델이다. 언어모델이 상대적으로 대용량이기 언어모델의 파라미터를 고정하고 오디오 인코더의 파라미터만 업데이트하는 프로즌 (Frozen) 방법으로 학습한다. 학습을 위한 과제는 음성인식 (Automatic Speech Recognition)과 요약 (Abstractive Summarization) 이다. 학습을 마친 후 질의응답 (Question Answering) 작업으로 테스트를 진행했다. 그 결과, 정답 문장을 생성하기 위해서는 추가적인 학습이 필요한 것으로 보였으나, 음성인식으로 사전학습 한 모델의 경우 정답과 유사한 키워드를 사용하는 문법적으로 올바른 문장을 생성함을 확인했다.

Learning-based Inertial-wheel Odometry for a Mobile Robot (모바일 로봇을 위한 학습 기반 관성-바퀴 오도메트리)

  • Myeongsoo Kim;Keunwoo Jang;Jaeheung Park
    • The Journal of Korea Robotics Society
    • /
    • v.18 no.4
    • /
    • pp.427-435
    • /
    • 2023
  • This paper proposes a method of estimating the pose of a mobile robot by using a learning model. When estimating the pose of a mobile robot, wheel encoder and inertial measurement unit (IMU) data are generally utilized. However, depending on the condition of the ground surface, slip occurs due to interaction between the wheel and the floor. In this case, it is hard to predict pose accurately by using only encoder and IMU. Thus, in order to reduce pose error even in such conditions, this paper introduces a pose estimation method based on a learning model using data of the wheel encoder and IMU. As the learning model, long short-term memory (LSTM) network is adopted. The inputs to LSTM are velocity and acceleration data from the wheel encoder and IMU. Outputs from network are corrected linear and angular velocity. Estimated pose is calculated through numerically integrating output velocities. Dataset used as ground truth of learning model is collected in various ground conditions. Experimental results demonstrate that proposed learning model has higher accuracy of pose estimation than extended Kalman filter (EKF) and other learning models using the same data under various ground conditions.