• Title/Summary/Keyword: machine learning models

Search Result 1,361, Processing Time 0.028 seconds

A Study on the Current Status and Application Strategies for Intelligent Archival Information Services (지능형 기록정보서비스를 위한 선진 기술 현황 분석 및 적용 방안)

  • Kim, Tae-Young;Gang, Ju-Yeon;Kim, Geon;Oh, Hyo-Jung
    • Journal of Korean Society of Archives and Records Management
    • /
    • v.18 no.4
    • /
    • pp.149-182
    • /
    • 2018
  • In the era of digital transformation, new technologies have begun to be applied in the field of records management, away from the traditional view that emphasized the existing institutional and administrative aspects. Therefore, this study analyzed the service status of archives, libraries, and museums applied with advanced intelligent technology and identified the differences. Then, we proposed how to apply intelligent archival information services based on the analysis results. The reason for including libraries and museums in the research is that they are covered by a single category as an information service provider. To achieve our study aims, we conducted literature and case studies. Based on the results of the case study, we proposed the application strategies of intelligent archival information services. The results of this study are expected to help develop intelligent archival service models that are suitable for the changed electronic records environment.

Analysis of Tensor Processing Unit and Simulation Using Python (텐서 처리부의 분석 및 파이썬을 이용한 모의실행)

  • Lee, Jongbok
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.19 no.3
    • /
    • pp.165-171
    • /
    • 2019
  • The study of the computer architecture has shown that major improvements in price-to-energy performance stems from domain-specific hardware development. This paper analyzes the tensor processing unit (TPU) ASIC which can accelerate the reasoning of the artificial neural network (NN). The core device of the TPU is a MAC matrix multiplier capable of high-speed operation and software-managed on-chip memory. The execution model of the TPU can meet the reaction time requirements of the artificial neural network better than the existing CPU and the GPU execution models, with the small area and the low power consumption even though it has many MAC and large memory. Utilizing the TPU for the tensor flow benchmark framework, it can achieve higher performance and better power efficiency than the CPU or CPU. In this paper, we analyze TPU, simulate the Python modeled OpenTPU, and synthesize the matrix multiplication unit, which is the key hardware.

Application of power spectral density function for damage diagnosis of bridge piers

  • Bayat, Mahmoud;Ahmadi, Hamid Reza;Mahdavi, Navideh
    • Structural Engineering and Mechanics
    • /
    • v.71 no.1
    • /
    • pp.57-63
    • /
    • 2019
  • During the last two decades, much joint research regarding vibration based methods has been done, leading to developing various algorithms and techniques. These algorithms and techniques can be divided into modal methods and signal methods. Although modal methods have been widely used for health monitoring and damage detection, signal methods due to higher efficiency have received considerable attention in various fields, including aerospace, mechanical and civil engineering. Signal-based methods are derived directly from the recorded responses through signal processing algorithms to detect damage. According to different signal processing techniques, signal-based methods can be divided into three categories including time domain methods, frequency domain methods, and time-frequency domain methods. The frequency domain methods are well-known and interest in using them has increased in recent years. To determine dynamic behaviours, to identify systems and to detect damages of bridges, different methods and algorithms have been proposed by researchers. In this study, a new algorithm to detect seismic damage in the bridge's piers is suggested. To evaluate the algorithm, an analytical model of a bridge with simple spans is used. Based on the algorithm, before and after damage, the bridge is excited by a sine force, and the piers' responses are measured. The dynamic specifications of the bridge are extracted by Power Spectral Density function. In addition, the Least Square Method is used to detect damage in the bridge's piers. The results indicate that the proposed algorithm can identify the seismic damage effectively. The algorithm is output-only method and measuring the excitation force is not needed. Moreover, the proposed approach does not need numerical models.

Consumer behavior prediction using Airbnb web log data (에어비앤비(Airbnb) 웹 로그 데이터를 이용한 고객 행동 예측)

  • An, Hyoin;Choi, Yuri;Oh, Raeeun;Song, Jongwoo
    • The Korean Journal of Applied Statistics
    • /
    • v.32 no.3
    • /
    • pp.391-404
    • /
    • 2019
  • Customers' fixed characteristics have often been used to predict customer behavior. It has recently become possible to track customer web logs as customer activities move from offline to online. It has become possible to collect large amounts of web log data; however, the researchers only focused on organizing the log data or describing the technical characteristics. In this study, we predict the decision-making time until each customer makes the first reservation, using Airbnb customer data provided by the Kaggle website. This data set includes basic customer information such as gender, age, and web logs. We use various methodologies to find the optimal model and compare prediction errors for cases with web log data and without it. We consider six models such as Lasso, SVM, Random Forest, and XGBoost to explore the effectiveness of the web log data. As a result, we choose Random Forest as our optimal model with a misclassification rate of about 20%. In addition, we confirm that using web log data in our study doubles the prediction accuracy in predicting customer behavior compared to not using it.

Multivariate Outlier Removing for the Risk Prediction of Gas Leakage based Methane Gas (메탄 가스 기반 가스 누출 위험 예측을 위한 다변량 특이치 제거)

  • Dashdondov, Khongorzul;Kim, Mi-Hye
    • Journal of the Korea Convergence Society
    • /
    • v.11 no.12
    • /
    • pp.23-30
    • /
    • 2020
  • In this study, the relationship between natural gas (NG) data and gas-related environmental elements was performed using machine learning algorithms to predict the level of gas leakage risk without directly measuring gas leakage data. The study was based on open data provided by the server using the IoT-based remote control Picarro gas sensor specification. The naturel gas leaks into the air, it is a big problem for air pollution, environment and the health. The proposed method is multivariate outlier removing method based Random Forest (RF) classification for predicting risk of NG leak. After, unsupervised k-means clustering, the experimental dataset has done imbalanced data. Therefore, we focusing our proposed models can predict medium and high risk so best. In this case, we compared the receiver operating characteristic (ROC) curve, accuracy, area under the ROC curve (AUC), and mean standard error (MSE) for each classification model. As a result of our experiments, the evaluation measurements include accuracy, area under the ROC curve (AUC), and MSE; 99.71%, 99.57%, and 0.0016 for MOL_RF respectively.

Strain demand prediction of buried steel pipeline at strike-slip fault crossings: A surrogate model approach

  • Xie, Junyao;Zhang, Lu;Zheng, Qian;Liu, Xiaoben;Dubljevic, Stevan;Zhang, Hong
    • Earthquakes and Structures
    • /
    • v.20 no.1
    • /
    • pp.109-122
    • /
    • 2021
  • Significant progress in the oil and gas industry advances the application of pipeline into an intelligent era, which poses rigorous requirements on pipeline safety, reliability, and maintainability, especially when crossing seismic zones. In general, strike-slip faults are prone to induce large deformation leading to local buckling and global rupture eventually. To evaluate the performance and safety of pipelines in this situation, numerical simulations are proved to be a relatively accurate and reliable technique based on the built-in physical models and advanced grid technology. However, the computational cost is prohibitive, so one has to wait for a long time to attain a calculation result for complex large-scale pipelines. In this manuscript, an efficient and accurate surrogate model based on machine learning is proposed for strain demand prediction of buried X80 pipelines subjected to strike-slip faults. Specifically, the support vector regression model serves as a surrogate model to learn the high-dimensional nonlinear relationship which maps multiple input variables, including pipe geometries, internal pressures, and strike-slip displacements, to output variables (namely tensile strains and compressive strains). The effectiveness and efficiency of the proposed method are validated by numerical studies considering different effects caused by structural sizes, internal pressure, and strike-slip movements.

AutoML and CNN-based Soft-voting Ensemble Classification Model For Road Traffic Emerging Risk Detection (도로교통 이머징 리스크 탐지를 위한 AutoML과 CNN 기반 소프트 보팅 앙상블 분류 모델)

  • Jeon, Byeong-Uk;Kang, Ji-Soo;Chung, Kyungyong
    • Journal of Convergence for Information Technology
    • /
    • v.11 no.7
    • /
    • pp.14-20
    • /
    • 2021
  • Most accidents caused by road icing in winter lead to major accidents. Because it is difficult for the driver to detect the road icing in advance. In this work, we study how to accurately detect road traffic emerging risk using AutoML and CNN's ensemble model that use both structured and unstructured data. We train CNN-based road traffic emerging risk classification model using images that are unstructured data and AutoML-based road traffic emerging risk classification model using weather data that is structured data, respectively. After that the ensemble model is designed to complement the CNN-based classification model by inputting probability values derived from of each models. Through this, improves road traffic emerging risk classification performance and alerts drivers more accurately and quickly to enable safe driving.

Classes in Object-Oriented Modeling (UML): Further Understanding and Abstraction

  • Al-Fedaghi, Sabah
    • International Journal of Computer Science & Network Security
    • /
    • v.21 no.5
    • /
    • pp.139-150
    • /
    • 2021
  • Object orientation has become the predominant paradigm for conceptual modeling (e.g., UML), where the notions of class and object form the primitive building blocks of thought. Classes act as templates for objects that have attributes and methods (actions). The modeled systems are not even necessarily software systems: They can be human and artificial systems of many different kinds (e.g., teaching and learning systems). The UML class diagram is described as a central component of model-driven software development. It is the most common diagram in object-oriented models and used to model the static design view of a system. Objects both carry data and execute actions. According to some authorities in modeling, a certain degree of difficulty exists in understanding the semantics of these notions in UML class diagrams. Some researchers claim class diagrams have limited use for conceptual analysis and that they are best used for logical design. Performing conceptual analysis should not concern the ways facts are grouped into structures. Whether a fact will end up in the design as an attribute is not a conceptual issue. UML leads to drilling down into physical design details (e.g., private/public attributes, encapsulated operations, and navigating direction of an association). This paper is a venture to further the understanding of object-orientated concepts as exemplified in UML with the aim of developing a broad comprehension of conceptual modeling fundamentals. Thinging machine (TM) modeling is a new modeling language employed in such an undertaking. TM modeling interlaces structure (components) and actionality where actions infiltrate the attributes as much as the classes. Although space limitations affect some aspects of the class diagram, the concluding assessment of this study reveals the class description is a kind of shorthand for a richer sematic TM construct.

Object detection in financial reporting documents for subsequent recognition

  • Sokerin, Petr;Volkova, Alla;Kushnarev, Kirill
    • International journal of advanced smart convergence
    • /
    • v.10 no.1
    • /
    • pp.1-11
    • /
    • 2021
  • Document page segmentation is an important step in building a quality optical character recognition module. The study examined already existing work on the topic of page segmentation and focused on the development of a segmentation model that has greater functional significance for application in an organization, as well as broad capabilities for managing the quality of the model. The main problems of document segmentation were highlighted, which include a complex background of intersecting objects. As classes for detection, not only classic text, table and figure were selected, but also additional types, such as signature, logo and table without borders (or with partially missing borders). This made it possible to pose a non-trivial task of detecting non-standard document elements. The authors compared existing neural network architectures for object detection based on published research data. The most suitable architecture was RetinaNet. To ensure the possibility of quality control of the model, a method based on neural network modeling using the RetinaNet architecture is proposed. During the study, several models were built, the quality of which was assessed on the test sample using the Mean average Precision metric. The best result among the constructed algorithms was shown by a model that includes four neural networks: the focus of the first neural network on detecting tables and tables without borders, the second - seals and signatures, the third - pictures and logos, and the fourth - text. As a result of the analysis, it was revealed that the approach based on four neural networks showed the best results in accordance with the objectives of the study on the test sample in the context of most classes of detection. The method proposed in the article can be used to recognize other objects. A promising direction in which the analysis can be continued is the segmentation of tables; the areas of the table that differ in function will act as classes: heading, cell with a name, cell with data, empty cell.

Accuracy evaluation of threshold rainfall impacting pedestrian using ROC (ROC를 이용한 보행에 영향을 미치는 한계강우량의 정확도 평가)

  • Choo, Kyungsu;Kang, Dongho;Kim, Byungsik
    • Journal of Korea Water Resources Association
    • /
    • v.53 no.12
    • /
    • pp.1173-1181
    • /
    • 2020
  • Recently, as local heavy rains occur frequently in a short period of time, economic and social impacts are increasing beyond the simple primary damage. In advanced meteorologically advanced countries, realistic and reliable impact forecasts are conducted by analyzing socio-economic impacts, not information transmission as simple weather forecasts. In this paper, the degree of flooding was derived using the Spatial Runoff Assessment Tool (S-RAT) and FLO-2D models to calculate the threshold rainfall that can affect human walking, and the threshold rainfall of the concept of Grid to Grid (G2G) was calculated. In addition, although it was used a lot in the medical field in the past, a quantitative accuracy analysis was performed through the ROC analysis technique, which is widely used in natural phenomena such as drought or flood and machine learning. As a result of the analysis, the results of the time period similar to that of the actual and simulated immersion were obtained, and as a result of the ROC (Receiver Operating Characteristic) curve, the adequacy of the fair stage was secured with more than 0.7.