• Title/Summary/Keyword: random data analysis

Search Result 1,725, Processing Time 0.027 seconds

On prediction of random effects in log-normal frailty models

  • Ha, Il-Do;Cho, Geon-Ho
    • Journal of the Korean Data and Information Science Society
    • /
    • v.20 no.1
    • /
    • pp.203-209
    • /
    • 2009
  • Frailty models are useful for the analysis of correlated and/or heterogeneous survival data. However, the inferences of fixed parameters, rather than random effects, have been mainly studied. The prediction (or estimation) of random effects is also practically useful to investigate the heterogeneity of the hospital or patient effects. In this paper we propose how to extend the prediction method for random effects in HGLMs (hierarchical generalized linear models) to log-normal semiparametric frailty models with nonparametric baseline hazard. The proposed method is demonstrated by a simulation study.

  • PDF

Random projection ensemble adaptive nearest neighbor classification (랜덤 투영 앙상블 기법을 활용한 적응 최근접 이웃 판별분류기법)

  • Kang, Jongkyeong;Jhun, Myoungshic
    • The Korean Journal of Applied Statistics
    • /
    • v.34 no.3
    • /
    • pp.401-410
    • /
    • 2021
  • Popular in discriminant classification analysis, k-nearest neighbor classification methods have limitations that do not reflect the local characteristic of the data, considering only the number of fixed neighbors. Considering the local structure of the data, the adaptive nearest neighbor method has been developed to select the number of neighbors. In the analysis of high-dimensional data, it is common to perform dimension reduction such as random projection techniques before using k-nearest neighbor classification. Recently, an ensemble technique has been developed that carefully combines the results of such random classifiers and makes final assignments by voting. In this paper, we propose a novel discriminant classification technique that combines adaptive nearest neighbor methods with random projection ensemble techniques for analysis on high-dimensional data. Through simulation and real-world data analyses, we confirm that the proposed method outperforms in terms of classification accuracy compared to the previously developed methods.

Dynamic Characteristics of the Noise and Vibration of High-speed Train's Wheelset using Time-varying Frequency Analysis (시간-주파수 분석을 이용한 고속철도차량 윤축에서 발생하는 소음과 진동의 동적 특성)

  • Lee, Jun-Seok;Choi, Sung-Hoon;Kim, Sang-Soo;Park, Choon-Soo
    • Journal of the Korean Society for Railway
    • /
    • v.12 no.4
    • /
    • pp.465-471
    • /
    • 2009
  • In this paper, a relationship between the noise and vibration of a high-speed train's wheelset is examined by using time-varying frequency analysis with random data analysis which together contributes to a reduction in the number of experimental running. The noise and vibration of the wheelset is mainly caused by an interaction between the wheel and railway which shows in non-stationary characteristics. For the analysis, they are measured by some microphones and accelerometers, and those signals are post-processed by time-varying frequency analysis with random data analysis. From the analysis, their methods are useful for analyzing the noise and vibration of high-speed train's wheelset.

A Study on the Random Vibration Analysis of Large Scale Antenna (대형 안테나의 Random Vibration 해석에 관한 연구)

  • Shin, Geon-Ho;Hur, Jang-Wook
    • Journal of the Korean Society of Manufacturing Process Engineers
    • /
    • v.20 no.6
    • /
    • pp.44-50
    • /
    • 2021
  • This study analyzed the stability of antenna equipped on vehicles by the link of modal analysis and random vibration analysis with the vibration data of MIL-STD-810H, METHOD 514.8. As a result of the random vibration analysis of antenna, the maximum equivalent stress 41.9MPa and minimum margin of safety 8.37 was generated in the bracket of antenna by the vertical direction vibration. Thus, it was found that antenna has enough stability during the operation.

BINARY RANDOM POWER APPROACH TO MODELING ASYMMETRIC CONDITIONAL HETEROSCEDASTICITY

  • KIM S.;HWANG S.Y.
    • Journal of the Korean Statistical Society
    • /
    • v.34 no.1
    • /
    • pp.61-71
    • /
    • 2005
  • A class of asymmetric ARCH processes is proposed via binary random power transformations. This class accommodates traditional nonlinear models such as threshold ARCH (Rabemanjara and Zacoian (1993)) and Box-Cox type ARCH models(Higgins and Bera (1992)). Stationarity condition of the model is addressed. Iterative least squares(ILS) and pseudo maximum like-lihood(PML) methods are discussed for estimating parameters and related algorithms are presented. Illustrative analysis for Korea Stock Prices Index (KOSPI) data is conducted.

PPNC: Privacy Preserving Scheme for Random Linear Network Coding in Smart Grid

  • He, Shiming;Zeng, Weini;Xie, Kun;Yang, Hongming;Lai, Mingyong;Su, Xin
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.11 no.3
    • /
    • pp.1510-1532
    • /
    • 2017
  • In smart grid, privacy implications to individuals and their families are an important issue because of the fine-grained usage data collection. Wireless communications are utilized by many utility companies to obtain information. Network coding is exploited in smart grids, to enhance network performance in terms of throughput, delay, robustness, and energy consumption. However, random linear network coding introduces a new challenge for privacy preserving due to the encoding of data and updating of coefficients in forwarder nodes. We propose a distributed privacy preserving scheme for random linear network coding in smart grid that considers the converged flows character of the smart grid and exploits a homomorphic encryption function to decrease the complexities in the forwarder node. It offers a data confidentiality privacy preserving feature, which can efficiently thwart traffic analysis. The data of the packet is encrypted and the tag of the packet is encrypted by a homomorphic encryption function. The forwarder node random linearly codes the encrypted data and directly processes the cryptotext tags based on the homomorphism feature. Extensive security analysis and performance evaluations demonstrate the validity and efficiency of the proposed scheme.

A Continuation-Ratio Logits Mixed Model for Structured Polytomous Data

  • Choi, Jae-Sung
    • Journal of the Korean Data and Information Science Society
    • /
    • v.17 no.1
    • /
    • pp.187-193
    • /
    • 2006
  • This paper shows how to use continuation-ratio logits for the analysis of structured polytomous data. Here, response categories are considered to have a nested binary structure. Thus, conditionally nested binary random variables can be defined in each step. Two types of factors are considered as independent variables affecting response probabilities. For the purpose of analyzing categorical data with binary nested strutures a continuation-ratio mixed model is suggested. Estimation procedure for the unknown parameters in a suggested model is also discussed in detail by an example.

  • PDF

Research on artificial intelligence based battery analysis and evaluation methods using electric vehicle operation data (전기 차 운행 데이터를 활용한 인공지능 기반의 배터리 분석 및 평가 방법 연구)

  • SeungMo Hong
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.16 no.6
    • /
    • pp.385-391
    • /
    • 2023
  • As the use of electric vehicles has increased to minimize carbon emissions, the analyzing the state and performance of lithium-ion batteries that is instrumental in electric vehicles have been important. Comprehensive analysis using not only the voltage, current and temperature of the battery pack, which can affect the condition and performance of the battery, but also the driving data and charging pattern data of the electric vehicle is required. Therefore, a thorough analysis is imperative, utilizing electric vehicle operation data, charging pattern data, as well as battery pack voltage, current, and temperature data, which collectively influence the condition and performance of the battery. Therefore, collection and preprocessing of battery data collected from electric vehicles, collection and preprocessing of data on driver driving habits in addition to simple battery data, detailed design and modification of artificial intelligence algorithm based on the analyzed influencing factors, and A battery analysis and evaluation model was designed. In this paper, we gathered operational data and battery data from real-time electric buses. These data sets were then utilized to train a Random Forest algorithm. Furthermore, a comprehensive assessment of battery status, operation, and charging patterns was conducted using the explainable Artificial Intelligence (XAI) algorithm. The study identified crucial influencing factors on battery status, including rapid acceleration, rapid deceleration, sudden stops in driving patterns, the number of drives per day in the charging and discharging pattern, daily accumulated Depth of Discharge (DOD), cell voltage differences during discharge, maximum cell temperature, and minimum cell temperature. These factors were confirmed to significantly impact the battery condition. Based on the identified influencing factors, a battery analysis and evaluation model was designed and assessed using the Random Forest algorithm. The results contribute to the understanding of battery health and lay the foundation for effective battery management in electric vehicles.

불규칙 하중하의 확률론적 피로 해석의 신뢰성 평가

  • 송삼홍;장두수
    • Proceedings of the Korean Society of Precision Engineering Conference
    • /
    • 1993.04b
    • /
    • pp.365-369
    • /
    • 1993
  • The reliability assessment of a simple porbabilistic fatigue analysis under random loading is proposed. Using the crack closure concept, the crack opening stress is assumed to be constant during random loading. The available loading history and test data are used for the comparison with the results of the reliability assessment of probabilistic fatigue analysis.

Finding a plan to improve recognition rate using classification analysis

  • Kim, SeungJae;Kim, SungHwan
    • International journal of advanced smart convergence
    • /
    • v.9 no.4
    • /
    • pp.184-191
    • /
    • 2020
  • With the emergence of the 4th Industrial Revolution, core technologies that will lead the 4th Industrial Revolution such as AI (artificial intelligence), big data, and Internet of Things (IOT) are also at the center of the topic of the general public. In particular, there is a growing trend of attempts to present future visions by discovering new models by using them for big data analysis based on data collected in a specific field, and inferring and predicting new values with the models. In order to obtain the reliability and sophistication of statistics as a result of big data analysis, it is necessary to analyze the meaning of each variable, the correlation between the variables, and multicollinearity. If the data is classified differently from the hypothesis test from the beginning, even if the analysis is performed well, unreliable results will be obtained. In other words, prior to big data analysis, it is necessary to ensure that data is well classified according to the purpose of analysis. Therefore, in this study, data is classified using a decision tree technique and a random forest technique among classification analysis, which is a machine learning technique that implements AI technology. And by evaluating the degree of classification of the data, we try to find a way to improve the classification and analysis rate of the data.