• Title/Summary/Keyword: log Data Analysis

Search Result 978, Processing Time 0.027 seconds

The Validation Study of Normality Distribution of Aquatic Toxicity Data for Statistical Analysis (수생태 독성자료의 정규성 분포 특성 확인을 통해 통계분석 시 분포 특성 적용에 대한 타당성 확인 연구)

  • OK, Seung-yeop;Moon, Hyo-Bang;Ra, Jin-Sung
    • Journal of Environmental Health Sciences
    • /
    • v.45 no.2
    • /
    • pp.192-202
    • /
    • 2019
  • Objectives: According to the central limit theorem, the samples in population might be considered to follow normal distribution if a large number of samples are available. Once we assume that toxicity dataset follow normal distribution, we can treat and process data statistically to calculate genus or species mean value with standard deviation. However, little is known and only limited studies are conducted to investigate whether toxicity dataset follows normal distribution or not. Therefore, the purpose of study is to evaluate the generally accepted normality hypothesis of aquatic toxicity dataset Methods: We selected the 8 chemicals, which consist of 4 organic and 4 inorganic chemical compounds considering data availability for the development of species sensitivity distribution. Toxicity data were collected at the US EPA ECOTOX Knowledgebase by simple search with target chemicals. Toxicity data were re-arranged to a proper format based on the endpoint and test duration, where we conducted normality test according to the Shapiro-Wilk test. Also we investigated the degree of normality by simple log transformation of toxicity data Results: Despite of the central limit theorem, only one large dataset (n>25) follow normal distribution out of 25 large dataset. By log transforming, more 7 large dataset show normality. As a result of normality test on small dataset (n<25), log transformation of toxicity value generally increases normality. Both organic and inorganic chemicals show normality growth for 26 species and 30 species, respectively. Those 56 species shows normality growth by log transformation in the taxonomic groups such as amphibian (1), crustacean (21), fish (22), insect (5), rotifer (2), and worm (5). In contrast, mollusca shows normality decrease at 1 species out of 23 that originally show normality. Conclusions: The normality of large toxicity dataset was not always satisfactory to the central limit theorem. Normality of those data could be improved through log transformation. Therefore, care should be taken when using toxicity data to induce, for example, mean value for risk assessment.

A probabilistic analysis of Miner's law for different loading conditions

  • Blason, Sergio;Correia, Jose A.F.O.;Jesus, Abilio M.P. De;Calcada, Rui A.B.;Fernandez-Canteli, Alfonso
    • Structural Engineering and Mechanics
    • /
    • v.60 no.1
    • /
    • pp.71-90
    • /
    • 2016
  • In this paper, the normalized variable V=(log N-B)(log ${\Delta}{\sigma}-C$-C), as derived from the probabilistic S-N field of Castillo and Canteli, is taken as a reference for calculation of damage accumulation and probability of failure using the Miner number in scenarios of variable amplitude loading. Alternative damage measures, such as the classical Miner and logarithmic Miner, are also considered for comparison between theoretical lifetime prediction and experimental data. The suitability of this approach is confirmed for it provides safe lifetime prediction when applied to fatigue data obtained for riveted joints made of a puddle iron original from the Fao bridge, as well as for data from experimental programs published elsewhere carried out for different materials (aluminium and concrete specimens) under distinct variable loading histories.

Design and Application of Metadata Schema in Datawebhouse System (데이터웹하우스 시스템에서 메타데이터 스키마의 설계 및 활용)

  • Park, Jong-Mo;Cho, Kyung-San
    • The KIPS Transactions:PartD
    • /
    • v.14D no.6
    • /
    • pp.701-706
    • /
    • 2007
  • Datawebhouse consists of both web log analysis used for customer management and datawarehouse used for decision support. However, datawebhouse needs complex operations for management in order to transform and integrate data from heterogeneous data sources and distributed systems. We propose a metadata schema in order to enable data integration and data management which are essential in datawebhouse environments. We show that our proposed schema supports datawebhouse development and enables integrated asset management of business information. With ETL metadata for web log extract, we can improve the data processing time of web log.

Performance Analysis of M-ary Optical Communication over Log-Normal Fading Channels for CubeSat Platforms

  • Lim, Hyung-Chul;Yu, Sung-Yeol;Sung, Ki-Pyoung;Park, Jong Uk;Choi, Chul-Sung;Choi, Mansoo
    • Journal of Astronomy and Space Sciences
    • /
    • v.37 no.4
    • /
    • pp.219-228
    • /
    • 2020
  • A CubeSat platform has become a popular choice due to inexpensive commercial off-the-shelf (COTS) components and low launch cost. However, it requires more power-efficient and higher-data rate downlink capability for space applications related to remote sensing. In addition, the platform is limited by the size, weight and power (SWaP) constraints as well as the regulatory issue of licensing the radio frequency (RF) spectrum. The requirements and limitations have put optical communications on promising alternatives to RF communications for a CubeSat platform, owing to the power efficiency and high data rate as well as the license free spectrum. In this study, we analyzed the performance of optical downlink communications compatible with CubeSat platforms in terms of data rate, bit error rate (BER) and outage probability. Mathematical models of BER and outage probability were derived based on not only the log-normal model of atmospheric turbulence but also a transmitter with a finite extinction ratio. Given the fixed slot width, the optimal guard time and modulation orders were chosen to achieve the target data rate. And the two performance metrics, BER and outage data rate, were analyzed and discussed with respect to beam divergence angle, scintillation index and zenith angle.

The Choice of an Optimal Growth Function Considering Environmental Factors and Production Style (생산방식과 환경요인들을 고려한 최적성장함수의 선택에 관한 연구)

  • Choi, Jong Du
    • Environmental and Resource Economics Review
    • /
    • v.13 no.4
    • /
    • pp.717-734
    • /
    • 2004
  • This paper examined the statistical goodness-of-fit tests for biological growth model in bioeconomic analysis. Some authors estimated usually growth function for fish in the world. However, few studies have estimated growth equations for the bivalve species. Thus, this paper studied the common functional forms of fitting growth equations for cham scallops considering environmental factors and production styles. The following functional forms are considered: linear, log-reciprocal, double log, polynomial and linear with interactions. Results of fitting these various functional forms with real data are compared and evaluated using standard statistical goodness-of-fit tests. Results also indicate that log-reciprocal function is statistically the best fit to the real data. Therefore, the log-reciprocal function is decided the best function describing cham scallop biological growth and hence might be useful for economic evaluation(i.e., optimal harvesting time).

  • PDF

Well Data Interpretation using Software Developed for Estimation of Petrophysical Properties in Gas Hydrate Bearing Sediments in Ulleung Basin, Offshore Korea (가스하이드레이트 퇴적층 물성 추정 소프트웨어를 이용한 울릉분지 시추공 자료 해석)

  • Seo, Kwang-Won;Lim, Jong-Se
    • Journal of Energy Engineering
    • /
    • v.21 no.1
    • /
    • pp.55-67
    • /
    • 2012
  • For the development of gas hydrate as new future energy resources, the drilling was carried out at the five locations where have high potential as gas hydrate bearing sediments in Ulleung basin, offshore Korea in 2007. Well log data were obtained from all wells and core data were procured from 3 wells, UBGH1-04, UBGH1-09 and UBGH1-10. In this study, user-friendly software, "KMU GH Logs 2010", is developed and this software is based on the estimation methods developed in previous study for gas hydrate bearing sediments and the properties estimated from UBGH1-04, UBGH1-09 and UBGH1-10. Petrophysical properties in un-cored wells, UBGH1-01 and UBGH1-14, are also estimated by using well log data. Porosity is estimated by density log and gas hydrate saturation is calculated by sonic log and resistivity log. Sedimentary facies are estimated by applying the linear discriminant analysis using both well log and sedimentary facies data from core analysis. It is confirmed that DITM facies and MSS facies appeared signs of gas hydrate disassociation are able to be distinguished by the method.

Derivation of Probable Rainfall Intensity Formula at Masan District (마산지방 확률강우강도식의 유도)

  • Kim, Ji-Hong;Bae, Deg-Hyo
    • Journal of Wetlands Research
    • /
    • v.2 no.1
    • /
    • pp.49-58
    • /
    • 2000
  • The frequency analysis of annual maximum rainfall data and the derivation of probable rainfall intensity formula at Masan station are performed in this study. Based on the eight different rainfall duration data from 10 minutes to 24 hours, eight types of probability distribution (Gamma, Lognormal, Log-Pearson type III, GEV, Gumbel, Log-Gumbel, Weibull, and Wakeby distributions), three types of parameter estimation scheme (moment, maximum likelihood and probability weighted methods) and three types of goodness-of-fit test (${\chi}^2$, Kolmogorov-Smirnov and Cramer von Mises tests) were considered to find an appropriate probability distribution at Masan station. The Lognormal-2 distribution was selected and the probable rainfall intensity formula was derived by regression analysis. The derived formula can be used for estimating rainfall quantiles of the Masan vicinity areas with convenience and reliability in practice.

  • PDF

Windows based PC Log Collection System using Open Source (오픈소스를 이용한 윈도우 기반 PC 로그 수집 시스템)

  • Song, Jungho;Kim, Hakmin;Yoon, Jin
    • KIISE Transactions on Computing Practices
    • /
    • v.22 no.7
    • /
    • pp.332-337
    • /
    • 2016
  • System administrator or security managers need to collect logs of computing device (desktop or server), which are used for the purpose of cause-analysis of security incident and discover if damage to system was either caused by hacking or computer virus. Furthermore, appropriate log maintenance helps preventing security breech incidents through identification of vulnerability. In addition, it can be utilized for prevention of data leakage through the insider. In the paper, we present log collection system developed using open source supported by commands and basic methods of Windows. Furthermore, we aim to collect log information to enable search and analysis from diverse perspectives and to propose a way to integrate with open source-based search engine system.

A Framework for Analyzing the Effectiveness of a Collaboration Support System for Small and Medium-sized Enterprises (중소제조기업 협업지원 시스템의 도입 및 활용 효과 분석 프레임워크)

  • Kim, Jeong-Yeon;Ahn, Jae-Hyung;Shin, Dong-Min;Moon, Yong-Ma
    • IE interfaces
    • /
    • v.25 no.1
    • /
    • pp.13-20
    • /
    • 2012
  • Recently, the collaboration among small and medium-sized enterprises(SMEs) has been recognized as an effective competitive tool. As several systems have been developed to boost the collaboration, it is necessary to analyze the effectiveness of the systems in terms of their contribution to enhance operational performance of SMEs through objective and quantitative validation. In particular, the analysis for SMEs rather than large-scaled enterprises has not received much attention due to lack of relevant information and difficulty of collecting data. This paper presents a framework for analyzing the effectiveness of the collaboration support system, called i-manufacturing hub, which has been implemented by Korean government. Identification of influential factors to the effectiveness of collaboration hub, and constructing necessary hypotheses are proposed. To overcome the difficulty in data collection only by means of surveys through subjective questionnaires, we exploit system log data that are generated while SMEs use the system. As an initial phase to analyze the effectiveness through hypothesis validation, we discuss several interesting observations and challenges in the direction of enhancing collaboration among SMEs for better operational performance improvement and more participation in the collaboration hub.

Bivariate odd-log-logistic-Weibull regression model for oral health-related quality of life

  • Cruz, Jose N. da;Ortega, Edwin M.M.;Cordeiro, Gauss M.;Suzuki, Adriano K.;Mialhe, Fabio L.
    • Communications for Statistical Applications and Methods
    • /
    • v.24 no.3
    • /
    • pp.271-290
    • /
    • 2017
  • We study a bivariate response regression model with arbitrary marginal distributions and joint distributions using Frank and Clayton's families of copulas. The proposed model is used for fitting dependent bivariate data with explanatory variables using the log-odd log-logistic Weibull distribution. We consider likelihood inferential procedures based on constrained parameters. For different parameter settings and sample sizes, various simulation studies are performed and compared to the performance of the bivariate odd-log-logistic-Weibull regression model. Sensitivity analysis methods (such as local and total influence) are investigated under three perturbation schemes. The methodology is illustrated in a study to assess changes on schoolchildren's oral health-related quality of life (OHRQoL) in a follow-up exam after three years and to evaluate the impact of caries incidence on the OHRQoL of adolescents.