• 제목/요약/키워드: Distributed Data Analysis

검색결과 2,340건 처리시간 0.034초

분포 통계 해석에 의한 계면 결함 부분방전 진단 (Partial Discharge Diagnosis of Interface Defect by the Distribution Statistical Analysis)

  • 조경순;이강원;김원종;홍진웅;신종열
    • 한국전기전자재료학회논문지
    • /
    • 제21권4호
    • /
    • pp.348-353
    • /
    • 2008
  • Most of the high voltage insulation systems, such as the power cable joint having hetero interface, are composed of more than two different insulators to improve insulating performance. The partial discharge(PD) in these hetero interface is expected to affect the total insulation performance. Thus, it is important to study electrical properties on these interfaces. This study described the influence of copper and semiconductive substance defects on $\Phi$-q-n distribution between the interface of the model cable joints to classify PD source. PD was sequentially detected for 600 cycles of the applied voltage. The K-means cluster analysis has been analyzed to investigate the $\Phi$-q-n distribution. The skewness-kurtosis(Sk-Ku) plot from K-means clustering results was defined to quantify cluster distribution and classify distribution patterns. The Sk-Ku plot is composed of skewness and kurtosis along abscissa and ordinate which indicate the asymmetry and the sharpness of distribution. As a result of the Sk-Ku plot, it was confirmed that the data was distributed in 1st 2nd and 3rd quadrant at copper foreign substance defect, but in case of semiconductive foreign substance, the data was distributed in 2nd quadrant only.

제주 외도천 유역의 통합 물수지 분석 (An Integrated Water Budget Analysis of Oedocheon Watershed in Jeju Island)

  • 김남원;정일문;나한나
    • 한국환경과학회지
    • /
    • 제24권4호
    • /
    • pp.471-480
    • /
    • 2015
  • Hydrologic component analysis was conducted to investigate water budget characteristics the Oedocheon watershed, Jeju Island. For this purpose, integrated SWAT-MODFLOW model was applied to this watershed for continuous surface water-groundwater modeling. Pasture and forest-deciduous are the major land use types and these affect general hydrologic component ratio. The spatio-temporal groundwater recharge can be obtained from SWAT and then distributed groundwater recharge can be reproduced by MODFLOW. The groundwater level variation was simulated with distributed groundwater pumping data. The water budget in this watershed was compared with the previous estimated result by Jeju-Do(2013). As this result considered discharge to the coastal side, the discrepancy was found. However, it was found that the overall tendency of both analyses were similar.

러스터 파일 시스템 기반 하둡 맵리듀스 실행 환경 구현 및 성능 분석 (Implementation and Performance Analysis of Hadoop MapReduce over Lustre Filesystem)

  • 곽재혁;김상완;허태상;황순욱
    • 정보과학회 컴퓨팅의 실제 논문지
    • /
    • 제21권8호
    • /
    • pp.561-566
    • /
    • 2015
  • 하둡은 오픈소스 기반의 분산 데이터 처리 프레임워크로서 과학 및 상용 분야에서 널리 사용되고 있는데 최근에 대규모 데이터의 실시간 처리 및 분석을 위해 고성능 컴퓨팅(HPC) 기술을 활용하여 하둡을 고성능화하기 위한 연구가 시도되고 있다. 본 논문에서는 하둡의 기본 파일시스템 구현인 하둡 분산파일시스템(HDFS)을 고성능 병렬 분산파일시스템인 러스터 파일시스템으로 대체하여 사용할 수 있도록 하둡 파일시스템 라이브러리를 확장하여 구현하였고 하둡이 제공하는 표준 벤치마크 도구를 사용하여 성능을 분석하였다. 실험 결과 러스터 파일시스템 기반으로 하둡 맵리듀스 응용을 수행하는 경우에 2-13배의 성능 향상이 있음을 확인할 수 있었다.

네트워크 분석을 이용한 『영악전서(景岳全書)』 신방팔진(新方八陣)의 탐색적 구조 분석 (Exploratory Structural Analysis on Eight Positions for New Formulations in Jingyuequanshu by Network Analysis)

  • 정윤경;김형석;김현호;박영재;박영배
    • 대한한의학회지
    • /
    • 제35권3호
    • /
    • pp.49-59
    • /
    • 2014
  • Background: Eight positions for new formulations in Jingyuequanshu is a unique classifying system and each position can be presented as a network. Network analysis is an effective method of visualizing the relationship of textual information. Purpose: This study aimed to analyze the structure of the eight positions for new formulations in Jingyuequanshu, and to acquire information about composing materia medica. Methods: We constructed an initial database and binary matrix of eight positions for new formulation in Jingyuequanshu, including its herbs and formulations. With this data, we carried out frequency and network analysis. Results: We analyzed each of the eight positions for new formulation, entire positions, and five positions after removing 'causal/cold/hot' positions. We found that the formulas of the causal position are distributed throughout the new formulation, and those of the cold and hot positions are also distributed similarly but the two groups are exclusive to each other. The other 5 positions are distributed exclusively to one another. Conclusions: Eight positions for new formulation of Jingyuequanshu were classified into three axes by exploratory network analysis. One is an axis of causal position, another is an axis of cold/hot positions, and the last is an axis of the other five positions.

다변량 분석기법에 의한 지점강우의 권역화 연구 (A Study on the Regionalization of Point Rainfall by Multivariate Analysis Technique)

  • 박상우;전병호;장석환
    • 한국수자원학회논문집
    • /
    • 제36권5호
    • /
    • pp.879-892
    • /
    • 2003
  • 본 연구에서는 강우의 지역빈도분석에 필요한 수문학적 동질성을 갖는 지점강우의 권역화를 수행하였다. 이를 위해 전국에 걸친 기상청 산하의 60개 강우관측소에 대한 32개의 강우특성자료를 추출하였으며, 추출된 각 지점의 많은 강우자료들은 다변량 분석의 자료축약기법인 주성분분석과 그룹화 기법인 군집분석을 통하여 합리적이고 효율적으로 권역화되었다. 본 연구의 결과인 지점강우의 권역은 강우지역을 수문학적 동질성의 5개 권역과 3개의 기타지역으로 분류되었으며, 각 강우성분의 권역별 평균값으로부터 각 권역의 강우특성을 상대적으로 비교 분석하였다.

Complex Segregation Analysis of Categorical Traits in Farm Animals: Comparison of Linear and Threshold Models

  • Kadarmideen, Haja N.;Ilahi, H.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제18권8호
    • /
    • pp.1088-1097
    • /
    • 2005
  • Main objectives of this study were to investigate accuracy, bias and power of linear and threshold model segregation analysis methods for detection of major genes in categorical traits in farm animals. Maximum Likelihood Linear Model (MLLM), Bayesian Linear Model (BALM) and Bayesian Threshold Model (BATM) were applied to simulated data on normal, categorical and binary scales as well as to disease data in pigs. Simulated data on the underlying normally distributed liability (NDL) were used to create categorical and binary data. MLLM method was applied to data on all scales (Normal, categorical and binary) and BATM method was developed and applied only to binary data. The MLLM analyses underestimated parameters for binary as well as categorical traits compared to normal traits; with the bias being very severe for binary traits. The accuracy of major gene and polygene parameter estimates was also very low for binary data compared with those for categorical data; the later gave results similar to normal data. When disease incidence (on binary scale) is close to 50%, segregation analysis has more accuracy and lesser bias, compared to diseases with rare incidences. NDL data were always better than categorical data. Under the MLLM method, the test statistics for categorical and binary data were consistently unusually very high (while the opposite is expected due to loss of information in categorical data), indicating high false discovery rates of major genes if linear models are applied to categorical traits. With Bayesian segregation analysis, 95% highest probability density regions of major gene variances were checked if they included the value of zero (boundary parameter); by nature of this difference between likelihood and Bayesian approaches, the Bayesian methods are likely to be more reliable for categorical data. The BATM segregation analysis of binary data also showed a significant advantage over MLLM in terms of higher accuracy. Based on the results, threshold models are recommended when the trait distributions are discontinuous. Further, segregation analysis could be used in an initial scan of the data for evidence of major genes before embarking on molecular genome mapping.

데이터웨어하우스의 개발생산성 향상을 위한 측정지표의 설계 및 분석 (Design and Analysis of Metrics for Enhancing Productivity of Datawarehouse)

  • 박종모;조경산
    • 인터넷정보학회논문지
    • /
    • 제8권5호
    • /
    • pp.151-160
    • /
    • 2007
  • 다양한 분석 정보를 추출하여 저장하는 데이터웨어하우스는 기업의 마케팅 및 의사결정 지원을 위해 사용된다. 그러나 데이터웨어하우스는 분산되어 저장된 대규모의 데이터를 통합하기 때문에 처리 시간과 비용이 증가하며 오류 발생의 위험이 높다. 따라서 개발생산성의 향상 및 비용을 절감하기 위해 본 연구에서는 생산성, 프로세스 품질, 데이터 품질의 영역에서 프로세스 측정지표를 설계하여 제안한다. 또한 측정 지표를 이용하여 프로세스를 평가하고 평가결과를 분석하여 개발생산성의 향상과 프로세스가 개선됨을 보인다.

  • PDF

한국인의 개인색채 유형에 따른 패션색채 기호의 분석 (An Analysis of Fashion Color Preferences According to Koreans' Personal Color Types)

  • 조은영;유태순
    • 복식
    • /
    • 제58권10호
    • /
    • pp.37-51
    • /
    • 2008
  • The purpose of this study is according to four Seasonal Color system, the researcher analyzed the Korean Personal color and investigated abstract color preferences and the fashion color preferences to support the Korean color preferences and the rotor recognition trend. The study was conducted according to the following procedures: 144 women from 20s to 50s were selected as participants of the survey and the interview. The fashion color preferences and the personal body color was analyzed by the questionnaire and the interview. The data were analyzed by SPSS 12.0 program. As a way of analysis, crosstabs analysis, correlation analysis, t-test, ANOVA and regression analysis were used. As a results, in the Personal color type of Korean women, it is distributed in the order of spring, summer, winter and autumn. In addition, Light image is highly distributed in the Personal color image. People preferred summer color group in the general color preference and the cosmetic color preference and they preferred winter color type in the clothes color and the suitable color for themselves. In the color recognition, as the color interest increases, the coincidence between suitable color and favorite color increased. The recognition of suitable color, the consideration of color in purchasing and the interest of outward appearances were highly showed.

GIS기반 K-DRUM을 이용한 금강권 대유역 홍수유출 적용 (Application of Flood Discharge for Gumgang Watershed Using GIS-based K-DRUM)

  • 박진혁;허영택
    • 대한공간정보학회지
    • /
    • 제18권1호
    • /
    • pp.11-20
    • /
    • 2010
  • 지금까지 국내에서 개발된 분포형 강우-유출모형은 입력 자료의 구축에 많은 시간과 노력이 필요하며, 유출과정에서 운동역학적인 이론에 근거하여 물의 흐름을 수치해석으로 추적해 나가는데 많은 계산시간이 소요되는 단점이 있었다. 그래서 지금까지 분포형모형의 적용유역은 대부분 일정규모 이내의 유역에 국한되어 적용되어왔다. 하지만, 최근 유역관리 개념의 변화에 따라 통합유역관리의 필요성이 증대되면서 댐의 상.하류를 포함한 유역전체의 유출계산이 요구되었다. 본 논문은 자체개발한 GIS기반의 물리학적 분포형 강우-유출모형인 K-DRUM을 국내 대유역인 금강권수계 전체를 대상으로 모의하여 홍수유출시 대유역에 대하여 분포형모형의 실무에서의 적용 가능성을 검토하였다. GIS수문매개변수를 DEM, 토지피복도, 토양도 등의 기본 GIS자료들로부터 추출하여 모형의 입력인자로 사용하였으며, 초기 토양함수조건을 설정하는데 있어서 자동보정기법을 적용함으로써 기존의 시행착오법으로 인해 소요되는 시간과 부정확한 설정으로 인해 발생될 수 있는 문제점을 해결하였다. 태풍 "에위니아"를 대상으로 용담, 대청, 공주 지점에서의 유출량 모의 결과 체적오차백분율은 13.3~25.6%, 첨두유량오차백분율은 5.4~22.8%, 총유출량오차는 7.3~12.5%로 실무적용 가능성이 충분한 것으로 판단된다.

제2형 당뇨병의 위험인자 분석을 위한 다층 퍼셉트론과 로지스틱 회귀 모델의 비교 (A comparison of Multilayer Perceptron with Logistic Regression for the Risk Factor Analysis of Type 2 Diabetes Mellitus)

  • 서혜숙;최진욱;이홍규
    • 대한의용생체공학회:의공학회지
    • /
    • 제22권4호
    • /
    • pp.369-375
    • /
    • 2001
  • The statistical regression model is one of the most frequently used clinical analysis methods. It has basic assumption of linearity, additivity and normal distribution of data. However, most of biological data in medical field are nonlinear and unevenly distributed. To overcome the discrepancy between the basic assumption of statistical model and actual biological data, we propose a new analytical method based on artificial neural network. The newly developed multilayer perceptron(MLP) is trained with 120 data set (60 normal, 60 patient). On applying test data, it shows the discrimination power of 0.76. The diabetic risk factors were also identified from the MLP neural network model and the logistic regression model. The signigicant risk factors identified by MLP model were post prandial glucose level(PP2), sex(male), fasting blood sugar(FBS) level, age, SBP, AC and WHR. Those from the regression model are sex(male), PP2, age and FBS. The combined risk factors can be identified using the MLP model. Those are total cholesterol and body weight, which is consistent with the result of other clinical studies. From this experiment we have learned that MLP can be applied to the combined risk factor analysis of biological data which can not be provided by the conventional statistical method.

  • PDF