• 제목/요약/키워드: Correlated data

검색결과 5,040건 처리시간 0.033초

The Distributions of Variance Components in Two Stage Regression Model

  • Park, Dong-Joon
    • Journal of the Korean Data and Information Science Society
    • /
    • 제7권1호
    • /
    • pp.87-92
    • /
    • 1996
  • A regression model with nested erroe structure is considered. The regression model includes two error terms that are independent and normally distributed with zero means and constant variances. This error structure of the model gives correlated response variables. The distributions of variance components in the regression model with nested error structure are dervied by using theorems for quadratic forms.

  • PDF

논관개 양수량의 관개면적과 강수량과의 관계 (Correlations between Pumping Rate to Irrigated Area and Rainfall Amount in a Paddy Field)

  • 이성희;김태철
    • 한국농공학회:학술대회논문집
    • /
    • 한국농공학회 2001년도 학술발표회 발표논문집
    • /
    • pp.89-92
    • /
    • 2001
  • This study was to analyse the correlations between pumping rate to irrigated area and rainfall amount in the Geum river basins. A total of 84 pumping stations and field data from the paddy of 28,772 ha were introduced to the analysis. The results showed that the pumping volume was highly correlated to the rainfall during the irrigation period and irrigated area. But, it was difficult to determine the exact correlation factors, because of the lack of data like the efficiency of water in the paddy field.

  • PDF

Analysis of the Predictive Validity of College Entrance Criteria

  • Bae, Hyun-Wung
    • Journal of the Korean Data and Information Science Society
    • /
    • 제18권4호
    • /
    • pp.973-983
    • /
    • 2007
  • Korea Military Academy has been using College Scholastic Ability Test(CSAT) and High School Grades(HSG) with other measures such as an Essay-type Test(ET), Physical Test(PT) and Personal Interview(PI) as criteria for entrance. The purpose of study is to investigate the properness of the criteria in admission decisions by examining the relationship between the college GPA and criteria, and the prediction of academic performance. The study showed that CSAT and HSG are significantly correlated with the college GPA, and these two criteria are better predictors for academic performance. Regression analysis also provided an important message that HSG is a better predictor than CSAT.

  • PDF

EFFICIENT ESTIMATION IN SEMIPARAMETRIC RANDOM EFFECT PANEL DATA MODELS WITH AR(p) ERRORS

  • Lee, Young-Kyung
    • Journal of the Korean Statistical Society
    • /
    • 제36권4호
    • /
    • pp.523-542
    • /
    • 2007
  • In this paper we consider semiparametric random effect panel models that contain AR(p) disturbances. We derive the efficient score function and the information bound for estimating the slope parameters. We make minimal assumptions on the distribution of the random errors, effects, and the regressors, and provide semiparametric efficient estimates of the slope parameters. The present paper extends the previous work of Park et al.(2003) where AR(1) errors were considered.

결합 연결구조 기반의 동적 개인 지식네트워크 설계 (Dynamic Personal Knowledge Network Design based on Correlated Connection Structure)

  • 심정연
    • 컴퓨터교육학회논문지
    • /
    • 제18권6호
    • /
    • pp.71-79
    • /
    • 2015
  • 클라우드와 빅네이터의 새로운 시대에서 필요한 데이터를 방대한 데이터 풀로부터 어떻게 찾아내고 활용하느냐는 매우 중요한 일이다. 이러한 빅데이터의 시대에는 무엇보다도 방대하고도 변화무쌍한 데이터를 잘 처리하고 유용한 정보를 신속하게 획득할 수 있는 진화된 형태의 효율적 지능적 지식시스템 설계를 필요로 한다. 따라서 본 연구에서는 진화된 지능 시스템 연구의 하나로서 구조적으로 재구성될 수 있는 동적 개인적 지식네트워크를 제안하고자 한다. 작은 공간에 큰 세계를 매핑하여 효율적으로 처리할 수 있는 인간 두뇌의 기능과 이 안에서 일어나는 뉴로다이나믹스 메커니즘에 착안하여 구조적 유연성을 갖는 지능 시스템을 설계하였다. 서로 다른 네트워크의 구조적-기능적 결합이 가능하도록 개인 지식네트워크를 구조화하고 핵심 영역에 속하는 공통 노드를 찾아 결합을 하며 재구성하는 기능을 부여하였다. 또한 시스템이 재구성된 지식네트워크로부터 최적 경로를 추출하며 추출된 경로를 가지고 추론 프로세스를 진행하는 기능 갖도록 구상하였다.

Spatial Variability of Soil Properties using Nested Variograms at Multiple Scales

  • Chung, Sun-Ok;Sudduth, Kenneth A.;Drummond, Scott T.;Kitchen, Newell R.
    • Journal of Biosystems Engineering
    • /
    • 제39권4호
    • /
    • pp.377-388
    • /
    • 2014
  • Purpose: Determining the spatial structure of data is important in understanding within-field variability for site-specific crop management. An understanding of the spatial structures present in the data may help illuminate interrelationships that are important in subsequent explanatory analyses, especially when site variables are correlated or are a combined response to multiple causative factors. Methods: In this study, correlation, principal component analysis, and single and nested variogram models were applied to soil electrical conductivity and chemical property data of two fields in central Missouri, USA. Results: Some variables that were highly correlated, or were strongly expressed in the same principal component, exhibited similar spatial ranges when fitted with a single variogram model. However, single variogram results were dependent on the active lag distance used, with short distances (30 m) required to fit short-range variability. Longer active lag distances only revealed long-range spatial components. Nested models generally yielded a better fit than single models for sensor-based conductivity data, where multiple scales of spatial structure were apparent. Gaussian-spherical nested models fit well to the data at both short (30 m) and long (300 m) active lag distances, generally capturing both short-range and long-range spatial components. As soil conductivity relates strongly to profile texture, we hypothesize that the short-range components may relate to the scale of erosion processes, while the long-range components are indicative of the scale of landscape morphology. Conclusion: In this study, we investigated the effect of changing active lag distance on the calculation of the range parameter. Future work investigating scale effects on other variogram parameters, including nugget and sill variances, may lead to better model selection and interpretation. Once this is achieved, separation of nested spatial components by factorial kriging may help to better define the correlations existing between spatial datasets.

공간상관 센서네트워크에서 신뢰성 있는 데이터 수집을 위한 측정의 분배 (A Measurement Allocation for Reliable Data Gathering in Spatially Corrected Sensor Networks)

  • 변상선
    • 한국정보통신학회:학술대회논문집
    • /
    • 한국정보통신학회 2016년도 춘계학술대회
    • /
    • pp.434-437
    • /
    • 2016
  • 이 논문에서는 공간상관 (spatial correlation) 센서네트워크에서 효과적이고 신뢰성있는 센서 데이터 수집을 위한 각 센서의 측정 확률 분배를 고려한다. 즉, 신뢰성이 높은 측정 데이터를 전달해주는 센서에게 더 높은 측정 확률을 분배하여 더 자주 측정되게 하는 것이다. 상관 모델은 각 센서의 전송파워 제한, 측정과정과 무선전송과정에서 발생될 수 있는 노이즈, 무선 채널의 감쇄를 고려하여 만들어진다. 그리고, 데이터 수집의 신뢰성은 데이터 수집 노드 (sink node)에서 왜곡 오차 (distortion error)를 계산함으로써 측정된다. 우리는 이 측정 분배를 정의된 공간상관상에서 협력게임으로 모델링하고 각 센서의 측정 확률을 Shapley Value를 통해 할당한다. Shapley Value는 협력게임에서 각 플레이어의 공헌도를 측정하는 방법으로, 공간상관 센서네트워크에서 각 센서들의 데이터 수집의 공헌도를 측정하는 데 사용될 수 있다. 따라서, 우리는 각 센서의 공헌도에 비례하여 측정 확률을 분배하는 것이다.

  • PDF

A modified partial least squares regression for the analysis of gene expression data with survival information

  • Lee, So-Yoon;Huh, Myung-Hoe;Park, Mira
    • Journal of the Korean Data and Information Science Society
    • /
    • 제25권5호
    • /
    • pp.1151-1160
    • /
    • 2014
  • In DNA microarray studies, the number of genes far exceeds the number of samples and the gene expression measures are highly correlated. Partial least squares regression (PLSR) is one of the popular methods for dimensional reduction and known to be useful for the classifications of microarray data by several studies. In this study, we suggest a modified version of the partial least squares regression to analyze gene expression data with survival information. The method is designed as a new gene selection method using PLSR with an iterative procedure of imputing censored survival time. Mean square error of prediction criterion is used to determine the dimension of the model. To visualize the data, plot for variables superimposed with samples are used. The method is applied to two microarray data sets, both containing survival time. The results show that the proposed method works well for interpreting gene expression microarray data.

유비쿼터스 센서 네트워크에서 연관된 데이터의 효율적인 처리방안 (Efficient Processing Scheme for Correlated Data in Ubiquitous Sensor Networks)

  • 류제택;허남호;유승화;김기형
    • 한국정보통신설비학회:학술대회논문집
    • /
    • 한국정보통신설비학회 2008년도 정보통신설비 학술대회
    • /
    • pp.63-68
    • /
    • 2008
  • In now days, Ubiquitous technology grow up, so the variety service are developed. Sensor networks purpose is collection information about environment and geographic. But sensor network has limit in power, cost and so on. There is much restriction. Some sensor networks purpose is monitoring environment. And there is some relation in sensing data. Sensor nodes sense information by periods. First sensing data correlate with next sensing data. At this point, this paper suggest power saving method. Some data are same, the other data are similar.

  • PDF

Clinical Significance of Expression and Amplification of the DcR3 Gene in Pancreatic Carcinomas

  • Zhou, Jian;Song, Shi-Duo;Li, De-Chun;Zhou, Jin;Zhu, Dong-Ming;Zheng, Shi-Ying
    • Asian Pacific Journal of Cancer Prevention
    • /
    • 제13권2호
    • /
    • pp.719-724
    • /
    • 2012
  • This study aimed to investigate the clinical significance of expression and amplification of decoy receptor 3 (DcR3) in pancreatic carcinomas (PC). mRNA expression was detected by PQ-PCR, and amplification was determined. DcR3 protein expression was detected by immunohistochemistry and ELISA. Correlations between DcR3 expression and clinical pathological factors were analyzed. The relative amount of DcR3 in PC tissues and non-cancerous tissues showed a statistically significant difference, 21 cases displaying more than two fold DcR3 amplification, while no such amplification was found in normal pancreatic tissues. DcR3 positive cell staining was located in the cytoplasm. The positive rate of DcR3 in PC and non-cancerous tissues showed a significant difference. DcR3 mRNA expression was correlated with clinical staging, size of the tumor, lymph node metastasis and histological staging, while protein expression was correlated with clinical data like tumor size. DcR3 gene amplification only correlated with tumor size. The level of DcR3 in serum of the PC resectable group before operation was $72.2{\pm}10.2$ pg/ml, showing a significant difference compared to gallbladder carcinoma group (GC) or pancreatic benign tumor (PBT) group (P < 0.01). In conclusion, DcR3 amplification is correlated with DcR3 expression in PC tissues, especially those clinical pathological factors which reflect tumor progression. Assessment of DcR3 level in sera of PC patients may be helpful for the early diagnosis and prognostic judgement.