• Title/Summary/Keyword: penalized

Search Result 166, Processing Time 0.022 seconds

Evaluation of the DCT-PLS Method for Spatial Gap Filling of Gridded Data (격자자료 결측복원을 위한 DCT-PLS 기법의 활용성 평가)

  • Youn, Youjeong;Kim, Seoyeon;Jeong, Yemin;Cho, Subin;Lee, Yangwon
    • Korean Journal of Remote Sensing
    • /
    • v.36 no.6_1
    • /
    • pp.1407-1419
    • /
    • 2020
  • Long time-series gridded data is crucial for the analyses of Earth environmental changes. Climate reanalysis and satellite images are now used as global-scale periodical and quantitative information for the atmosphere and land surface. This paper examines the feasibility of DCT-PLS (penalized least square regression based on discrete cosine transform) for the spatial gap filling of gridded data through the experiments for multiple variables. Because gap-free data is required for an objective comparison of original with gap-filled data, we used LDAPS (Local Data Assimilation and Prediction System) daily data and MODIS (Moderate Resolution Imaging Spectroradiometer) monthly products. In the experiments for relative humidity, wind speed, LST (land surface temperature), and NDVI (normalized difference vegetation index), we made sure that randomly generated gaps were retrieved very similar to the original data. The correlation coefficients were over 0.95 for the four variables. Because the DCT-PLS method does not require ancillary data and can refer to both spatial and temporal information with a fast computation, it can be applied to operative systems for satellite data processing.

Correlation between MR Image-Based Radiomics Features and Risk Scores Associated with Gene Expression Profiles in Breast Cancer (유방암에서 자기공명영상 근거 영상표현형과 유전자 발현 프로파일 근거 위험도의 관계)

  • Ga Ram Kim;You Jin Ku;Jun Ho Kim;Eun-Kyung Kim
    • Journal of the Korean Society of Radiology
    • /
    • v.81 no.3
    • /
    • pp.632-643
    • /
    • 2020
  • Purpose To investigate the correlation between magnetic resonance (MR) image-based radiomics features and the genomic features of breast cancer by focusing on biomolecular intrinsic subtypes and gene expression profiles based on risk scores. Materials and Methods We used the publicly available datasets from the Cancer Genome Atlas and the Cancer Imaging Archive to extract the radiomics features of 122 breast cancers on MR images. Furthermore, PAM50 intrinsic subtypes were classified and their risk scores were determined from gene expression profiles. The relationship between radiomics features and biomolecular characteristics was analyzed. A penalized generalized regression analysis was performed to build prediction models. Results The PAM50 subtype demonstrated a statistically significant association with the maximum 2D diameter (p = 0.0189), degree of correlation (p = 0.0386), and inverse difference moment normalized (p = 0.0337). Among risk score systems, GGI and GENE70 shared 8 correlated radiomic features (p = 0.0008-0.0492) that were statistically significant. Although the maximum 2D diameter was most significantly correlated to both score systems (p = 0.0139, and p = 0.0008), the overall degree of correlation of the prediction models was weak with the highest correlation coefficient of GENE70 being 0.2171. Conclusion Maximum 2D diameter, degree of correlation, and inverse difference moment normalized demonstrated significant relationships with the PAM50 intrinsic subtypes along with gene expression profile-based risk scores such as GENE70, despite weak correlations.

Penalized least distance estimator in the multivariate regression model (다변량 선형회귀모형의 벌점화 최소거리추정에 관한 연구)

  • Jungmin Shin;Jongkyeong Kang;Sungwan Bang
    • The Korean Journal of Applied Statistics
    • /
    • v.37 no.1
    • /
    • pp.1-12
    • /
    • 2024
  • In many real-world data, multiple response variables are often dependent on the same set of explanatory variables. In particular, if several response variables are correlated with each other, simultaneous estimation considering the correlation between response variables might be more effective way than individual analysis by each response variable. In this multivariate regression analysis, least distance estimator (LDE) can estimate the regression coefficients simultaneously to minimize the distance between each training data and the estimates in a multidimensional Euclidean space. It provides a robustness for the outliers as well. In this paper, we examine the least distance estimation method in multivariate linear regression analysis, and furthermore, we present the penalized least distance estimator (PLDE) for efficient variable selection. The LDE technique applied with the adaptive group LASSO penalty term (AGLDE) is proposed in this study which can reflect the correlation between response variables in the model and can efficiently select variables according to the importance of explanatory variables. The validity of the proposed method was confirmed through simulations and real data analysis.

Topology Optimization of a HDD Actuator Arm

  • Chang, Su-Young;Cho, Ji-Hyon;Youn, Sung-Kie;Kim, Cheol-Soon;Oh, Dong-Ho
    • Computational Structural Engineering : An International Journal
    • /
    • v.1 no.2
    • /
    • pp.89-96
    • /
    • 2001
  • A study on the topology optimization of a Hard-Disk-Driver(HDD) actuator arm is presented. The purpose of the present wert is to increase the natural frequency of tole first lateral mode of the HDD actuator arm under the constraint of total moment of inertia, so as to facilitate the position control of the high speed actuator arm. The first lateral mode is an important factor in the position control process. Thus the topology optimization for 2-D model of the HDD actuator arm is considered. A new objective function corresponding to multieigenvalue optimization is suggested to improve the solution of the eigenvalue optimization problem. The material density of the structure is treated as the design variable and the intermediate density is penalized. The effects of different element types and material property functions on the final topology are studied. When the problem is discretized using 8-node element of a uniform density, tole smoothly-varying density field is obtained without checker-board patterns incurred. AS a result of 7he study, an improved design of the HDD actuator arm is suggested. Dynamic characteristics of the suggested design are compared computationally with those of the old design. With the same amount of the moment of inertia, the natural frequency of the first lateral mode of the suggested design is subsequently increased over the existing one.

  • PDF

Multiple Group Testing Procedures for Analysis of High-Dimensional Genomic Data

  • Ko, Hyoseok;Kim, Kipoong;Sun, Hokeun
    • Genomics & Informatics
    • /
    • v.14 no.4
    • /
    • pp.187-195
    • /
    • 2016
  • In genetic association studies with high-dimensional genomic data, multiple group testing procedures are often required in order to identify disease/trait-related genes or genetic regions, where multiple genetic sites or variants are located within the same gene or genetic region. However, statistical testing procedures based on an individual test suffer from multiple testing issues such as the control of family-wise error rate and dependent tests. Moreover, detecting only a few of genes associated with a phenotype outcome among tens of thousands of genes is of main interest in genetic association studies. In this reason regularization procedures, where a phenotype outcome regresses on all genomic markers and then regression coefficients are estimated based on a penalized likelihood, have been considered as a good alternative approach to analysis of high-dimensional genomic data. But, selection performance of regularization procedures has been rarely compared with that of statistical group testing procedures. In this article, we performed extensive simulation studies where commonly used group testing procedures such as principal component analysis, Hotelling's $T^2$ test, and permutation test are compared with group lasso (least absolute selection and shrinkage operator) in terms of true positive selection. Also, we applied all methods considered in simulation studies to identify genes associated with ovarian cancer from over 20,000 genetic sites generated from Illumina Infinium HumanMethylation27K Beadchip. We found a big discrepancy of selected genes between multiple group testing procedures and group lasso.

Variable Selection in Normal Mixture Model Based Clustering under Heteroscedasticity (이분산 상황 하에서 정규혼합모형 기반 군집분석의 변수선택)

  • Kim, Seung-Gu
    • The Korean Journal of Applied Statistics
    • /
    • v.24 no.6
    • /
    • pp.1213-1224
    • /
    • 2011
  • In high dimensionality where the number of variables are excessively larger than observations, it is required to remove the noninformative variables to cluster observations. Most model-based approaches for variable selection have been considered under the assumption of homoscedasticity and their models are mainly estimated by a penalized likelihood method. In this paper, a different approach is proposed to remove the noninformative variables effectively and to cluster based on the modified normal mixture model simultaneously. The validity of the model was provided and an EM algorithm was derived to estimate the parameters. Simulation studies and an experiment using real microarray dataset showed the effectiveness of the proposed method.

A Study on the Effect of Pre-cue in Simple Reactions on Control-on-Display Interfaces

  • Lim, Ji-Hyoun;Choi, Jun-Young;Kim, Young-Su
    • Journal of the Ergonomics Society of Korea
    • /
    • v.30 no.4
    • /
    • pp.563-569
    • /
    • 2011
  • Objective: This study focuses on the effects of pre-cues informing the location of upcoming visual stimulus on finger movement response in the context of control-on-display interfaces. Background: Previous research on pre-cues focus on attention allocation and motion studies were limited to indirect control conditions. The design of this study aimed to collect data on the exact landing point for finger-tap responses to a given visual stimulus. Method: Controlled visual stimuli and tasks were presented on a UI evaluation system built using mobile web standards; response accuracy and response time were measured and collected as appropriate. Among the 16 recruited participants, 11 completed the experiment. Results: Providing pre-cue on the location of stimulus affected response time and response accuracy. The response bias, which is a distance from the center of stimulus to the finger-tap location, was larger when the pre-cue was given during a one-handed operation. Conclusion: Given a pre-cue, response time decreases, but with accuracy penalized. Application: In designing touch-screen UI's - more strictly, visual components also acting as controllers - designers would do well to balance human perceptual and cognitive characteristics strategically.

Semiparametric Approach to Logistic Model with Random Intercept (준모수적 방법을 이용한 랜덤 절편 로지스틱 모형 분석)

  • Kim, Mijeong
    • The Korean Journal of Applied Statistics
    • /
    • v.28 no.6
    • /
    • pp.1121-1131
    • /
    • 2015
  • Logistic models with a random intercept are useful to analyze longitudinal binary data. Traditionally, the random intercept of the logistic model is assumed to be parametric (such as normal distribution) and is also assumed to be independent to variables. Such assumptions are very strong and restricted for application to real data. Recently, Garcia and Ma (2015) derived semiparametric efficient estimators for logistic model with a random intercept without these assumptions. Their estimator shows the consistency where we do not assume any parametric form for the random intercept. In addition, the method is computationally simple. In this paper, we apply this method to analyze toenail infection data. We compare the semiparametric estimator with maximum likelihood estimator, penalized quasi-likelihood estimator and hierarchical generalized linear estimator.

Spatial Clustering Method Via Generalized Lasso (Generalized Lasso를 이용한 공간 군집 기법)

  • Song, Eunjung;Choi, Hosik;Hwang, Seungsik;Lee, Woojoo
    • The Korean Journal of Applied Statistics
    • /
    • v.27 no.4
    • /
    • pp.561-575
    • /
    • 2014
  • In this paper, we propose a penalized likelihood method to detect local spatial clusters associated with disease. The key computational algorithm is based on genlasso by Tibshirani and Taylor (2011). The proposed method has two main advantages over Kulldorff's method which is popoular to detect local spatial clusters. First, it is not needed to specify a proper cluster size a priori. Second, any type of covariate can be incorporated and, it is possible to find local spatial clusters adjusted for some demographic variables. We illustrate our proposed method using tuberculosis data from Seoul.

Multinomial Logit Framework to Evaluate the Impact of Seating Position on Senior Occupant Injury Severity in Traffic Accidents (고령탑승자의 좌석별 상해정도에 관한 연구)

  • Choi, Jaesung
    • Journal of the Korean Society of Safety
    • /
    • v.32 no.3
    • /
    • pp.141-150
    • /
    • 2017
  • A rapid increase in traffic accidents involving senior vehicle occupants has been an issue in Korea because of the aging of the population occurring at one of the fastest rates in the world; unfortunately, few studies beyond several looking at the effect of senior occupants on the level of accident injury severity can be found in the literature. A Multinomial logit model was estimated with Newton-Raphson algorithm to perform bias-reducing penalized likelihood optimization. Model covariates integral to developing the model were included, but the main focus was on the interaction of seating position and injury to senior vehicle occupants. It was found that the likelihood of an accident resulting in a fatality increased: 2.2 times for the driver seat, 2.7 times for the front passenger seat, and even 6.7 times for the rear seat. A mandatory seatbelt law to be extended to the rear seat needs to pass the assembly as soon as possible, and government, industry, and safety groups should be encouraged to join forces to strongly carry out targeted campaigns for the wearing of seatbelts in all vehicle seats to enhance the safety of senior occupants as well as other occupants who are vulnerable to road traffic accidents.