통합 검색 | Korea Science

On Effect of Nonnormality on Size of Test for Dimensionality in Discriminant Analysis

Changha Hwang
- Communications for Statistical Applications and Methods
- /
- 제3권3호
- /
- pp.25-30
- /
- 1996
In discriminant analysis the procedures commonly used to estimate the dimensionality involve testing a sequence of dimensionality hypotheses. There is a problem with the size of the test since dimensionality hypotheses are tested sequentially and thus they are actually conditional tests. The focus of this paper is to investigate in asymptotic sense what happens to the sequential testing procedure if the assumption of normality does not hold.
PDF

Dimensionality Reduction of RNA-Seq Data

Al-Turaiki, Isra
- International Journal of Computer Science & Network Security
- /
- 제21권3호
- /
- pp.31-36
- /
- 2021
RNA sequencing (RNA-Seq) is a technology that facilitates transcriptome analysis using next-generation sequencing (NSG) tools. Information on the quantity and sequences of RNA is vital to relate our genomes to functional protein expression. RNA-Seq data are characterized as being high-dimensional in that the number of variables (i.e., transcripts) far exceeds the number of observations (e.g., experiments). Given the wide range of dimensionality reduction techniques, it is not clear which is best for RNA-Seq data analysis. In this paper, we study the effect of three dimensionality reduction techniques to improve the classification of the RNA-Seq dataset. In particular, we use PCA, SVD, and SOM to obtain a reduced feature space. We built nine classification models for a cancer dataset and compared their performance. Our experimental results indicate that better classification performance is obtained with PCA and SOM. Overall, the combinations PCA+KNN, SOM+RF, and SOM+KNN produce preferred results.
https://doi.org/10.22937/IJCSNS.2021.21.3.4 인용 PDF KSCI

A NEW INDEX OF DIMENSIONALITY - DETECT

Kim, Hae-Rim
- 한국수학교육학회지시리즈B:순수및응용수학
- /
- 제3권2호
- /
- pp.141-154
- /
- 1996
A data-driven index of dimensionality for an educational or psychological test - DETECT, short for Dimensionality Evaluation To Enumerate Contributing Traits, is proposed in this paper. It is based on estimated conditional covariances of item pairs, given score on remaining test items. Its purpose is to detect whatever multidimensionality structure exists, especially in the case of approximate simple structure. It does so by assigning items to relatively dimensionally homogeneous clusters via attempted maximization of the DETECT over all possible item cluster partitions. The performance of DETECT is studied through real and simulated data analyses.
PDF

Evaluation of Histograms Local Features and Dimensionality Reduction for 3D Face Verification

Ammar, Chouchane;Mebarka, Belahcene;Abdelmalik, Ouamane;Salah, Bourennane
- Journal of Information Processing Systems
- /
- 제12권3호
- /
- pp.468-488
- /
- 2016
The paper proposes a novel framework for 3D face verification using dimensionality reduction based on highly distinctive local features in the presence of illumination and expression variations. The histograms of efficient local descriptors are used to represent distinctively the facial images. For this purpose, different local descriptors are evaluated, Local Binary Patterns (LBP), Three-Patch Local Binary Patterns (TPLBP), Four-Patch Local Binary Patterns (FPLBP), Binarized Statistical Image Features (BSIF) and Local Phase Quantization (LPQ). Furthermore, experiments on the combinations of the four local descriptors at feature level using simply histograms concatenation are provided. The performance of the proposed approach is evaluated with different dimensionality reduction algorithms: Principal Component Analysis (PCA), Orthogonal Locality Preserving Projection (OLPP) and the combined PCA+EFM (Enhanced Fisher linear discriminate Model). Finally, multi-class Support Vector Machine (SVM) is used as a classifier to carry out the verification between imposters and customers. The proposed method has been tested on CASIA-3D face database and the experimental results show that our method achieves a high verification performance.
https://doi.org/10.3745/JIPS.02.0037 인용 PDF KSCI

CNN 기반 초분광 영상 분류를 위한 PCA 차원축소의 영향 분석 (The Impact of the PCA Dimensionality Reduction for CNN based Hyperspectral Image Classification)

곽태홍;송아람;김용일
- 대한원격탐사학회지
- /
- 제35권6_1호
- /
- pp.959-971
- /
- 2019
대표적인 딥러닝(deep learning) 기법 중 하나인 Convolutional Neural Network(CNN)은 고수준의 공간-분광 특징을 추출할 수 있어 초분광 영상 분류(Hyperspectral Image Classification)에 적용하는 연구가 활발히 진행되고 있다. 그러나 초분광 영상은 높은 분광 차원이 학습 과정의 시간과 복잡도를 증가시킨다는 문제가 있어 이를 해결하기 위해 기존 딥러닝 기반 초분광 영상 분류 연구들에서는 차원축소의 목적으로 Principal Component Analysis (PCA)를 적용한 바 있다. PCA는 데이터를 독립적인 주성분의 축으로 변환시킬 수 있어 분광 차원을 효율적으로 압축할 수 있으나, 분광 정보의 손실을 초래할 수 있다. PCA의 사용 유무가 CNN 학습의 정확도와 시간에 영향을 미치는 것은 분명하지만 이를 분석한 연구가 부족하다. 본 연구의 목적은 PCA를 통한 분광 차원축소가 CNN에 미치는 영향을 정량적으로 분석하여 효율적인 초분광 영상 분류를 위한 적절한 PCA의 적용 방법을 제안하는 데에 있다. 이를 위해 PCA를 적용하여 초분광 영상을 축소시켰으며, 축소된 차원의 크기를 바꿔가며 CNN 모델에 적용하였다. 또한, 모델 내의 컨볼루션(convolution) 연산 방식에 따른 PCA의 민감도를 분석하기 위해 2D-CNN과 3D-CNN을 적용하여 비교 분석하였다. 실험결과는 분류정확도, 학습시간, 분산 비율, 학습 과정을 통해 분석되었다. 축소된 차원의 크기가 분산 비율이 99.7~8%인 주성분 개수일 때 가장 효율적이었으며, 3차원 커널 경우 2D-CNN과는 다르게 원 영상의 분류정확도가 PCA-CNN보다 더 높았으며, 이를 통해 PCA의 차원축소 효과가 3차원 커널에서 상대적으로 적은 것을 알 수 있었다.
https://doi.org/10.7780/kjrs.2019.35.6.1.7 인용 PDF KSCI HTML

A Refinement on DETECT for Polytomous Test Data

Kim, Hae-Rim
- Communications for Statistical Applications and Methods
- /
- 제13권3호
- /
- pp.467-477
- /
- 2006
A multidimensionality detecting procedure DETECT, based on conditional covariances between items, is extended and refined to deal with polytomous item data as well as binary one. A large body of simulation study shows extraordinary performance of DETECT in both enumerating degrees of multidimensionality in a test and discovering dimensionally distinctive item clusters. Real data study also provides very meaningful results, making DETECT a strong dimensionality assessment tool for the test data analysis.
https://doi.org/10.5351/CKSS.2006.13.3.467 인용 PDF KSCI

Major SNP Marker Identification with MDR and CART Application

Lee, Jea-Young;Choi, Yu-Mi
- Communications for Statistical Applications and Methods
- /
- 제15권2호
- /
- pp.265-271
- /
- 2008
It is commonly believed that diseases of human or economic traits of livestock are caused not by single genes acting alone, but multiple genes interacting with one another. This issue is difficult due to the limitations of parametric-statistic methods of gene effects. So we introduce multifactor-dimensionality reduction(MDR) as a methods for reducing the dimensionality of multilocus information. The MDR method is nonparametric (i. e., no hypothesis about the value of a statistical parameter is made), model free (i. e., it assumes no particular inheritance model) and is directly applicable to case-control studies. Application of the MDR method revealed the best model with an interaction effect between the SNPs, SNP1 and SNP3, while only one main effect of SNP1 was statistically significant for LMA (p < 0.01) under a general linear mixed model.
https://doi.org/10.5351/CKSS.2008.15.2.265 인용 PDF KSCI

EFMDR-Fast: An Application of Empirical Fuzzy Multifactor Dimensionality Reduction for Fast Execution

Leem, Sangseob;Park, Taesung
- Genomics & Informatics
- /
- 제16권4호
- /
- pp.37.1-37.3
- /
- 2018
Gene-gene interaction is a key factor for explaining missing heritability. Many methods have been proposed to identify gene-gene interactions. Multifactor dimensionality reduction (MDR) is a well-known method for the detection of gene-gene interactions by reduction from genotypes of single-nucleotide polymorphism combinations to a binary variable with a value of high risk or low risk. This method has been widely expanded to own a specific objective. Among those expansions, fuzzy-MDR uses the fuzzy set theory for the membership of high risk or low risk and increases the detection rates of gene-gene interactions. Fuzzy-MDR is expanded by a maximum likelihood estimator as a new membership function in empirical fuzzy MDR (EFMDR). However, EFMDR is relatively slow, because it is implemented by R script language. Therefore, in this study, we implemented EFMDR using RCPP ($c^{{+}{+}}$ package) for faster executions. Our implementation for faster EFMDR, called EMMDR-Fast, is about 800 times faster than EFMDR written by R script only.
https://doi.org/10.5808/GI.2018.16.4.e37 인용 PDF KSCI

Identification of epistasis in ischemic stroke using multifactor dimensionality reduction and entropy decomposition

Park, Jung-Dae;Kim, Youn-Young;Lee, Chae-Young
- BMB Reports
- /
- 제42권9호
- /
- pp.617-622
- /
- 2009
We investigated the genetic associations of ischemic stroke by identifying epistasis of its heterogeneous subtypes such as small vessel occlusion (SVO) and large artery atherosclerosis (LAA). Epistasis was analyzed with 24 genes in 207 controls and 271 patients (SVO = 110, LAA = 95) using multifactor dimensionality reduction and entropy decomposition. The multifactor dimensionality reduction analysis with any of 1- to 4-locus models showed no significant association with LAA (P > 0.05). The analysis of SVO, however, revealed a significant association in the best 3-locus model with P10L of TGF-$\beta{1}$, C1013T of SPP1, and R485K of F5 (testing balanced accuracy = 63.17%, P < 0.05). Subsequent entropy analysis also revealed that such heterogeneity was present and quite a large entropy was estimated among the 3 loci for SVO (5.43%), but only a relatively small entropy was estimated for LAA (1.81%). This suggests that the synergistic epistasis model might contribute specifically to the pathogenetsis of SVO, which implies a different etiopathogenesis of the ischemic stroke subtypes.
https://doi.org/10.5483/BMBRep.2009.42.9.617 인용 PDF

차원축소 없는 채널집중 네트워크를 이용한 SAR 변형표적 식별 (SAR Recognition of Target Variants Using Channel Attention Network without Dimensionality Reduction)

박지훈;최여름;채대영;임호
- 한국군사과학기술학회지
- /
- 제25권3호
- /
- pp.219-230
- /
- 2022
In implementing a robust automatic target recognition(ATR) system with synthetic aperture radar(SAR) imagery, one of the most important issues is accurate classification of target variants, which are the same targets with different serial numbers, configurations and versions, etc. In this paper, a deep learning network with channel attention modules is proposed to cope with the recognition problem for target variants based on the previous research findings that the channel attention mechanism selectively emphasizes the useful features for target recognition. Different from other existing attention methods, this paper employs the channel attention modules without dimensionality reduction along the channel direction from which direct correspondence between feature map channels can be preserved and the features valuable for recognizing SAR target variants can be effectively derived. Experiments with the public benchmark dataset demonstrate that the proposed scheme is superior to the network with other existing channel attention modules.
https://doi.org/10.9766/KIMST.2022.25.3.219 인용 PDF KSCI

검색결과 162건 처리시간 0.019초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)