• 제목/요약/키워드: methods of data analysis

검색결과 19,201건 처리시간 0.05초

Classification of Microarray Gene Expression Data by MultiBlock Dimension Reduction

  • Oh, Mi-Ra;Kim, Seo-Young;Kim, Kyung-Sook;Baek, Jang-Sun;Son, Young-Sook
    • Communications for Statistical Applications and Methods
    • /
    • 제13권3호
    • /
    • pp.567-576
    • /
    • 2006
  • In this paper, we applied the multiblock dimension reduction methods to the classification of tumor based on microarray gene expressions data. This procedure involves clustering selected genes, multiblock dimension reduction and classification using linear discrimination analysis and quadratic discrimination analysis.

RAM 분석 정확도 향상을 위한 야전운용 데이터의 이상값과 결측값 처리 방안 (Method of Processing the Outliers and Missing Values of Field Data to Improve RAM Analysis Accuracy)

  • 김인석;정원
    • 한국신뢰성학회지:신뢰성응용연구
    • /
    • 제17권3호
    • /
    • pp.264-271
    • /
    • 2017
  • Purpose: Field operation data contains missing values or outliers due to various causes of the data collection process, so caution is required when utilizing RAM analysis results by field operation data. The purpose of this study is to present a method to minimize the RAM analysis error of the field data to improve the accuracy. Methods: Statistical methods are presented for processing of the outliers and the missing values of the field operating data, and after analyzing the RAM, the differences between before and after applying the technique are discussed. Results: The availability is estimated to be lower by 6.8 to 23.5% than that before processing, and it is judged that the processing of the missing values and outliers greatly affect the RAM analysis result. Conclusion: RAM analysis of OO weapon system was performed and suggestions for improvement of RAM analysis were presented through comparison with the new and current method. Data analysis results without appropriate treatment of error values may result in incorrect conclusions leading to inappropriate decisions and actions.

페널티 방법을 이용한 주성분분석 연구 (A study on principal component analysis using penalty method)

  • 박철용
    • Journal of the Korean Data and Information Science Society
    • /
    • 제28권4호
    • /
    • pp.721-731
    • /
    • 2017
  • 이 연구에서는 Lasso 페널티 방법을 이용한 주성분분석 방법을 소개한다. 주성분분석에 Lasso 페널티를 적용하는 방법으로 흔히 사용되는 방법은 크게 두 가지가 있다. 첫 번째 방법은 주성분을 반응변수로 놓고 원 자료행렬을 설명변수로 하는 회귀분석의 회귀계수를 이용하여 최적의 선형결 합 벡터를 구할 때 Lasso 페널티 (일반적으로 elastic net 페널티)를 부과하는 방법이다. 두 번째 방법은 원자료행렬을 비정칙값 분해로 근사하고 남은 잔차행렬에 Lasso 페널티를 부과하여 최적의 선형결합 벡터를 구하는 방법이다. 이 연구에서는 주성분 분석에 Lasso 페널티를 부과하는 이 두 가지 방법들을 자세하게 개관하는데, 이 방법들은 변수 숫자가 표본크기보다 큰 경우에도 적용가능한 장점이 있다. 또한 실제 자료분석에서 R 프로그램을 통해 두 방법을 적용하고 그 결과를 비교한다. 구체적으로 변수 숫자가 표본크기보다 큰 Ahamad (1967)의 crime 자료에 적용한다.

An Identification of Outlying Cells in Contingency Table via Correspondence Analysis Map

  • Hong, Chong Sun;Lee, Jong Cheol
    • Communications for Statistical Applications and Methods
    • /
    • 제8권1호
    • /
    • pp.39-49
    • /
    • 2001
  • When an appropriate model is fitted to explain a certain categorical data, outlying cell detection plays very important role to reduce the lack of fit. There exist many statistical methods to identify outlying cells in contingency table. In this paper, correspondence analysis is applied to identify one or two outlying cells. When corresponding relationships between categories of the row and columns are explored, we find that outlying cells could be identified via the correspondence analysis map.

  • PDF

ONE TYPE OF EDDY DEVELOPMENT IN THE NORTHEASTERN KUROSHIO BRANCH

  • Bulatov, Nafanail V.;Kapshiter, Alexander V.;Obukhova, Natalya G.
    • 대한원격탐사학회:학술대회논문집
    • /
    • 대한원격탐사학회 2006년도 Proceedings of ISRS 2006 PORSEC Volume II
    • /
    • pp.926-929
    • /
    • 2006
  • Some features of vertical structure of the frontal interaction zone of the warm Kuroshio Current and cold Oyashio Current are known from 1930 from analysis of ship data. Ship data however do not allow carrying out the area detailed survey opposite to satellite infrared (IR) observations which possess by high spatial and temporal resolution. Analysis of NOAA AVHRR IR images demonstrated that process of formation and development of the Kuroshio warm core rings is highly complex. They are formed as a result of development of anticyclonic meanders of the warm Kuroshio waters and spin off them from the current. Joint analysis of thermal infrared images and altimetry data has also indicated that interaction of eddies to the frontal zone plays a crucial role in formation of large eddies moving to the Southern Kuril region.

  • PDF

An integrated approach for structural health monitoring using an in-house built fiber optic system and non-parametric data analysis

  • Malekzadeh, Masoud;Gul, Mustafa;Kwon, Il-Bum;Catbas, Necati
    • Smart Structures and Systems
    • /
    • 제14권5호
    • /
    • pp.917-942
    • /
    • 2014
  • Multivariate statistics based damage detection algorithms employed in conjunction with novel sensing technologies are attracting more attention for long term Structural Health Monitoring of civil infrastructure. In this study, two practical data driven methods are investigated utilizing strain data captured from a 4-span bridge model by Fiber Bragg Grating (FBG) sensors as part of a bridge health monitoring study. The most common and critical bridge damage scenarios were simulated on the representative bridge model equipped with FBG sensors. A high speed FBG interrogator system is developed by the authors to collect the strain responses under moving vehicle loads using FBG sensors. Two data driven methods, Moving Principal Component Analysis (MPCA) and Moving Cross Correlation Analysis (MCCA), are coded and implemented to handle and process the large amount of data. The efficiency of the SHM system with FBG sensors, MPCA and MCCA methods for detecting and localizing damage is explored with several experiments. Based on the findings presented in this paper, the MPCA and MCCA coupled with FBG sensors can be deemed to deliver promising results to detect both local and global damage implemented on the bridge structure.

마케팅 리서치에서 다중측정방법에 관한 실증적 연구 (A Case Study of Fashion Marketing Research using Multiple Methods)

  • 박혜정;김혜정;이영주;임숙자
    • 복식문화연구
    • /
    • 제10권6호
    • /
    • pp.601-616
    • /
    • 2002
  • Qualitative research is a method widely used in marketing research. However, the method has seldom been used in fashion marketing research in Korea. The purpose of this study was to prove that using both qualitative and quantitative research methods in main stage is much useful than using qualitative research method only in exploratory stage. Qualitative data were gathered by conducting Focus Group Interview(FGI) with 48 college students. Quantitative data were gathered by surveying college students, and 487 questionnaires were used in the statistical analysis. The data were analyzed using content analysis, mean, standard deviation, and t-test. As a result, FGI, one of the tools used in qualitative research methods, was proved to be useful in revealing consumers´deep emotional needs as well as purchase motives. FGI also revealed information which quantitative research method tools such as survey could have missed. Therefore, it is best to use multiple methods-simultaneous use of quantitative and qualitative methods-to understand fast changing consumers´needs and purchase motives.

  • PDF

VNURBS기반의 다차원 불균질 볼륨 객체의 표현: 모델링 및 응용 (Volumetric NURBS Representation of Multidimensional and Heterogeneous Objects: Modeling and Applications)

  • 박상근
    • 한국CDE학회논문집
    • /
    • 제10권5호
    • /
    • pp.314-327
    • /
    • 2005
  • This paper describes the volumetric data modeling and analysis methods that employ volumetric NURBS or VNURBS that represents heterogeneous objects or fields in multidimensional space. For volumetric data modeling, we formulate the construction algorithms involving the scattered data approximation and the curvilinear grid data interpolation. And then the computational algorithms are presented for the geometric and mathematical analysis of the volume data set with the VNURBS model. Finally, we apply the modeling and analysis methods to various field applications including grid generation, flow visualization, implicit surface modeling, and image morphing. Those application examples verify the usefulness and extensibility of our VNUBRS representation in the context of volume modeling and analysis.

A Comparative Study of Medical Data Classification Methods Based on Decision Tree and System Reconstruction Analysis

  • Tang, Tzung-I;Zheng, Gang;Huang, Yalou;Shu, Guangfu;Wang, Pengtao
    • Industrial Engineering and Management Systems
    • /
    • 제4권1호
    • /
    • pp.102-108
    • /
    • 2005
  • This paper studies medical data classification methods, comparing decision tree and system reconstruction analysis as applied to heart disease medical data mining. The data we study is collected from patients with coronary heart disease. It has 1,723 records of 71 attributes each. We use the system-reconstruction method to weight it. We use decision tree algorithms, such as induction of decision trees (ID3), classification and regression tree (C4.5), classification and regression tree (CART), Chi-square automatic interaction detector (CHAID), and exhausted CHAID. We use the results to compare the correction rate, leaf number, and tree depth of different decision-tree algorithms. According to the experiments, we know that weighted data can improve the correction rate of coronary heart disease data but has little effect on the tree depth and leaf number.

State Analysis and Location Tracking Technology through EEG and Position Data Analysis

  • Jo, Guk-Han;Song, Young-Joon
    • 한국정보기술학회 영문논문지
    • /
    • 제8권2호
    • /
    • pp.27-39
    • /
    • 2018
  • In this paper, we describe the algorithms, EEG classification methods, and position data analysis methods using EEG and ADS1299 sensors. In addition, it is necessary to manage the amount of real-time data of location data and EEG data and to extract data efficiently. To do this, we explain the process of extracting important information from a vast amount of data through a cloud server. The electrical signals extracted from the brain are measured to determine the psychological state and health status, and the measured positions can be collected using the position sensor and triangulation method.