• Title/Summary/Keyword: Statistical data

Search Result 15,004, Processing Time 0.037 seconds

Quantitative Analysis for Plasma Etch Modeling Using Optical Emission Spectroscopy: Prediction of Plasma Etch Responses

  • Jeong, Young-Seon;Hwang, Sangheum;Ko, Young-Don
    • Industrial Engineering and Management Systems
    • /
    • v.14 no.4
    • /
    • pp.392-400
    • /
    • 2015
  • Monitoring of plasma etch processes for fault detection is one of the hallmark procedures in semiconductor manufacturing. Optical emission spectroscopy (OES) has been considered as a gold standard for modeling plasma etching processes for on-line diagnosis and monitoring. However, statistical quantitative methods for processing the OES data are still lacking. There is an urgent need for a statistical quantitative method to deal with high-dimensional OES data for improving the quality of etched wafers. Therefore, we propose a robust relevance vector machine (RRVM) for regression with statistical quantitative features for modeling etch rate and uniformity in plasma etch processes by using OES data. For effectively dealing with the OES data complexity, we identify seven statistical features for extraction from raw OES data by reducing the data dimensionality. The experimental results demonstrate that the proposed approach is more suitable for high-accuracy monitoring of plasma etch responses obtained from OES.

Analysis of various statistical techniques used in the articles published during last 19 years in The Journal of Korean Acupuncture & Moxibusition Society (침구학회지 논문에 응용된 통계방식에 관한 연구 -1984 창간호부터 2002년 19권 6호까지 19년간-)

  • Lee, Seung-deok
    • Journal of Acupuncture Research
    • /
    • v.20 no.1
    • /
    • pp.144-158
    • /
    • 2003
  • This study was carried out to investigate what kinds of statistical techniques have been used to analyze data from oriental medicine research, For study, 551 original articles which used statistical techniques in their data analysis were selected form the articles published in The journal of Korean Acupuncture & Moxibustion Society(JKAMS) between 1984 to 2002. among them, 122 articles used descriptive statistics while 429 articles used inferential statistics for data analysis. For that 429 articles, t-test (189 articles), analysis fo variance (111 articles), chi-square test (14 articles), correlation (10 articles), regression analysis (4 articles), factor analysis(5 articles), or nonparametric test (23 articles) were chose to analyze the data. Nonparametric approach has substantial power in case data do not meet the assumption of normality. This method is not only easy to use ut also provides measures of the statistical variation of nominal and ordinal scale. This study shows that more and more recent papers use nonparametric test compared to the old articles. nine different statistical software or packages (SAS, SPSS, Statview, Minitab, Sigma plot, ISP, Graphpad prism, Excel, Access) have been used in the articles published JKMAS. High level statistical techniques such as SAS, SPSS, and Statview are user friendly and used most for acupuncture and Moxibustion research. Including tables and plots in an article facilitates understanding family process data from a descriptive standpoint, minimized erroneous statistical conclusions, and clarifies theoretically important relationships among variables. Table and plots have been used 500 and 233 articles, respectively. A computer procedure is proposed and illustrated with statistical packages using SAS, SPSS, Statview and ISP.

  • PDF

Statistical Literacy of Fifth and Sixth Graders for the Data Presentation Task Based on the Speculative Data Generation Process (가상적 자료 생성 과정에 기반을 둔 자료 표현 과제에 대한 초등학교 5, 6학년 학생들의 통계적 소양)

  • Moon, Eun-Hye;Lee, Kwangho
    • Education of Primary School Mathematics
    • /
    • v.21 no.4
    • /
    • pp.397-413
    • /
    • 2018
  • The purpose of this study is to analyze the level of statistical literacy among fifth and sixth graders in the data presentation task based on the speculative data generation process. For the research, the data presentation tasks based on the speculative data generation process was designed and statistical literacy standards for evaluating the student's level was presented based on prior studies. It is meaningful that the stepwise presentation of the students' statistical literacy and analysis of their developmental patterns can help them to find their current position and reach a higher level of performance. In this study, the standard of statistical literacy level was clarified based on the previous research, and a new perspective was presented about the data presentation instruction in the statistical education by analyzing the students' responses by each level.

Iterative integrated imputation for missing data and pathway models with applications to breast cancer subtypes

  • Linder, Henry;Zhang, Yuping
    • Communications for Statistical Applications and Methods
    • /
    • v.26 no.4
    • /
    • pp.411-430
    • /
    • 2019
  • Tumor development is driven by complex combinations of biological elements. Recent advances suggest that molecularly distinct subtypes of breast cancers may respond differently to pathway-targeted therapies. Thus, it is important to dissect pathway disturbances by integrating multiple molecular profiles, such as genetic, genomic and epigenomic data. However, missing data are often present in the -omic profiles of interest. Motivated by genomic data integration and imputation, we present a new statistical framework for pathway significance analysis. Specifically, we develop a new strategy for imputation of missing data in large-scale genomic studies, which adapts low-rank, structured matrix completion. Our iterative strategy enables us to impute missing data in complex configurations across multiple data platforms. In turn, we perform large-scale pathway analysis integrating gene expression, copy number, and methylation data. The advantages of the proposed statistical framework are demonstrated through simulations and real applications to breast cancer subtypes. We demonstrate superior power to identify pathway disturbances, compared with other imputation strategies. We also identify differential pathway activity across different breast tumor subtypes.

Optimal Designs for Multivariate Nonparametric Kernel Regression with Binary Data

  • Park, Dong-Ryeon
    • Communications for Statistical Applications and Methods
    • /
    • v.2 no.2
    • /
    • pp.243-248
    • /
    • 1995
  • The problem of optimal design for a nonparametric regression with binary data is considered. The aim of the statistical analysis is the estimation of a quantal response surface in two dimensions. Bias, variance and IMSE of kernel estimates are derived. The optimal design density with respect to asymptotic IMSE is constructed.

  • PDF

An Analysis on Error Types of Graphs for Statistical Literacy Education: Ethical Problems at Data Analysis in the Statistical Problem Solving (통계적 소양 교육을 위한 그래프 오류 유형 분석: 자료 분석 단계에서의 통계 윤리 문제)

  • Tak, Byungjoo;Kim, Dabin
    • Journal of Elementary Mathematics Education in Korea
    • /
    • v.24 no.1
    • /
    • pp.1-30
    • /
    • 2020
  • This study was carried out in order to identify the error types of statistical graphs for statistical literacy education. We analyze the meaning of using graphs in statistical problem solving, and identify categories, frequencies, and contexts as the components of statistical graphs. Error types of representing categories and frequencies make statistics consumers see incorrect distributions of data by subjective point of view of statistics producers and visual illusion. Error types of providing contexts hinder the interpretation of statistical information by concealing or twisting the contexts of data. Moreover, the findings show that tasks provide standardized frame already for drawing graphs in order to avoid errors and pay attention to the process of drawing the graph rather than statistical literacy for analyzing data. We suggest some implications about statistical literacy education, ethical problems, and knowledge for teaching to be considered when teaching the statistical graph in elementary mathematics classes.

Analyzing seventh graders' statistical thinking through statistical processes by phases and instructional settings (통계적 과정의 학습에서 나타난 중학교 1학년 학생들의 단계별·수업 형태별 통계적 사고 분석)

  • Kim, Ga Young;Kim, Rae Young
    • The Mathematical Education
    • /
    • v.58 no.3
    • /
    • pp.459-481
    • /
    • 2019
  • This study aims to investigate students' statistical thinking through statistical processes in different instructional settings: Teacher-centered instruction vs. student-centered learning. We first developed instructional materials that allowed students to experience all the processes of statistics, including data collection, data analysis, data representation, and interpretation of the results. Using the instructional materials for four classes, we collected and analyzed the data from 57 seventh graders' discourse and artifacts from two different instructional settings using the analytic framework generated on the basis of literature review. The results showed that students felt difficulty particularly in the process of data collection and graph representations. In addition, even though data description has been heavily emphasized for data analysis in statistics education, it is surprisingly discovered that students had a hard time to understand the relationship between data and representations. Also, there were relationships between students' statistical thinking and instructional settings. Even though both groups of students showed difficulty in data collection and graph representations of the data, there were significant differences between the groups in terms of their performance. Whereas students from student-centered learning class outperformed in making decisions considering verification and justification, students from teacher-centered lecture class did better in problems requiring accuracy than the counterpart. The results from the study provide meaningful implications on developing curriculum and instructional methods for statistics education.

Cubic normal distribution and its significance in structural reliability

  • Zhao, Yan-Gang;Lu, Zhao-Hui
    • Structural Engineering and Mechanics
    • /
    • v.28 no.3
    • /
    • pp.263-280
    • /
    • 2008
  • Information on the distribution of the basic random variable is essential for the accurate analysis of structural reliability. The usual method for determining the distributions is to fit a candidate distribution to the histogram of available statistical data of the variable and perform approximate goodness-of-fit tests. Generally, such candidate distribution would have parameters that may be evaluated from the statistical moments of the statistical data. In the present paper, a cubic normal distribution, whose parameters are determined using the first four moments of available sample data, is investigated. A parameter table based on the first four moments, which simplifies parameter estimation, is given. The simplicity, generality, flexibility and advantages of this distribution in statistical data analysis and its significance in structural reliability evaluation are discussed. Numerical examples are presented to demonstrate these advantages.

Effects of Spreadsheet-used Instruction on Statistical Thinking and Attitude (스프래드시트를 활용한 수엽이 통계적 사고 및 태도에 미치는 효과)

  • Lee, Jong-Hak;Kim, Won-Kyoung
    • The Mathematical Education
    • /
    • v.50 no.2
    • /
    • pp.185-212
    • /
    • 2011
  • The purpose of this study is to analyze whether spreadsheet-used instruction can improve statistical thinking ability and attitude and also to identify what characteristics of statistical thinking is constructed. For this study, a subject of 2 classes were randomly selected among the 12 classes of the 11th grader in D high school and designated one class as the experimental group and the other class as the control group. Eight hours of the spread sheet-used instruction and the traditional textbook-oriented instruction had been carried out in each class. The research findings are as follows. First, the spread sheet-used instruction is shown to be more effective in enhancing statistical thinking than the traditional textbook-oriented instruction. Second, the spread sheet-used instruction is shown to be more effective in improving statistical attitude than the traditional textbook-oriented instruction. Third, students have shown the various characteristics of statistical thinking in the data descriptive process, data arrange-summary process, data representing process, and data analying process through the spread sheet-used instructions. Hence, the spread sheet-used instruction is recommended in teaching statistics.

A Proposal of Some Analysis Methods for Discovery of User Information from Web Data

  • Ahn, JeongYong;Han, Kyung Soo
    • Communications for Statistical Applications and Methods
    • /
    • v.8 no.1
    • /
    • pp.281-289
    • /
    • 2001
  • The continuous growth in the use of the World Wide Web is creating the data with very large scale and different types. Analyzing such data can help to determine the life time value of users, evaluate the effectiveness of web sites, and design marketing strategies and services. In this paper, we propose some analysis methods for web data and present an example of a prototypical web data analysis.

  • PDF