• Title/Summary/Keyword: Statistical data interpretation

Search Result 174, Processing Time 0.025 seconds

Tree-Structured Nonlinear Regression

  • Chang, Young-Jae;Kim, Hyeon-Soo
    • The Korean Journal of Applied Statistics
    • /
    • v.24 no.5
    • /
    • pp.759-768
    • /
    • 2011
  • Tree algorithms have been widely developed for regression problems. One of the good features of a regression tree is the flexibility of fitting because it can correctly capture the nonlinearity of data well. Especially, data with sudden structural breaks such as the price of oil and exchange rates could be fitted well with a simple mixture of a few piecewise linear regression models. Now that split points are determined by chi-squared statistics related with residuals from fitting piecewise linear models and the split variable is chosen by an objective criterion, we can get a quite reasonable fitting result which goes in line with the visual interpretation of data. The piecewise linear regression by a regression tree can be used as a good fitting method, and can be applied to a dataset with much fluctuation.

A Study on the Calculation and Provision of Accruals-Quality by Big Data Real-Time Predictive Analysis Program

  • Shin, YeounOuk
    • International journal of advanced smart convergence
    • /
    • v.8 no.3
    • /
    • pp.193-200
    • /
    • 2019
  • Accruals-Quality(AQ) is an important proxy for evaluating the quality of accounting information disclosures. High-quality accounting information will provide high predictability and precision in the disclosure of earnings and will increase the response to stock prices. And high Accruals-Quality, such as mitigating heterogeneity in accounting information interpretation, provides information usefulness in capital markets. The purpose of this study is to suggest how AQ, which represents the quality of accounting information disclosure, is transformed into digitized data in real-time in combination with IT information technology and provided to financial analyst's information environment in real-time. And AQ is a framework for predictive analysis through big data log analysis system. This real-time information from AQ will help financial analysts to increase their activity and reduce information asymmetry. In addition, AQ, which is provided in real time through IT information technology, can be used as an important basis for decision-making by users of capital market information, and is expected to contribute in providing companies with incentives to voluntarily improve the quality of accounting information disclosure.

Assessment through Statistical Methods of Water Quality Parameters(WQPs) in the Han River in Korea

  • Kim, Jae Hyoun
    • Journal of Environmental Health Sciences
    • /
    • v.41 no.2
    • /
    • pp.90-101
    • /
    • 2015
  • Objective: This study was conducted to develop a chemical oxygen demand (COD) regression model using water quality monitoring data (January, 2014) obtained from the Han River auto-monitoring stations. Methods: Surface water quality data at 198 sampling stations along the six major areas were assembled and analyzed to determine the spatial distribution and clustering of monitoring stations based on 18 WQPs and regression modeling using selected parameters. Statistical techniques, including combined genetic algorithm-multiple linear regression (GA-MLR), cluster analysis (CA) and principal component analysis (PCA) were used to build a COD model using water quality data. Results: A best GA-MLR model facilitated computing the WQPs for a 5-descriptor COD model with satisfactory statistical results ($r^2=92.64$,$Q{^2}_{LOO}=91.45$,$Q{^2}_{Ext}=88.17$). This approach includes variable selection of the WQPs in order to find the most important factors affecting water quality. Additionally, ordination techniques like PCA and CA were used to classify monitoring stations. The biplot based on the first two principal components (PCs) of the PCA model identified three distinct groups of stations, but also differs with respect to the correlation with WQPs, which enables better interpretation of the water quality characteristics at particular stations as of January 2014. Conclusion: This data analysis procedure appears to provide an efficient means of modelling water quality by interpreting and defining its most essential variables, such as TOC and BOD. The water parameters selected in a COD model as most important in contributing to environmental health and water pollution can be utilized for the application of water quality management strategies. At present, the river is under threat of anthropogenic disturbances during festival periods, especially at upstream areas.

Statistical Consideration on the Resources of the Countries in the World (세계 각국의 자원에 대한 통계적 고찰)

  • Huh, Moon-Yul;Choi, Byong-Su;Lee, Seung-Chun
    • The Korean Journal of Applied Statistics
    • /
    • v.22 no.1
    • /
    • pp.41-57
    • /
    • 2009
  • The paper investigates the resources of the 232 countries based on the 39 resources of these countries. The data used in this work is from various sources like UN, CIA, World bank, OECD reports and the home pages of each country. The purpose of the study is to evaluate what resources are most influential to the wealth of a country, to the well-bring of the country, or the status of the country's development. For this, data visualization method is applied. Data visualization technique, although powerful for exploratory purposes, is dependent upon the users expertize and the interpretation is also dependent on the of the users. For objective methods of investigation, mutual information based on the Shanon's entropy theory is applied here. All the statistical methods employed in this paper are processed with DAVIS (Huh and Song, 2002)

Changes in Statistical Knowledge and Experience of Data-driven Decision-making of Pre-service Teachers who Participated in Data Analysis Projects (데이터 분석 프로젝트 참여한 예비 교사의 통계적 지식에 대한 변화와 데이터 기반 의사 결정의 경험)

  • Suh, Heejoo;Han, Sunyoung
    • Communications of Mathematical Education
    • /
    • v.35 no.2
    • /
    • pp.153-172
    • /
    • 2021
  • Various competencies such as critical thinking, systems thinking, problem solving competence, communication skill, and data literacy are likely to be required in the 4th industrial revolution. The competency regarding data literacy is one of those competencies. To nurture citizens who will live in the future, it is timely to consider research on teacher education for supporting teachers' development of statistical thinking as well as statistical knowledge. Therefore, in this study we developed and implemented a data analysis project for pre-service teachers to understand their changes in statistical knowledge in addition to their experiences of data-driven decision making process that required them utilizing their statistical thinking. We used a mixed method (i.e., sequential explanatory design) research to analyze the quantitative and qualitative data collected. The findings indicated that pre-service teachers have low knowledge level of their understanding on the relationship between population means and sample means, and estimation of the population mean and its interpretation. When it comes to the data-driven decision making process, we found that the pre-service teachers' experiences varied even when they worked as a small group for the project. We end this paper by presenting implications of the study for the fields of teacher education and statistics education.

Digital Image Processing of Side Scan Sonar for Underwater Man-made Structure (수중 인공구조물에 대한 사이드스캔소나 탐사자료의 영상처리)

  • Shin, Sung-Ryul;Lim, Min-Hyuk;Kim, Kwang-Eun
    • Journal of Advanced Marine Engineering and Technology
    • /
    • v.33 no.2
    • /
    • pp.344-354
    • /
    • 2009
  • Side scan sonar using acoustic wave plays a very important role in the underwater, sea floor, and shallow marine geologic survey. In this study, we have acquired side scan sonar data for the underwater man-made structures, artificial reefs and fishing grounds, installed and distributed in the survey area. We applied digital image processing techniques to side scan sonar data in order to improve and enhance an image quality. We carried out digital image processing with various kinds of filtering in spatial domain and frequency domain. We tested filtering parameters such as kernel size, differential operator, and statistical value. We could easily estimate the conditions, distribution and environment of artificial structures through the interpretation of side scan sonar.

THE NUMERICAL IMPLEMENTATION OF RISK

  • Lee, Chun-Jin
    • Journal of applied mathematics & informatics
    • /
    • v.2 no.2
    • /
    • pp.53-62
    • /
    • 1995
  • If one is to estimate environmemtal risk based on data or predict risk based on expert opinion the parameter environmental risk musk be defined precisely so that when data becomes available the numerical values of the estimates and/or prediction can be evaluated. Also the definitionmust be precise so that it may be successfully used in regulatory and litigation activities. The presentation is a develop-ment of a definition which lends to statistical analysis and to inference in addition lends to ease of engineering interpretation. Various impli-cations and useful extensions in measuring numerically for two or more dimensional mixed effects of several toxicants could be developed in further research.

A Study on the Realities and Activation of Community Participation of Young Farmers (영농청소년의 지역사회참여실태 및 활성화 방안)

  • Lee, Chae-Shik;Park, Eun-Shik
    • Journal of Agricultural Extension & Community Development
    • /
    • v.14 no.2
    • /
    • pp.395-415
    • /
    • 2007
  • The purposes of this study were to investigate the realities of community participation of young farmers and to suggest measures to activate community participation. The data were collected from 234 young farmers from rural Korea. With SPSS 13.0 program for Windows, Frequency, t-test, ANOVA, LSD for post-hoc interpretation and Factor Analysis were employed to analyze the data with statistical significance level of .05. The main results of the study and suggestions were as follows: 1) Young farmers were more likely to participate in watching television on community, discussion with others and internet search for community, while, rural youths were less likely to participate in contacting government and parliament. 2) Difficulties of community participations of young farmers were lack of time and insufficient information about participatory activities. The study suggested that young farmers should get more opportunities to participate in diverse types of active opportunities and practical information.

  • PDF

A STUDY ON SYNTHETIC GENERATION OF MONTHLY STREAMFLOW BY BIVARIATE ANALYSIS (BIVARIATE ANALYSIS에 의한 월류량에 모의발생에 관한 연구)

  • Seo, Byeong-Ha;Yun, Yong-Nam;Gang, Gwan-Won
    • Water for future
    • /
    • v.12 no.2
    • /
    • pp.63-69
    • /
    • 1979
  • The sequences of monthly streamflows constitute a non-statonary time series. The purely stochastic model has been applied to data generation of non-stationary time series. Tow different mothods--single site and multisite generation--have been used on the hydrologic time series. In this study the synthetic generation method by bivariate analysis, studied by Thomas Fiering, one of multi-site models, has been applied to the historical data on monthly streamflows at two sites in Nakdong River, and also for validity of this model the single site Thomas Fiering model applied. Through statistical analysis it has been shown that the performance of bivariate Thomas Fiering model was better than that of the other. By comparison of mean and standard deviaion between the historical and the generated, and cross correlogram interpretation, it has been known that the model used herein has good performance to simultaneously generate the monthly streamflows at two sites in a river hasin.

  • PDF

Landsilde Analysis of Yongin Area Using Spatial Database (공간 데이터베이스를 이용한 1991년 용인지역 산사태 분석)

  • 이사로;민경덕
    • Economic and Environmental Geology
    • /
    • v.33 no.4
    • /
    • pp.321-332
    • /
    • 2000
  • The purpose of this study is to analyze landslide that occurred in Yongin area in 1991 using spatial database. For this, landslide locations are detected from aerial photographs interpretation and field survey. The locations of landslide, topography, soil, forest and geology were constructed to spatial database using Geographic Information System (GIS). To establish occurrence factors of landslide, slope, aspect and curvature of topography were calculated from the topographic database. Texture, material, drainage and effective thickness of soil were extracted from the soil database, and type, age, diameter and density of wood were extracted from the forest database. Lithology was extracted from the geological database, and land use was classified from the TM satellite image. Landslide was analyzed using spatial correlation between the landslide and the landslide occurrence factors by bivariate probability methods. GIS was used to analyze vast data efficiently and statistical programs were used to maintain specialty and accuracy. The result can be used to prevention of hazard, land use planning and construction planning as basic data.

  • PDF