• Title/Summary/Keyword: Data Bias

Search Result 1,779, Processing Time 0.028 seconds

Analysis of periodontal data using mixed effects models

  • Cho, Young Il;Kim, Hae-Young
    • Journal of Periodontal and Implant Science
    • /
    • v.45 no.1
    • /
    • pp.2-7
    • /
    • 2015
  • A fundamental problem in analyzing complex multilevel-structured periodontal data is the violation of independency among the observations, which is an assumption in traditional statistical models (e.g., analysis of variance and ordinary least squares regression). In many cases, aggregation (i.e., mean or sum scores) has been employed to overcome this problem. However, the aggregation approach still exhibits certain limitations, such as a loss of power and detailed information, no cross-level relationship analysis, and the potential for creating an ecological fallacy. In order to handle multilevel-structured data appropriately, mixed effects models have been introduced and employed in dental research using periodontal data. The use of mixed effects models might account for the potential bias due to the violation of the independency assumption as well as provide accurate estimates.

Design of a ParamHub for Machine Learning in a Distributed Cloud Environment

  • Su-Yeon Kim;Seok-Jae Moon
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.16 no.2
    • /
    • pp.161-168
    • /
    • 2024
  • As the size of big data models grows, distributed training is emerging as an essential element for large-scale machine learning tasks. In this paper, we propose ParamHub for distributed data training. During the training process, this agent utilizes the provided data to adjust various conditions of the model's parameters, such as the model structure, learning algorithm, hyperparameters, and bias, aiming to minimize the error between the model's predictions and the actual values. Furthermore, it operates autonomously, collecting and updating data in a distributed environment, thereby reducing the burden of load balancing that occurs in a centralized system. And Through communication between agents, resource management and learning processes can be coordinated, enabling efficient management of distributed data and resources. This approach enhances the scalability and stability of distributed machine learning systems while providing flexibility to be applied in various learning environments.

An Approach for the NHPP Software Reliability Model Using Erlang Distribution (어랑 분포를 이용한 NHPP 소프트웨어 신뢰성장 모형에 관한 연구)

  • Kim Hee-Cheul;Choi Yue-Soon;Park Jong-Goo
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.10 no.1
    • /
    • pp.7-14
    • /
    • 2006
  • The finite failure NHPP models proposed in the literature exhibit either constant, monotonic increasing or monotonic decreasing failure occurrence rates per fault. In this paper, we propose the Erlang reliability model, which can capture the increasing nature of the failure occurrence rate per fault. Equations to estimate the parameters of the Erlang finite failure NHPP model based on failure data collected in the form of inter-failure times are developed. For the sake of proposing shape parameter of the Erlang distribution, we used to the goodness-of-fit test of distribution. Data set, where the underlying failure process could not be adequately described by the existing models, which motivated the development of the Erlang model. Analysis of the failure data set which led us to the Erlang model, using arithmetic and Laplace trend tests, goodness-of-fit test, bias tests is presented.

Experience Way of Artificial Intelligence PLAY Educational Model for Elementary School Students

  • Lee, Kibbm;Moon, Seok-Jae
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.12 no.4
    • /
    • pp.232-237
    • /
    • 2020
  • Given the recent pace of development and expansion of Artificial Intelligence (AI) technology, the influence and ripple effects of AI technology on the whole of our lives will be very large and spread rapidly. The National Artificial Intelligence R&D Strategy, published in 2019, emphasizes the importance of artificial intelligence education for K-12 students. It also mentions STEM education, AI convergence curriculum, and budget for supporting the development of teaching materials and tools. However, it is necessary to create a new type of curriculum at a time when artificial intelligence curriculum has never existed before. With many attempts and discussions going very fast in all countries on almost the same starting line. Also, there is no suitable professor for K-12 students, and it is difficult to make K-12 students understand the concept of AI. In particular, it is difficult to teach elementary school students through professional programming in AI education. It is also difficult to learn tools that can teach AI concepts. In this paper, we propose an educational model for elementary school students to improve their understanding of AI through play or experience. This an experiential education model that combineds exploratory learning and discovery learning using multi-intelligence and the PLAY teaching-learning model to undertand the importance of data training or data required for AI education. This educational model is designed to learn how a computer that knows only binary numbers through UA recognizes images. Through code.org, students were trained to learn AI robots and configured to understand data bias like play. In addition, by learning images directly on a computer through TeachableMachine, a tool capable of supervised learning, to understand the concept of dataset, learning process, and accuracy, and proposed the process of AI inference.

NHPP Software Reliability Model based on Generalized Gamma Distribution (일반화 감마 분포를 이용한 NHPP 소프트웨어 신뢰도 모형에 관한 연구)

  • Kim, Hee-Cheul
    • Journal of the Korea Society of Computer and Information
    • /
    • v.10 no.6 s.38
    • /
    • pp.27-36
    • /
    • 2005
  • Finite failure NHPP models presented in the literature exhibit either constant, monotonic increasing or monotonic decreasing failure occurrence rates Per fault. This Paper Proposes reliability model using the generalized gamma distribution, which can capture the monotonic increasing(or monotonic decreasing) nature of the failure occurrence rate per fault. Equations to estimate the parameters of the generalized gamma finite failure NHPP model based on failure data collected in the form of interfailure times are developed. For the sake of proposing shape parameter of the generalized gamma distribution, used to the special pattern. Data set, where the underlying failure process could not be adequately described by the knowing models, which motivated the development of the gamma or Weibull model. Analysis of failure data set for the generalized gamma modell, using arithmetic and Laplace trend tests . goodness-of-fit test, bias tests is presented.

  • PDF

Estimation of Launch Vehicle Tracking Error due to Radio Refraction (레이다 전파굴절에 의한 발사체 추적오차 추정)

  • Seo, Gwang-Gyo;Kim, Yoonsoo;Shin, Vladimir;Song, Ha-Ryong;Choi, Yong-Tae
    • Journal of the Korean Society for Aeronautical & Space Sciences
    • /
    • v.45 no.12
    • /
    • pp.1076-1083
    • /
    • 2017
  • This paper discusses the error estimation in radar measurement data obtained while tracking a launch vehicle. It is known that typical radar measurement data consist of the true positional or orientation information on the vehicle being tracked, random noise and a deterministic bias due to radio refraction. Unlike previous research works, this paper proposes a tracking-error (mainly bias) estimation method solely based on the single radar measurement with no aid of other measurement such as GPS. The proposed method has been verified with real measurement data obtained while tracking the KSLV-I launch vehicle.

The Analysis of Changma Structure using Radiosonde Observational Data from KEOP-2007: Part I. the Assessment of the Radiosonde Data (KEOP-2007 라디오존데 관측자료를 이용한 장마 특성 분석: Part I. 라디오존데 관측 자료 평가 분석)

  • Kim, Ki-Hoon;Kim, Yeon-Hee;Chang, Dong-Eon
    • Atmosphere
    • /
    • v.19 no.2
    • /
    • pp.213-226
    • /
    • 2009
  • In order to investigate the characteristics of Changma over the Korean peninsula, KEOP-2007 IOP (Intensive Observing Period) was conducted from 15 June 2007 to 15 July 2007. KEOP-2007 IOP is high spatial and temporal radiosonde observations (RAOB) which consisted of three special stations (Munsan, Haenam, and Ieodo) from National Institute of Meteorological Research, five operational stations (Sokcho, Baengnyeongdo, Pohang, Heuksando, and Gosan) from Korea Meteorological Administration (KMA), and two operational stations (Osan and Gwangju) from Korean Air Force (KAF) using four different types of radiosonde sensors. The error statistics of the sensor of radiosonde were investigated using quality control check. The minimum and maximum error frequency appears at the sensor of RS92-SGP and RS1524L respectively. The error frequency of DFM-06 tends to increase below 200 hPa but RS80-15L and RS1524L show vice versa. Especially, the error frequency of RS1524L tends to increase rapidly over 200 hPa. Systematic biases of radiosonde show warm biases in case of temperature and dry biases in case of relative humidity compared with ECMWF (European Center for Medium-Range Weather Forecast) analysis data and precipitable water vapor from GPS. The maximum and minimum values of systematic bias appear at the sensor of DFM-06 and RS92-SGP in case of temperature and RS80-15L and DFM-06 in case of relative humidity. The systematic warm and dry biases at all sensors tend to increase during daytime than nighttime because air temperature around sensor increases from the solar heating during daytime. Systematic biases of radiosonde are affected by the sensor type and the height of the sun but random errors are more correlated with the moisture conditions at each observation station.

An Uncertainty Assessment for Annual Variability of Precipitation Simulated by AOGCMs Over East Asia (AOGCM에 의해 모의된 동아시아지역의 강수 연변동성에 대한 불확실성 평가)

  • Shin, Jinho;Lee, Hyo-Shin;Kim, Minji;Kwon, Won-Tae
    • Atmosphere
    • /
    • v.20 no.2
    • /
    • pp.111-130
    • /
    • 2010
  • An uncertainty assessment for precipitation datasets simulated by Atmosphere-Ocean Coupled General Circulation Model (AOGCM) is conducted to provide reliable climate scenario over East Asia. Most of results overestimate precipitation compared to the observational data (wet bias) in spring-fall-winter, while they underestimate precipitation (dry bias) in summer in East Asia. Higher spatial resolution model shows better performances in simulation of precipitation. To assess the uncertainty of spatiotemporal precipitation in East Asia, the cyclostationary empirical orthogonal function (CSEOF) analysis is applied. An annual cycle of precipitation obtained from the CSEOF analysis accounts for the biggest variability in its total variability. A comparison between annual cycles of observed and modeled precipitation anomalies shows distinct differences: 1) positive precipitation anomalies of the multi-model ensemble (MME) for 20 models (thereafter MME20) in summer locate toward the north compared to the observational data so that it cannot explain summer monsoon rainfalls across Korea and Japan. 2) The onset of summer monsoon in MME20 in Korean peninsula starts earlier than observed one. These differences show the uncertainty of modeled precipitation. Also the comparison provides the criteria of annual cycle and correlation between modeled and observational data which helps to select best models and generate a new MME, which is better than the MME20. The spatiotemporal deviation of precipitation is significantly associated with lower-level circulations. In particular, lower-level moisture transports from the warm pool of the western Pacific and corresponding moisture convergence significantly are strongly associated with summer rainfalls. These lower-level circulations physically consistent with precipitation give insight into description of the reason in the monsoon of East Asia why behaviors of individually modeled precipitation differ from that of observation.

Verification of the Global Numerical Weather Prediction Using SYNOP Surface Observation Data (SYNOP 지상관측자료를 활용한 수치모델 전구 예측성 검증)

  • Lee, Eun-Hee;Choi, In-Jin;Kim, Ki-Byung;Kang, Jeon-Ho;Lee, Juwon;Lee, Eunjeong;Seol, Kyung-Hee
    • Atmosphere
    • /
    • v.27 no.2
    • /
    • pp.235-249
    • /
    • 2017
  • This paper describes methodology verifying near-surface predictability of numerical weather prediction models against the surface synoptic weather station network (SYNOP) observation. As verification variables, temperature, wind, humidity-related variables, total cloud cover, and surface pressure are included in this tool. Quality controlled SYNOP observation through the pre-processing for data assimilation is used. To consider the difference of topographic height between observation and model grid points, vertical inter/extrapolation is applied for temperature, humidity, and surface pressure verification. This verification algorithm is applied for verifying medium-range forecasts by a global forecasting model developed by Korea Institute of Atmospheric Prediction Systems to measure the near-surface predictability of the model and to evaluate the capability of the developed verification tool. It is found that the verification of near-surface prediction against SYNOP observation shows consistency with verification of upper atmosphere against global radiosonde observation, suggesting reliability of those data and demonstrating importance of verification against in-situ measurement as well. Although verifying modeled total cloud cover with observation might have limitation due to the different definition between the model and observation, it is also capable to diagnose the relative bias of model predictability such as a regional reliability and diurnal evolution of the bias.

The Effect of Managerial Ability on Analysts' Earnings Forecast (경영자 능력이 재무분석가 이익예측 정보에 미치는 영향)

  • Park, Bo-Young
    • Management & Information Systems Review
    • /
    • v.35 no.4
    • /
    • pp.213-227
    • /
    • 2016
  • This study examines the effects of managerial ability on information asymmetry. We use analyst forecast errors as a proxy for information asymmetry, because analysts are referred to as efficient users using firm-level data. The sample consists of 2,246 non-banking firm-years listed in Korea Stock Exchange(KOSPI) during the period 2000 to 2013. We measure managerial ability using DEA(Data Envelopment Analysis) following Demerjian et al.(2012). Using those measures, we examines the effects of managerial ability on analysts' earnings forecast errors and analysts' earnings forecast bias. The results of this study are as follows. First, we find that managerial ability are positively associated with analysts' earnings forecast accuracy. Second, we show that the firms with higher managerial ability tend to have lower the optimistic errors in analysts' earnings forecasts. This study could be useful for outside stakeholders to understand the importance of managerial ability.

  • PDF