• Title/Summary/Keyword: 편향된 데이터

Search Result 160, Processing Time 0.024 seconds

Shrinkage Structure of Ridge Partial Least Squares Regression

  • Kim, Jong-Duk
    • Journal of the Korean Data and Information Science Society
    • /
    • v.18 no.2
    • /
    • pp.327-344
    • /
    • 2007
  • Ridge partial least squares regression (RPLS) is a regression method which can be obtained by combining ridge regression and partial least squares regression and is intended to provide better predictive ability and less sensitive to overfitting. In this paper, explicit expressions for the shrinkage factor of RPLS are developed. The structure of the shrinkage factor is explored and compared with those of other biased regression methods, such as ridge regression, principal component regression, ridge principal component regression, and partial least squares regression using a near infrared data set.

  • PDF

Face Recognition Network using gradCAM (gradCam을 사용한 얼굴인식 신경망)

  • Chan Hyung Baek;Kwon Jihun;Ho Yub Jung
    • Smart Media Journal
    • /
    • v.12 no.2
    • /
    • pp.9-14
    • /
    • 2023
  • In this paper, we proposed a face recognition network which attempts to use more facial features awhile using smaller number of training sets. When combining the neural network together for face recognition, we want to use networks that use different part of the facial features. However, the network training chooses randomly where these facial features are obtained. Other hand, the judgment basis of the network model can be expressed as a saliency map through gradCAM. Therefore, in this paper, we use gradCAM to visualize where the trained face recognition model has made a observations and recognition judgments. Thus, the network combination can be constructed based on the different facial features used. Using this approach, we trained a network for small face recognition problem. In an simple toy face recognition example, the recognition network used in this paper improves the accuracy by 1.79% and reduces the equal error rate (EER) by 0.01788 compared to the conventional approach.

Measurement of Refractive Index of Liquids by the Maximum and Minimum Deviated Laser Beam (레이저광의 최대.최소 편향법을 이용한 액체의 굴절률 측정)

  • Lee, Jae-Ran;Kim, Sok-Won;Lee, Yong-San
    • Korean Journal of Optics and Photonics
    • /
    • v.19 no.3
    • /
    • pp.182-186
    • /
    • 2008
  • The prism spectrometer is a standard device for the measurement of refractive index; it is used in undergraduate laboratories. Typically, however, lots of attention is required in the alignment, and the accuracy of the obtained refractive index is not so high in spite of the durability of the device. The maximum and minimum deviation method, which compensates the disadvantages of the prism spectrometer, can be composed cost effectively using a length marking tape and a rotating platform. It can measure the refractive indices accurately by utilizing a wide screen. In this study, the equal sided hollow prism whose length is 26 mm was fabricated and measured the refractive indices of seven kind of liquids (pure water, $C_3H_5(OH)_2$, $CCl_4$, $C_6H_4NH_2$, $CS_2$, $C_6H_4(CH_3)_2)$ by using the prism spectrometer and maximum and minimum deviated laser beam method at the wavelengths of He-Ne laser (${\lambda}$= 632.8 nm) and YVO4 laser (${\lambda}$= 532 nm). The result shows that the data obtained by the latter method are more accurate and precise than those obtained by the former device.

BigData Research in Information Systems : Focusing on Journal Articles about Information Systems (정보시스템 분야의 빅데이터 연구 흐름 분석 : Information Systems 관련 저널을 중심으로)

  • Park, Kyungbo;Kim, Juyeong;Kim, Han-Min
    • Asia-pacific Journal of Multimedia Services Convergent with Art, Humanities, and Sociology
    • /
    • v.9 no.6
    • /
    • pp.681-689
    • /
    • 2019
  • The 46th Davos Forum of the World Economic Forum (WEF) predicts the continued growth of the 4th industry in the future. Currently, the 4th industry is attracting attention in various academic and practical fields. As a core technology of the 4th industry, Big Data is regarded as a major resource to lead the 4th industrial revolution along with artificial intelligence. As the growing interest in Big Data, researches on it are actively being done. However, literature studies on existing Big Data are focused on qualitative research, and quantitative research is insufficient. Therefore, this study aims to analyze the big data research flow in MIS field and to make academic thirst for quantification. This study has collected 145 abstracts of big data papers published in major journals in MIS field and confirmed that a majority of papers are published in Decision Support Systems Journal. Text mining and text network analysis were performed only for DSS journals to eliminate bias. As a result of the analysis, it was found out that researches on combining big data in the management field between 2012 and 2014, and researches on system development and analysis method for using big data from 2015 to 2017 were conducted.

Pairwise Neural Networks for Predicting Compound-Protein Interaction (약물-표적 단백질 연관관계 예측모델을 위한 쌍 기반 뉴럴네트워크)

  • Lee, Munhwan;Kim, Eunghee;Kim, Hong-Gee
    • Korean Journal of Cognitive Science
    • /
    • v.28 no.4
    • /
    • pp.299-314
    • /
    • 2017
  • Predicting compound-protein interactions in-silico is significant for the drug discovery. In this paper, we propose an scalable machine learning model to predict compound-protein interaction. The key idea of this scalable machine learning model is the architecture of pairwise neural network model and feature embedding method from the raw data, especially for protein. This method automatically extracts the features without additional knowledge of compound and protein. Also, the pairwise architecture elevate the expressiveness and compact dimension of feature by preventing biased learning from occurring due to the dimension and type of features. Through the 5-fold cross validation results on large scale database show that pairwise neural network improves the performance of predicting compound-protein interaction compared to previous prediction models.

Estimating Simulation Parameters for Kint Fabrics from Static Drapes (정적 드레이프를 이용한 니트 옷감의 시뮬레이션 파라미터 추정)

  • Ju, Eunjung;Choi, Myung Geol
    • Journal of the Korea Computer Graphics Society
    • /
    • v.26 no.5
    • /
    • pp.15-24
    • /
    • 2020
  • We present a supervised learning method that estimates the simulation parameters required to simulate the fabric from the static drape shape of a given fabric sample. The static drape shape was inspired by Cusick's drape, which is used in the apparel industry to classify fabrics according to their mechanical properties. The input vector of the training model consists of the feature vector extracted from the static drape and the density value of a fabric specimen. The output vector consists of six simulation parameters that have a significant influence on deriving the corresponding drape result. To generate a plausible and unbiased training data set, we first collect simulation parameters for 400 knit fabrics and generate a Gaussian Mixed Model (GMM) generation model from them. Next, a large number of simulation parameters are randomly sampled from the GMM model, and cloth simulation is performed for each sampled simulation parameter to create a virtual static drape. The generated training data is fitted with a log-linear regression model. To evaluate our method, we check the accuracy of the training results with a test data set and compare the visual similarity of the simulated drapes.

A Data Analysis and Visualization of AI Ethics -Focusing on the interactive AI service 'Lee Luda'- (인공지능 윤리 인식에 대한 데이터 분석 및 시각화 연구 -대화형 인공지능 서비스 '이루다'를 중심으로-)

  • Lee, Su-Ryeon;Choi, Eun-Jung
    • Journal of Digital Convergence
    • /
    • v.20 no.2
    • /
    • pp.269-275
    • /
    • 2022
  • As artificial intelligence services targeting humans increase, social demands are increasing that artificial intelligence should also be made on an ethical basis. Following this trend, the government and businesses are preparing policies and norms related to artificial intelligence ethics. In order to establish reasonable policies and norms, the first step is to understand the public's perceptions. In this paper, social data and news comments were collected and analyzed to understand the public's perception related to artificial intelligence and ethics. Interest analysis, emotional analysis, and discourse analysis were performed and visualized on the collected datasets. As a result of the analysis, interest in "artificial intelligence ethics" and "artificial intelligence" favorability showed an inversely proportional correlation. As a result of discourse analysis, the biggest issue was "personal information leakage," and it also showed a discourse on contamination and deflection of learning data and whether computer-made artificial intelligence should be given a legal personality. This study can be used as data to grasp the public's perception when preparing artificial intelligence ethical norms and policies.

On variable bandwidth Kernel Regression Estimation (변수평활량을 이용한 커널회귀함수 추정)

  • Seog, Kyung-Ha;Chung, Sung-Suk;Kim, Dae-Hak
    • Journal of the Korean Data and Information Science Society
    • /
    • v.9 no.2
    • /
    • pp.179-188
    • /
    • 1998
  • Local polynomial regression estimation is the most popular one among kernel type regression estimator. In local polynomial regression function esimation bandwidth selection is crucial problem like the kernel estimation. When the regression curve has complicated structure variable bandwidth selection will be appropriate. In this paper, we propose a variable bandwidth selection method fully data driven. We will choose the bandwdith by selecting minimising estiamted MSE which is estimated by the pilot bandwidth study via croos-validation method. Monte carlo simulation was conducted in order to show the superiority of proposed bandwidth selection method.

  • PDF

A Design and Implementation of Computer-based Test System (컴퓨터기반 시험 시스템 설계 및 구축)

  • Cho Sung-Ho
    • The Journal of the Korea Contents Association
    • /
    • v.5 no.1
    • /
    • pp.1-8
    • /
    • 2005
  • E-learning is the application of e-business technology and services to teaching and learning. It use of new multimedia technologies and Internet to improved the qualify of learning by facilitating access to remote resources and services. In this paper, we show a computer-based test system, which is carefully designed and implemented. The system consists of a contents delivery mechanism, computer-adaptive test algorithm, and review engine. In this papepr, we describe what are points to be considered when design and implementing a computer-based test system. In addition, this paper shows how to control the bias value for computer-adaptive algorithm using real data.

  • PDF

Selection of Vertiport Location, Route Setting and Operating Time Analysis of Urban Air Mobility in Metropolitan Area (수도권 도심항공 모빌리티 수직이착륙장 위치 선정, 경로 설정 및 운행 소요시간 분석)

  • Oh, Jae-Seok;Hwang, Ho-Yon
    • Journal of Advanced Navigation Technology
    • /
    • v.24 no.5
    • /
    • pp.358-367
    • /
    • 2020
  • With the increases of average commuting time of office workers in the Seoul metropolitan area and the cost of traffic congestion on roads, the need for new transportation is increasing and urban air mobility (UAM) is emerging as an alternative. Therefore, in this paper, the vertiport locations were selected and routes were established using population, traffic and commuting data of Seoul and Gyeonggi Province. Vector thrust type and multicopter type of eVTOL compatible for UAM were selected by analyzing the types of eVTOLand time required for selected routes was calculated. In addition, the time required when we utilize other transportations was compared with UAM. Finally, it was verified that the commuting time can be sharply reduced when we use UAM.