• Title/Summary/Keyword: statistical learning

Search Result 1,324, Processing Time 0.028 seconds

Comparative study of data augmentation methods for fake audio detection (음성위조 탐지에 있어서 데이터 증강 기법의 성능에 관한 비교 연구)

  • KwanYeol Park;Il-Youp Kwak
    • The Korean Journal of Applied Statistics
    • /
    • v.36 no.2
    • /
    • pp.101-114
    • /
    • 2023
  • The data augmentation technique is effectively used to solve the problem of overfitting the model by allowing the training dataset to be viewed from various perspectives. In addition to image augmentation techniques such as rotation, cropping, horizontal flip, and vertical flip, occlusion-based data augmentation methods such as Cutmix and Cutout have been proposed. For models based on speech data, it is possible to use an occlusion-based data-based augmentation technique after converting a 1D speech signal into a 2D spectrogram. In particular, SpecAugment is an occlusion-based augmentation technique for speech spectrograms. In this study, we intend to compare and study data augmentation techniques that can be used in the problem of false-voice detection. Using data from the ASVspoof2017 and ASVspoof2019 competitions held to detect fake audio, a dataset applied with Cutout, Cutmix, and SpecAugment, an occlusion-based data augmentation method, was trained through an LCNN model. All three augmentation techniques, Cutout, Cutmix, and SpecAugment, generally improved the performance of the model. In ASVspoof2017, Cutmix, in ASVspoof2019 LA, Mixup, and in ASVspoof2019 PA, SpecAugment showed the best performance. In addition, increasing the number of masks for SpecAugment helps to improve performance. In conclusion, it is understood that the appropriate augmentation technique differs depending on the situation and data.

Development of a Water Quality Indicator Prediction Model for the Korean Peninsula Seas using Artificial Intelligence (인공지능 기법을 활용한 한반도 해역의 수질평가지수 예측모델 개발)

  • Seong-Su Kim;Kyuhee Son;Doyoun Kim;Jang-Mu Heo;Seongeun Kim
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.29 no.1
    • /
    • pp.24-35
    • /
    • 2023
  • Rapid industrialization and urbanization have led to severe marine pollution. A Water Quality Index (WQI) has been developed to allow the effective management of marine pollution. However, the WQI suffers from problems with loss of information due to the complex calculations involved, changes in standards, calculation errors by practitioners, and statistical errors. Consequently, research on the use of artificial intelligence techniques to predict the marine and coastal WQI is being conducted both locally and internationally. In this study, six techniques (RF, XGBoost, KNN, Ext, SVM, and LR) were studied using marine environmental measurement data (2000-2020) to determine the most appropriate artificial intelligence technique to estimate the WOI of five ecoregions in the Korean seas. Our results show that the random forest method offers the best performance as compared to the other methods studied. The residual analysis of the WQI predicted score and actual score using the random forest method shows that the temporal and spatial prediction performance was exceptional for all ecoregions. In conclusion, the RF model of WQI prediction developed in this study is considered to be applicable to Korean seas with high accuracy.

Study of Smart Integration processing Systems for Sensor Data (센서 데이터를 위한 스마트 통합 처리 시스템 연구)

  • Ji, Hyo-Sang;Kim, Jae-Sung;Kim, Ri-Won;Kim, Jeong-Joon;Han, Ik-Joo;Park, Jeong-Min
    • Asia-pacific Journal of Multimedia Services Convergent with Art, Humanities, and Sociology
    • /
    • v.7 no.8
    • /
    • pp.327-342
    • /
    • 2017
  • In this paper, we introduce an integrated processing system of smart sensor data for IoT service which collects sensor data and efficiently processes it. Based on the technology of collecting sensor data to the development of the IoT field and sending it to the network · Based on the receiving technology, as various projects such as smart homes, autonomous running vehicles progress, the sensor data is processed and effectively An autonomous control system to utilize has been a problem. However, since the data type of the sensor for monitoring the autonomous control system varies according to the domain, a sensor data integration processing system applying the autonomous control system to various different domains is necessary. Therefore, in this paper, we introduce the Smart Sensor Data Integrated Processing System, apply it and use the window as a reference to process internal and external sensor data 1) receiveData, 2) parseData, 3) addToDatabase 3 With the process of the stage, we provide and implement the automatic window opening / closing system "Smart Window" which ventilates to create a comfortable indoor environment by autonomous control system. As a result, standby information is collected and monitored, and machine learning for performing statistical analysis and better autonomous control based on the stored data is made possible.

Age classification of emergency callers based on behavioral speech utterance characteristics (발화행태 특징을 활용한 응급상황 신고자 연령분류)

  • Son, Guiyoung;Kwon, Soonil;Baik, Sungwook
    • The Journal of Korean Institute of Next Generation Computing
    • /
    • v.13 no.6
    • /
    • pp.96-105
    • /
    • 2017
  • In this paper, we investigated the age classification from the speaker by analyzing the voice calls of the emergency center. We classified the adult and elderly from the call center calls using behavioral speech utterances and SVM(Support Vector Machine) which is a machine learning classifier. We selected two behavioral speech utterances through analysis of the call data from the emergency center: Silent Pause and Turn-taking latency. First, the criteria for age classification selected through analysis based on the behavioral speech utterances of the emergency call center and then it was significant(p <0.05) through statistical analysis. We analyzed 200 datasets (adult: 100, elderly: 100) by the 5 fold cross-validation using the SVM(Support Vector Machine) classifier. As a result, we achieved 70% accuracy using two behavioral speech utterances. It is higher accuracy than one behavioral speech utterance. These results can be suggested age classification as a new method which is used behavioral speech utterances and will be classified by combining acoustic information(MFCC) with new behavioral speech utterances of the real voice data in the further work. Furthermore, it will contribute to the development of the emergency situation judgment system related to the age classification.

Effects of Personal Protective Equipment Practice Education on the Effectiveness of Repeated Learning and Satisfaction (개인보호구 실습교육의 반복학습 효과와 만족도에 미치는 영향)

  • Dae Jin Jo;Won Souk Eoh
    • Journal of Korean Society of Occupational and Environmental Hygiene
    • /
    • v.33 no.2
    • /
    • pp.156-170
    • /
    • 2023
  • Objectives: This study conducted practical training to improve the proper usage of personal protective equipment(PPE), which greatly impacts workplace safety and health management. Personal protective equipment education was conducted through active participation, without theoretical modules, and aimed to identify the effects of repeated practical education and determine ways to increase participant satisfaction. Methods: Study data were analyzed using the IBM SPSS Statistics ver.29 software. First, participants' general characteristics were analyzed with frequency analysis. Second, the normality and equality of variances (Leven's test) were tested for the dependent variables prior to statistical analyses to determine the use of parametric tests. In general, normality is assumed when the sample size is 30 or more per the central limit theorem (Park et al., 2014). As our sample size of health management workers was 43, normality can be assumed. However, to ensure rigor of the study, we examined skewness and kurtosis. The results confirmed that the data were normally distributed. Third, the effects of repeated PPE training were analyzed using paired t-tests. Fourth, differences in satisfaction with PPE training according to the safety and health job position and safety and health certification were analyzed with t-test and Welch's t-test. For parameters that did not meet the assumption of equal variances, the Welch's t-test was performed. Results: Repeated PPE training improved the educational outcomes, and the improvements were significant in the 1st and 2nd respiratory PPE and safety and hygiene PPE training evaluations (p<.001). In terms of safety and health job position, repeated training led to improvements in educational outcomes, with significant improvements observed among supervisors and specialized health management institution workers in the 1st and 2nd training evaluations (p<.005). In terms of safety certification, repeated training led to improvements in educational outcomes, with significant improvements observed among both certified and non-certified individuals (p<.005). Regarding satisfaction with PPE training according to safety and health job positions, specialized health management institution workers showed greater satisfaction than supervisors, with significant differences in the satisfaction for expertise of lecture, work relevance, and lecturer's attitude (p<.001). Regarding satisfaction with PPE training according to safety and health certification, satisfaction was higher among certified individuals, with significant differences in satisfaction for work relevance and lecture attitude (p<.05) Conclusions: PPE education should be recommended to be provided as practical training. Repeated training can enhance educational outcomes for individuals with inadequate knowledge and understanding of PPE prior to education. For individuals with high levels of pre-existing knowledge and understanding of PPE, the results show that various training experiences should be provided to enhance their satisfaction. Therefore, it suggests that the workplace should actively seek educational media and methods to acquire expertise and skills in wearing personal protective equipment and improve the ability to use

The Effect of Engineering Design Based Ocean Clean Up Lesson on STEAM Attitude and Creative Engineering Problem Solving Propensity (공학설계기반 오션클린업(Ocean Clean-up) 수업이 STEAM태도와 창의공학적 문제해결성향에 미치는 효과)

  • DongYoung Lee;Hyojin Yi;Younkyeong Nam
    • Journal of the Korean earth science society
    • /
    • v.44 no.1
    • /
    • pp.79-89
    • /
    • 2023
  • The purpose of this study was to investigate the effects of engineering design-based ocean cleanup classes on STEAM attitudes and creative engineering problem-solving dispositions. Furthermore, during this process, we tried to determine interesting points that students encountered in engineering design-based classes. For this study, a science class with six lessons based on engineering design was developed and reviewed by a professor who majored in engineering design, along with five engineering design experts with a master's degree or higher. The subject of the class was selected as the design and implementation of scientific and engineering measures to reduce marine pollution based on the method implemented in an actual Ocean Clean-up Project. The engineering design process utilized the engineering design model presented by NGSS (2013), and was configured to experience redesign through the optimization process. To verify effectiveness, the STEAM attitude questionnaire developed by Park et al. (2019) and the creative engineering problemsolving propensity test tool developed by Kang and Nam (2016) were used. A pre and post t-test was used for statistical analysis for the effectiveness test. In addition, the contents of interesting points experienced by the learners were transcribed after receiving descriptive responses, and were analyzed and visualized through degree centrality analysis. Results confirmed that engineering design in science classes had a positive effect on both STEAM attitude and creative engineering problem-solving disposition (p< .05). In addition, as a result of unstructured data analysis, science and engineering knowledge, engineering experience, and cooperation and collaboration appeared as factors in which learners were interested in learning, confirming that engineering experience was the main factor.

Implementation of reliable dynamic honeypot file creation system for ransomware attack detection (랜섬웨어 공격탐지를 위한 신뢰성 있는 동적 허니팟 파일 생성 시스템 구현)

  • Kyoung Wan Kug;Yeon Seung Ryu;Sam Beom Shin
    • Convergence Security Journal
    • /
    • v.23 no.2
    • /
    • pp.27-36
    • /
    • 2023
  • In recent years, ransomware attacks have become more organized and specialized, with the sophistication of attacks targeting specific individuals or organizations using tactics such as social engineering, spear phishing, and even machine learning, some operating as business models. In order to effectively respond to this, various researches and solutions are being developed and operated to detect and prevent attacks before they cause serious damage. In particular, honeypots can be used to minimize the risk of attack on IT systems and networks, as well as act as an early warning and advanced security monitoring tool, but in cases where ransomware does not have priority access to the decoy file, or bypasses it completely. has a disadvantage that effective ransomware response is limited. In this paper, this honeypot is optimized for the user environment to create a reliable real-time dynamic honeypot file, minimizing the possibility of an attacker bypassing the honeypot, and increasing the detection rate by preventing the attacker from recognizing that it is a honeypot file. To this end, four models, including a basic data collection model for dynamic honeypot generation, were designed (basic data collection model / user-defined model / sample statistical model / experience accumulation model), and their validity was verified.

Mediating Effect of Professional Identity on the Relationship between Job- and Organization- related Factors and Job Satisfaction among Social Workers in Senior Welfare Facilities (노인생활시설 사회복지사들의 직무 및 조직특성과 직무만족도의 관계에서 전문직업적 정체성의 매개효과)

  • Cha, Myeong Jin;Je, Seok Bong
    • 한국노년학
    • /
    • v.29 no.2
    • /
    • pp.669-682
    • /
    • 2009
  • The purpose of this study was to explore the role of professional identity as mediating variable in the relationship between job- and organization- related factors and job satisfaction. This study surveyed social workers who worked at 24 senior welfare facilities in Daegu·Gyeoungbuk province from Aug. 1. to Aug. 30. 2006. A total of 137 questionnaires were collected using on-site survey (response rate 76.7%). Statistical analyses were performed using SPSS 12.0 for Windows. Descriptive analysis and frequency analysis were performed on overall measurement items and hierarchical regression analysis was conducted to test the mediating effect of professional identity. The reliability of statements was acceptable since the coefficient alphas were > .70. Results of hierarchical regression showed that professional identity was verified as a partial mediator in the relationship between factors related with job and organization and job satisfaction. As the population ages, there will be an increasing need for professional social workers effectively to work with and help care for the elderly. This study highlighted that job- and organization- related factors, namely self-regulations and social supports, are significantly related with job satisfaction of social workers. Especially, such effect was more significantly apparent in high professional identity which is playing a partial mediator. This result implies that there is potential to change work environments of social workers ensuring a delegation of power and responsibility. Therefore, efforts should be made to improve the promotion system and connect social worker as servant with community through diverse service learning programs.

CNN Model for Prediction of Tensile Strength based on Pore Distribution Characteristics in Cement Paste (시멘트풀의 공극분포특성에 기반한 인장강도 예측 CNN 모델)

  • Sung-Wook Hong;Tong-Seok Han
    • Journal of the Computational Structural Engineering Institute of Korea
    • /
    • v.36 no.5
    • /
    • pp.339-346
    • /
    • 2023
  • The uncertainties of microstructural features affect the properties of materials. Numerous pores that are randomly distributed in materials make it difficult to predict the properties of the materials. The distribution of pores in cementitious materials has a great influence on their mechanical properties. Existing studies focus on analyzing the statistical relationship between pore distribution and material responses, and the correlation between them is not yet fully determined. In this study, the mechanical response of cementitious materials is predicted through an image-based data approach using a convolutional neural network (CNN), and the correlation between pore distribution and material response is analyzed. The dataset for machine learning consists of high-resolution micro-CT images and the properties (tensile strength) of cementitious materials. The microstructures are characterized, and the mechanical properties are evaluated through 2D direct tension simulations using the phase-field fracture model. The attributes of input images are analyzed to identify the spot with the greatest influence on the prediction of material response through CNN. The correlation between pore distribution characteristics and material response is analyzed by comparing the active regions during the CNN process and the pore distribution.

A Groundwater Potential Map for the Nakdonggang River Basin (낙동강권역의 지하수 산출 유망도 평가)

  • Soonyoung Yu;Jaehoon Jung;Jize Piao;Hee Sun Moon;Heejun Suk;Yongcheol Kim;Dong-Chan Koh;Kyung-Seok Ko;Hyoung-Chan Kim;Sang-Ho Moon;Jehyun Shin;Byoung Ohan Shim;Hanna Choi;Kyoochul Ha
    • Journal of Soil and Groundwater Environment
    • /
    • v.28 no.6
    • /
    • pp.71-89
    • /
    • 2023
  • A groundwater potential map (GPM) was built for the Nakdonggang River Basin based on ten variables, including hydrogeologic unit, fault-line density, depth to groundwater, distance to surface water, lineament density, slope, stream drainage density, soil drainage, land cover, and annual rainfall. To integrate the thematic layers for GPM, the criteria were first weighted using the Analytic Hierarchical Process (AHP) and then overlaid using the Technique for Ordering Preferences by Similarity to Ideal Solution (TOPSIS) model. Finally, the groundwater potential was categorized into five classes (very high (VH), high (H), moderate (M), low (L), very low (VL)) and verified by examining the specific capacity of individual wells on each class. The wells in the area categorized as VH showed the highest median specific capacity (5.2 m3/day/m), while the wells with specific capacity < 1.39 m3/day/m were distributed in the areas categorized as L or VL. The accuracy of GPM generated in the work looked acceptable, although the specific capacity data were not enough to verify GPM in the studied large watershed. To create GPMs for the determination of high-yield well locations, the resolution and reliability of thematic maps should be improved. Criterion values for groundwater potential should be established when machine learning or statistical models are used in the GPM evaluation process.