• Title/Summary/Keyword: Linear predictive model

Search Result 288, Processing Time 0.024 seconds

Classification of Whale Sounds using LPC and Neural Networks (신경망과 LPC 계수를 이용한 고래 소리의 분류)

  • An, Woo-Jin;Lee, Eung-Jae;Kim, Nam-Gyu;Chong, Ui-Pil
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.18 no.2
    • /
    • pp.43-48
    • /
    • 2017
  • The underwater transients signals contain the characteristics of complexity, time varying, nonlinear, and short duration. So it is very hard to model for these signals with reference patterns. In this paper we separate the whole length of signals into some short duration of constant length with overlapping frame by frame. The 20th LPC(Linear Predictive Coding) coefficients are extracted from the original signals using Durbin algorithm and applied to neural network. The 65% of whole signals were learned and 35% of the signals were tested in the neural network with two hidden layers. The types of the whales for sound classification are Blue whale, Dulsae whale, Gray whale, Humpback whale, Minke whale, and Northern Right whale. Finally, we could obtain more than 83% of classification rate from the test signals.

  • PDF

Prognostic Role of Circulating Tumor Cells in the Pulmonary Vein, Peripheral Blood, and Bone Marrow in Resectable Non-Small Cell Lung Cancer

  • Lee, Jeong Moon;Jung, Woohyun;Yum, Sungwon;Lee, Jeong Hoon;Cho, Sukki
    • Journal of Chest Surgery
    • /
    • v.55 no.3
    • /
    • pp.214-224
    • /
    • 2022
  • Background: Studies of the prognostic role of circulating tumor cells (CTCs) in early-stage non-small cell lung cancer (NSCLC) are still limited. This study investigated the prognostic power of CTCs from the pulmonary vein (PV), peripheral blood (PB), and bone marrow (BM) for postoperative recurrence in patients who underwent curative resection for NSCLC. Methods: Forty patients who underwent curative resection for NSCLC were enrolled. Before resection, 10-mL samples were obtained of PB from the radial artery, blood from the PV of the lobe containing the tumor, and BM aspirates from the rib. A microfabricated filter was used for CTC enrichment, and immunofluorescence staining was used to identify CTCs. Results: The pathologic stage was stage I in 8 patients (20%), II in 15 (38%), III in 14 (35%), and IV in 3 (8%). The median number of PB-, PV-, and BM-CTCs was 4, 4, and 5, respectively. A time-dependent receiver operating characteristic curve analysis showed that PB-CTCs had excellent predictive value for recurrence-free survival (RFS), with the highest area under the curve at each time point (first, second, and third quartiles of RFS). In a multivariate Cox proportional hazard regression model, PB-CTCs were an independent risk factor for recurrence (hazard ratio, 10.580; 95% confidence interval, 1.637-68.388; p<0.013). Conclusion: The presence of ≥4 PB-CTCs was an independent poor prognostic factor for RFS, and PV-CTCs and PB-CTCs had a positive linear correlation in patients with recurrence.

Development and Validation of an Integrated Healthy Workplace Management Model in Taiwan

  • Fu-Li Chen;Peter Y. Chen;Chi-Chen Chen;Tao-Hsin Tung
    • Safety and Health at Work
    • /
    • v.13 no.4
    • /
    • pp.394-400
    • /
    • 2022
  • Background: Impacts of exposure are generally monitored and recorded after injuries or illness occur. Yet, absence of conventional after-the-effect impacts (i.e., lagging indicators), tend to focus on physical health and injuries, and fail to inform if workers are not exposed to safety and health hazards. In contrast to lagging indicators, leading indicators are proactive, preventive, and predictive indexes that offer insights how effective safety and health. The present study is to validate an extended Voluntary Protection Programs (VPP) that consists of six leading indicators. Methods: Questionnaires were distributed to 13 organizations (response rate = 93.1%, 1,439 responses) in Taiwan. Cronbach α, multiple linear regression and canonical correlation were used to test the reliability of the extended Voluntary Protection Programs (VPP) which consists of six leading indicators (safe climate, transformational leadership, organizational justice, organizational support, hazard prevention and control, and training). Criteria-related validation strategy was applied to examine relationships of six leading indicators with six criteria (perceived health, burnout, depression, job satisfaction, job performance, and life satisfaction). Results: The results showed that the Cronbach's α of six leading indicators ranged from 0.87 to 0.92. The canonical correlation analysis indicated a positive correlation between the six leading indicators and criteria (1st canonical function: correlation = 0.647, square correlation = 0.419, p < 0.001). Conclusions: The present study validates the extended VPP framework that focuses on promoting safety and physical and mental health. Results further provides applications of the extended VPP framework to promote workers' safety and health.

Comparison of Solar Power Generation Forecasting Performance in Daejeon and Busan Based on Preprocessing Methods and Artificial Intelligence Techniques: Using Meteorological Observation and Forecast Data (전처리 방법과 인공지능 모델 차이에 따른 대전과 부산의 태양광 발전량 예측성능 비교: 기상관측자료와 예보자료를 이용하여)

  • Chae-Yeon Shim;Gyeong-Min Baek;Hyun-Su Park;Jong-Yeon Park
    • Atmosphere
    • /
    • v.34 no.2
    • /
    • pp.177-185
    • /
    • 2024
  • As increasing global interest in renewable energy due to the ongoing climate crisis, there is a growing need for efficient technologies to manage such resources. This study focuses on the predictive skill of daily solar power generation using weather observation and forecast data. Meteorological data from the Korea Meteorological Administration and solar power generation data from the Korea Power Exchange were utilized for the period from January 2017 to May 2023, considering both inland (Daejeon) and coastal (Busan) regions. Temperature, wind speed, relative humidity, and precipitation were selected as relevant meteorological variables for solar power prediction. All data was preprocessed by removing their systematic components to use only their residuals and the residual of solar data were further processed with weighted adjustments for homoscedasticity. Four models, MLR (Multiple Linear Regression), RF (Random Forest), DNN (Deep Neural Network), and RNN (Recurrent Neural Network), were employed for solar power prediction and their performances were evaluated based on predicted values utilizing observed meteorological data (used as a reference), 1-day-ahead forecast data (referred to as fore1), and 2-day-ahead forecast data (fore2). DNN-based prediction model exhibits superior performance in both regions, with RNN performing the least effectively. However, MLR and RF demonstrate competitive performance comparable to DNN. The disparities in the performance of the four different models are less pronounced than anticipated, underscoring the pivotal role of fitting models using residuals. This emphasizes that the utilized preprocessing approach, specifically leveraging residuals, is poised to play a crucial role in the future of solar power generation forecasting.

Design of Data-centroid Radial Basis Function Neural Network with Extended Polynomial Type and Its Optimization (데이터 중심 다항식 확장형 RBF 신경회로망의 설계 및 최적화)

  • Oh, Sung-Kwun;Kim, Young-Hoon;Park, Ho-Sung;Kim, Jeong-Tae
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.60 no.3
    • /
    • pp.639-647
    • /
    • 2011
  • In this paper, we introduce a design methodology of data-centroid Radial Basis Function neural networks with extended polynomial function. The two underlying design mechanisms of such networks involve K-means clustering method and Particle Swarm Optimization(PSO). The proposed algorithm is based on K-means clustering method for efficient processing of data and the optimization of model was carried out using PSO. In this paper, as the connection weight of RBF neural networks, we are able to use four types of polynomials such as simplified, linear, quadratic, and modified quadratic. Using K-means clustering, the center values of Gaussian function as activation function are selected. And the PSO-based RBF neural networks results in a structurally optimized structure and comes with a higher level of flexibility than the one encountered in the conventional RBF neural networks. The PSO-based design procedure being applied at each node of RBF neural networks leads to the selection of preferred parameters with specific local characteristics (such as the number of input variables, a specific set of input variables, and the distribution constant value in activation function) available within the RBF neural networks. To evaluate the performance of the proposed data-centroid RBF neural network with extended polynomial function, the model is experimented with using the nonlinear process data(2-Dimensional synthetic data and Mackey-Glass time series process data) and the Machine Learning dataset(NOx emission process data in gas turbine plant, Automobile Miles per Gallon(MPG) data, and Boston housing data). For the characteristic analysis of the given entire dataset with non-linearity as well as the efficient construction and evaluation of the dynamic network model, the partition of the given entire dataset distinguishes between two cases of Division I(training dataset and testing dataset) and Division II(training dataset, validation dataset, and testing dataset). A comparative analysis shows that the proposed RBF neural networks produces model with higher accuracy as well as more superb predictive capability than other intelligent models presented previously.

Implementing an Adaptive Neuro-Fuzzy Model for Emotion Prediction Based on Heart Rate Variability(HRV) (심박변이도를 이용한 적응적 뉴로 퍼지 감정예측 모형에 관한 연구)

  • Park, Sung Soo;Lee, Kun Chang
    • Journal of Digital Convergence
    • /
    • v.17 no.1
    • /
    • pp.239-247
    • /
    • 2019
  • An accurate prediction of emotion is a very important issue for the sake of patient-centered medical device development and emotion-related psychology fields. Although there have been many studies on emotion prediction, no studies have applied the heart rate variability and neuro-fuzzy approach to emotion prediction. We propose ANFEP(Adaptive Neuro Fuzzy System for Emotion Prediction) HRV. The ANFEP bases its core functions on an ANFIS(Adaptive Neuro-Fuzzy Inference System) which integrates neural networks with fuzzy systems as a vehicle for training predictive models. To prove the proposed model, 50 participants were invited to join the experiment and Heart rate variability was obtained and used to input the ANFEP model. The ANFEP model with STDRR and RMSSD as inputs and two membership functions per input variable showed the best results. The result out of applying the ANFEP to the HRV metrics proved to be significantly robust when compared with benchmarking methods like linear regression, support vector regression, neural network, and random forest. The results show that reliable prediction of emotion is possible with less input and it is necessary to develop a more accurate and reliable emotion recognition system.

A Study of Anomaly Detection for ICT Infrastructure using Conditional Multimodal Autoencoder (ICT 인프라 이상탐지를 위한 조건부 멀티모달 오토인코더에 관한 연구)

  • Shin, Byungjin;Lee, Jonghoon;Han, Sangjin;Park, Choong-Shik
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.3
    • /
    • pp.57-73
    • /
    • 2021
  • Maintenance and prevention of failure through anomaly detection of ICT infrastructure is becoming important. System monitoring data is multidimensional time series data. When we deal with multidimensional time series data, we have difficulty in considering both characteristics of multidimensional data and characteristics of time series data. When dealing with multidimensional data, correlation between variables should be considered. Existing methods such as probability and linear base, distance base, etc. are degraded due to limitations called the curse of dimensions. In addition, time series data is preprocessed by applying sliding window technique and time series decomposition for self-correlation analysis. These techniques are the cause of increasing the dimension of data, so it is necessary to supplement them. The anomaly detection field is an old research field, and statistical methods and regression analysis were used in the early days. Currently, there are active studies to apply machine learning and artificial neural network technology to this field. Statistically based methods are difficult to apply when data is non-homogeneous, and do not detect local outliers well. The regression analysis method compares the predictive value and the actual value after learning the regression formula based on the parametric statistics and it detects abnormality. Anomaly detection using regression analysis has the disadvantage that the performance is lowered when the model is not solid and the noise or outliers of the data are included. There is a restriction that learning data with noise or outliers should be used. The autoencoder using artificial neural networks is learned to output as similar as possible to input data. It has many advantages compared to existing probability and linear model, cluster analysis, and map learning. It can be applied to data that does not satisfy probability distribution or linear assumption. In addition, it is possible to learn non-mapping without label data for teaching. However, there is a limitation of local outlier identification of multidimensional data in anomaly detection, and there is a problem that the dimension of data is greatly increased due to the characteristics of time series data. In this study, we propose a CMAE (Conditional Multimodal Autoencoder) that enhances the performance of anomaly detection by considering local outliers and time series characteristics. First, we applied Multimodal Autoencoder (MAE) to improve the limitations of local outlier identification of multidimensional data. Multimodals are commonly used to learn different types of inputs, such as voice and image. The different modal shares the bottleneck effect of Autoencoder and it learns correlation. In addition, CAE (Conditional Autoencoder) was used to learn the characteristics of time series data effectively without increasing the dimension of data. In general, conditional input mainly uses category variables, but in this study, time was used as a condition to learn periodicity. The CMAE model proposed in this paper was verified by comparing with the Unimodal Autoencoder (UAE) and Multi-modal Autoencoder (MAE). The restoration performance of Autoencoder for 41 variables was confirmed in the proposed model and the comparison model. The restoration performance is different by variables, and the restoration is normally well operated because the loss value is small for Memory, Disk, and Network modals in all three Autoencoder models. The process modal did not show a significant difference in all three models, and the CPU modal showed excellent performance in CMAE. ROC curve was prepared for the evaluation of anomaly detection performance in the proposed model and the comparison model, and AUC, accuracy, precision, recall, and F1-score were compared. In all indicators, the performance was shown in the order of CMAE, MAE, and AE. Especially, the reproduction rate was 0.9828 for CMAE, which can be confirmed to detect almost most of the abnormalities. The accuracy of the model was also improved and 87.12%, and the F1-score was 0.8883, which is considered to be suitable for anomaly detection. In practical aspect, the proposed model has an additional advantage in addition to performance improvement. The use of techniques such as time series decomposition and sliding windows has the disadvantage of managing unnecessary procedures; and their dimensional increase can cause a decrease in the computational speed in inference.The proposed model has characteristics that are easy to apply to practical tasks such as inference speed and model management.

Laryngeal Cancer Screening using Cepstral Parameters (켑스트럼 파라미터를 이용한 후두암 검진)

  • 이원범;전경명;권순복;전계록;김수미;김형순;양병곤;조철우;왕수건
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.14 no.2
    • /
    • pp.110-116
    • /
    • 2003
  • Background and Objectives : Laryngeal cancer discrimination using voice signals is a non-invasive method that can carry out the examination rapidly and simply without giving discomfort to the patients. n appropriate analysis parameters and classifiers are developed, this method can be used effectively in various applications including telemedicine. This study examines voice analysis parameters used for laryngeal disease discrimination to help discriminate laryngeal diseases by voice signal analysis. The study also estimates the laryngeal cancer discrimination activity of the Gaussian mixture model (GMM) classifier based on the statistical modelling of voice analysis parameters. Materials and Methods : The Multi-dimensional voice program (MDVP) parameters, which have been widely used for the analysis of laryngeal cancer voice, sometimes fail to analyze the voice of a laryngeal cancer patient whose cycle is seriously damaged. Accordingly, it is necessary to develop a new method that enables an analysis of high reliability for the voice signals that cannot be analyzed by the MDVP. To conduct the experiments of laryngeal cancer discrimination, the authors used three types of voices collected at the Department of Otorhinorlaryngology, Pusan National University Hospital. 50 normal males voice data, 50 voices of males with benign laryngeal diseases and 105 voices of males laryngeal cancer. In addition, the experiment also included 11 voices data of males with laryngeal cancer that cannot be analyzed by the MDVP, Only monosyllabic vowel /a/ was used as voice data. Since there were only 11 voices of laryngeal cancer patients that cannot be analyzed by the MDVP, those voices were used only for discrimination. This study examined the linear predictive cepstral coefficients (LPCC) and the met-frequency cepstral coefficients (MFCC) that are the two major cepstrum analysis methods in the area of acoustic recognition. Results : The results showed that this met frequency scaling process was effective in acoustic recognition but not useful for laryngeal cancer discrimination. Accordingly, the linear frequency cepstral coefficients (LFCC) that excluded the met frequency scaling from the MFCC was introduced. The LFCC showed more excellent discrimination activity rather than the MFCC in predictability of laryngeal cancer. Conclusion : In conclusion, the parameters applied in this study could discriminate accurately even the terminal laryngeal cancer whose periodicity is disturbed. Also it is thought that future studies on various classification algorithms and parameters representing pathophysiology of vocal cords will make it possible to discriminate benign laryngeal diseases as well, in addition to laryngeal cancer.

  • PDF

A study on the factors to affect the career success among workers with disabilities (지체장애근로자의 직업성공 요인에 관한 연구)

  • Lee, Dal-Yob
    • 한국사회복지학회:학술대회논문집
    • /
    • 2003.10a
    • /
    • pp.185-216
    • /
    • 2003
  • This study was aimed at investigating important factors influencing career success among regular workers. The current researcher scrutinized the degree to which variables and factors affect the career success and occupational turnover rates of the research participants. At the same tune, two hypothetical path models established by the researcher were examined using linear multiple regression methods and the LISREL. After examining the differences among the factors of career success, a comparison was made between the disabled worker group and the non-disabled worker group. A questionnaire using the 5-point Likert scale was distributed to a group of 374 workers with disabilities and 463 workers without disabilities. For the data analysis purpose, the structural equation model, factor analysis, correlation analysis, and multiple regression analysis were carried out. The results of this study ran be summarized as follows. First, the results of factor analysis showed important categories of conceptual themes of career success. The initial conceptual factor model did not accord with the empirical one. A three-factorial model revealed categories of personal, family, and organizational factor respectively. The personal factor was composed of the self-esteem and self-efficiency. The family factor was consisted of the multi-roles stress and the number of children. Finally, the organizational factor was composed of the capacity for utilizing resources, networking, and the frequency of mentoring. In addition, the total 10 sub areas of career success were divided by two important aspects; the subjective career success and the objective career success. Second, both research participant groups seemed to be influenced by their occupational types. However, all predictive variables excluding the wage rate and the average length of work years had significant impact on job success for the disabled work group, while all the variables excluding the frequency of advice and length of working years had significant impact on job success for the non-disabled worker group. Third, the turnover rate was significantly influenced by the age and the experience of turnover of the research participants. However, the number of co-workers was the strongest predictive variable for the worker group with disabilities, but the occupation choice variable for the worker group without disabilities. For the disabled worker group, the turnover rate was differently influenced by the type of occupation, the length of working years, while multi-role stress and the average working years at the time of turnover for the worker group without disabilities. Fifth, as a result of verifying the hypothetical path model, it showed that the first model was somewhat proper and could predict the career success on both research participant groups. In the second model, the Chi-square, the degree of freedom (($x^2=64.950$, df=61, P=0.341), and the adjusted Goodness of Fit Index (AGFI) were .964, and the Comparative Fit Index (CFI) were .997, and the Root Mean Squared Residual (RMR) was respectively. .038. The model was best fitted and could predict the career success more highly because the goodness of fit index in the whole models was within the allowed range. In conclusion, the following research implications can be suggested. First, the occupational type of research participants was one of the most important variables to predict the career success for both research participant groups. It means that people with disabilities require human development services including education. They need to improve themselves in this knowledge-based society. Furthermore, for maintaining the career success, people with disabilities should be approached by considering the subjective career success aspects including wages and the promotion opportunities than the objective career success aspects.

  • PDF

A Comparative Study on the Methods for Weighting the Dimensions of Customer Satisfaction with Importance Perceived by Customers (고객만족도 조사도구의 차원별 가중치 부여방법 비교)

  • Kang, Myunggeun;Cho, Woohyun;Lee, Sunhee;Choi, Kuison;Mooon, Kitae
    • Quality Improvement in Health Care
    • /
    • v.7 no.2
    • /
    • pp.230-242
    • /
    • 2000
  • Background : The measuring instruments for customer satisfaction in hospitals are often composed of some dimensions reflecting the conceptive complexity of them. Then, overall satisfaction would be expected to be equal the 'weighted' sum of scores by dimensions because the importance rated by customers may be different across the dimensions. But the issue of how to weight the dimensions with importance is not yet solved. We examined 3 sets of weighting methods as to make effect on predictive power against overall satisfaction. Methods : We conducted a survey included 483 subjects who had visited or admitted to a university hospital, using the short form questionnaire being developed by The Korean Society of Quality Assurance in Health Care for out-patient and in-patient. By using a multiple linear regression model, we compared among changes of explanatory powers against overall satisfaction as dependent variable after weighting 4 dimensions of the survey questionnaire as independent variables with importance scores of dimensions perceived by consumers. And we compared the feasibility of each weighting, methods by checking missing cases. Results : There were no weighting methods increasing the explanatory power after applying them. The method of absolute scoring was found higher explanatory-power than others, but this finding had no statistical significance. Regarding the number of missing value, method of absolutely scoring had the least cases. Conclusion : Our findings suggested that weighting the dimensions with importance might have little significance in the cases of scales having items highly correlated, such as consumers' satisfaction. Though asking with items to be answered absolutely, customers might be rating relatively in some degree and this method produced least missing cases. Considering these points, in the cases when weighting the dimensions with importance would be required, we suggest that weighting method by absolute scoring might be better than others.

  • PDF