• Title/Summary/Keyword: Statistical Relational Learning

Search Result 6, Processing Time 0.021 seconds

First-Order Logic Generation and Weight Learning Method in Markov Logic Network Using Association Analysis (연관분석을 이용한 마코프 논리네트워크의 1차 논리 공식 생성과 가중치 학습방법)

  • Ahn, Gil-Seung;Hur, Sun
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.38 no.1
    • /
    • pp.74-82
    • /
    • 2015
  • Two key challenges in statistical relational learning are uncertainty and complexity. Standard frameworks for handling uncertainty are probability and first-order logic respectively. A Markov logic network (MLN) is a first-order knowledge base with weights attached to each formula and is suitable for classification of dataset which have variables correlated with each other. But we need domain knowledge to construct first-order logics and a computational complexity problem arises when calculating weights of first-order logics. To overcome these problems we suggest a method to generate first-order logics and learn weights using association analysis in this study.

Area-wise relational knowledge distillation

  • Sungchul Cho;Sangje Park;Changwon Lim
    • Communications for Statistical Applications and Methods
    • /
    • v.30 no.5
    • /
    • pp.501-516
    • /
    • 2023
  • Knowledge distillation (KD) refers to extracting knowledge from a large and complex model (teacher) and transferring it to a relatively small model (student). This can be done by training the teacher model to obtain the activation function values of the hidden or the output layers and then retraining the student model using the same training data with the obtained values. Recently, relational KD (RKD) has been proposed to extract knowledge about relative differences in training data. This method improved the performance of the student model compared to conventional KDs. In this paper, we propose a new method for RKD by introducing a new loss function for RKD. The proposed loss function is defined using the area difference between the teacher model and the student model in a specific hidden layer, and it is shown that the model can be successfully compressed, and the generalization performance of the model can be improved. We demonstrate that the accuracy of the model applying the method proposed in the study of model compression of audio data is up to 1.8% higher than that of the existing method. For the study of model generalization, we demonstrate that the model has up to 0.5% better performance in accuracy when introducing the RKD method to self-KD using image data.

Middle School Students' Analogical Transfer in Algebra Word Problem Solving (중학생을 대상으로 한 대수 문장제 해결에서의 유추적 전이)

  • 이종희;김진화;김선희
    • The Mathematical Education
    • /
    • v.42 no.3
    • /
    • pp.353-368
    • /
    • 2003
  • Analogy, based on a similarity, is to infer the properties of the similar object from properties of an object. It can be a very useful thinking tool for learning mathematical patterns and laws, noticing on relational properties among various situations. The purpose of this study, when manipulating hint condition, figure and table conditions and the amount of original learning by using algebra word problems, is to verify the effects of analogical transfer in solving equivalent, isomorphic and similar problems according to the similarity of source problems and target ones. Five study questions were set up for the above purpose. It was 354 first grade students of S and G middle schools in Seoul that were experimented for this study. The data was processed by MANOVA analysis of statistical program, SPSS 10.0. The results of this studies would indicate that most of the students would be poor at solving isomorphic and similar problems in the performance of analogical transfer according to the similarity of source and target problems. Hints, figure and table conditions did not facilitate the analogical transfer. Merely, on the condition that amount of teaming was increased, analogical transfer of the students was facilitated. Therefore, it is necessary to have students do much more analogical problem-solving experience to improve their analogical reasoning ability through the instruction program development in the educational fields.

  • PDF

A Study of the Correlation between Social Support and Stress in the Pregnant Women (임신부가 지각한 사회적 지지와 스트레스에 관한 상관관계 연구)

  • 서연옥
    • Journal of Korean Academy of Nursing
    • /
    • v.17 no.2
    • /
    • pp.153-161
    • /
    • 1987
  • This descriptive-correlational study was conducted to recognize the relational between the Perceived social support and stress in pregnant women. The subject were comprised of 154 patients who were at 24 weeks over, and data were collected by way of questionnairs at K-university hospital in Seoul during 23 days (November 21, 1986 to December 13, 1986). The result of this study may be summerized as follows. 1) The degrees of the support from situation perceived by pregnant women were: the minimal 20 feints, the maximal 61 points and the mean 40.42 feints. 2) The degrees of stress perceived by pregnant women were: the minimal 32 points, the maximal 109 points and the mean 65.49 points. 3) The results of hypotheses were: the first hypothesis was not supported that the higher the degree of social support from situation perceived by pregnant women, the lower that of stress(r= -0.01129, p=0.8895) the second was supported that the higher the degree if support perceived indirectly by pregnant women, the lower that of stress(r= -0.23832, p=0.0029). the third was supported that the higher the degree of support perceived directly by pregnant women, the lower that of stress. (r= -0.36019, p=0.0001) 4) The fourth shows that in the relationships between the characteristics of socio-popualtion and obstetrics and the degree of social support, there was significant difference; i) the support from situation differs in learning (t= -2.178, p<.05), and satisfaction of marriage (F=5.06, p<.01) ii) the support perceived indirectly differs in learning (t= -3.065, p<.01), month of pregnant(F= 2.78, p <.05), satisfaction of marriage (F=20.08, p<.001) and forms of family (t=2.11, p<.05) iii) the support perceived directly differs in satisfaction of marriage (F=21.00, p <.001) and forms of family (t=2.11, p<.05) 5) the fifth shows that in the relationship between the characteristics of sociopopulation and obstetrics and stress, the degree of satisfaction of marriage only shows the significant statistical difference(F= 0.40, p<.01) 6) The sixth shows that the factor affecting the stress of pregnant women was the support perceived directly and its explanatory power was 12.9%.

  • PDF

Online news-based stock price forecasting considering homogeneity in the industrial sector (산업군 내 동질성을 고려한 온라인 뉴스 기반 주가예측)

  • Seong, Nohyoon;Nam, Kihwan
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.2
    • /
    • pp.1-19
    • /
    • 2018
  • Since stock movements forecasting is an important issue both academically and practically, studies related to stock price prediction have been actively conducted. The stock price forecasting research is classified into structured data and unstructured data, and it is divided into technical analysis, fundamental analysis and media effect analysis in detail. In the big data era, research on stock price prediction combining big data is actively underway. Based on a large number of data, stock prediction research mainly focuses on machine learning techniques. Especially, research methods that combine the effects of media are attracting attention recently, among which researches that analyze online news and utilize online news to forecast stock prices are becoming main. Previous studies predicting stock prices through online news are mostly sentiment analysis of news, making different corpus for each company, and making a dictionary that predicts stock prices by recording responses according to the past stock price. Therefore, existing studies have examined the impact of online news on individual companies. For example, stock movements of Samsung Electronics are predicted with only online news of Samsung Electronics. In addition, a method of considering influences among highly relevant companies has also been studied recently. For example, stock movements of Samsung Electronics are predicted with news of Samsung Electronics and a highly related company like LG Electronics.These previous studies examine the effects of news of industrial sector with homogeneity on the individual company. In the previous studies, homogeneous industries are classified according to the Global Industrial Classification Standard. In other words, the existing studies were analyzed under the assumption that industries divided into Global Industrial Classification Standard have homogeneity. However, existing studies have limitations in that they do not take into account influential companies with high relevance or reflect the existence of heterogeneity within the same Global Industrial Classification Standard sectors. As a result of our examining the various sectors, it can be seen that there are sectors that show the industrial sectors are not a homogeneous group. To overcome these limitations of existing studies that do not reflect heterogeneity, our study suggests a methodology that reflects the heterogeneous effects of the industrial sector that affect the stock price by applying k-means clustering. Multiple Kernel Learning is mainly used to integrate data with various characteristics. Multiple Kernel Learning has several kernels, each of which receives and predicts different data. To incorporate effects of target firm and its relevant firms simultaneously, we used Multiple Kernel Learning. Each kernel was assigned to predict stock prices with variables of financial news of the industrial group divided by the target firm, K-means cluster analysis. In order to prove that the suggested methodology is appropriate, experiments were conducted through three years of online news and stock prices. The results of this study are as follows. (1) We confirmed that the information of the industrial sectors related to target company also contains meaningful information to predict stock movements of target company and confirmed that machine learning algorithm has better predictive power when considering the news of the relevant companies and target company's news together. (2) It is important to predict stock movements with varying number of clusters according to the level of homogeneity in the industrial sector. In other words, when stock prices are homogeneous in industrial sectors, it is important to use relational effect at the level of industry group without analyzing clusters or to use it in small number of clusters. When the stock price is heterogeneous in industry group, it is important to cluster them into groups. This study has a contribution that we testified firms classified as Global Industrial Classification Standard have heterogeneity and suggested it is necessary to define the relevance through machine learning and statistical analysis methodology rather than simply defining it in the Global Industrial Classification Standard. It has also contribution that we proved the efficiency of the prediction model reflecting heterogeneity.

Usefulness of Data Mining in Criminal Investigation (데이터 마이닝의 범죄수사 적용 가능성)

  • Kim, Joon-Woo;Sohn, Joong-Kweon;Lee, Sang-Han
    • Journal of forensic and investigative science
    • /
    • v.1 no.2
    • /
    • pp.5-19
    • /
    • 2006
  • Data mining is an information extraction activity to discover hidden facts contained in databases. Using a combination of machine learning, statistical analysis, modeling techniques and database technology, data mining finds patterns and subtle relationships in data and infers rules that allow the prediction of future results. Typical applications include market segmentation, customer profiling, fraud detection, evaluation of retail promotions, and credit risk analysis. Law enforcement agencies deal with mass data to investigate the crime and its amount is increasing due to the development of processing the data by using computer. Now new challenge to discover knowledge in that data is confronted to us. It can be applied in criminal investigation to find offenders by analysis of complex and relational data structures and free texts using their criminal records or statement texts. This study was aimed to evaluate possibile application of data mining and its limitation in practical criminal investigation. Clustering of the criminal cases will be possible in habitual crimes such as fraud and burglary when using data mining to identify the crime pattern. Neural network modelling, one of tools in data mining, can be applied to differentiating suspect's photograph or handwriting with that of convict or criminal profiling. A case study of in practical insurance fraud showed that data mining was useful in organized crimes such as gang, terrorism and money laundering. But the products of data mining in criminal investigation should be cautious for evaluating because data mining just offer a clue instead of conclusion. The legal regulation is needed to control the abuse of law enforcement agencies and to protect personal privacy or human rights.

  • PDF