• Title/Summary/Keyword: Learning Structure

Search Result 2,210, Processing Time 0.027 seconds

The development of four efficient optimal neural network methods in forecasting shallow foundation's bearing capacity

  • Hossein Moayedi;Binh Nguyen Le
    • Computers and Concrete
    • /
    • v.34 no.2
    • /
    • pp.151-168
    • /
    • 2024
  • This research aimed to appraise the effectiveness of four optimization approaches - cuckoo optimization algorithm (COA), multi-verse optimization (MVO), particle swarm optimization (PSO), and teaching-learning-based optimization (TLBO) - that were enhanced with an artificial neural network (ANN) in predicting the bearing capacity of shallow foundations located on cohesionless soils. The study utilized a database of 97 laboratory experiments, with 68 experiments for training data sets and 29 for testing data sets. The ANN algorithms were optimized by adjusting various variables, such as population size and number of neurons in each hidden layer, through trial-and-error techniques. Input parameters used for analysis included width, depth, geometry, unit weight, and angle of shearing resistance. After performing sensitivity analysis, it was determined that the optimized architecture for the ANN structure was 5×5×1. The study found that all four models demonstrated exceptional prediction performance: COA-MLP, MVO-MLP, PSO-MLP, and TLBO-MLP. It is worth noting that the MVO-MLP model exhibited superior accuracy in generating network outputs for predicting measured values compared to the other models. The training data sets showed R2 and RMSE values of (0.07184 and 0.9819), (0.04536 and 0.9928), (0.09194 and 0.9702), and (0.04714 and 0.9923) for COA-MLP, MVO-MLP, PSO-MLP, and TLBO-MLP methods respectively. Similarly, the testing data sets produced R2 and RMSE values of (0.08126 and 0.07218), (0.07218 and 0.9814), (0.10827 and 0.95764), and (0.09886 and 0.96481) for COA-MLP, MVO-MLP, PSO-MLP, and TLBO-MLP methods respectively.

A new surrogate method for the neutron kinetics calculation of nuclear reactor core transients

  • Xiaoqi Li;Youqi Zheng;Xianan Du;Bowen Xiao
    • Nuclear Engineering and Technology
    • /
    • v.56 no.9
    • /
    • pp.3571-3584
    • /
    • 2024
  • Reactor core transient calculation is very important for the reactor safety analysis, in which the kernel is neutron kinetics calculation by simulating the variation of neutron density or thermal power over time. Compared with the point kinetics method, the time-space neutron kinetics calculation can provide accurate variation of neutron density in both space and time domain. But it consumes a lot of resources. It is necessary to develop a surrogate model that can quickly obtain the temporal and spatial variation information of neutron density or power with acceptable calculation accuracy. This paper uses the time-varying characteristics of power to construct a time function, parameterizes the time-varying characteristics which contains the information about the spatial change of power. Thereby, the amount of targets to predict in the space domain is compressed. A surrogate method using the machine learning is proposed in this paper. In the construction of a neural network, the input is processed by a convolutional layer, followed by a fully connected layer or a deconvolution layer. For the problem of time sequence disturbance, a structure combining convolutional neural network and recurrent neural network is used. It is verified in the tests of a series of 1D, 2D and 3D reactor models. The predicted values obtained using the constructed neural network models in these tests are in good agreement with the reference values, showing the powerful potential of the surrogate models.

Investigating the Cognitive Process of a Student's Modeling on a Modeling-Emphasized Argument-Based General Chemistry Experiment (모델링을 강조한 논의 기반 일반화학실험에서 학생들의 모델링에 대한 인지과정 탐색)

  • Lee, Dongwon;Cho, Hey Sook;Nam, Jeonghee
    • Journal of The Korean Association For Science Education
    • /
    • v.35 no.2
    • /
    • pp.313-323
    • /
    • 2015
  • The purpose of this study is to investigate the cognitive process of student's modeling on a modeling-emphasized argument-based general chemistry experiment. The participants were twenty-one freshman students. Six topics were carried out during the first semester and semi-structured interview was implemented at the end of the semester. Semi-structured interview questions were used to elicit elements of effective model, modeling strategies, difficulties that students have experienced during modeling, and resolving the difficulties that students have experienced during modeling. All student interview data were collected and transcribed. The results of this study are summarized as follows: (1) Elements of effective model were considered to be visual expression, persuasive explanation, and rhetorical structure. (2) Modeling strategies included arranging important keywords or writing the outline, and during the modeling process, students used various data, suggested data after reconstructing, suggested definitions and explanations of core concepts, used meta-cognition, and considering rhetorical structure. (3) Difficulties students have experienced during modeling could be categorized as lack of modeling strategy and understanding. (4) Resolving difficulties students have experienced during modeling could be categorized as modeling strategy and understanding. Students learn the strategy by feedback, modeling experience, evaluation of experimental report, models which they constructed previously and references, and the understanding of contents were achieved through arguments which occurred during classes and during the process of writing the experimental reports. These results suggest that when using modeling in teaching and learning, the argument-based learning strategy can be effective in enhancing students' modeling by helping them to understand meta-modeling with scientific concepts.

A View about Li(理) and Ki(氣) of Hayasi Razan(林羅山) (하야시 라잔(林羅山)의 이기관(理氣觀))

  • Lee, Yongsoo
    • The Journal of Korean Philosophical History
    • /
    • no.31
    • /
    • pp.347-374
    • /
    • 2011
  • Along with Hujiwara Seika(藤原惺窩), Hayashi Razan(林羅山) is called the founder of the Japanese Confucianism in the Eto(江戶) era. And it is necessary for us to grasp that how Razan understand the theory of I-Ki(理氣論), then we can investigate the characteristics of his thought. In ordinary, people understand that the theory of I-Ki, as a completed view of the world, is integration of the structure of theory of the neo-Confucianism. So a certain thinker's ideological attitude is determined according to how people understand the theory. And then we can grasp the structure of his view of the world and human. Therefore, the purpose of this paper is to study how Razan had understanded the I(理) and Ki(氣). In spite of a scholar of Zhu Xi(朱熹), Razan didn't accept Zhu's view of I-Ki, he seem to lean toward the view of Wang Yangmings'(王陽明) in the his early learning days. But that doesn't mean he is a scholar of doctrine of Wang Yangming. When he meets the logical contradiction under the process of investigating the problem of Sein and Sollen, he just only to explain it with logic of Ki(氣) which is closed by mind. Meanwhile if we suppose I(理) is pure goodness and there is no things outside of I(理), if so Razan doubts about that where is the root of evil and he try to investigate the answer. In his latter years, Razan takes Zhu Xi's doctrine again get out of the mental attitude to the view of I-Ki(理氣). The outcome of precedent study about Razan points a fact that Razan needs a little more digging into the ieda of 'Fact and Sollen' which had been the reason of ideal confusion of him. But his ideal confusion is not the point of issue. Point is that Razan had understanded I-Ki(理氣) with monistic of Shim(心) in his early years. As a result, that bring about the outcome which exclude ontological thinking, and had come to grips with aspects of Sollen of all things in understanding of the doctrine of Zhu Xi. And I think that is the clue to understanding of Razan's learning.

Data collection strategy for building rainfall-runoff LSTM model predicting daily runoff (강수-일유출량 추정 LSTM 모형의 구축을 위한 자료 수집 방안)

  • Kim, Dongkyun;Kang, Seokkoo
    • Journal of Korea Water Resources Association
    • /
    • v.54 no.10
    • /
    • pp.795-805
    • /
    • 2021
  • In this study, after developing an LSTM-based deep learning model for estimating daily runoff in the Soyang River Dam basin, the accuracy of the model for various combinations of model structure and input data was investigated. A model was built based on the database consisting of average daily precipitation, average daily temperature, average daily wind speed (input up to here), and daily average flow rate (output) during the first 12 years (1997.1.1-2008.12.31). The Nash-Sutcliffe Model Efficiency Coefficient (NSE) and RMSE were examined for validation using the flow discharge data of the later 12 years (2009.1.1-2020.12.31). The combination that showed the highest accuracy was the case in which all possible input data (12 years of daily precipitation, weather temperature, wind speed) were used on the LSTM model structure with 64 hidden units. The NSE and RMSE of the verification period were 0.862 and 76.8 m3/s, respectively. When the number of hidden units of LSTM exceeds 500, the performance degradation of the model due to overfitting begins to appear, and when the number of hidden units exceeds 1000, the overfitting problem becomes prominent. A model with very high performance (NSE=0.8~0.84) could be obtained when only 12 years of daily precipitation was used for model training. A model with reasonably high performance (NSE=0.63-0.85) when only one year of input data was used for model training. In particular, an accurate model (NSE=0.85) could be obtained if the one year of training data contains a wide magnitude of flow events such as extreme flow and droughts as well as normal events. If the training data includes both the normal and extreme flow rates, input data that is longer than 5 years did not significantly improve the model performance.

GEase-K: Linear and Nonlinear Autoencoder-based Recommender System with Side Information (GEase-K: 부가 정보를 활용한 선형 및 비선형 오토인코더 기반의 추천시스템)

  • Taebeom Lee;Seung-hak Lee;Min-jeong Ma;Yoonho Cho
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.3
    • /
    • pp.167-183
    • /
    • 2023
  • In the recent field of recommendation systems, various studies have been conducted to model sparse data effectively. Among these, GLocal-K(Global and Local Kernels for Recommender Systems) is a research endeavor combining global and local kernels to provide personalized recommendations by considering global data patterns and individual user characteristics. However, due to its utilization of kernel tricks, GLocal-K exhibits diminished performance on highly sparse data and struggles to offer recommendations for new users or items due to the absence of side information. In this paper, to address these limitations of GLocal-K, we propose the GEase-K (Global and EASE kernels for Recommender Systems) model, incorporating the EASE(Embarrassingly Shallow Autoencoders for Sparse Data) model and leveraging side information. Initially, we substitute EASE for the local kernel in GLocal-K to enhance recommendation performance on highly sparse data. EASE, functioning as a simple linear operational structure, is an autoencoder that performs highly on extremely sparse data through regularization and learning item similarity. Additionally, we utilize side information to alleviate the cold-start problem. We enhance the understanding of user-item similarities by employing a conditional autoencoder structure during the training process to incorporate side information. In conclusion, GEase-K demonstrates resilience in highly sparse data and cold-start situations by combining linear and nonlinear structures and utilizing side information. Experimental results show that GEase-K outperforms GLocal-K based on the RMSE and MAE metrics on the highly sparse GoodReads and ModCloth datasets. Furthermore, in cold-start experiments divided into four groups using the GoodReads and ModCloth datasets, GEase-K denotes superior performance compared to GLocal-K.

An Intelligent Decision Support System for Selecting Promising Technologies for R&D based on Time-series Patent Analysis (R&D 기술 선정을 위한 시계열 특허 분석 기반 지능형 의사결정지원시스템)

  • Lee, Choongseok;Lee, Suk Joo;Choi, Byounggu
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.3
    • /
    • pp.79-96
    • /
    • 2012
  • As the pace of competition dramatically accelerates and the complexity of change grows, a variety of research have been conducted to improve firms' short-term performance and to enhance firms' long-term survival. In particular, researchers and practitioners have paid their attention to identify promising technologies that lead competitive advantage to a firm. Discovery of promising technology depends on how a firm evaluates the value of technologies, thus many evaluating methods have been proposed. Experts' opinion based approaches have been widely accepted to predict the value of technologies. Whereas this approach provides in-depth analysis and ensures validity of analysis results, it is usually cost-and time-ineffective and is limited to qualitative evaluation. Considerable studies attempt to forecast the value of technology by using patent information to overcome the limitation of experts' opinion based approach. Patent based technology evaluation has served as a valuable assessment approach of the technological forecasting because it contains a full and practical description of technology with uniform structure. Furthermore, it provides information that is not divulged in any other sources. Although patent information based approach has contributed to our understanding of prediction of promising technologies, it has some limitations because prediction has been made based on the past patent information, and the interpretations of patent analyses are not consistent. In order to fill this gap, this study proposes a technology forecasting methodology by integrating patent information approach and artificial intelligence method. The methodology consists of three modules : evaluation of technologies promising, implementation of technologies value prediction model, and recommendation of promising technologies. In the first module, technologies promising is evaluated from three different and complementary dimensions; impact, fusion, and diffusion perspectives. The impact of technologies refers to their influence on future technologies development and improvement, and is also clearly associated with their monetary value. The fusion of technologies denotes the extent to which a technology fuses different technologies, and represents the breadth of search underlying the technology. The fusion of technologies can be calculated based on technology or patent, thus this study measures two types of fusion index; fusion index per technology and fusion index per patent. Finally, the diffusion of technologies denotes their degree of applicability across scientific and technological fields. In the same vein, diffusion index per technology and diffusion index per patent are considered respectively. In the second module, technologies value prediction model is implemented using artificial intelligence method. This studies use the values of five indexes (i.e., impact index, fusion index per technology, fusion index per patent, diffusion index per technology and diffusion index per patent) at different time (e.g., t-n, t-n-1, t-n-2, ${\cdots}$) as input variables. The out variables are values of five indexes at time t, which is used for learning. The learning method adopted in this study is backpropagation algorithm. In the third module, this study recommends final promising technologies based on analytic hierarchy process. AHP provides relative importance of each index, leading to final promising index for technology. Applicability of the proposed methodology is tested by using U.S. patents in international patent class G06F (i.e., electronic digital data processing) from 2000 to 2008. The results show that mean absolute error value for prediction produced by the proposed methodology is lower than the value produced by multiple regression analysis in cases of fusion indexes. However, mean absolute error value of the proposed methodology is slightly higher than the value of multiple regression analysis. These unexpected results may be explained, in part, by small number of patents. Since this study only uses patent data in class G06F, number of sample patent data is relatively small, leading to incomplete learning to satisfy complex artificial intelligence structure. In addition, fusion index per technology and impact index are found to be important criteria to predict promising technology. This study attempts to extend the existing knowledge by proposing a new methodology for prediction technology value by integrating patent information analysis and artificial intelligence network. It helps managers who want to technology develop planning and policy maker who want to implement technology policy by providing quantitative prediction methodology. In addition, this study could help other researchers by proving a deeper understanding of the complex technological forecasting field.

Query-based Answer Extraction using Korean Dependency Parsing (의존 구문 분석을 이용한 질의 기반 정답 추출)

  • Lee, Dokyoung;Kim, Mintae;Kim, Wooju
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.3
    • /
    • pp.161-177
    • /
    • 2019
  • In this paper, we study the performance improvement of the answer extraction in Question-Answering system by using sentence dependency parsing result. The Question-Answering (QA) system consists of query analysis, which is a method of analyzing the user's query, and answer extraction, which is a method to extract appropriate answers in the document. And various studies have been conducted on two methods. In order to improve the performance of answer extraction, it is necessary to accurately reflect the grammatical information of sentences. In Korean, because word order structure is free and omission of sentence components is frequent, dependency parsing is a good way to analyze Korean syntax. Therefore, in this study, we improved the performance of the answer extraction by adding the features generated by dependency parsing analysis to the inputs of the answer extraction model (Bidirectional LSTM-CRF). The process of generating the dependency graph embedding consists of the steps of generating the dependency graph from the dependency parsing result and learning the embedding of the graph. In this study, we compared the performance of the answer extraction model when inputting basic word features generated without the dependency parsing and the performance of the model when inputting the addition of the Eojeol tag feature and dependency graph embedding feature. Since dependency parsing is performed on a basic unit of an Eojeol, which is a component of sentences separated by a space, the tag information of the Eojeol can be obtained as a result of the dependency parsing. The Eojeol tag feature means the tag information of the Eojeol. The process of generating the dependency graph embedding consists of the steps of generating the dependency graph from the dependency parsing result and learning the embedding of the graph. From the dependency parsing result, a graph is generated from the Eojeol to the node, the dependency between the Eojeol to the edge, and the Eojeol tag to the node label. In this process, an undirected graph is generated or a directed graph is generated according to whether or not the dependency relation direction is considered. To obtain the embedding of the graph, we used Graph2Vec, which is a method of finding the embedding of the graph by the subgraphs constituting a graph. We can specify the maximum path length between nodes in the process of finding subgraphs of a graph. If the maximum path length between nodes is 1, graph embedding is generated only by direct dependency between Eojeol, and graph embedding is generated including indirect dependencies as the maximum path length between nodes becomes larger. In the experiment, the maximum path length between nodes is adjusted differently from 1 to 3 depending on whether direction of dependency is considered or not, and the performance of answer extraction is measured. Experimental results show that both Eojeol tag feature and dependency graph embedding feature improve the performance of answer extraction. In particular, considering the direction of the dependency relation and extracting the dependency graph generated with the maximum path length of 1 in the subgraph extraction process in Graph2Vec as the input of the model, the highest answer extraction performance was shown. As a result of these experiments, we concluded that it is better to take into account the direction of dependence and to consider only the direct connection rather than the indirect dependence between the words. The significance of this study is as follows. First, we improved the performance of answer extraction by adding features using dependency parsing results, taking into account the characteristics of Korean, which is free of word order structure and omission of sentence components. Second, we generated feature of dependency parsing result by learning - based graph embedding method without defining the pattern of dependency between Eojeol. Future research directions are as follows. In this study, the features generated as a result of the dependency parsing are applied only to the answer extraction model in order to grasp the meaning. However, in the future, if the performance is confirmed by applying the features to various natural language processing models such as sentiment analysis or name entity recognition, the validity of the features can be verified more accurately.

Major Class Recommendation System based on Deep learning using Network Analysis (네트워크 분석을 활용한 딥러닝 기반 전공과목 추천 시스템)

  • Lee, Jae Kyu;Park, Heesung;Kim, Wooju
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.3
    • /
    • pp.95-112
    • /
    • 2021
  • In university education, the choice of major class plays an important role in students' careers. However, in line with the changes in the industry, the fields of major subjects by department are diversifying and increasing in number in university education. As a result, students have difficulty to choose and take classes according to their career paths. In general, students choose classes based on experiences such as choices of peers or advice from seniors. This has the advantage of being able to take into account the general situation, but it does not reflect individual tendencies and considerations of existing courses, and has a problem that leads to information inequality that is shared only among specific students. In addition, as non-face-to-face classes have recently been conducted and exchanges between students have decreased, even experience-based decisions have not been made as well. Therefore, this study proposes a recommendation system model that can recommend college major classes suitable for individual characteristics based on data rather than experience. The recommendation system recommends information and content (music, movies, books, images, etc.) that a specific user may be interested in. It is already widely used in services where it is important to consider individual tendencies such as YouTube and Facebook, and you can experience it familiarly in providing personalized services in content services such as over-the-top media services (OTT). Classes are also a kind of content consumption in terms of selecting classes suitable for individuals from a set content list. However, unlike other content consumption, it is characterized by a large influence of selection results. For example, in the case of music and movies, it is usually consumed once and the time required to consume content is short. Therefore, the importance of each item is relatively low, and there is no deep concern in selecting. Major classes usually have a long consumption time because they have to be taken for one semester, and each item has a high importance and requires greater caution in choice because it affects many things such as career and graduation requirements depending on the composition of the selected classes. Depending on the unique characteristics of these major classes, the recommendation system in the education field supports decision-making that reflects individual characteristics that are meaningful and cannot be reflected in experience-based decision-making, even though it has a relatively small number of item ranges. This study aims to realize personalized education and enhance students' educational satisfaction by presenting a recommendation model for university major class. In the model study, class history data of undergraduate students at University from 2015 to 2017 were used, and students and their major names were used as metadata. The class history data is implicit feedback data that only indicates whether content is consumed, not reflecting preferences for classes. Therefore, when we derive embedding vectors that characterize students and classes, their expressive power is low. With these issues in mind, this study proposes a Net-NeuMF model that generates vectors of students, classes through network analysis and utilizes them as input values of the model. The model was based on the structure of NeuMF using one-hot vectors, a representative model using data with implicit feedback. The input vectors of the model are generated to represent the characteristic of students and classes through network analysis. To generate a vector representing a student, each student is set to a node and the edge is designed to connect with a weight if the two students take the same class. Similarly, to generate a vector representing the class, each class was set as a node, and the edge connected if any students had taken the classes in common. Thus, we utilize Node2Vec, a representation learning methodology that quantifies the characteristics of each node. For the evaluation of the model, we used four indicators that are mainly utilized by recommendation systems, and experiments were conducted on three different dimensions to analyze the impact of embedding dimensions on the model. The results show better performance on evaluation metrics regardless of dimension than when using one-hot vectors in existing NeuMF structures. Thus, this work contributes to a network of students (users) and classes (items) to increase expressiveness over existing one-hot embeddings, to match the characteristics of each structure that constitutes the model, and to show better performance on various kinds of evaluation metrics compared to existing methodologies.

Development and Application of STEAM Education Program for Informal Science Learning in Elementary School: Focused on Theme of 'Light' (초등학교 비형식 과학 교육을 위한 융합인재교육(STEAM) 프로그램의 개발 및 적용 - '빛' 주제를 중심으로)

  • Lee, Hyonyong;Baek, Soyeon;Lee, Hyundong
    • Journal of the Korean Society of Earth Science Education
    • /
    • v.10 no.2
    • /
    • pp.122-139
    • /
    • 2017
  • The purposes of this study are to develop the STEAM program grounded on curriculum and to investigate educational effects of the developed program on students' attitude of science and science self-efficacy by application to elementary informal science education environment. In order to develop this program, the literature reviews were conducted and then STEAM education program based on the theme 'light' is developed. The developed program was revised and complemented through preliminary applications and consulting with experts, and applied to 65 students. A single group pre-post paired t-test was conducted through the students' attitude of science and science self-efficacy test. The semi-structure interviews were used to gather focused and additional data. The results of this study were as follows: firstly, STEAM education program was developed with the theme 'light' for elementary students in order to increase their interest related to real life. Secondly, the results indicated that the program was statistically significant on the attitude of science for the group of third and fourth graders. However, the effects of science self-efficacy did not appear a significant result for the third and fourth graders. They expressed one possible reason. The theme of light was not familiar with them because the theme was scheduled to teach in the second semester of the fourth graders. Some of students in this group did have a chance to learn the theme. Thirdly, the program was very effective for the fifth and sixth graders on their attitude of science and science self-efficacy. In conclusion, STEAM education program developed with the theme of light is contributed to elementary students' attitude of science in the informal science education. Students' learning experiences of relevant concepts can influence on students' science self-efficacy. It could be very important factor to consider students' grade level and previous learning experiences when the educational programs develop.