• Title/Summary/Keyword: learning sources

Search Result 348, Processing Time 0.028 seconds

Automatic Categorization of Islamic Jurisprudential Legal Questions using Hierarchical Deep Learning Text Classifier

  • AlSabban, Wesam H.;Alotaibi, Saud S.;Farag, Abdullah Tarek;Rakha, Omar Essam;Al Sallab, Ahmad A.;Alotaibi, Majid
    • International Journal of Computer Science & Network Security
    • /
    • v.21 no.9
    • /
    • pp.281-291
    • /
    • 2021
  • The Islamic jurisprudential legal system represents an essential component of the Islamic religion, that governs many aspects of Muslims' daily lives. This creates many questions that require interpretations by qualified specialists, or Muftis according to the main sources of legislation in Islam. The Islamic jurisprudence is usually classified into branches, according to which the questions can be categorized and classified. Such categorization has many applications in automated question-answering systems, and in manual systems in routing the questions to a specialized Mufti to answer specific topics. In this work we tackle the problem of automatic categorisation of Islamic jurisprudential legal questions using deep learning techniques. In this paper, we build a hierarchical deep learning model that first extracts the question text features at two levels: word and sentence representation, followed by a text classifier that acts upon the question representation. To evaluate our model, we build and release the largest publicly available dataset of Islamic questions and answers, along with their topics, for 52 topic categories. We evaluate different state-of-the art deep learning models, both for word and sentence embeddings, comparing recurrent and transformer-based techniques, and performing extensive ablation studies to show the effect of each model choice. Our hierarchical model is based on pre-trained models, taking advantage of the recent advancement of transfer learning techniques, focused on Arabic language.

Development of Data Visualized Web System for Virtual Power Forecasting based on Open Sources based Location Services using Deep Learning (오픈소스 기반 지도 서비스를 이용한 딥러닝 실시간 가상 전력수요 예측 가시화 웹 시스템)

  • Lee, JeongHwi;Kim, Dong Keun
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.25 no.8
    • /
    • pp.1005-1012
    • /
    • 2021
  • Recently, the use of various location-based services-based location information systems using maps on the web has been expanding, and there is a need for a monitoring system that can check power demand in real time as an alternative to energy saving. In this study, we developed a deep learning real-time virtual power demand prediction web system using open source-based mapping service to analyze and predict the characteristics of power demand data using deep learning. In particular, the proposed system uses the LSTM(Long Short-Term Memory) deep learning model to enable power demand and predictive analysis locally, and provides visualization of analyzed information. Future proposed systems will not only be utilized to identify and analyze the supply and demand and forecast status of energy by region, but also apply to other industrial energies.

Enhancing Smart Grid Efficiency through SAC Reinforcement Learning: Renewable Energy Integration and Optimal Demand Response in the CityLearn Environment (SAC 강화 학습을 통한 스마트 그리드 효율성 향상: CityLearn 환경에서 재생 에너지 통합 및 최적 수요 반응)

  • Esanov Alibek Rustamovich;Seung Je Seong;Chang-Gyoon Lim
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.19 no.1
    • /
    • pp.93-104
    • /
    • 2024
  • Demand response is a strategy that encourages customers to adjust their consumption patterns at times of peak demand with the aim to improve the reliability of the power grid and minimize expenses. The integration of renewable energy sources into smart grids poses significant challenges due to their intermittent and unpredictable nature. Demand response strategies, coupled with reinforcement learning techniques, have emerged as promising approaches to address these challenges and optimize grid operations where traditional methods fail to meet such kind of complex requirements. This research focuses on investigating the application of reinforcement learning algorithms in demand response for renewable energy integration. The objectives include optimizing demand-side flexibility, improving renewable energy utilization, and enhancing grid stability. The results emphasize the effectiveness of demand response strategies based on reinforcement learning in enhancing grid flexibility and facilitating the integration of renewable energy.

A Study on Educational Tasks about the Succession Patterns of Dietary Culture in Korea and Japan (식생활문화(食生活文化) 계승(繼承)의 현상(現狀)에 관한 한(韓).일(日) 양국(兩國)의 교육적(敎育的)인 과제연구(課題硏究))

  • Kim, Hye-Ja;Haruta, Kazuko
    • Journal of the Korean Society of Food Culture
    • /
    • v.8 no.4
    • /
    • pp.337-348
    • /
    • 1993
  • This study was carried out to investigate the succession patterns of dietary culture and to find out all the educational problems with female college students in both countries as the central figure. The results are as follows. The degree of knowledge acquisition about food of annual custom is 58% in Korea and 72% in Japan. What the rate of knowledge acquisition is high among both countries’ similar food of annual custom are ${\ulcorner}Seolnal(Gantan){\lrcorner}$, ${\ulcorner}Sambok(Doyonohi){\lrcorner}$, and ${\ulcorner}Chuseok(Tsukimi){\lrcorner}$. Cooking experience of festive food is 45% in Korea and 58% in Japan. Among both countries' common festive food what cooking experience is high in Korea are ${\ulcorner}Seolnal{\lrcorner}$ and ${\ulcorner}Chuseok{\lrcorner}$, which are over 97%. In Japan those are ${\ulcorner}Gantan{\lrcorner}$ and ${\ulcorner}Tsukimi{\lrcorner}$, which are over 80%. Regarding learning experience of festive food ${\ulcorner}Seolnal{\lrcorner}$ and ${\ulcorner}Gantan{\lrcorner}$ are beyond 80% and ${\ulcorner}Chuseok{\lrcorner}$ is 88%. In Japan ${\ulcorner}Tsukimi{\lrcorner}$ is 71% and ${\ulcorner}Omisoka{\lrcorner}$ is 85%. The learning sources of food of annual custom are parents and schools in common, and Korea has another learning sources, mass communication. Festive food that is cooked shows much similarity between two countries, but each country has originality. As common food of annual custom ${\ulcorner}Seolnal{\lrcorner}$ has nine kinds of food, ${\ulcorner}Sambok{\lrcorner}$ has three kinds, and ${\ulcorner}Chuseok{\lrcorner}$ has five kinds in Korea In Japan ${\ulcorner}Gantan{\lrcorner}$ has fourteen kinds of food, ${\ulcorner}Doyonohi{\lrcorner}$ has three kinds, and ${\ulcorner}Tsukimi{\lrcorner}$ has five kinds. The successive consciousness about food of annual custom is concentrated on a specific food in Korea. And Japanese consciousness is shown as an expansion-type on diverse food. Korean successive consciousness is 69.4% and Japanese consciousness is 82%. The higher the rate of knowledge acquisition, cooking experience, and learning experience are in both countries, the higher successive consciousness is. So we must note for the importance of home and school’s education.

  • PDF

Machine learning-based Fine Dust Prediction Model using Meteorological data and Fine Dust data (기상 데이터와 미세먼지 데이터를 활용한 머신러닝 기반 미세먼지 예측 모형)

  • KIM, Hye-Lim;MOON, Tae-Heon
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.24 no.1
    • /
    • pp.92-111
    • /
    • 2021
  • As fine dust negatively affects disease, industry and economy, the people are sensitive to fine dust. Therefore, if the occurrence of fine dust can be predicted, countermeasures can be prepared in advance, which can be helpful for life and economy. Fine dust is affected by the weather and the degree of concentration of fine dust emission sources. The industrial sector has the largest amount of fine dust emissions, and in industrial complexes, factories emit a lot of fine dust as fine dust emission sources. This study targets regions with old industrial complexes in local cities. The purpose of this study is to explore the factors that cause fine dust and develop a predictive model that can predict the occurrence of fine dust. weather data and fine dust data were used, and variables that influence the generation of fine dust were extracted through multiple regression analysis. Based on the results of multiple regression analysis, a model with high predictive power was extracted by learning with a machine learning regression learner model. The performance of the model was confirmed using test data. As a result, the models with high predictive power were linear regression model, Gaussian process regression model, and support vector machine. The proportion of training data and predictive power were not proportional. In addition, the average value of the difference between the predicted value and the measured value was not large, but when the measured value was high, the predictive power was decreased. The results of this study can be developed as a more systematic and precise fine dust prediction service by combining meteorological data and urban big data through local government data hubs. Lastly, it will be an opportunity to promote the development of smart industrial complexes.

A study on performance improvement considering the balance between corpus in Neural Machine Translation (인공신경망 기계번역에서 말뭉치 간의 균형성을 고려한 성능 향상 연구)

  • Park, Chanjun;Park, Kinam;Moon, Hyeonseok;Eo, Sugyeong;Lim, Heuiseok
    • Journal of the Korea Convergence Society
    • /
    • v.12 no.5
    • /
    • pp.23-29
    • /
    • 2021
  • Recent deep learning-based natural language processing studies are conducting research to improve performance by training large amounts of data from various sources together. However, there is a possibility that the methodology of learning by combining data from various sources into one may prevent performance improvement. In the case of machine translation, data deviation occurs due to differences in translation(liberal, literal), style(colloquial, written, formal, etc.), domains, etc. Combining these corpora into one for learning can adversely affect performance. In this paper, we propose a new Corpus Weight Balance(CWB) method that considers the balance between parallel corpora in machine translation. As a result of the experiment, the model trained with balanced corpus showed better performance than the existing model. In addition, we propose an additional corpus construction process that enables coexistence with the human translation market, which can build high-quality parallel corpus even with a monolingual corpus.

Radionuclide identification based on energy-weighted algorithm and machine learning applied to a multi-array plastic scintillator

  • Hyun Cheol Lee ;Bon Tack Koo ;Ju Young Jeon ;Bo-Wi Cheon ;Do Hyeon Yoo ;Heejun Chung;Chul Hee Min
    • Nuclear Engineering and Technology
    • /
    • v.55 no.10
    • /
    • pp.3907-3912
    • /
    • 2023
  • Radiation portal monitors (RPMs) installed at airports and harbors to prevent illicit trafficking of radioactive materials generally use large plastic scintillators. However, their energy resolution is poor and radionuclide identification is nearly unfeasible. In this study, to improve isotope identification, a RPM system based on a multi-array plastic scintillator and convolutional neural network (CNN) was evaluated by measuring the spectra of radioactive sources. A multi-array plastic scintillator comprising an assembly of 14 hexagonal scintillators was fabricated within an area of 50 × 100 cm2. The energy spectra of 137Cs, 60Co, 226Ra, and 4K (KCl) were measured at speeds of 10-30 km/h, respectively, and an energy-weighted algorithm was applied. For the CNN, 700 and 300 spectral images were used as training and testing images, respectively. Compared to the conventional plastic scintillator, the multi-arrayed detector showed a high collection probability of the optical photons generated inside. A Compton maximum peak was observed for four moving radiation sources, and the CNN-based classification results showed that at least 70% was discriminated. Under the speed condition, the spectral fluctuations were higher than those under dwelling condition. However, the machine learning results demonstrated that a considerably high level of nuclide discrimination was possible under source movement conditions.

Improving the Accuracy of Document Classification by Learning Heterogeneity (이질성 학습을 통한 문서 분류의 정확성 향상 기법)

  • Wong, William Xiu Shun;Hyun, Yoonjin;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.3
    • /
    • pp.21-44
    • /
    • 2018
  • In recent years, the rapid development of internet technology and the popularization of smart devices have resulted in massive amounts of text data. Those text data were produced and distributed through various media platforms such as World Wide Web, Internet news feeds, microblog, and social media. However, this enormous amount of easily obtained information is lack of organization. Therefore, this problem has raised the interest of many researchers in order to manage this huge amount of information. Further, this problem also required professionals that are capable of classifying relevant information and hence text classification is introduced. Text classification is a challenging task in modern data analysis, which it needs to assign a text document into one or more predefined categories or classes. In text classification field, there are different kinds of techniques available such as K-Nearest Neighbor, Naïve Bayes Algorithm, Support Vector Machine, Decision Tree, and Artificial Neural Network. However, while dealing with huge amount of text data, model performance and accuracy becomes a challenge. According to the type of words used in the corpus and type of features created for classification, the performance of a text classification model can be varied. Most of the attempts are been made based on proposing a new algorithm or modifying an existing algorithm. This kind of research can be said already reached their certain limitations for further improvements. In this study, aside from proposing a new algorithm or modifying the algorithm, we focus on searching a way to modify the use of data. It is widely known that classifier performance is influenced by the quality of training data upon which this classifier is built. The real world datasets in most of the time contain noise, or in other words noisy data, these can actually affect the decision made by the classifiers built from these data. In this study, we consider that the data from different domains, which is heterogeneous data might have the characteristics of noise which can be utilized in the classification process. In order to build the classifier, machine learning algorithm is performed based on the assumption that the characteristics of training data and target data are the same or very similar to each other. However, in the case of unstructured data such as text, the features are determined according to the vocabularies included in the document. If the viewpoints of the learning data and target data are different, the features may be appearing different between these two data. In this study, we attempt to improve the classification accuracy by strengthening the robustness of the document classifier through artificially injecting the noise into the process of constructing the document classifier. With data coming from various kind of sources, these data are likely formatted differently. These cause difficulties for traditional machine learning algorithms because they are not developed to recognize different type of data representation at one time and to put them together in same generalization. Therefore, in order to utilize heterogeneous data in the learning process of document classifier, we apply semi-supervised learning in our study. However, unlabeled data might have the possibility to degrade the performance of the document classifier. Therefore, we further proposed a method called Rule Selection-Based Ensemble Semi-Supervised Learning Algorithm (RSESLA) to select only the documents that contributing to the accuracy improvement of the classifier. RSESLA creates multiple views by manipulating the features using different types of classification models and different types of heterogeneous data. The most confident classification rules will be selected and applied for the final decision making. In this paper, three different types of real-world data sources were used, which are news, twitter and blogs.

Technology Licensing Agreements from an Organizational Learning Perspective

  • Lee, JongKuk;Song, Sangyoung
    • Asia Marketing Journal
    • /
    • v.15 no.3
    • /
    • pp.79-95
    • /
    • 2013
  • New product innovation is a process of embodying new knowledge in a product and technology licensing is getting popular as a means to innovations and introduction of new product to the market in today's competitive global market environment. Incumbents often rely on technology licensing to access new product opportunities created by other firms. Prior research has examined various aspects of technology licensing agreements such as specific contract terms of licensing agreements, e.g., distribution of control rights, exclusivity of licensing agreements, cross-licensing, and the scope of licensing agreements. This study aims to provide answers to an important, but under-researched question: why do some incumbents initiate more licensing agreement for exploratory learning while others do it for exploitative learning along the innovation process? We attempt to extend our knowledge of licensing agreements from an organizational learning perspective. Technology licensing as a specific form of interfirm linkages can be initiated with different learning objectives along the process of new product innovation. The exploratory stages of the innovation process such as discovery or research stages involve extensive searches to create new knowledge or capabilities, whereas the exploitative stages of the innovation process such as application or test stages near the commercialization are more focused on developing specific applications or improving their efficiency or reliability. Thus, different stages of the innovation process generate different types of learning and the resulting technological resources. We examine when incumbents as licensees initiate more licensing agreements for exploratory learning objectives and when more for exploitative learning objectives, focusing on two factors that may influence a firm's formation of exploratory and exploitative licensing agreements: 1) its past radical and incremental innovation experience and 2) its internal investments in R&D and marketing. We develop and test our hypotheses regarding the relationship between a firm's radical and incremental new product experience, R&D investment intensity and marketing investment intensity, and the likelihood of engaging in exploratory and exploitive licensing agreements. Using data collected from various secondary sources (Recap database, Compustat database, and FDA website), we analyzed technology licensing agreements initiated in the biotechnology and pharmaceutical industries from 1988 to 2011. The results of this study show that incumbents initiate exploratory rather than exploitative licensing agreements when they have more radical innovation experience and when they invest in R&D activities more intensively; in contrast, they initiate exploitative rather than exploratory licensing agreements when they have more incremental innovation experience and when they invest in marketing activities more intensively. The findings of this study contribute to the licensing and interfirm cooperation studies. First, this study lays a foundation to understand the organizational learning aspect of technology licensing agreements. Second, this study sheds lights on how a firm's internal investments in R&D and marketing are linked to its tendency to initiate licensing agreements along the innovation process. Finally, the findings of this study provide important insight to managers regarding which technologies to gain via licensing agreements. This study suggests that firms need to consider their internal investments in R&D and marketing as well as their past innovation experiences when they initiate licensing agreements along the process of new product innovation.

  • PDF

Knowledge Transfer between Users and Producers in the Accumulation of Technological Capability

  • Lim, Chai-Sung
    • Journal of Technology Innovation
    • /
    • v.13 no.2
    • /
    • pp.179-205
    • /
    • 2005
  • This study reveals that the user industry has a limited role in being a source of technological capability in the case of the machine tool industry in Korea where the user industry is relatively more advanced than other capital goods industries. This study examines the sources of technological capability in terms of migration of workforces and flow of product development knowledge. Although the capital goods sector is generally regarded as being the sector where user-producer interaction is important, the user industry is not the seed-bed of technological capability for machine development. Users and producers interact in terms of expressing 'needs', mainly in the form of specifications. As a result of receiving unique specifications from users, the producer learns to react by making specific customised special purpose machines. The user's specification could include information o the imported machine originally used. When confronted with technical problems in developing a new machine, the producer accesses foreign sources of knowledge. This study's finding reveals that users of special purpose machines have a significantly clearer role in providing specifications than do users of general purpose machine tools. Most intensive interactive learning between users and producers in the production process is found in special purpose machine tools. From the empirical findings, policy implications are discussed.

  • PDF