• Title/Summary/Keyword: Multi-Model Training

Search Result 352, Processing Time 0.028 seconds

Development of Information Extraction System from Multi Source Unstructured Documents for Knowledge Base Expansion (지식베이스 확장을 위한 멀티소스 비정형 문서에서의 정보 추출 시스템의 개발)

  • Choi, Hyunseung;Kim, Mintae;Kim, Wooju;Shin, Dongwook;Lee, Yong Hun
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.4
    • /
    • pp.111-136
    • /
    • 2018
  • In this paper, we propose a methodology to extract answer information about queries from various types of unstructured documents collected from multi-sources existing on web in order to expand knowledge base. The proposed methodology is divided into the following steps. 1) Collect relevant documents from Wikipedia, Naver encyclopedia, and Naver news sources for "subject-predicate" separated queries and classify the proper documents. 2) Determine whether the sentence is suitable for extracting information and derive the confidence. 3) Based on the predicate feature, extract the information in the proper sentence and derive the overall confidence of the information extraction result. In order to evaluate the performance of the information extraction system, we selected 400 queries from the artificial intelligence speaker of SK-Telecom. Compared with the baseline model, it is confirmed that it shows higher performance index than the existing model. The contribution of this study is that we develop a sequence tagging model based on bi-directional LSTM-CRF using the predicate feature of the query, with this we developed a robust model that can maintain high recall performance even in various types of unstructured documents collected from multiple sources. The problem of information extraction for knowledge base extension should take into account heterogeneous characteristics of source-specific document types. The proposed methodology proved to extract information effectively from various types of unstructured documents compared to the baseline model. There is a limitation in previous research that the performance is poor when extracting information about the document type that is different from the training data. In addition, this study can prevent unnecessary information extraction attempts from the documents that do not include the answer information through the process for predicting the suitability of information extraction of documents and sentences before the information extraction step. It is meaningful that we provided a method that precision performance can be maintained even in actual web environment. The information extraction problem for the knowledge base expansion has the characteristic that it can not guarantee whether the document includes the correct answer because it is aimed at the unstructured document existing in the real web. When the question answering is performed on a real web, previous machine reading comprehension studies has a limitation that it shows a low level of precision because it frequently attempts to extract an answer even in a document in which there is no correct answer. The policy that predicts the suitability of document and sentence information extraction is meaningful in that it contributes to maintaining the performance of information extraction even in real web environment. The limitations of this study and future research directions are as follows. First, it is a problem related to data preprocessing. In this study, the unit of knowledge extraction is classified through the morphological analysis based on the open source Konlpy python package, and the information extraction result can be improperly performed because morphological analysis is not performed properly. To enhance the performance of information extraction results, it is necessary to develop an advanced morpheme analyzer. Second, it is a problem of entity ambiguity. The information extraction system of this study can not distinguish the same name that has different intention. If several people with the same name appear in the news, the system may not extract information about the intended query. In future research, it is necessary to take measures to identify the person with the same name. Third, it is a problem of evaluation query data. In this study, we selected 400 of user queries collected from SK Telecom 's interactive artificial intelligent speaker to evaluate the performance of the information extraction system. n this study, we developed evaluation data set using 800 documents (400 questions * 7 articles per question (1 Wikipedia, 3 Naver encyclopedia, 3 Naver news) by judging whether a correct answer is included or not. To ensure the external validity of the study, it is desirable to use more queries to determine the performance of the system. This is a costly activity that must be done manually. Future research needs to evaluate the system for more queries. It is also necessary to develop a Korean benchmark data set of information extraction system for queries from multi-source web documents to build an environment that can evaluate the results more objectively.

The way to make training data for deep learning model to recognize keywords in product catalog image at E-commerce (온라인 쇼핑몰에서 상품 설명 이미지 내의 키워드 인식을 위한 딥러닝 훈련 데이터 자동 생성 방안)

  • Kim, Kitae;Oh, Wonseok;Lim, Geunwon;Cha, Eunwoo;Shin, Minyoung;Kim, Jongwoo
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.1
    • /
    • pp.1-23
    • /
    • 2018
  • From the 21st century, various high-quality services have come up with the growth of the internet or 'Information and Communication Technologies'. Especially, the scale of E-commerce industry in which Amazon and E-bay are standing out is exploding in a large way. As E-commerce grows, Customers could get what they want to buy easily while comparing various products because more products have been registered at online shopping malls. However, a problem has arisen with the growth of E-commerce. As too many products have been registered, it has become difficult for customers to search what they really need in the flood of products. When customers search for desired products with a generalized keyword, too many products have come out as a result. On the contrary, few products have been searched if customers type in details of products because concrete product-attributes have been registered rarely. In this situation, recognizing texts in images automatically with a machine can be a solution. Because bulk of product details are written in catalogs as image format, most of product information are not searched with text inputs in the current text-based searching system. It means if information in images can be converted to text format, customers can search products with product-details, which make them shop more conveniently. There are various existing OCR(Optical Character Recognition) programs which can recognize texts in images. But existing OCR programs are hard to be applied to catalog because they have problems in recognizing texts in certain circumstances, like texts are not big enough or fonts are not consistent. Therefore, this research suggests the way to recognize keywords in catalog with the Deep Learning algorithm which is state of the art in image-recognition area from 2010s. Single Shot Multibox Detector(SSD), which is a credited model for object-detection performance, can be used with structures re-designed to take into account the difference of text from object. But there is an issue that SSD model needs a lot of labeled-train data to be trained, because of the characteristic of deep learning algorithms, that it should be trained by supervised-learning. To collect data, we can try labelling location and classification information to texts in catalog manually. But if data are collected manually, many problems would come up. Some keywords would be missed because human can make mistakes while labelling train data. And it becomes too time-consuming to collect train data considering the scale of data needed or costly if a lot of workers are hired to shorten the time. Furthermore, if some specific keywords are needed to be trained, searching images that have the words would be difficult, as well. To solve the data issue, this research developed a program which create train data automatically. This program can make images which have various keywords and pictures like catalog and save location-information of keywords at the same time. With this program, not only data can be collected efficiently, but also the performance of SSD model becomes better. The SSD model recorded 81.99% of recognition rate with 20,000 data created by the program. Moreover, this research had an efficiency test of SSD model according to data differences to analyze what feature of data exert influence upon the performance of recognizing texts in images. As a result, it is figured out that the number of labeled keywords, the addition of overlapped keyword label, the existence of keywords that is not labeled, the spaces among keywords and the differences of background images are related to the performance of SSD model. This test can lead performance improvement of SSD model or other text-recognizing machine based on deep learning algorithm with high-quality data. SSD model which is re-designed to recognize texts in images and the program developed for creating train data are expected to contribute to improvement of searching system in E-commerce. Suppliers can put less time to register keywords for products and customers can search products with product-details which is written on the catalog.

Demand Characteristics and Analysis of Changes in Spatial Accessibility of Public Sports Facilities (공공체육시설 수요특성 및 공간적 접근성 분석)

  • Kim, Seong-Hee;Kim, Yong-Jin
    • The Journal of the Korea Contents Association
    • /
    • v.17 no.7
    • /
    • pp.283-293
    • /
    • 2017
  • This study analyzed the actual conditions of use of public sports facilities and characteristics of the users of the facilities through surveys and measured the spatial imbalance of the public sports facilities currently supplied by using gravity potential model. This study also suggests evaluation criteria that may be considered for efficient location selection by examining the change of accessibility to the facilities that meet the needs of users in the future. As the results of the questionnaire survey, unlike current usage, the users hoped for badminton, weight training and swimming. And we could confirm the demand for the expansion of the multi - purpose indoor gym which can carry out such activities in the areas. As the result of the analysis on the difference in accessibility of the public sports facility, there were some large variations in the regions. It was found that a balanced supply of facilities was needed in terms of equity. In particular, when analyzing by considering the population estimates of 2025, It is analyzed that the accessibility will be reduced to about 60% compared to that of 2015. In addition, it is evaluated as the best alternative in terms of overall efficiency that the location of the facilities should be in Munsan area where population growth is expected in the future.

Development of Female Entrepreneurial Competency Model (여성 기업가 역량모델 개발)

  • Kim, Miran;Eom, Wooyong
    • Asia-Pacific Journal of Business Venturing and Entrepreneurship
    • /
    • v.17 no.5
    • /
    • pp.133-150
    • /
    • 2022
  • The purpose of this study was to develop a female entrepreneurial competency model. For the purpose, two Focus Group Interviews (FGI) were conducted with seven outstanding female entrepreneurs, and three expert reviews were conducted. In addition, in order to verify the validity of the provisional female entrepreneur competency model derived from the FGI and competency modeling expert review, the female entrepreneur competency model was finally confirmed through a survey of 442 female entrepreneurs. The results were as follows. First, a female entrepreneur competency model consisting of 6 competency groups and 25 competencies of entrepreneurship, emotion, business management, relationship management, strategic management, and multitasking, and 75 behavioral indicators describing each competency was developed. Second, sensibility and multitasking are competencies that reflect the characteristics of female entrepreneurs. In particular, 'social sense', which is the ability to be considerate of others in the emotional competency group and the ability to respond well to subtle nuances, and the multitasking competency group's unique strengths are women's ability to perform various tasks at the same time. The 'work-family control ability' of a female entrepreneur who maintains a balance between 'multi-tasking' and work and family is a representative competency of only female entrepreneurs. Third, the developed female entrepreneurship competency model is meaningful in that it not only increases female entrepreneurial competency so that prospective female entrepreneurs can successfully run a business through entrepreneurship education, but it also makes it easy for existing female entrepreneurs to reflect and improve their competencies. If we provide appropriate training programs to female entrepreneurs based on their competency, it will be possible to effectively enhance the entrepreneurial competency, which is the key to strengthening the competitiveness of female entrepreneurs. The female entrepreneur competency model developed through this study can provide a basis for future research on competency diagnosis and education needs analysis.

Feasibility of Deep Learning Algorithms for Binary Classification Problems (이진 분류문제에서의 딥러닝 알고리즘의 활용 가능성 평가)

  • Kim, Kitae;Lee, Bomi;Kim, Jong Woo
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.1
    • /
    • pp.95-108
    • /
    • 2017
  • Recently, AlphaGo which is Bakuk (Go) artificial intelligence program by Google DeepMind, had a huge victory against Lee Sedol. Many people thought that machines would not be able to win a man in Go games because the number of paths to make a one move is more than the number of atoms in the universe unlike chess, but the result was the opposite to what people predicted. After the match, artificial intelligence technology was focused as a core technology of the fourth industrial revolution and attracted attentions from various application domains. Especially, deep learning technique have been attracted as a core artificial intelligence technology used in the AlphaGo algorithm. The deep learning technique is already being applied to many problems. Especially, it shows good performance in image recognition field. In addition, it shows good performance in high dimensional data area such as voice, image and natural language, which was difficult to get good performance using existing machine learning techniques. However, in contrast, it is difficult to find deep leaning researches on traditional business data and structured data analysis. In this study, we tried to find out whether the deep learning techniques have been studied so far can be used not only for the recognition of high dimensional data but also for the binary classification problem of traditional business data analysis such as customer churn analysis, marketing response prediction, and default prediction. And we compare the performance of the deep learning techniques with that of traditional artificial neural network models. The experimental data in the paper is the telemarketing response data of a bank in Portugal. It has input variables such as age, occupation, loan status, and the number of previous telemarketing and has a binary target variable that records whether the customer intends to open an account or not. In this study, to evaluate the possibility of utilization of deep learning algorithms and techniques in binary classification problem, we compared the performance of various models using CNN, LSTM algorithm and dropout, which are widely used algorithms and techniques in deep learning, with that of MLP models which is a traditional artificial neural network model. However, since all the network design alternatives can not be tested due to the nature of the artificial neural network, the experiment was conducted based on restricted settings on the number of hidden layers, the number of neurons in the hidden layer, the number of output data (filters), and the application conditions of the dropout technique. The F1 Score was used to evaluate the performance of models to show how well the models work to classify the interesting class instead of the overall accuracy. The detail methods for applying each deep learning technique in the experiment is as follows. The CNN algorithm is a method that reads adjacent values from a specific value and recognizes the features, but it does not matter how close the distance of each business data field is because each field is usually independent. In this experiment, we set the filter size of the CNN algorithm as the number of fields to learn the whole characteristics of the data at once, and added a hidden layer to make decision based on the additional features. For the model having two LSTM layers, the input direction of the second layer is put in reversed position with first layer in order to reduce the influence from the position of each field. In the case of the dropout technique, we set the neurons to disappear with a probability of 0.5 for each hidden layer. The experimental results show that the predicted model with the highest F1 score was the CNN model using the dropout technique, and the next best model was the MLP model with two hidden layers using the dropout technique. In this study, we were able to get some findings as the experiment had proceeded. First, models using dropout techniques have a slightly more conservative prediction than those without dropout techniques, and it generally shows better performance in classification. Second, CNN models show better classification performance than MLP models. This is interesting because it has shown good performance in binary classification problems which it rarely have been applied to, as well as in the fields where it's effectiveness has been proven. Third, the LSTM algorithm seems to be unsuitable for binary classification problems because the training time is too long compared to the performance improvement. From these results, we can confirm that some of the deep learning algorithms can be applied to solve business binary classification problems.

A Comparison of American and Korean Experimental Studies on Positive Behavior Support within a Multi-Tiered System of Supports (다층지원체계 중심의 긍정적 행동지원에 관한 한국과 미국의 실험연구 비교분석)

  • Chang, Eun Jin;Lee, Mi-Young;Jeong, Jae-Woo;ChoBlair, Kwang-Sun;Lee, Donghyung;Song, Wonyoung;Han, Miryeung
    • Korean Journal of School Psychology
    • /
    • v.15 no.3
    • /
    • pp.399-431
    • /
    • 2018
  • The purpose of this study was to summarize the empirical literature on implementation of positive behavior support (PBS) within a multi-tiered system of supports in American and Korean schools and to compare its key features and outcomes in an attempt to suggest future directions for development of a Korean school-wide PBS model and implementation manuals as well as directions for future research. Twenty-four American articles and 11 Korean articles (total 35 articles) that reported the outcomes of implementation of PBS at a tier 1 and/or tier 2, or tier 3 level and that met established inclusion criteria were analyzed using systematic procedures. Comparisons were made in the areas of key features and outcomes of PBS in addition to general methodology (e.g., participants, design, implementation duration, dependent measures) at each tier of PBS. The results indicated that positive outcomes for student behavior and other areas were reported across tiers in all American and Korean studies. At the tier 1 level, teaching expectations and rules were the primary focus of PBS in American and Korean schools. However, Korean schools focused on modifying the school and classroom environments and teaching social skills whereas American schools focused on teacher training on standardized interventions or curricular by experts and teacher support during implementation of PBS. At the tier 2 level, more American studies reported implementation of tier 2 interventions within school-wide PBS, and Check/In Check/Out (CICO) was found to be the most commonly used tier 2 intervention. The results also indicated that in comparison to Korean schools, American schools were more likely to use systematic screening tools or procedures to identify students who need tier 2 interventions and more likely to promote parental involvement with implementing interventions. At the tier 3 level, more Korean studies reported the outcomes of individualized interventions, but more American studies reported that designing individualized intervention plans based on comprehensive functional behavior assessment results and establishment of systematic screening systems were focused when implementing individualized interventions. Furthermore, few Korean studies reported the assessment of procedural integrity, social validity, and contextual fit in implementing PBS across tiers, indicating the need for development of valid instruments that could be used in assessing these areas. Based on these results, limitations of the study and suggestions for future research are discussed.

Wildfire Severity Mapping Using Sentinel Satellite Data Based on Machine Learning Approaches (Sentinel 위성영상과 기계학습을 이용한 국내산불 피해강도 탐지)

  • Sim, Seongmun;Kim, Woohyeok;Lee, Jaese;Kang, Yoojin;Im, Jungho;Kwon, Chunguen;Kim, Sungyong
    • Korean Journal of Remote Sensing
    • /
    • v.36 no.5_3
    • /
    • pp.1109-1123
    • /
    • 2020
  • In South Korea with forest as a major land cover class (over 60% of the country), many wildfires occur every year. Wildfires weaken the shear strength of the soil, forming a layer of soil that is vulnerable to landslides. It is important to identify the severity of a wildfire as well as the burned area to sustainably manage the forest. Although satellite remote sensing has been widely used to map wildfire severity, it is often difficult to determine the severity using only the temporal change of satellite-derived indices such as Normalized Difference Vegetation Index (NDVI) and Normalized Burn Ratio (NBR). In this study, we proposed an approach for determining wildfire severity based on machine learning through the synergistic use of Sentinel-1A Synthetic Aperture Radar-C data and Sentinel-2A Multi Spectral Instrument data. Three wildfire cases-Samcheok in May 2017, Gangreung·Donghae in April 2019, and Gosung·Sokcho in April 2019-were used for developing wildfire severity mapping models with three machine learning algorithms (i.e., Random Forest, Logistic Regression, and Support Vector Machine). The results showed that the random forest model yielded the best performance, resulting in an overall accuracy of 82.3%. The cross-site validation to examine the spatiotemporal transferability of the machine learning models showed that the models were highly sensitive to temporal differences between the training and validation sites, especially in the early growing season. This implies that a more robust model with high spatiotemporal transferability can be developed when more wildfire cases with different seasons and areas are added in the future.

Efficiently Development Plan from the User's Need Analysis of the Army Tactical C4I(ATCIS) System (지상전술 C4I(ATCIS)체계 운용자 요구분석을 통한 효율적 발전 방안)

  • Park, Chang-Woon;Yang, Hae-Sool
    • The Journal of the Korea Contents Association
    • /
    • v.8 no.5
    • /
    • pp.246-259
    • /
    • 2008
  • This study was to minimize the trial and error in the primary step of the C4I system(ATCIS) of the each army corps on the front line, and test the economy and efficiency was tested by reviewing related papers and the system characteristics of other countries. The relationship was researched by analyzing the collected survey data and survey data related to the user's requirement level such as the army standards, that is, commonality, timeliness, simplification, automaticity, field availability and viability, multi-stage security and interoperability, unification. The result showed that the C4I system was efficiently operated through the system reliability for the specification of the system and operation manual, maneuverability and security, adaptability of the war field and system support and management, and good education and training about system operation, and less system maintenance and supplementary element. As a result, the development plan confirmed that the continuous operator education and the construction of the maintenance, and the upgrade digitalization(C4ISR+D) with the korean characteristics based on IT of network systems, and system development of the measurement model of the operator performance must be continuously supplemented in the near future.

Argovian Cantonal School in Aarau and Albert Einstein I (칸톤학교 아라우와 아인슈타인 I)

  • Chung, Byung Hoon
    • Journal of The Korean Association For Science Education
    • /
    • v.39 no.2
    • /
    • pp.233-248
    • /
    • 2019
  • This study shows that the Argovian Cantonal School in Aarau, Switzerland, which Albert Einstein attended from 1895 to 1896, had been closely related to the ideological education controversy in German Gymnasium throughout the 19th century. Due to this controversy, Einstein hardly received a formal science education in Bavaria. Despite the lack of formal education in Germany, he had a habit of self-studying from an early age and continued with this practice all through his life. He had a hard time at the authoritarian school in Munich, but at the democratic school in Aarau, where freedom and autonomy were secured, he was able to achieve emotional stability. For a long time, the city Aarau prevailed as a location of tolerance and multi-culturalism, without religious, regional, and national discrimination. This was possible due to the influence of external and unrestricted social mobility, as well as the Enlightenment from France. As a result, this small public school was able to acquire a mass of qualified human resources from outside of Switzerland. As a consequence of the controversy regarding the educational ideology, the Cantonal School adopted practical thoughts and the Enlightenment that fit the spirit of the times. The school consisted of two independent educational organizations: the Gymnasium, where the 'neuhumanistsch' education for the elite training was conducted, and the 'Gewerbeschule', where a more realistic education system was set up to suit the citizen life. In particular, after 1835, the Gymnasium changed gradually from the pure humanistic education to the 'utraquistisch' ways by introducing practical subjects such as natural history. Thereafter, the Cantonal School became an institution that was able to achieve a genuine humanity, academic, and civic life education. Einstein, who attended the 'technische Abteilung' of the 'Gewerbeschule,' considered this school as a role model of an institution that realized true democracy, and that left an unforgettable impression on him.

A Comparison of Pan-sharpening Algorithms for GK-2A Satellite Imagery (천리안위성 2A호 위성영상을 위한 영상융합기법의 비교평가)

  • Lee, Soobong;Choi, Jaewan
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.40 no.4
    • /
    • pp.275-292
    • /
    • 2022
  • In order to detect climate changes using satellite imagery, the GCOS (Global Climate Observing System) defines requirements such as spatio-temporal resolution, stability by the time change, and uncertainty. Due to limitation of GK-2A sensor performance, the level-2 products can not satisfy the requirement, especially for spatial resolution. In this paper, we found the optimal pan-sharpening algorithm for GK-2A products. The six pan-sharpening methods included in CS (Component Substitution), MRA (Multi-Resolution Analysis), VO (Variational Optimization), and DL (Deep Learning) were used. In the case of DL, the synthesis property based method was used to generate training dataset. The process of synthesis property is that pan-sharpening model is applied with Pan (Panchromatic) and MS (Multispectral) images with reduced spatial resolution, and fused image is compared with the original MS image. In the synthesis property based method, fused image with desire level for user can be produced only when the geometric characteristics between the PAN with reduced spatial resolution and MS image are similar. However, since the dissimilarity exists, RD (Random Down-sampling) was additionally used as a way to minimize it. Among the pan-sharpening methods, PSGAN was applied with RD (PSGAN_RD). The fused images are qualitatively and quantitatively validated with consistency property and the synthesis property. As validation result, the GSA algorithm performs well in the evaluation index representing spatial characteristics. In the case of spectral characteristics, the PSGAN_RD has the best accuracy with the original MS image. Therefore, in consideration of spatial and spectral characteristics of fused image, we found that PSGAN_RD is suitable for GK-2A products.