• Title/Summary/Keyword: 추론 검증

Search Result 460, Processing Time 0.028 seconds

An Effective Segmentation Scheme for Korean Sentence Classification tasks (한국어 문장 분류 태스크에서의 효과적 분절 전략)

  • Kim, Jin-Sung;Kim, Gyeong-Min;Son, Junyoung;Lim, Heuiseok
    • Annual Conference on Human and Language Technology
    • /
    • 2021.10a
    • /
    • pp.173-177
    • /
    • 2021
  • 분절을 통한 양질의 입력 자질을 구성하는 것은 언어모델의 문장에 대한 이해도를 높이기 위한 필수적인 단계이다. 분절은 문장의 의미를 이해하는 데 있어 중요한 역할을 하기 때문이다. 따라서, 한국어 문장 분류 태스크를 수행함에 있어 한국어의 특징에 맞는 분절 기법을 선택하는 것은 필수적이다. 명확한 판단 기준 마련을 위해, 우리는 한국어 문장 분류 태스크에서 가장 효과적인 분절 기법이 무엇인지 감성 분석, 자연어 추론, 텍스트 간 의미적 유사성 판단 태스크를 통해 검증한다. 이 때 비교할 분절 기법의 유형 분류 기준은 언어학적 단위에 따라 어절, 형태소, 음절, 자모 네 가지로 설정하며, 분절 기법 외의 다른 실험 환경들은 동일하게 설정하여 분절 기법이 문장 분류 성능에 미치는 영향만을 측정하도록 한다. 실험 결과에 따르면 자모 단위의 분절 기법을 적용한 모델이 평균적으로 가장 높은 성능을 보여주며, 반복 실험 간 편차가 적어 일관적인 성능 결과를 기록함을 확인할 수 있다.

  • PDF

Ko-ATOMIC 2.0: Constructing Commonsense Knowledge Graph in Korean (Ko-ATOMIC 2.0: 한국어 상식 지식 그래프 구축)

  • Jaewook Lee;Jaehyung Seo;Dahyun Jung;Chanjun Park;Imatitikua Aiyanyo;Heuiseok Lim
    • Annual Conference on Human and Language Technology
    • /
    • 2023.10a
    • /
    • pp.319-323
    • /
    • 2023
  • 일반 상식 기반의 지식 그래프는 대규모 코퍼스에 포함되어 있는 일반 상식을 수집하고 구조화하는 지식의 표현 방법이다. 일반 상식 기반의 지식 그래프는 코퍼스 내에 포함되어 있는 다양한 일반 상식의 형태와 관계를 모델링하며, 주로 질의응답 시스템, 상식 추론 등의 자연어처리 하위 작업에 활용할 수 있다. 가장 잘 알려진 일반 상식 기반의 지식 그래프로는 ConceptNet [1], ATOMIC [2]이 있다. 하지만 한국어 기반의 일반 상식 기반의 지식 그래프에 대한 연구가 존재했지만, 자연어처리 태스크에 활용하기에는 충분하지 않다. 본 연구에서는 대규모 언어 모델과 프롬프트의 활용을 통해 한국어 일반 상식 기반의 지식 그래프를 효과적으로 구축하는 방법론을 제시한다. 또한, 제안하는 방법론으로 구축한 지식 그래프와 기존의 한국어 상식 그래프의 품질을 양적, 질적으로 검증한다.

  • PDF

Necessity of Standardization and Standardized Method for Substances Accounting of Environmental Liability Insurance (환경책임보험 배출 물질 정산의 표준화 필요성 및 산출방법 표준화)

  • Park, Myeongnam;Kim, Chang-wan;Shin, Dongil
    • Journal of the Korean Institute of Gas
    • /
    • v.22 no.5
    • /
    • pp.1-17
    • /
    • 2018
  • Related incidents and accidents are frequent after 2000 years, such as the outbreak of the Taian peninsula crude oil spillage and Gumi hydrofluoric acid leakage accident. In the wake of such environmental pollution accidents, Consensus has been formed to enact legislation on liability for the compensation of environmental pollution in 2014 and the rescue, and has been in force since January 2016. Therefore, in the domestic insurance industry, the introduced environmental liability insurance system needs to be managed through the standardization formula of a new insurance model for managing the environmental risk. This study has been carried out by the emergence of a safe insurance model with a risky nature of the risk type, which is one of the services of the knowledge base. The verification of the six assurance media on the occurrence of environmental pollution such as chemical, waste, marine, soil, etc. is expressed through semantic interoperability through this possible ontology. The insurance model was designed and presented by deducing the relationship between the amount of money and the amount of money that was written in the area of existing expertise, In order to exclude the possible consequences, the concept of abstract is conceptualized in the form of a customer, and a plan for the future development of an ontology-based decision support system is proposed to reduce the cost and resources consumed every year. It is expected that standardization of the verification standard of the mass of mass will minimize errors and reduce the time and resources required for verification.

Identification of Wells Effect and Effects of Risk Perception of Wrong Verdict (평결 판단에서 웰스효과의 확인과 평결 오류 위험성 지각의 영향)

  • Dong-Heon Seok;Mi-Jin Kim
    • Korean Journal of Culture and Social Issue
    • /
    • v.19 no.2
    • /
    • pp.159-178
    • /
    • 2013
  • The purpose of this study was to 1) replicate the Wells effect(i.e., reluctance to rule against the Defendant solely on the basis of probabilistic evidence) in Korea and 2) examine the validity of an Alternative explanation(i.e., perception of risk of wrong verdict). In study 1(n=46), mock jurors in the tire-tracks condition were reluctant to rule against the defendant based on their perceived probability and this pattern was not resulted in the tire-tracks-belief condition. Therefore, the Wells effect was replicated in Korea. In study 2(n=70), we manipulated the participants' perception of risk of wrong verdict. That is, participants who were assigned in the high risk perception of wrong verdict were informed that if the defendant were found guilty, the defendant would get considerable demage both in finance and reputation of the company. Participants in the low risk perception of wrong verdict condition were informed that these demage would not be great. The results revealed that the Wells effect was pronounced in the high risk perception of wrong verdict condition. That is, participants were more reluctant to rule against the defendant when they perceive the significance of the result of wrong verdict as high. Limitations of the study and the directions for future study were discussed.

  • PDF

Response Modeling for the Marketing Promotion with Weighted Case Based Reasoning Under Imbalanced Data Distribution (불균형 데이터 환경에서 변수가중치를 적용한 사례기반추론 기반의 고객반응 예측)

  • Kim, Eunmi;Hong, Taeho
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.1
    • /
    • pp.29-45
    • /
    • 2015
  • Response modeling is a well-known research issue for those who have tried to get more superior performance in the capability of predicting the customers' response for the marketing promotion. The response model for customers would reduce the marketing cost by identifying prospective customers from very large customer database and predicting the purchasing intention of the selected customers while the promotion which is derived from an undifferentiated marketing strategy results in unnecessary cost. In addition, the big data environment has accelerated developing the response model with data mining techniques such as CBR, neural networks and support vector machines. And CBR is one of the most major tools in business because it is known as simple and robust to apply to the response model. However, CBR is an attractive data mining technique for data mining applications in business even though it hasn't shown high performance compared to other machine learning techniques. Thus many studies have tried to improve CBR and utilized in business data mining with the enhanced algorithms or the support of other techniques such as genetic algorithm, decision tree and AHP (Analytic Process Hierarchy). Ahn and Kim(2008) utilized logit, neural networks, CBR to predict that which customers would purchase the items promoted by marketing department and tried to optimized the number of k for k-nearest neighbor with genetic algorithm for the purpose of improving the performance of the integrated model. Hong and Park(2009) noted that the integrated approach with CBR for logit, neural networks, and Support Vector Machine (SVM) showed more improved prediction ability for response of customers to marketing promotion than each data mining models such as logit, neural networks, and SVM. This paper presented an approach to predict customers' response of marketing promotion with Case Based Reasoning. The proposed model was developed by applying different weights to each feature. We deployed logit model with a database including the promotion and the purchasing data of bath soap. After that, the coefficients were used to give different weights of CBR. We analyzed the performance of proposed weighted CBR based model compared to neural networks and pure CBR based model empirically and found that the proposed weighted CBR based model showed more superior performance than pure CBR model. Imbalanced data is a common problem to build data mining model to classify a class with real data such as bankruptcy prediction, intrusion detection, fraud detection, churn management, and response modeling. Imbalanced data means that the number of instance in one class is remarkably small or large compared to the number of instance in other classes. The classification model such as response modeling has a lot of trouble to recognize the pattern from data through learning because the model tends to ignore a small number of classes while classifying a large number of classes correctly. To resolve the problem caused from imbalanced data distribution, sampling method is one of the most representative approach. The sampling method could be categorized to under sampling and over sampling. However, CBR is not sensitive to data distribution because it doesn't learn from data unlike machine learning algorithm. In this study, we investigated the robustness of our proposed model while changing the ratio of response customers and nonresponse customers to the promotion program because the response customers for the suggested promotion is always a small part of nonresponse customers in the real world. We simulated the proposed model 100 times to validate the robustness with different ratio of response customers to response customers under the imbalanced data distribution. Finally, we found that our proposed CBR based model showed superior performance than compared models under the imbalanced data sets. Our study is expected to improve the performance of response model for the promotion program with CBR under imbalanced data distribution in the real world.

Development of Expert system for Plant Construction Project Management (플랜트 건설 공사를 위한 사업관리 전문가 시스템의 개발)

  • 김우주;최대우;김정수
    • Journal of Information Technology Application
    • /
    • v.2 no.1
    • /
    • pp.1-24
    • /
    • 2000
  • Project management in the Construction field inherently has more uncertainty and more risks relative to ones from other area. This is the very reason for why project management is recognized as the important task to construction companies. For getting better performance in the project management, we need a system that keeps the consistencies in a automatic or semi-automatic manner through the project management stages like as project definition stage, project planning stage, project design and implementation stage. But since the early stages such as definition and planning stages has many unstructured features and also are dependent to unique expertise or experience of a specific company, we have difficulty providing systematic support for the task of these stages. This kind of problem becomes harder to solve especially in the plant construction domain that is our target domain. Therefore, in this paper, we propose and also implement a systematic approach to resolve the problem mentioned for the early project management stages in the plant construction domain. The results of our approach can be used not only for the purpose of the early project management stages but also can be used automatically as an input to commercial project management tools for the middle project management stages. Because of doing in this way, the construction project can be consistently managed from the definition to implementation stage in a seamless manner. For achieving this purpose, we adopt knowledge based inference, CBR, and neural network as major methodologies and we also applied our approach to two real world cases, power plant and drainage treatment plant cases from a leading construction company in Korea. Since these two application cases showed us very successful results, we can say our approach was validated successfully to the plant construction area. Finally, we believe our approach will contribute to many project management problems from more broader construction area.

  • PDF

Exploring Pre-Service Science Teachers' Positioning and Epistemic Understanding in a Course about Designing Inquiry-Based Lessons (탐구 수업 설계 강좌에서 예비 중등 과학 교사의 위치짓기와 인식적 이해 탐색)

  • Ha, Heesoo;Kang, Eunhee;Kim, Heui-Baik
    • Journal of The Korean Association For Science Education
    • /
    • v.40 no.3
    • /
    • pp.307-320
    • /
    • 2020
  • This study explores how the positioning of two pre-service science teachers (PSTs) is reflected in their different epistemic understandings of inquiry-based lessons. We collected the PSTs' products during their design and enactment of an inquiry-based lesson and recorded their practices in the enacted lesson. Interviews were recorded and transcribed for analysis. The results indicate that one PST, Dohyung was positioned as a subject of evaluation throughout the course and the other, Jinwoo, was positioned as a preservice teacher and a subject of evaluation. Their positions were reflected in their epistemic understandings of inquiry-based lessons, which were developed when designing these lessons. During lesson design, both PSTs showed a shared understanding; they explained inquiry-based lessons as students setting and evaluating hypotheses under teachers' guidance. However, as they faced unexpected situations during lesson enactment, they developed different epistemic understandings. To receive a good grade, Dohyung showed a strong preference for anticipating situations that could occur in class and planning responses to them. He understood inquiry-based lessons as ones in which students conduct experiments to produce results expected by the teacher. On the other hand, Jinwoo emphasized the reasoning process based on students' prior knowledge and explained inquiry-based lessons as ones in which students construct new knowledge through a scientific reasoning process based on their knowledge. The findings of this study will contribute to developing strategies to support PSTs' development of their epistemic understandings of knowledge construction in inquiry-based lessons.

S-FDS : a Smart Fire Detection System based on the Integration of Fuzzy Logic and Deep Learning (S-FDS : 퍼지로직과 딥러닝 통합 기반의 스마트 화재감지 시스템)

  • Jang, Jun-Yeong;Lee, Kang-Woon;Kim, Young-Jin;Kim, Won-Tae
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.54 no.4
    • /
    • pp.50-58
    • /
    • 2017
  • Recently, some methods of converging heterogeneous fire sensor data have been proposed for effective fire detection, but the rule-based methods have low adaptability and accuracy, and the fuzzy inference methods suffer from detection speed and accuracy by lack of consideration for images. In addition, a few image-based deep learning methods were researched, but it was too difficult to rapidly recognize the fire event in absence of cameras or out of scope of a camera in practical situations. In this paper, we propose a novel fire detection system combining a deep learning algorithm based on CNN and fuzzy inference engine based on heterogeneous fire sensor data including temperature, humidity, gas, and smoke density. we show it is possible for the proposed system to rapidly detect fire by utilizing images and to decide fire in a reliable way by utilizing multi-sensor data. Also, we apply distributed computing architecture to fire detection algorithm in order to avoid concentration of computing power on a server and to enhance scalability as a result. Finally, we prove the performance of the system through two experiments by means of NIST's fire dynamics simulator in both cases of an explosively spreading fire and a gradually growing fire.

Development of an Artificial Neural Expert System for Rational Determination of Lateral Earth Pressure Coefficient (합리적인 측압계수 결정을 위한 인공신경 전문가 시스템의 개발)

  • 문상호;문현구
    • Journal of the Korean Geotechnical Society
    • /
    • v.15 no.1
    • /
    • pp.99-112
    • /
    • 1999
  • By using 92 values of lateral earth pressure coefficient(K) measured in Korea, the tendency of K with varying depth is analyzed and compared with the range of K defined by Hoek and Brown. The horizontal stress is generally larger than the vertical stress in Korea : About 84 % of K values are above 1. In this study, the theory of elasto-plasticity is applied to analyze the variation of K values, and the results are compared with those of numerical analysis. This reveals that the erosion, sedimentation and weathering of earth crust are important factors in the determination of K values. Surface erosion, large lateral pressure and good rock mass increase the K values, but sedimentation decreases the K values. This study enable us to analyze the effects of geological processes on the K values, especially at shallow depth where underground excavation takes place. A neural network expert system using multi-layer back-propagation algorithm is developed to predict the K values. The neural network model has a correlation coefficient above 0.996 when it is compared with measured data. The comparison with 9 measured data which are not included in the back-propagation learning has shown an average inference error of 20% and the correlation coefficient above 0.95. The expert system developed in this study can be used for reliable determination of K values.

  • PDF

A Profit Prediction Model in the International Construction Market - focusing on Small and Medium Sized Construction Companies (CBR을 활용한 해외건설 수익성 예측 모델 개발 - 중소·중견기업을 중심으로 -)

  • Hwang, Geon Wook;Jang, woosik;Park, Chan-Young;Han, Seung-Heon;Kim, Jong Sung
    • Korean Journal of Construction Engineering and Management
    • /
    • v.16 no.4
    • /
    • pp.50-59
    • /
    • 2015
  • While the international construction industry for Korean companies have grown in market size exponentially in the recent years, the profit rate of small and medium sized construction companies (SMCCs) are incomparably lower than those of large construction companies. Furthermore, small and medium size companies, especially subcontractor, lacks the judgement of project involvement appropriateness, which leads to an unpredictable profit rate. Therefore, this research aims to create a profit rate prediction model for the international construction project focusing on SMCCs. First, the factors that influence the profit rate and the area of profit zone are defined by using a total of 8,637 projects since the year 1965. Seconds, an extensive literature review is conducted to derive 10 influencing factors. Multiple regression analysis and corresponding judgement technique are used to derive the weight of each factor. Third, cased based reasoning (CBR) methodology is applied to develop the model for profit rate analysis in the project participation review stage. Using 120 validation data set, the developed model showed 11% (14 data sets) of error rate for type 1 and type 2 error. In utilizing the result, project decision makers are able to make decision based on authentic results instead of intuitive based decisions. The model additionally give guidance to the Korean subcontractors when advancing into the international construction based on the model result that shows the profit distribution and checks in advance for the quality of the project to secure a sound profit in each project.