통합 검색 | Korea Science

Improving methods for normalizing biomedical text entities with concepts from an ontology with (almost) no training data at BLAH5 the CONTES

Ferre, Arnaud;Ba, Mouhamadou;Bossy, Robert
- Genomics & Informatics
- /
- 제17권2호
- /
- pp.20.1-20.5
- /
- 2019
Entity normalization, or entity linking in the general domain, is an information extraction task that aims to annotate/bind multiple words/expressions in raw text with semantic references, such as concepts of an ontology. An ontology consists minimally of a formally organized vocabulary or hierarchy of terms, which captures knowledge of a domain. Presently, machine-learning methods, often coupled with distributional representations, achieve good performance. However, these require large training datasets, which are not always available, especially for tasks in specialized domains. CONTES (CONcept-TErm System) is a supervised method that addresses entity normalization with ontology concepts using small training datasets. CONTES has some limitations, such as it does not scale well with very large ontologies, it tends to overgeneralize predictions, and it lacks valid representations for the out-of-vocabulary words. Here, we propose to assess different methods to reduce the dimensionality in the representation of the ontology. We also propose to calibrate parameters in order to make the predictions more accurate, and to address the problem of out-of-vocabulary words, with a specific method.
https://doi.org/10.5808/GI.2019.17.2.e20 인용 PDF KSCI

Digital Transformation in Summer Training Process at King Abdulaziz University: Action Design Research in Practice

Bahaddad, Adel;Bitar, Hind
- International Journal of Computer Science & Network Security
- /
- 제22권7호
- /
- pp.171-180
- /
- 2022
In the knowledge development of online assessment in learning management systems (LMSs), many assessments are evaluated weekly in the summer training course for undergraduate students in the Faculty of Computing and Information Technology at King Abdul-Aziz University in Saudi Arabia. The number of performance assessments in the summer training course reaches 15 weeks. Many of them, however, are sent or done informally or through unreliable ways and cannot be verified by third parties. Therefore, applying the concept of digital transformation is essential. This research study reported herein used the action design research (ADR) method to build a new information technology system that could assist in the digital transformation. An electronic platform was designed, developed, implemented, and evaluated using the ADR method so that the main people involved in the summer training process (i.e., students, academic supervisors, and administrators) would have a high level of satisfaction with it. The study was conducted on 452 students, 105 academic supervisors, and 15 administrative staff and was conducted during the summer semester of 2020. All the training processes were digitally transformed and automated to control and raise the level and reliability of the training. All involved people were satisfied, thus, shifting the process to be in a digital form assist in achieving the high-level goal.
https://doi.org/10.22937/IJCSNS.2022.22.7.21 인용 PDF KSCI

Comparison of machine learning algorithms to evaluate strength of concrete with marble powder

Sharma, Nitisha;Upadhya, Ankita;Thakur, Mohindra S.;Sihag, Parveen
- Advances in materials Research
- /
- 제11권1호
- /
- pp.75-90
- /
- 2022
In this paper, functionality of soft computing algorithms such as Group method of data handling (GMDH), Random forest (RF), Random tree (RT), Linear regression (LR), M5P, and artificial neural network (ANN) have been looked out to predict the compressive strength of concrete mixed with marble powder. Assessment of result suggests that, the overall performance of ANN based model gives preferable results over the different applied algorithms for the estimate of compressive strength of concrete. The results of coefficient of correlation were maximum in ANN model (0.9139) accompanied through RT with coefficient of correlation (CC) value 0.8241 and minimum root mean square error (RMSE) value of ANN (4.5611) followed by RT with RMSE (5.4246). Similarly, other evaluating parameters like, Willmott's index and Nash-sutcliffe coefficient value of ANN was 0.9458 and 0.7502 followed by RT model (0.8763 and 0.6628). The end result showed that, for both subsets i.e., training and testing subset, ANN has the potential to estimate the compressive strength of concrete. Also, the results of sensitivity suggest that the water-cement ratio has a massive impact in estimating the compressive strength of concrete with marble powder with ANN based model in evaluation with the different parameters for this data set.
https://doi.org/10.12989/amr.2022.11.1.075 인용 KSCI

A Classification Model for Predicting the Injured Body Part in Construction Accidents in Korea

Lim, Jiseon;Cho, Sungjin;Kang, Sanghyeok
- 국제학술발표논문집
- /
- The 9th International Conference on Construction Engineering and Project Management
- /
- pp.230-237
- /
- 2022
It is difficult to predict industrial accidents in the construction industry because many accident factors, such as human-related factors and environment-related factors, affect the accidents. Many studies have analyzed the severity of injuries and types of accidents; however, there were few studies on the prediction of injured body parts. This study aims to develop a classification model to predict the part of the injured body based on accident-related factors. Construction accident cases from June 2018 to July 2021 provided by the Korea Construction Safety Management Integrated Information were collected through web crawling and then preprocessed. A naïve Bayes classifier, one of the supervised learning algorithms, was employed to construct a classification model of the injured body part, which has four categories: 1) torso, 2) upper extremity, 3) head, and 4) lower extremity. The predictor variables are accident type, type of work, facility type, injury source, and activity type. As a result, the average accuracy for each injured body part was 50.4%. The accuracy of the upper extremity and lower extremity was relatively higher than the cases of the torso and head. Unlike the other classifications, such as spam mail filtering, a naïve Bayes classifier does not provide a good classification performance in construction accidents. The reasons are discussed in the study. Based on the results of this study, more detailed guidelines for construction safety management can be provided, which help establish safety measures at the construction site.
PDF

SHAP를 이용한 이미지 어노테이션 자동화 프로세스 연구 (A Study on Image Annotation Automation Process using SHAP for Defect Detection)

정진형;심현수;김용수
- 산업경영시스템학회지
- /
- 제46권1호
- /
- pp.76-83
- /
- 2023
Recently, the development of computer vision with deep learning has made object detection using images applicable to diverse fields, such as medical care, manufacturing, and transportation. The manufacturing industry is saving time and money by applying computer vision technology to detect defects or issues that may occur during the manufacturing and inspection process. Annotations of collected images and their location information are required for computer vision technology. However, manually labeling large amounts of images is time-consuming, expensive, and can vary among workers, which may affect annotation quality and cause inaccurate performance. This paper proposes a process that can automatically collect annotations and location information for images using eXplainable AI, without manual annotation. If applied to the manufacturing industry, this process is thought to save the time and cost required for image annotation collection and collect relatively high-quality annotation information.
https://doi.org/10.11627/jksie.2023.46.1.076 인용 PDF

토픽모델링을 활용한 무역분야 연구동향 분석 (A Study on the Research Trends in Int'l Trade Using Topic modeling)

이지훈;김정숙
- 무역학회지
- /
- 제45권3호
- /
- pp.55-69
- /
- 2020
This study examines the research trends and knowledge structure of international trade studies using topic modeling method, which is one of the main methodologies of text mining. We collected and analyzed English abstracts of 1,868 papers of three Korean major journals in the area of international trade from 2003 to 2019. We used the Latent Dirichlet Allocation(LDA), an unsupervised machine learning algorithm to extract the latent topics from the large quantity of research abstracts. 20 topics are identified without any prior human judgement. The topics reveal topographical maps of research in international trade and are representative and meaningful in the sense that most of them correspond to previously established sub-topics in trade studies. Then we conducted a regression analysis on the document-topic distributions generated by LDA to identify hot and cold topics. We discovered 2 hot topics(internationalization capacity and performance of export companies, economic effect of trade) and 2 cold topics(exchange rate and current account, trade finance). Trade studies are characterized as a interdisciplinary study of three agendas(i.e. international economy, International Business, trade practice), and 20 topics identified can be grouped into these 3 agendas. From the estimated results of the study, we find that the Korean government's active pursuit of FTA and consequent necessity of capacity building in Korean export firms lie behind the popularity of topic selection by the Korean researchers in the area of int'l trade.
https://doi.org/10.22659/KTRA.2020.45.3.55 인용 PDF

Prediction of karst sinkhole collapse using a decision-tree (DT) classifier

Boo Hyun Nam;Kyungwon Park;Yong Je Kim
- Geomechanics and Engineering
- /
- 제36권5호
- /
- pp.441-453
- /
- 2024
Sinkhole subsidence and collapse is a common geohazard often formed in karst areas such as the state of Florida, United States of America. To predict the sinkhole occurrence, we need to understand the formation mechanism of sinkhole and its karst hydrogeology. For this purpose, investigating the factors affecting sinkholes is an essential and important step. The main objectives of the presenting study are (1) the development of a machine learning (ML)-based model, namely C5.0 decision tree (C5.0 DT), for the prediction of sinkhole susceptibility, which accounts for sinkhole/subsidence inventory and sinkhole contributing factors (e.g., geological/hydrogeological) and (2) the construction of a regional-scale sinkhole susceptibility map. The study area is east central Florida (ECF) where a cover-collapse type is commonly reported. The C5.0 DT algorithm was used to account for twelve (12) identified hydrogeological factors. In this study, a total of 1,113 sinkholes in ECF were identified and the dataset was then randomly divided into 70% and 30% subsets for training and testing, respectively. The performance of the sinkhole susceptibility model was evaluated using a receiver operating characteristic (ROC) curve, particularly the area under the curve (AUC). The C5.0 model showed a high prediction accuracy of 83.52%. It is concluded that a decision tree is a promising tool and classifier for spatial prediction of karst sinkholes and subsidence in the ECF area.
https://doi.org/10.12989/gae.2024.36.5.441 인용

메타인지 발달을 위한 인지적 도제 기반의 로봇 프로그래밍 교수.학습 모형 개발 (Development of a Robot Programming Instructional Model based on Cognitive Apprenticeship for the Enhancement of Metacognition)

연혜진;조미헌
- 정보교육학회논문지
- /
- 제18권2호
- /
- pp.225-234
- /
- 2014
로봇 프로그래밍은 주어진 과제를 해결하기 위한 알고리즘을 계획하고, 그 알고리즘을 구현하며, 그 결과를 로봇이라는 매체를 통해서 쉽게 확인하고 오류를 수정할 수 있도록 한다. 따라서 로봇 프로그래밍은 반성적 사고에 기반을 둔 문제해결의 과정이며, 학생들의 메타인지와 밀접히 관련된다. 이에 본 연구는 학생의 메타인지 발달을 위한 로봇 프로그래밍 교수 학습 모형을 개발하는 것을 목적으로 수행되었다. 로봇 프로그래밍 교수 학습의 단계를 '학습과제 탐구', '교사의 모델링', '과제 수행 계획 및 시각화', '과제 수행', '자기 평가 및 강화' 등과 같은 5가지로 나누고, 각 단계의 활동들을 메타인지 주요 전략들(계획, 모니터링, 조절, 평가)와 연계하였다. 또한 학생들의 프로그래밍 활동과 메타인지 전략의 활용을 지원하기 위하여 인지적 도제를 기반으로 '모델링', '코칭', '스캐폴딩'과 같은 전략들을 교수 학습 모델과 연계하여 명시하였다. 이와 더불어서, 메타인지 활동을 지원하기 위해서 자기질문법을 도입하여, 학생들이 로봇 프로그래밍 활동의 각 단계별로 사용할 수 있는 자기질문 등을 제시하였다.
https://doi.org/10.14352/jkaie.2014.18.2.225 인용 PDF KSCI

텍스트 마이닝을 활용한 지역 특성 기반 도시재생 유형 추천 시스템 제안 (Suggestion of Urban Regeneration Type Recommendation System Based on Local Characteristics Using Text Mining)

김익준;이준호;김효민;강주영
- 지능정보연구
- /
- 제26권3호
- /
- pp.149-169
- /
- 2020
현 정부의 주요 국책사업 중 하나인 도시재생 뉴딜사업은 매년 100 곳씩, 5년간 500곳을대상으로 50조를 투자하여 낙후된 지역을 개발하는 것으로 언론과 지자체의 높은 이목이 집중되고 있다. 그러나, 현재 이 사업모델은 면적 규모에 따라 "우리동네 살리기, 주거정비지원형, 일반근린형, 중심시가지형, 경제기반형" 등 다섯 가지로 나뉘어 추진되어 그 지역 본래의 특성을 반영하지 못하고 있다. 국내 도시재생 성공 키워드는 "주민 참여", "지역특화" "부처협업", "민관협력"이다. 성공 키워드에 따르면 지자체에서 정부에게 도시재생 사업을 제안할 때 지역주민, 민간기업의 도움과 함께 도시의 특성을 정확히 이해하고 도시의 특성에 어울리는 방향으로 사업을 추진하는 것이 가장 중요하다는 것을 알 수 있다. 또한 도시재생 사업 후 발생하는 부작용 중 하나인 젠트리피케이션 문제를 고려하면 그 지역 특성에 맞는 도시재생 유형을 선정하여 추진하는 것이 중요하다. 이에 본 연구는 '도시재생 뉴딜 사업' 방법론의 한계점을 보완하기 위해, 기존 서울시가 지역 특성에 기반하여 추진하고 있는 "2025 서울시 도시재생 전략계획"의 도시재생 유형을 참고하여 도시재생 사업지에 맞는 도시재생 유형을 추천하는 시스템을 머신러닝 알고리즘을 활용하여 제안하고자 한다. 서울시 도시재생 유형은 "저이용저개발, 쇠퇴낙후, 노후주거, 역사문화자원 특화" 네 가지로 분류된다 (Shon and Park, 2017). 지역 특성을 파악하기 위해 총 4가지 도시재생 유형에 대해 사업이 진행된 22개의 지역에 대한 뉴스 미디어 10만여건의 텍스트 데이터를 수집하였다. 수집된 텍스트를 이용하여 도시재생 유형에 따른 지역별 주요 키워드를 도출하고 토픽모델링을 수행하여 유형별 차이가 있는 지 탐색해 보았다. 다음 단계로 주어진 텍스트를 기반으로 도시재생 유형을 추천하는 추천시스템 구축을 위해 텍스트 데이터를 벡터로 변환하여 머신러닝 분류모델을 개발하였고, 이를 검증한 결과 97% 정확도를 보였다. 따라서 본 연구에서 제안하는 추천 시스템은 도시재생 사업을 진행하는 과정에서 신규 사업지의 지역 특성에 기반한 도시재생 유형을 추천할 수 있을 것으로 기대된다.
https://doi.org/10.13088/jiis.2020.26.3.169 인용 PDF KSCI

Comparison of three boosting methods in parent-offspring trios for genotype imputation using simulation study

Mikhchi, Abbas;Honarvar, Mahmood;Kashan, Nasser Emam Jomeh;Zerehdaran, Saeed;Aminafshar, Mehdi
- Journal of Animal Science and Technology
- /
- 제58권1호
- /
- pp.1.1-1.6
- /
- 2016
Background: Genotype imputation is an important process of predicting unknown genotypes, which uses reference population with dense genotypes to predict missing genotypes for both human and animal genetic variations at a low cost. Machine learning methods specially boosting methods have been used in genetic studies to explore the underlying genetic profile of disease and build models capable of predicting missing values of a marker. Methods: In this study strategies and factors affecting the imputation accuracy of parent-offspring trios compared from lower-density SNP panels (5 K) to high density (10 K) SNP panel using three different Boosting methods namely TotalBoost (TB), LogitBoost (LB) and AdaBoost (AB). The methods employed using simulated data to impute the un-typed SNPs in parent-offspring trios. Four different datasets of G1 (100 trios with 5 k SNPs), G2 (100 trios with 10 k SNPs), G3 (500 trios with 5 k SNPs), and G4 (500 trio with 10 k SNPs) were simulated. In four datasets all parents were genotyped completely, and offspring genotyped with a lower density panel. Results: Comparison of the three methods for imputation showed that the LB outperformed AB and TB for imputation accuracy. The time of computation were different between methods. The AB was the fastest algorithm. The higher SNP densities resulted the increase of the accuracy of imputation. Larger trios (i.e. 500) was better for performance of LB and TB. Conclusions: The conclusion is that the three methods do well in terms of imputation accuracy also the dense chip is recommended for imputation of parent-offspring trios.
https://doi.org/10.1186/s40781-015-0081-1 인용 PDF KSCI

검색결과 568건 처리시간 0.029초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)