• Title/Summary/Keyword: labeling data

Search Result 478, Processing Time 0.028 seconds

Assessment of Risk Levels in Cut-Slope Using Dimensionality Reduction and Clustering Analysis (차원축소와 클러스터링 분석을 활용한 도로비탈면 위험등급 산정)

  • Seo, Seunghwan;Kim, Gunwoong;Woo, Younghoon;Park, Byungsuk;Kim, Juhyong;Kim, Seung-Hyun;Chung, Moonkyung
    • Journal of the Korean Geotechnical Society
    • /
    • v.40 no.5
    • /
    • pp.113-129
    • /
    • 2024
  • This study reclassifies the risk levels of cut-slopes and addresses the limitations inherent in existing evaluation methods using road slope maintenance data. Conventional risk assessment predominantly relies on subjective expert judgment, resulting in issues of consistency and reliability. To mitigate these limitations, this study applies dimensionality reduction techniques, specifically Principal Component Analysis (PCA) and Linear Discriminant Analysis (LDA), followed by K-means clustering, to classify new risk levels. The clustering results using PCA demonstrated more distinct cluster separation compared to LDA, and also showed superior performance in terms of the silhouette coefficient and other clustering metrics. This suggests that the existing risk level labels may not adequately capture the underlying data structure. Furthermore, the inconsistency observed between LDA-based clustering results and current risk labels indicates potential reliability issues in the present labeling approach. To resolve this, new risk levels were assigned using PCA and K-means clustering, with cluster risk levels evaluated based on risk scores. A quantitative analysis of key risk factors was also conducted to establish criteria for risk classification and assess the impact of each variable on the different risk levels. This study proposes a data-driven, objective, and quantitative approach to risk level evaluation, aiming to improve the efficiency and reliability of road slope management.

An Outlier Detection Using Autoencoder for Ocean Observation Data (해양 이상 자료 탐지를 위한 오토인코더 활용 기법 최적화 연구)

  • Kim, Hyeon-Jae;Kim, Dong-Hoon;Lim, Chaewook;Shin, Yongtak;Lee, Sang-Chul;Choi, Youngjin;Woo, Seung-Buhm
    • Journal of Korean Society of Coastal and Ocean Engineers
    • /
    • v.33 no.6
    • /
    • pp.265-274
    • /
    • 2021
  • Outlier detection research in ocean data has traditionally been performed using statistical and distance-based machine learning algorithms. Recently, AI-based methods have received a lot of attention and so-called supervised learning methods that require classification information for data are mainly used. This supervised learning method requires a lot of time and costs because classification information (label) must be manually designated for all data required for learning. In this study, an autoencoder based on unsupervised learning was applied as an outlier detection to overcome this problem. For the experiment, two experiments were designed: one is univariate learning, in which only SST data was used among the observation data of Deokjeok Island and the other is multivariate learning, in which SST, air temperature, wind direction, wind speed, air pressure, and humidity were used. Period of data is 25 years from 1996 to 2020, and a pre-processing considering the characteristics of ocean data was applied to the data. An outlier detection of actual SST data was tried with a learned univariate and multivariate autoencoder. We tried to detect outliers in real SST data using trained univariate and multivariate autoencoders. To compare model performance, various outlier detection methods were applied to synthetic data with artificially inserted errors. As a result of quantitatively evaluating the performance of these methods, the multivariate/univariate accuracy was about 96%/91%, respectively, indicating that the multivariate autoencoder had better outlier detection performance. Outlier detection using an unsupervised learning-based autoencoder is expected to be used in various ways in that it can reduce subjective classification errors and cost and time required for data labeling.

Developing a Korean Standard Brain Atlas on the basis of Statistical and Probabilistic Approach and Visualization tool for Functional image analysis (확률 및 통계적 개념에 근거한 한국인 표준 뇌 지도 작성 및 기능 영상 분석을 위한 가시화 방법에 관한 연구)

  • Koo, B.B.;Lee, J.M.;Kim, J.S.;Lee, J.S.;Kim, I.Y.;Kim, J.J.;Lee, D.S.;Kwon, J.S.;Kim, S.I.
    • The Korean Journal of Nuclear Medicine
    • /
    • v.37 no.3
    • /
    • pp.162-170
    • /
    • 2003
  • The probabilistic anatomical maps are used to localize the functional neuro-images and morphological variability. The quantitative indicator is very important to inquire the anatomical position of an activated legion because functional image data has the low-resolution nature and no inherent anatomical information. Although previously developed MNI probabilistic anatomical map was enough to localize the data, it was not suitable for the Korean brains because of the morphological difference between Occidental and Oriental. In this study, we develop a probabilistic anatomical map for Korean normal brain. Normal 75 blains of T1-weighted spoiled gradient echo magnetic resonance images were acquired on a 1.5-T GESIGNA scanner. Then, a standard brain is selected in the group through a clinician searches a brain of the average property in the Talairach coordinate system. With the standard brain, an anatomist delineates 89 regions of interest (ROI) parcellating cortical and subcortical areas. The parcellated ROIs of the standard are warped and overlapped into each brain by maximizing intensity similarity. And every brain is automatically labeledwith the registered ROIs. Each of the same-labeled region is linearly normalize to the standard brain, and the occurrence of each legion is counted. Finally, 89 probabilistic ROI volumes are generated. This paper presents a probabilistic anatomical map for localizing the functional and structural analysis of Korean normal brain. In the future, we'll develop the group specific probabilistic anatomical maps of OCD and schizophrenia disease.

지노믹트리 Microarray 토탈솔루션

  • O Tae-Jeong
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2006.02a
    • /
    • pp.46-55
    • /
    • 2006
  • (주)지노믹트리는 DNA 마이크로어레이 기술을 기반으로 하는 분자진단회사로서, 다음의 세가지 사업에 전력하고 있다. 첫째는 독창적이며 특화된 바이오마커 발굴기술 (MAGIC system)을 바탕으로 각종 암진단을 위한 바이오마커 개발연구 두 번째는 당사의 원천 기술인 다중동시검출 시스템을 이용한 질병 진단 시스템 및 증폭시스템 세 번째는 마이크로어레이 기술을 이용한 유전자 발현 분석, Array CGH, DNA 메틸레이션 분석 그리고 miRNA 검출 등의 지노믹스시대의 연구를 위한 토탈솔루션을 제공하고 있다. 지난 5년간의 마이크로어레이 기반기술을 이용한 자체연구 활동을 수행하면서 축적된 마이크로어레이 관련기술 노-하우들을 국내 마이크로어레이 연구자들에게 공급하기 위하여 노력하고 있다. 특히 당사의 지노믹서비스 부문은 유전자 발현 분석 솔루션 제공을 위해서 자체적으로 제작하여 공급하고 있는 human cDNA(17K/25K) 및 rat cDNA (5.0K) 마이크로어레이, Human (22K) 및 mouse (10K) 올리고뉴클레오타이드 마이크로 어레이 그리고 미생물 연구를 위한 대장균 (6K) 및 폐렴균 (2.2K) 올리고뉴클레오타이드 마이크로어레이 제공 및 이를 이용한 유전자 발현 분석 서비스를 제공하고 있다. 체적으로 제작되는 마이크로어레이 서비스는 2001년 도입한 ISO9001 품질인증시스템의 기반하에서 제작부터 생산까지의 엄격한 품질관리 과정을 거쳐서 고품질의 마이크로어레이를 이용한 분석서비스를 제공 하고 있다. 또한 고객요구형 서비스를 위하여 국외 유수의 마이크로어레이 회사 (Agilent, Microarray Inc, TIGR, Eurogentec 등)의 whole genome 기반의 마이크로어레이 제품을 이용한 분석서비스를 제공하고 있으며 마이크로어레이 실험을 위해서 필수적으로 이용되고 있는 시약 (labeling kit), 마이크로어레이 hybridization을 위한 hardware (hybridization chamber, hnay centrifuge)등을 자체적으로 개발하여 공급하고 있다. DNA copy number 측정을 위한 Array CGH 분석을 위해서는 자체적으로 제작공구하고 있는 human cDNA 마이크로어레이 (17K/25K) 그기고 rat (5.0K) 마이크로어레이를 이용한 분석서비스 및 whole genome 기반의 Agilent 올리고뉴클레오타이드 CGH 어레이 (44K, 35Kb resolution)를 이용한 분석서비스를 제공하고 있다. Epigenetic study를 하는 연구자들을 위한 메틸레이션 마이크로어레이 분석 서비스를 제공하고 있다. 기존분석법인 Bisulfite 처리기반의 분석이 아닌 enzyme digestion후 PCR 증폭방법을 이용한 분석방법을 이용함으로써, bisulfite 처리에 의한 DNA 손실문제를 최소화 하였다. 현재 50개의 문헌을 통해 잘 보고된 메틸레이션 유전자들에 대한 분석서비스를 제공하고 있으며, 지속적으로 표적컨텐츠의 숫자를 증가시킬 예정이다. 최근 많은 연구자들의 관심을 끌고 있는 micro RNA 검출을 위한 DNA 마이크로어레이 서비스를 제공할 예정이다 (2006년 3월 출시). 현재 까지 알려진 약 320개의 모든 miRNA를 탑재하고 있는 소형 DNA 마이크로어레이를 이용한 분석서비스로서 1장의 마이크로어레이 실험을 통하여 알려진 모든 miRNA의 비교분석이 가능하다. 마이크로어레이 실험 뿐만 아니라 data 분석을 위한 software도 상당히 중요한 비중을 차지하고 있다 이를 위하여 (주)지노믹트리는 Agilent에서 개발한 GeneSpring GX (유전자 발현 분석), Signet (마이크로어레이 database) 및 GeneSpring GT (SNP 분석)를 공급하고 있다. 통계적인 기반 지식의 없은 일반 user들을 위한 간편하면서도 종합적인 기능을 포함하고 있는 우수한 프로그램으로 이미 국제적으로 많은 인정을 받고 있다. (주)지노믹트리는 국내외 많은 연구자들의 경제적, 시간적 연구여건을 고려한 마이크로어레이 토탈솔루션을 제공하고 있으며, 실험 분석에서 data 마이닝 그리고 마이크로어레이 실험 디자인에 이르는 토탈솔루션을 제공하고 있다.

  • PDF

Analysis of Access Authorization Conflict for Partial Information Hiding of RDF Web Document (RDF 웹 문서의 부분적인 정보 은닉과 관련한 접근 권한 충돌 문제의 분석)

  • Kim, Jae-Hoon;Park, Seog
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.18 no.2
    • /
    • pp.49-63
    • /
    • 2008
  • RDF is the base ontology model which is used in Semantic Web defined by W3C. OWL expands the RDF base model by providing various vocabularies for defining much more ontology relationships. Recently Jain and Farkas have suggested an RDF access control model based on RDF triple. Their research point is to introduce an authorization conflict problem by RDF inference which must be considered in RDF ontology data. Due to the problem, we cannot adopt XML access control model for RDF, although RDF is represented by XML. However, Jain and Farkas did not define the authorization propagation over the RDF upper/lower ontology concepts when an RDF authorization is specified. The reason why the authorization specification should be defined clearly is that finally, the authorizatin conflict is the problem between the authorization propagation in specifying an authorization and the authorization propagation in inferencing authorizations. In this article, first we define an RDF access authorization specification based on RDF triple in detail. Next, based on the definition, we analyze the authoriztion conflict problem by RDF inference in detail. Next, we briefly introduce a method which can quickly find an authorization conflict by using graph labeling techniques. This method is especially related with the subsumption relationship based inference. Finally, we present a comparison analysis with Jain and Farkas' study, and some experimental results showing the efficiency of the suggested conflict detection method.

Outlier Detection and Labeling of Ship Main Engine using LSTM-AutoEncoder (LSTM-AutoEncoder를 활용한 선박 메인엔진의 이상 탐지 및 라벨링)

  • Dohee Kim;Yeongjae Han;Hyemee Kim;Seong-Phil Kang;Ki-Hun Kim;Hyerim Bae
    • The Journal of Bigdata
    • /
    • v.7 no.1
    • /
    • pp.125-137
    • /
    • 2022
  • The transportation industry is one of the important industries due to the geographical requirements surrounded by the sea on three sides of Korea and the problem of resource poverty, which relies on imports for most of its resource consumption. Among them, the proportion of the shipping industry is large enough to account for most of the transportation industry, and maintenance in the shipping industry is also important in improving the operational efficiency and reducing costs of ships. However, currently, inspections are conducted every certain period of time for maintenance of ships, resulting in time and cost, and the cause is not properly identified. Therefore, in this study, the proposed methodology, LSTM-AutoEncoder, is used to detect abnormalities that may cause ship failure by considering the time of actual ship operation data. In addition, clustering is performed through clustering, and the potential causes of ship main engine failure are identified by grouping outlier by factor. This enables faster monitoring of various information on the ship and identifies the degree of abnormality. In addition, the current ship's fault monitoring system will be equipped with a concrete alarm point setting and a fault diagnosis system, and it will be able to help find the maintenance time.

Service Quality Evaluation based on Social Media Analytics: Focused on Airline Industry (소셜미디어 어낼리틱스 기반 서비스품질 평가: 항공산업을 중심으로)

  • Myoung-Ki Han;Byounggu Choi
    • Information Systems Review
    • /
    • v.24 no.1
    • /
    • pp.157-181
    • /
    • 2022
  • As competition in the airline industry intensifies, effective airline service quality evaluation has become one of the main challenges. In particular, as big data analytics has been touted as a new research paradigm, new research on service quality measurement using online review analysis has been attempted. However, these studies do not use review titles for analysis, relyon supervised learning that requires a lot of human intervention in learning, and do not consider airline characteristics in classifying service quality dimensions.To overcome the limitations of existing studies, this study attempts to measure airlines service quality and to classify it into the AIRQUAL service quality dimension using online review text as well as title based on self-trainingand sentiment analysis. The results show the way of effective extracting service quality dimensions of AIRQUAL from online reviews, and find that each service quality dimension have a significant effect on service satisfaction. Furthermore, the effect of review title on service satisfaction is also found to be significant. This study sheds new light on service quality measurement in airline industry by using an advanced analytical approach to analyze effects of service quality on customer satisfaction. This study also helps managers who want to improve customer satisfaction by providing high quality service in airline industry.

Development of surface detection model for dried semi-finished product of Kimbukak using deep learning (딥러닝 기반 김부각 건조 반제품 표면 검출 모델 개발)

  • Tae Hyong Kim;Ki Hyun Kwon;Ah-Na Kim
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.17 no.4
    • /
    • pp.205-212
    • /
    • 2024
  • This study developed a deep learning model that distinguishes the front (with garnish) and the back (without garnish) surface of the dried semi-finished product (dried bukak) for screening operation before transfter the dried bukak to oil heater using robot's vacuum gripper. For deep learning model training and verification, RGB images for the front and back surfaces of 400 dry bukak that treated by data preproccessing were obtained. YOLO-v5 was used as a base structure of deep learning model. The area, surface information labeling, and data augmentation techniques were applied from the acquired image. Parameters including mAP, mIoU, accumulation, recall, decision, and F1-score were selected to evaluate the performance of the developed YOLO-v5 deep learning model-based surface detection model. The mAP and mIoU on the front surface were 0.98 and 0.96, respectively, and on the back surface, they were 1.00 and 0.95, respectively. The results of binary classification for the two front and back classes were average 98.5%, recall 98.3%, decision 98.6%, and F1-score 98.4%. As a result, the developed model can classify the surface information of the dried bukak using RGB images, and it can be used to develop a robot-automated system for the surface detection process of the dried bukak before deep frying.

Biodistribution of $^{99m}Tc$-Lactosylated Serum Albumin in Mice with Diethylnitrosamine or Thiacetamide Induced Liver Injury (Diethylnitrosamine 및 Thioacetamide 유발 간손상 생쥐에서의 $^{99m}Tc$-Lactosylated Serum Albumin의 체내 분포상)

  • Whang, Jae-Seok;Ahn, Byeong-Cheol;Sung, Young-Ok;Seo, Ji-Hyoung;Bae, Jin-Ho;Jeong, Shin-Young;Yoo, Jung-Soo;Jeong, Jae-Min;Lee, Jae-Tae;Lee, Kyu-Bo
    • The Korean Journal of Nuclear Medicine
    • /
    • v.39 no.3
    • /
    • pp.200-208
    • /
    • 2005
  • Purpose: Tc-99m labeled diethylenetriaminepentaacctic acid (DTPA)-coupled galactosylated human serum albumin (GSA) is a currently used imaging agent for asialoglycoprotein receptor (ASGPR) of the liver, but, it has several shortcomings. Recently a new ASGPR imaging agent, $^{99m}Tc$-lactosylated human serum albumin (LSA), with simple labeling procedure, high labeling efficiency, high stability was developed. In order to assess the feasibility of the $^{99m}Tc$-LSA as a ASGPR imaging radiopharmaceuticals, we performed biodistribution study of the tracer in liver injured mice model and the results were compared with histolgic data. Materals and Methods: To induce hepatic damage in ICR mice, diethylnitrosamine (DEN) ($60mg/kg/week{\times}5time$, low dose or $180mg/kg/week{\times}2times$, high dose) and thioacetamide (TAA) ($50mg/kg{\times}1time$) were administrated intraperitoneally. Degree of liver damage was evaluated by tissue hematoxilin-eosin stain, and expression of asialoglycoprotein receptor (ASGPR) was assessed by immunohistochemistry using ASGPR antibody. $^{99m}Tc$-LSA was intravenously administrated via tail vein in DEN or TAA treated mice, and biodistribution study of the tracer was also performed. Results: DEN treated mice showed ballooning of hepatocyte and inflammatory cell infiltration in low dose group and severe hapatocyte necrosis in high dose group, and low dose group showed higher ASGPR staining than control mice in immunohistochemical staining. TAA treated mice showed severe hepatic necrosis. $^{99m}Tc$-LSA Biodistribution study showed that mice with hepatic necrosis induced by high dose DEN or TAA revealed higher blood activity and lower liver activity than control mice, due to slow clearance of the tracer by the liver. The degree of liver uptake was inversely correlated with the degree of histologic liver damage. But low dose DEN treated mice with mild hepatic injury showed normal blood clearance and hepatic activity, partly due to overexpression of ASGPR in mice with mild degree hepatic injury. Conclusion: Liver uptake of $^{99m}Tc$-LSA was inversely correlated with degree of histologic hepatic injury in DEN and TAA treated mice. These results support that $^{99m}Tc$-LSA can be used to evaluate the liver status in liver disease patients.

Development of Intelligent Severity of Atopic Dermatitis Diagnosis Model using Convolutional Neural Network (합성곱 신경망(Convolutional Neural Network)을 활용한 지능형 아토피피부염 중증도 진단 모델 개발)

  • Yoon, Jae-Woong;Chun, Jae-Heon;Bang, Chul-Hwan;Park, Young-Min;Kim, Young-Joo;Oh, Sung-Min;Jung, Joon-Ho;Lee, Suk-Jun;Lee, Ji-Hyun
    • Management & Information Systems Review
    • /
    • v.36 no.4
    • /
    • pp.33-51
    • /
    • 2017
  • With the advent of 'The Forth Industrial Revolution' and the growing demand for quality of life due to economic growth, needs for the quality of medical services are increasing. Artificial intelligence has been introduced in the medical field, but it is rarely used in chronic skin diseases that directly affect the quality of life. Also, atopic dermatitis, a representative disease among chronic skin diseases, has a disadvantage in that it is difficult to make an objective diagnosis of the severity of lesions. The aim of this study is to establish an intelligent severity recognition model of atopic dermatitis for improving the quality of patient's life. For this, the following steps were performed. First, image data of patients with atopic dermatitis were collected from the Catholic University of Korea Seoul Saint Mary's Hospital. Refinement and labeling were performed on the collected image data to obtain training and verification data that suitable for the objective intelligent atopic dermatitis severity recognition model. Second, learning and verification of various CNN algorithms are performed to select an image recognition algorithm that suitable for the objective intelligent atopic dermatitis severity recognition model. Experimental results showed that 'ResNet V1 101' and 'ResNet V2 50' were measured the highest performance with Erythema and Excoriation over 90% accuracy, and 'VGG-NET' was measured 89% accuracy lower than the two lesions due to lack of training data. The proposed methodology demonstrates that the image recognition algorithm has high performance not only in the field of object recognition but also in the medical field requiring expert knowledge. In addition, this study is expected to be highly applicable in the field of atopic dermatitis due to it uses image data of actual atopic dermatitis patients.

  • PDF