• Title/Summary/Keyword: 평가 일치도

Search Result 2,354, Processing Time 0.025 seconds

Skill Assessments for Evaluating the Performance of the Hydrodynamic Model (해수유동모델 검증을 위한 오차평가방법 비교 연구)

  • Kim, Tae-Yun;Yoon, Han-Sam
    • Journal of the Korean Society for Marine Environment & Energy
    • /
    • v.14 no.2
    • /
    • pp.107-113
    • /
    • 2011
  • To evaluate the performance of the hydrodynamic model, we introduced 10 skill assessments that are assorted by two groups: quantitative skill assessments (Absolute Average Error or AAE, Root Mean Squared Error or RMSE, Relative Absolute Average Error or RAAE, Percentage Model Error or PME) and qualitative skill assessments (Correlation Coefficient or CC, Reliability Index or RI, Index of Agreement or IA, Modeling Efficiency or MEF, Cost Function or CF, Coefficient of Residual Mass or CRM). These skill assessments were applied and calculated to evaluate the hydrodynamic modeling at one of Florida estuaries for water level, current, and salinity as comparing measured and simulated values. We found that AAE, RMSE, RAAE, CC, IA, MEF, CF, and CRM are suitable for the error assessment of water level and current, and AAE, RMSE, RAAE, PME, CC, RI, IA, CF, and CRM are good at the salinity error assessment. Quantitative and qualitative skill assessments showed the similar trend in terms of the classification for good and bad performance of model. Furthermore, this paper suggested the criteria of the "good" model performance for water level, current, and salinity. The criteria are RAAE < 10%, CC > 0.95, IA > 0.98, MEF > 0.93, CF < 0.21 for water level, RAAE < 20%, CC > 0.7, IA > 0.8, MEF > 0.5, CF < 0.5 for current, and RAAE < 10%, PME < 10%, CC > 0.9, RI < 1.15, CF < 0.1 for salinity.

Collection of Korean Emotional Speech Database from Actors (배우에 의한 한국어 정서음성 데이터베이스 수집)

  • Jo Cheolwoo;Bak Il-suh;Lee Yongju;Kim Bongwan
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • spring
    • /
    • pp.45-48
    • /
    • 2004
  • 본 논문에서는 한국어 정서음성 데이터베이스를 수집하는 과정을 기술하고 및 데이터베이스의 특성에 관해서 논의한다. 데이터베이스는 배우로부터 수집되었으며 주관적 평가에 의해 평가되었다. 배우는 남녀 각 3인씩 총 6인이며, 6가지 정서상태에 의해 10개의 문장을 발성하였고 20명의 평가자가 음성에 포함된 정서상태를 독립적으로 평가하였다. 작성된 데이터베이스는 임의제시 방법에 의한 주관적 평가결과 $80\%$이상의 일치도를 얻었다.

  • PDF

Feature Extraction and Similarity Measure Function Define For Beauty Evaluation of Korean Character (한글의 미적 평가를 위한 특징 추출 및 유사도 함수 정의)

  • 한군희;오명관;이형우;전병민
    • The Journal of the Korea Contents Association
    • /
    • v.2 no.1
    • /
    • pp.59-67
    • /
    • 2002
  • This study pre-processed the characters, performed the feature extraction for the beauty evaluation, and then defined the similarity function. It suggested the definition of the similarity function, and the extraction of the features of character elements. it experimented how much the various input character patterns were similar with the standard character patterns, found their results were almost similar with the expected ones and the results of beauty evaluation on general people through the questionaire with the results of the methods suggested here.

  • PDF

Constructing an Evaluation Set for Korean Sentiment Analysis Systems Incorporating the Category and the Strength of Sentiment (감성 강도를 고려한 감성 분석 평가집합 구축)

  • Kim, Do-Yeon;Wu, Yong;Park, Hyuk-Ro
    • The Journal of the Korea Contents Association
    • /
    • v.12 no.11
    • /
    • pp.30-38
    • /
    • 2012
  • Sentiment analysis is concerned with extracting and analyzing different kinds of user sentiment expressed in a variety of social media such as blog and twitter. Although sentiment analysis techniques are actively studied for these days, evaluation sets are not developed yet for Korean sentiment analysis. In this paper, we constructed an evaluation set for Korean sentiment analysis. To evaluate sentiment analysis systems more throughly, each sentence in our evaluation set is tagged with the polarity of the sentiment as well as the category and the strength of the sentiment. We divide kinds of sentiment into 7 positive categories and 15 negative categories. Each category is given the strength of the sentiment from 1 to 3. Our evaluation set consists of 3,270 sentences extracted from various social media. For each sentence, 5 human taggers assigned the category and the strength of the sentiment expressed in the sentence. The ratio of inter-taggers agreement was 93% in the polarity, 70% in the category, 58% in the strength of sentiment. The ratio of inter-taggers agreement our evaluation set is a bit higher than other evaluation sets developed for German and Spanish. This result shows our evaluation set can be used as a reliable resource for the evaluation of sentiment analysis systems.

Validity and Reliability of Cognitive Performance Scale in Long Term Care Hospital in Korea (인지수행척도(Cognitive Performance Scale)의 타당도와 신뢰도)

  • Lee, Ji Yun;Kim, Sun Min;Kim, A Reum
    • 한국노년학
    • /
    • v.30 no.1
    • /
    • pp.81-91
    • /
    • 2010
  • The purpose of this study was to test a validity and reliability of Cognitive Performance Scale(CPS), a cognitive measure generated from 5 items(comatose status, decision making, short-term memory, making self understood, and eating). Method: 393 patients in 2 hospitals for the elderly with dementia were measured with CPS by two nurses independently. The inter-rater agreement was tested by comparing two scores. The CPS score was compared with GDS, which was measured by doctors and nurses, and MMSE score which was drawn from the claim data of Health Insurance Review & Assessment Service. Result: The correlation coefficient between CPS and GDS was 0.742(p<0.0001), CPS and MMSE was -0.794(p<0.0001). The Cronbach's coefficient alpha of CPS was 0.742, Kappa value was 0.772~1.000. The CPS showed high validity and reliability in long term care hospitals of Korea.

A Study on the Assessment of Location Environment of a Regional Central Library - The Case of Daegu Metropolitan City - (지역대표도서관 입지환경 평가에 관한 연구 - 대구광역시를 중심으로 -)

  • Yoo, Jae-Woo;Kim, Sin-Young
    • Journal of Korean Library and Information Science Society
    • /
    • v.46 no.4
    • /
    • pp.427-450
    • /
    • 2015
  • This study examines locational environment factors for the construction of a Daegu metropolitan main library. We presented 12 evaluation factors in 4 large categories(social, physical, economical, and business promotion condition) and evaluated 7 proposed sites for Daegu metropolitan central library based on expert group verification. To the end, the results of assessment between researchers and expert group were arranged to coincide with each other. So, we secured the objectivity and validity. The result of this study can be used for assessing the site environment for a metropolitan central library and other public libraries.

Study for Reliability of Interpretation of the Three Phase Bone Scintigraphy in Patients with Post-traumatic Complex Regional Pain Syndrome (외상 후 복합부위통증증후군 환자에서 시행한 삼상 뼈 스캔의 판독 신뢰도에 관한 연구)

  • Park, Jung-Mi;Kim, Seon-Jung;Chung, Seung-Hyun;Lee, Yong-Taek
    • Nuclear Medicine and Molecular Imaging
    • /
    • v.42 no.1
    • /
    • pp.44-51
    • /
    • 2008
  • Purpose: We performed this study to evaluate reliability on interpretation of three phase bone scintigraphy (TPBS) in patients with post-traumatic complex regional pain syndrome (PT-CRPS). Methods: Based on International Association for the Study of Pain guideline in 1994, 34 patients with PT-CRPS were selected for this study. Two nuclear medicine physicians evaluated identical TPBS according to the uptake pattern, extent and intensity of the lesion, and their agreements (kappa values) were analysed. The final diagnosis based on arbitrary criteria of each physician were compared with those obtained by the criteria for PT-CRPS established in this study, which are hyperactivity on all phases (criteria 1), hyperactivity of whole joints on delayed phase (criteria 2), and hyperactivity of either whole or FDGal joints on delayed phase (criteria 3). Results: Intra-observer agreements were good for uptake pattern, intensity, and extent on TPBS. Inter-observer agreements were also good, except extent on blood pool phase (0.55). The inter-observer agreements on final diagnosis improved when criteria 1-3 were applied (0.77-0.88), compared to when physician's own criteria were used (0.63). Those also improved from 0.29 to 0.47-0.82 for acute stage, and from 0.37 to 1.0 for chronic stage. The sensitivities of chronic stage were relatively lower to those of acute stage. Conclusions: Inter-observer's variations in diagnosis of the patients with PT-CRPS using TPBS were observed. These results were attributed to different criteria set by observers. In order to improve agreement on interpretation of TPBS, common positive criteria should be established, especially considering uptake pattern and clinical stages.

Evaluation of Site-dependent Ductility Factors for Elastic Perfectly Plastic SDOF Systems (토질조건에 따른 탄소성 단자유도 구조물의 연성계수 평가)

  • Kang, Cheol-Kyu;Choi, Byong-Jeong
    • Journal of the Earthquake Engineering Society of Korea
    • /
    • v.8 no.4
    • /
    • pp.11-20
    • /
    • 2004
  • This paper suggests the site-dependent ductility factor which is a key component of response modification factor(R). To compute the ductility factor, a group of 1,860 ground motions recorded from 47 earthquake was considered. Based on the local site conditions at the recording station, ground motions were classified into four groups according to average shear wave velocity. This site classification was consistent with site categories of the UBC(1997), NEHRP(1997) and IBC 2000(1997). Based on the results of regression analysis, a simplified equations were proposed to compute site-dependent ductility factors. The proposed equations were relatively simple and provide a good estimation of mean ductility factors. Based on the proposed equation, ductility factors considering the site conditions can be evaluated in accordance with the present building codes.

Shade Comparative Analysis of Natural tooth Measured by Visual and Two Colorimeters(ShadeEye®,Shadepilot®) (2종 측색기와 시각을 이용한 자연치아의 색조 비교 분석)

  • An, Jin-Hee;Choi, Mee-Ra;Shim, Hye-Won
    • Journal of Dental Rehabilitation and Applied Science
    • /
    • v.29 no.1
    • /
    • pp.81-93
    • /
    • 2013
  • The objectives were to evaluate the accuracy of shade selection by human visual system(VS) and 2 different colorimeters ($ShadeEye^{(R)}$(SE) and Shadepilot (SP)). Maxillary anterior teeth of 30 volunteers which had no caries or restorations were included in the study. Firstly, the accordance in shade selection by 3 dentists and 2 colorimeters was investigated. Secondly, the color of the teeth were measured by 1 observer's naked eye and 2 colorimeters under different illumination conditions (Sunny versus cloudy day). Additionally testing of inter-observer variability selected colors by 2 novice and 2 experienced dentists were compared. For comparing visual and 2 different colorimeters, SP(60%) showed significantly highest rate of accordance than the visual (23.3%) or SE (16.7%) and lowest mean ${\Delta}E$ ($2.62{\pm}0.74$ versus $3.83{\pm}1.38$;SE or $4.04{\pm}1.61$;VS)(p<0.001). If accuracy of shade selection were measured using VS, the mean ${\Delta}E$ value of cloudy day was higher than that of sunny day ($4.35{\pm}1.70$ versus $3.53{\pm}1.31$; p<0.001). There were no significant difference of the mean ${\Delta}E$ value between sunny and cloudy day in both SE and SP. Inter- observer repeatability was higher in 2 experienced group (73.3%) than novice group (36.7%). The mean ${\Delta}E$ of experienced group was lower than that of novice group ($3.60{\pm}1.47$ versus $4.70{\pm}1.67$; p<0.001). Colorimeters (SE or SP) is more accurate and more reproducible compared with human shade assessment. Using visual system may be limited by cloudy and inexperience of tester, then more experience and using colorimeters may be helpful of raising the accurate repeatability of shade selection.

Study on Standardization of Korean Version of Psychiatric Diagnostic Screening Questionnaire(K-PDSQ) (한국판 정신장애 진단 선별 질문지의 표준화 연구)

  • Choi, Hyeong-Keun;Jung, Sung-Won;Jo, Hyun-Ju;Kim, Jeong-Bum;Jung, Chul-Ho
    • Anxiety and mood
    • /
    • v.9 no.1
    • /
    • pp.31-37
    • /
    • 2013
  • Objective : The PDSQ is a brief and psychometrically strong self-report scale designed to screen for common DSM-IV Axis I disorders in clinical settings. In this study, the K-PDSQ was compared with the M.I.N.I.-Plus (Mini-International Neuropsychiatric Interview-Plus) for diagnostic validity and availability of the K-PDSQ as a part of standardization of the K-PDSQ. Methods : The 640 patients were evaluated with the K-PDSQ and the M.I.N.I.-Plus. Diagnosing with the M.I.N.I.-Plus, the diagnostic correspondence, administering time, sensitivity, specificity, ROC curve, and AUC of the K-PDSQ were evaluated. Results : For the diagnostic correspondence of the K-PDSQ, Cohen's kappa coefficient was .66 between the K-PDSQ and the M.I.N.I.-Plus. The administering time of the K-PDSQ was $18.2{\pm}11.80$ minutes. Both sensitivity and specificity of the K-PDSQ were higher: the mean sensitivity across 10 subscales of K-PDSQ was 86%; the mean specificity was 84%. All AUCs of each subscale were above .80, which were statistically significant. Conclusion : The K-PDSQ is valid and available as a diagnostic screening tool. It will be widely used in clinical settings for screening DSM-IV Axis I diagnosis because of its simplicity and high reliability.