• Title/Summary/Keyword: correct classification rate

Search Result 107, Processing Time 0.027 seconds

An Extraction Algorithm of Compound Field-associated Terms for Korean Document Classifications (한글문서 분류용으로 이용할 복합어로 구성된 분야연상어의 추출법)

  • Lee, Samuel Sang-kon
    • Journal of KIISE:Software and Applications
    • /
    • v.32 no.7
    • /
    • pp.636-649
    • /
    • 2005
  • Field-associated Terms itself have field Information. So, they determine field of document just like when human being perceives field. In case of Korean, we organized and experimented them by collecting approximately IS,999 document banks that are classified into 180 fields. We obtained high precision of extraction that 88,782 single field-associated terms are contracted into 8,405 ones thus recording compression rate as approximately 9$\%$ and recall as above 0.77 (average 0.85), precision as above 0.90 (average 0.94). By applying established field-associated terms to initial determination for document classification and comparing it with filed determination by human being, we got correct answers above approximately 90$\%$. We can use results of research as fundamental research for initial stage and apply it document retrieval between multilingual environment thus utilizing it as fundamental research for multilingual information retrieval.

Study on evaluating the significance of 3D nuclear texture features for diagnosis of cervical cancer (자궁경부암 진단을 위한 3차원 세포핵 질감 특성값 유의성 평가에 관한 연구)

  • Choi, Hyun-Ju;Kim, Tae-Yun;Malm, Patrik;Bengtsson, Ewert;Choi, Heung-Kook
    • Journal of the Korea Society of Computer and Information
    • /
    • v.16 no.10
    • /
    • pp.83-92
    • /
    • 2011
  • The aim of this study is to evaluate whether 3D nuclear chromatin texture features are significant in recognizing the progression of cervical cancer. In particular, we assessed that our method could detect subtle differences in the chromatin pattern of seemingly normal cells on specimens with malignancy. We extracted nuclear texture features based on 3D GLCM(Gray Level Co occurrence Matrix) and 3D Wavelet transform from 100 cell volume data for each group (Normal, LSIL and HSIL). To evaluate the feasibility of 3D chromatin texture analysis, we compared the correct classification rate for each of the classifiers using them. In addition to this, we compared the correct classification rates for the classifiers using the proposed 3D nuclear texture features and the 2D nuclear texture features which were extracted in the same way. The results showed that the classifier using the 3D nuclear texture features provided better results. This means our method could improve the accuracy and reproducibility of quantification of cervical cell.

Non-Contact Gesture Recognition Algorithm for Smart TV Using Electric Field Disturbance (전기장 왜란을 이용한 비접촉 스마트 TV 제스처 인식 알고리즘)

  • Jo, Jung-Jae;Kim, Young-Chul
    • Journal of Korea Multimedia Society
    • /
    • v.17 no.2
    • /
    • pp.124-131
    • /
    • 2014
  • In this paper, we propose the non-contact gesture recognition algorithm using 4- channel electrometer sensor array. ELF(Extremely Low Frequency) EMI and PLN are minimized because ambient electromagnetic noise around sensors has a significant impact on entire data in indoor environments. In this study, we transform AC-type data into DC-type data by applying a 10Hz LPF as well as a maximum buffer value extracting algorithm considering H/W sampling rate. In addition, we minimize the noise with the Kalman filter and extract 2-dimensional movement information by taking difference value between two cross-diagonal deployed sensors. We implemented the DTW gesture recognition algorithm using extracted data and the time delayed information of peak values. Our experiment results show that average correct classification rate is over 95% on five-gesture scenario.

A Study on the Rainfall-Runoff Analysis of Using Satellite Image (위성영상정보를 이용한 강우유출 해석에 관한 연구)

  • Park, Young-Kee;Lee, Jeung-Seok;Park, Jeong-Gyu
    • Journal of Environmental Science International
    • /
    • v.19 no.1
    • /
    • pp.115-124
    • /
    • 2010
  • Urban watershed can be found in the visible changes in technology, the most realistic satellite images is to use the data. Satellite image data on the indicators for progress on the nature of the change of land use is consistent and repetitive information, regular observation makes possible the detailed analysis of space-time. These remote sensing techniques and the type of course and, by using the time series history, the past, the dynamic model and the randomized prediction methodology for the conversion process if the city and river basin cooperation of the space changes effectively will be able to extrapolate. For each of the main changes in river flow, depending on the area of urbanization as determined according to reproduce the duration of the relationship between the urbanization of the area and runoff can be represented as a linear polynomial expression was, if a linear expression in the two fast slew rate of 0.858 to 0.861 showed up, and fast slew rate of 0.934 to 0.974 for the polynomial are reported. Change of land use changes in the watershed of the flow is one of the most affecting elements. Therefore, changes in land use of the correct classification of rivers is a more accurate calculation of the amount of the floodgate. In particular, using the Landsat images through the image of the land use category, land use past data and calculated using the Markov Chain model and predict the future land use plan in the water control project will be used for large likely.

Development of a Driver Safety Information Service Model Using Point Detectors at Signalized Intersections (지점검지자료 기반 신호교차로 운전자 안전서비스 개발)

  • Jang, Jeong-A;Choe, Gi-Ju;Mun, Yeong-Jun
    • Journal of Korean Society of Transportation
    • /
    • v.27 no.5
    • /
    • pp.113-124
    • /
    • 2009
  • This paper suggests a new approach for providing information for driver safety at signalized intersections. Particularly dangerous situations at signalized intersections such as red-light violations, accelerating through yellow intervals, red-light running, and stopping abruptly due to the dilemma zone problem are considered in this study. This paper presents the development of a dangerous vehicle determination algorithm by collecting real-time vehicle speeds and times from multiple point detectors when the vehicles are traveling during phase-change. For an evaluation of this algorithm, VISSIM is used to perform a real-time multiple detection situation by changing the input data such as various inflow-volume, design speed change, driver perception, and response time. As a result the correct-classification rate is approximately 98.5% and the prediction rate of the algorithm is approximately 88.5%. This paper shows the sensitivity results by changing the input data. This result showed that the new approach can be used to improve safety for signalized intersections.

Estimation of the Percent of the Vote by Adjustment of Voter Turnout in Election Polls (선거여론조사에서 투표율 반영을 통한 득표율 추정)

  • Kim, Jeonghoon;Han, Sang-Tae;Kang, Hyuncheol
    • Journal of the Korean Data Analysis Society
    • /
    • v.20 no.6
    • /
    • pp.2873-2881
    • /
    • 2018
  • It is very important to obtain objective and credible information through election polls in order to contribute to the correct voting behavior of the voters or to establish appropriate election strategies for candidates or political parties. Therefore, many related organizations such as political parties, media organizations, and research institutions have been making efforts to improve the accuracy of the results of the polls and the election prediction. Kim et al. (2017) analyzed whether the non-response group responded that there is no support candidate in the election survey to increase the accuracy of the estimation of the vote rate. As a result, it has been confirmed that the accuracy of the estimation of the vote rate can be significantly improved by performing an appropriate classification on the non-response layer. In this study, we propose a method to estimate the turnout by each strata (sex, age group) under the condition that the total turnout rate is given for a specific district (region) and propose a procedure to predict the vote rate by reflecting the turnout. In addition, case studies were conducted using data gathered through telephone interviews for the 20th National Assembly elections in 2016.

A Case Study of the Error of Paleontology Exhibition Datas in the Natural History Museums of Korea (한국 자연사박물관 내 고생물학 전시자료들의 오류발생에 관한 사례연구)

  • Ko, Ju Yeong
    • Journal of the Korean earth science society
    • /
    • v.36 no.3
    • /
    • pp.236-245
    • /
    • 2015
  • This study investigated the errors in presenting paleontology exhibition data in 9 natural history museums for 2 years and two months from 15, Aug. 2013 to 25, March 2015. It was found that seven natural history museums presented 28 difference cases of data in error. The purpose of this study was to investigate why the errors occurred and how to prevent the errors from occurring and finally how to correct the errors earlier. For this purpose, this study review related literatures using conference proceedings, books, conducted a survey via natural history museums. Results suggested five ways to correct errors in the future. First, it is suggested that the authorities of the museum increase the number of curators and have specialists participate in excavation and maintenance, research, preparation of the exhibition data through a collaboration with universities and research institutes. Second, it is also suggested that the authorities establish the classification system to use in the exhibition process and secure a job for their maintenance specialists. Third, the authorities of museum should put an examination process in place as a system by inviting the external experts into the exhibition process and also establish a process of collecting errors identified by any museum visitors. Fourth, the authorities of museum should make an efforts to increase the participating rate of correcting errors through SNS, Docent, and educational programs among the community members and students. Fifth, they also should use mass media to show and present the research-proven figures of paleontological fossils, which hopefully helps resolve issues of the prior unchanging cultural inertia.

Survey of Knowledge on Hypertension among the Parents of Elementary School Students (초등학생 학부모의 고혈압 관련 지식에 관한 조사)

  • Kim, Jin-Soon
    • Journal of agricultural medicine and community health
    • /
    • v.30 no.1
    • /
    • pp.29-38
    • /
    • 2005
  • Objectives: Hypertension is the most important risk factors for the cerebrovascular diseases, and also for coronary heart diseases, it is therefore very important that the people have a knowledge on nature of hypertension and it's high risk in order to prevent and detect the hypertension as early as possible. Methods: This study was done to find out the knowledge on hypertension of 434 parents of elementary school students from Kimjae city, Jonbuk province, they were parents in grade 4, 5 and 6 attending two elementary schools. The survey took 10 days from November 20 to November 30, 2003. Results: first, The highest correct answer(94.5%) was "obesity is risk factors for hypertension", followed by "hypertension is closely related with hereditary factors(91.0%) and "high sodium intake is associated with high blood pressure"(85.7%). The lowest correct answer(77.4%) was the classification of blood pressure level between normal and high. Second, Rate of blood pressure measurement for fathers was 53.7% and 54.8% in mothers. Awareness of own blood pressure by fathers was 84.1 %, while 91.1% by mothers. Third, According to blood pressure level reported by parents, fathers with normal blood pressure was 59.2%, high normal blood pressure was 12.2%, while hypertension was 28.6%. It revealed that prevalence of hypertension of fathers was higher than mother (normal: 74.5%, high normal: 7.7%, hypertension: 18.2%). Conclusions: From the results of this study, it is important to strengthen the health education about hypertension for community people and also school students.

  • PDF

A study on the behavior of cosmetic customers (화장품구매 자료를 통한 고객 구매행태 분석)

  • Cho, Dae-Hyeon;Kim, Byung-Soo;Seok, Kyung-Ha;Lee, Jong-Un;Kim, Jong-Sung;Kim, Sun-Hwa
    • Journal of the Korean Data and Information Science Society
    • /
    • v.20 no.4
    • /
    • pp.615-627
    • /
    • 2009
  • In micro marketing promotion, it is important to know the behavior of customers. In this study we are interested in the forecasting of repurchase of customers from customers' behavior. By analyzing the cosmetic transaction data we derive some variables which play an important role in the knowledge of the customers' behavior and in the modeling of repurchase. As modeling tools we use the decision tree, logistic regression and neural network model. Finally we decide to use the decision tree as a final model since it yields the smallest RASE (root average squared error) and the greatest correct classification rate.

  • PDF

A Empirical Study on the Relevance of Technology Finance Supporting Business for Technologically Innovative SMEs (혁신형 중소기업 기술금융 지원사업의 적절성에 대한 실증연구)

  • Sung, Oong-Hyun
    • Journal of Korea Technology Innovation Society
    • /
    • v.16 no.1
    • /
    • pp.303-322
    • /
    • 2013
  • A relevance of supporting business of technology financing for technologically innovative SMEs is strongly required for its continuous expansion and development. This study analyzes empirically whether the selection of recipient firms from technology financing have been performed in accordance with its objectives and purposes. Results show that the probability of receiving technology financing is more likely to increase with higher technology rankings and higher operating income ratio. On the other hand, the probability of obtaining financing might be decreased gradually, as the size of capital and age of the firm are increasing. Results also show that technology rankings and firm's major characteristics are found to affect significantly on the decision-making of technology financing. Several useful comments are suggested to improve the relevance of the technology financing since the correct classification rate, which explains the appropriateness of the model, is not at high level. In addition, technology rankings are not uncorrelated with the amount of financing in regression analysis. These research results will contribute to ensure the appropriateness and credibility of the technology financing decision-making.

  • PDF