• Title/Summary/Keyword: document categorization

Search Result 73, Processing Time 0.028 seconds

A Study on the Present Situation of Landscape Management System through Analysis of the Landscape Review Results - Focused on Jeju Special Self-Governing Province Landscape Review- (경관 심의결과 분석을 통한 경관관리제도의 현황에 대한 연구 - 제주특별자치도 경관 심의를 중심으로 -)

  • Park, Hye-Jung;Park, Chul-Min
    • Journal of the Korean Institute of Rural Architecture
    • /
    • v.20 no.4
    • /
    • pp.9-17
    • /
    • 2018
  • The purpose of the study is to suggest ways to improve the Landscape Review system and Landscape Management System of Jeju Special Self-governing Province through Analysis of the Landscape Review Results and Jeju Special Self-governing Ordinance. For this purpose, the research methods were reviewed for preliminary study and reviewing the laws and ordinances related to landscape, and 318 cases of landscape review, which have been implemented since 2010, were analyzed by item by item along with the result of the review. The main results of the analysis are as follows. First, Jeju Special Self-governing Province, which currently operates an enhanced ordinance of development project review, is experiencing problems such as building the wrong construction projects due to the weak legal basis for follow-up management after landscape Review. Second, Jeju Special Self-governing Province expects efficient management of the province through expansion of the scope of the landscape review. Third, the current status of the decisions by the Landscape review showed that 57.7% of the bills passed, the lowest at 41.9% for the development projects. Fourth, analysis of the landscape review contents by categorization by item showed that ' Landscape Control Guideline' and 'Document not completed' are relatively high. Thus, eight years have passed since the start of the Landscape Management System and the Landscape Review, but systematic institutional stability is not sufficient, so it is necessary to make the Landscape Control Guideline easier to understand.

An Analytical Study on Automatic Classification of Domestic Journal articles Based on Machine Learning (기계학습에 기초한 국내 학술지 논문의 자동분류에 관한 연구)

  • Kim, Pan Jun
    • Journal of the Korean Society for information Management
    • /
    • v.35 no.2
    • /
    • pp.37-62
    • /
    • 2018
  • This study examined the factors affecting the performance of automatic classification based on machine learning for domestic journal articles in the field of LIS. In particular, In view of the classification performance that assigning automatically the class labels to the articles in "Journal of the Korean Society for Information Management", I investigated the characteristics of the key factors(weighting schemes, training set size, classification algorithms, label assigning methods) through the diversified experiments. Consequently, It is effective to apply each element appropriately according to the classification environment and the characteristics of the document set, and a fairly good performance can be obtained by using a simpler model. In addition, the classification of domestic journals can be considered as a multi-label classification that assigns more than one category to a specific article. Therefore, I proposed an optimal classification model using simple and fast classification algorithm and small learning set considering this environment.

An Analytical Study on Performance Factors of Automatic Classification based on Machine Learning (기계학습에 기초한 자동분류의 성능 요소에 관한 연구)

  • Kim, Pan Jun
    • Journal of the Korean Society for information Management
    • /
    • v.33 no.2
    • /
    • pp.33-59
    • /
    • 2016
  • This study examined the factors affecting the performance of automatic classification for the domestic conference papers based on machine learning techniques. In particular, In view of the classification performance that assigning automatically the class labels to the papers in Proceedings of the Conference of Korean Society for Information Management using Rocchio algorithm, I investigated the characteristics of the key factors (classifier formation methods, training set size, weighting schemes, label assigning methods) through the diversified experiments. Consequently, It is more effective that apply proper parameters (${\beta}$, ${\lambda}$) and training set size (more than 5 years) according to the classification environments and properties of the document set. and If the performance is equivalent, I discovered that the use of the more simple methods (single weighting schemes) is very efficient. Also, because the classification of domestic papers is corresponding with multi-label classification which assigning more than one label to an article, it is necessary to develop the optimum classification model based on the characteristics of the key factors in consideration of this environment.

Properties of the Twenty-seven Pulses in DongUiBoGam Based on the Eight Important Pulses (팔요맥을 중심으로 살펴본 『동의보감』 27맥 속성 연구)

  • Lee, Taehyung;Jung, Won-Mo;Go, Byeongho;Park, Hi-Joon;Kim, Namil;Chae, Younbyoung
    • Korean Journal of Acupuncture
    • /
    • v.32 no.4
    • /
    • pp.151-159
    • /
    • 2015
  • Objectives : Pulse diagnosis is considered particularly important among several methods of diagnosis in DongUiBoGam. In spite of its importance, numerous and various pulse descriptions made it difficult to learn and practice pulse diagnosis. In this article, we tried to analyze properties of the twenty-seven pulses from pulse diagnosis cases from DongUiBoGam to enable the practical understanding of pulse diagnosis. Methods : We constituted the four axis according to the eight important pulses. And we analyzed properties of the twenty-seven pulses through the relationship between the four pairs of important pulses and the twenty-seven pulses. To quantify the relevances of important pulses to the twenty-seven pulses, we used the term frequency-inverse document frequency(TF-IDF) method. Results : We could elicit properties of the twenty-seven pulses according to the four axis. Also, we reexamined the categorization of the seven exterior pulses / the eight interior pulses and the similar pulses from DongUiBoGam with the analysis results. Conclusions : We could understand properties of the twenty-seven pulses more specifically with the eight important pulses. And we also could see the relationship among the twenty-seven pulses on each axis. However, the limitation arising from insufficient number of pulse diagnosis cases in this research requires further research with more sources such as other traditional medical records or clinical records in the present time.

Improving Classification Accuracy in Hierarchical Trees via Greedy Node Expansion

  • Byungjin Lim;Jong Wook Kim
    • Journal of the Korea Society of Computer and Information
    • /
    • v.29 no.6
    • /
    • pp.113-120
    • /
    • 2024
  • With the advancement of information and communication technology, we can easily generate various forms of data in our daily lives. To efficiently manage such a large amount of data, systematic classification into categories is essential. For effective search and navigation, data is organized into a tree-like hierarchical structure known as a category tree, which is commonly seen in news websites and Wikipedia. As a result, various techniques have been proposed to classify large volumes of documents into the terminal nodes of category trees. However, document classification methods using category trees face a problem: as the height of the tree increases, the number of terminal nodes multiplies exponentially, which increases the probability of misclassification and ultimately leads to a reduction in classification accuracy. Therefore, in this paper, we propose a new node expansion-based classification algorithm that satisfies the classification accuracy required by the application, while enabling detailed categorization. The proposed method uses a greedy approach to prioritize the expansion of nodes with high classification accuracy, thereby maximizing the overall classification accuracy of the category tree. Experimental results on real data show that the proposed technique provides improved performance over naive methods.

The Bethedsa System 2001 Workshop Report (The Bethesda System 2001의 최신지견)

  • Hong, Eun-Kyung;Nam, Jong-Hee;Park, Moon-Hyang
    • The Korean Journal of Cytopathology
    • /
    • v.12 no.1
    • /
    • pp.1-15
    • /
    • 2001
  • The Bethesda System (TBS) was first developed in 1988 for the need to enhance the communication of the cytopathologic findings to the referring physician in unambiguous diagnostic terms. The terminology used in this reporting system should reflect current understanding of the pathogenesis of cervical/vaginal disease so the framework of the reporting system should be flexible enough to accommodate advances in medicine including virology, molecular biology, and pathology. Three years after the Introduction of TBS, the second Bethesda workshop was held to set or amend diagnostic criteria for each categories of TBS. TBS 1991 is now widely used. The third Bethesda workshop, The Bethesda System 2001 Workshop, was held in National Cancer institute Bethesda, Maryland from April 30 to May 2, 2001. Again, the goals of this workshop were to promote effective communication and to clarify in reporting cervical cytopathology results to clinicians and to provide with the information to make appropriate decisions about diagnosis and treatment. Nine forum groups were made and there were Web-based bulletin board discussions between October, 2000 and the first week of April, 2001. On the basis of bulletin board comments and discussions, the forum moderators recommended revised terminologies in the Workshop. Hot discussions were followed after the presentation by forum moderators during the workshop. Terminologies confusing clinicians and providing no additional informations regarding patient management were deleted in the workshop to clarify the cervicovaginal cytology results. Any informations related to the patient management were encouraged to add. So 'Satisfactory for evaluation but limited by...' of 'Specimen Adequacy' catergory was deleted. Terminology of 'Unsatisfactory' was further specified as 'Specimen rejected' and 'Specimen processed and examined, but unsatisfactory'. Terminologies of 'Benign Cellular Change' and 'Within Normal Limits' were combined and terminology was changed to 'Negative for intraepithelial lesion or malignancy'. In General categorization, category 'Other' was newly inserted and the presence of 'Endometrial cells' in women over 40 years old can be checked. Although the category 'Benign Cellular Change' was deleted, the organisms or reactive changes of this category can be listed in the descriptive diagnoses. Terminologies of ASCUS and AGUS were changed to atypical squamous cell and atypical glandular cell, respectively. Diagnostic term of 'Adenocarcinoma in situ', which is highly reproducible with reliable diagnostic criteria, was newly Inserted. The category of hormonal evaluation was deleted. Criteria for liquid-based specimen were discussed. Reporting by computer-assisted cytology was discussed and terminology for automated review was newly inserted. This is not the final edition of Bethesda 2001. The final document can be prepared before the ASCCP meeting in which Consensus Guidelines for the Management on Cytology Abnormalities and Cervical Precursors will develop in September 2001.

  • PDF

Design and Implementation of Web-based Problem Management System for CT Radiological Technologist Education (CT 전문방사선사 교육을 위한 웹기반 문항관리 시스템의 설계 및 구현)

  • Shin Yong-Won;Koo Bong-Oh;Shim Choon-Bo
    • The Journal of the Korea Contents Association
    • /
    • v.5 no.1
    • /
    • pp.27-35
    • /
    • 2005
  • Recently, despite of the rapid progress of information technology in the medical and health fields, the development and management of problem sets about medical and education contents related with radiological technologist has been still achieved by manual and offline method using document editor. In this study, the unique web-based problem management system is designed and implemented. That system can efficiently manage and present various kind of problem set about integrated education and personal license without time and space limitations in order to improve the efficiency of supplementary training and to obtain the professional license for CT radiological technologist. The proposed system is composed of administration module and user module. The former supports several functions such as problem creation, problem categorization, user management, and adjustment of leveled assessment. On the other hand, the latter functions examination applying , problem retrieval, personal score retrieval, and interpretation viewing, and so on. In addition, our system is expected as a useful and practical system which provides problem interpretation and analysis of score results after applying for the examination. It can elevate ability of learning and information interchange among them preparing for CT professional radiological technologist licensing examination

  • PDF

A Study on Evaluation System of River Levee Safety Map to Improve Maintenance Efficiency and Disaster Responsiveness (하천제방의 유지관리 효율성 및 재해 대응성 향상을 위한 하천제방 안전도맵 평가체계 연구)

  • Kim, Jin-Man;Moon, In-Jong;Yoon, Kwang-Seok;Kim, Soo-Young
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.19 no.9
    • /
    • pp.20-29
    • /
    • 2018
  • Owing to the changing climate and recent flood events, flood damage caused by river levee collapse and overflow is on the rise in Korea, making it necessary to enhance river levee maintenance technologies to deal with various flood damage scenarios. This paper proposes the evaluation system of a river-levee safety map to improve maintenance efficiency and disaster responsiveness. A river-levee safety map, indicating sliding, piping, visual inspection, scouring, and safety index of a levee fill material on a GIS map will enable the dangerous zone to be identified visually and the development of proactive measures to deal with it. This will maximize the river-levee maintenance efficiency, which is a break from traditional practice in that restoration measures are taken only after the damage has occurred. This study includes scouring and levee fill material in addition to previously-proposed sliding, piping and visual inspections. The research activities conducted in the study include 1) categorization of scouring and levee fill material based on document and data examination, 2) evaluation of sliding and piping at 5 locations on the left levee in the Nam river according to the duration time of the flood water level, and 3) evaluation of the characteristics of scouring and levee fill material at 9 locations on the left/right levee in the Nam River. The river levee safety map proposed in this study would be more useful and practical but further study on the manual for river management organization, repair and reinforcement methods, and budget is required.

Categorization of POIs Using Word and Context information (관심 지점 명칭의 단어와 문맥 정보를 활용한 관심 지점의 분류)

  • Choi, Su Jeong;Park, Seong-Bae
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.24 no.5
    • /
    • pp.470-476
    • /
    • 2014
  • A point of interest is a specific point location such as a cafe, a gallery, a shop, or a park. It consists of a name, a category, a location, and so on. Its information is necessary for location-based application, above all category is basic information. However, category information should be automatically gathered because it costs high to gather it manually. In this paper, we propose a novel method to estimate category of POIs automatically using an inner word and local context. An inner word is a word that contains POI's name. Their name sometimes expose category information. Thus, their name is used as inner word information in estimating category of POIs. Local context information means words around a POI's name in a document that mentioned the name. The context include information to estimate category. The evaluation of the proposed method is performed on two data sets. According to the experimental results, proposed model using combination inner word and local context show higher accuracy than that of model using each.

Primary Management Factors for Collaboration among Participants in Technical Proposal Tendering (기술제안입찰 참여자간의 협업지원을 위한 중점협업관리요소 도출)

  • Koo, Seonkeun;Lim, Susang;Yoon, Yousang;Han, Sangwon;Hyun, Changtaek
    • Korean Journal of Construction Engineering and Management
    • /
    • v.17 no.5
    • /
    • pp.3-12
    • /
    • 2016
  • Recently government is set to expand its policy to promote technical proposal tendering in a dimension of technical competitiveness reinforcement. Because a variety of complicated techniques are applied in technical proposal tendering and variables could be occurred in terms of cost, schedule, constructability and others when techniques are reflected on design document collaboration management among participants is considered insignificantly. So the research would determine primary management factors and presents management direction for collaboration among participants. First action for this is categorization of hindrance factors to collaboration into five factors as 'Poor work processing', 'Communication cap among participants', 'Lack of understanding about technical proposal tendering', 'Difficulty of decision making' and 'Insufficiency in managing the work data'. Second correlation analysis is conducted between the categorized factors and participants according to tasks in technical proposal tendering to figure out the correlation degree of variables. If there is a strong correlation between variables, hindrance factor in that case regarded primary management factor to collaboration and finally management direction is presented at each task.