• Title/Summary/Keyword: Class Number

Search Result 2,068, Processing Time 0.035 seconds

Performance Improvement of Nearest-neighbor Classification Learning through Prototype Selections (프로토타입 선택을 이용한 최근접 분류 학습의 성능 개선)

  • Hwang, Doo-Sung
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.49 no.2
    • /
    • pp.53-60
    • /
    • 2012
  • Nearest-neighbor classification predicts the class of an input data with the most frequent class among the near training data of the input data. Even though nearest-neighbor classification doesn't have a training stage, all of the training data are necessary in a predictive stage and the generalization performance depends on the quality of training data. Therefore, as the training data size increase, a nearest-neighbor classification requires the large amount of memory and the large computation time in prediction. In this paper, we propose a prototype selection algorithm that predicts the class of test data with the new set of prototypes which are near-boundary training data. Based on Tomek links and distance metric, the proposed algorithm selects boundary data and decides whether the selected data is added to the set of prototypes by considering classes and distance relationships. In the experiments, the number of prototypes is much smaller than the size of original training data and we takes advantages of storage reduction and fast prediction in a nearest-neighbor classification.

Combined Application of Data Imbalance Reduction Techniques Using Genetic Algorithm (유전자 알고리즘을 활용한 데이터 불균형 해소 기법의 조합적 활용)

  • Jang, Young-Sik;Kim, Jong-Woo;Hur, Joon
    • Journal of Intelligence and Information Systems
    • /
    • v.14 no.3
    • /
    • pp.133-154
    • /
    • 2008
  • The data imbalance problem which can be uncounted in data mining classification problems typically means that there are more or less instances in a class than those in other classes. In order to solve the data imbalance problem, there has been proposed a number of techniques based on re-sampling with replacement, adjusting decision thresholds, and adjusting the cost of the different classes. In this paper, we study the feasibility of the combination usage of the techniques previously proposed to deal with the data imbalance problem, and suggest a combination method using genetic algorithm to find the optimal combination ratio of the techniques. To improve the prediction accuracy of a minority class, we determine the combination ratio based on the F-value of the minority class as the fitness function of genetic algorithm. To compare the performance with those of single techniques and the matrix-style combination of random percentage, we performed experiments using four public datasets which has been generally used to compare the performance of methods for the data imbalance problem. From the results of experiments, we can find the usefulness of the proposed method.

  • PDF

Diet Composition of Coilia nasus in the Coastal Waters off Gori, Korea (고리 주변해역에서 출현하는 웅어 (Coilia nasus)의 위내용물 조성)

  • Baeck, Gun-Wook;Park, Joo-Myun;Choo, Hyun-Gi;Huh, Sung-Hoi
    • Korean Journal of Ichthyology
    • /
    • v.23 no.2
    • /
    • pp.163-167
    • /
    • 2011
  • The feeding habits of Coilia nasus were studied using 107 specimens collected from January to December 2005 in the coastal waters off Gori, Korea. The size of C. nasus ranged from 8.4 to 29.5 cm in standard length (SL). C. nasus was a carnivore that mainly consumed shrimps and copepods. Its diet also included small quantities of amphipods, euphausiids and chaetognaths. The feeding strategy graphical method reveled than C. nasus was specialized feeder and showed narrow niche width. Both small and large size class of C. nasus mainly consumed shrimps and copepods, and did not showed significant size-related changes in feeding habits. However, the mean number and weight of preys per stomach was higher than large size class than small size class.

A Study on Establishing the School Grouping System of Middle School -Focusing the Middle School in Gwangju Metropolitan City- (중학교 학교군 및 중학구 설정을 위한 조사 연구 -광주광역시 중학교를 중심으로-)

  • Lee, Hwa-Ryoung;Ha, Bong-Woon;Dong, Jae-Wook
    • Journal of the Korean Institute of Educational Facilities
    • /
    • v.18 no.3
    • /
    • pp.3-11
    • /
    • 2011
  • This study aims at proposing some reform measures for the middle school grouping system in Gwangju Metropolitan City, which is divided 86 middle schools into 10 clusters and 3 school districts. In doing so, it analyzes the present status of educational environment and student walking distance in each school district such as the number of student per teacher, the student density, the school size and the gender ratio in class. And it conducts a survey of 5,363 middle school students, 3,966 parents and 1,007 teachers, also evaluates their satisfaction levels and needs with the student allocation system. As the result of the survey and data analysis, it finds out some problems in some school districts which are gender imbalance in class, the preference for private middle schools and inconvenience in commuting to school. To solve these problems, the study suggests the better alternatives to replace the current system. Firstly, to set up the basic fundamental principles detailed in 3 action plan, which emphasize the adherence to a close-range allocation, the appropriate size of school and class, and the equalization of educational environment. Secondly, to establish the information system for managing the school district in order to be more objective and transparent. Finally, it gives a concrete proposal which divides the 10th school grouping system into the 11th. The result would be expected to ease the gender imbalance and the concentration of private middle schools, to improve the student walking condition to school.

  • PDF

Extraction Method of Significant Clinical Tests Based on Data Discretization and Rough Set Approximation Techniques: Application to Differential Diagnosis of Cholecystitis and Cholelithiasis Diseases (데이터 이산화와 러프 근사화 기술에 기반한 중요 임상검사항목의 추출방법: 담낭 및 담석증 질환의 감별진단에의 응용)

  • Son, Chang-Sik;Kim, Min-Soo;Seo, Suk-Tae;Cho, Yun-Kyeong;Kim, Yoon-Nyun
    • Journal of Biomedical Engineering Research
    • /
    • v.32 no.2
    • /
    • pp.134-143
    • /
    • 2011
  • The selection of meaningful clinical tests and its reference values from a high-dimensional clinical data with imbalanced class distribution, one class is represented by a large number of examples while the other is represented by only a few, is an important issue for differential diagnosis between similar diseases, but difficult. For this purpose, this study introduces methods based on the concepts of both discernibility matrix and function in rough set theory (RST) with two discretization approaches, equal width and frequency discretization. Here these discretization approaches are used to define the reference values for clinical tests, and the discernibility matrix and function are used to extract a subset of significant clinical tests from the translated nominal attribute values. To show its applicability in the differential diagnosis problem, we have applied it to extract the significant clinical tests and its reference values between normal (N = 351) and abnormal group (N = 101) with either cholecystitis or cholelithiasis disease. In addition, we investigated not only the selected significant clinical tests and the variations of its reference values, but also the average predictive accuracies on four evaluation criteria, i.e., accuracy, sensitivity, specificity, and geometric mean, during l0-fold cross validation. From the experimental results, we confirmed that two discretization approaches based rough set approximation methods with relative frequency give better results than those with absolute frequency, in the evaluation criteria (i.e., average geometric mean). Thus it shows that the prediction model using relative frequency can be used effectively in classification and prediction problems of the clinical data with imbalanced class distribution.

Clinical Analysis of the Arterial Bypass Surgery for Chronic Ischemia of the Lower Extremities (하지 만성 허혈에 대한 동맥 우회술의 임상적 고찰)

  • 안정태
    • Journal of Chest Surgery
    • /
    • v.28 no.7
    • /
    • pp.678-683
    • /
    • 1995
  • Arterial bypass for the chronic ischemia of the lower extremities underlying atherosclerotic obliterans has been performed with a number of alternative conduits from 1941 by Kunlin. It is indicated for the limb salvage of patients with threatened limb loss despite of several controversies in surgical treatment of atherosclerotic obliterans. From March 1991 to January 1995, 26 arterial bypasses were performed in 23 patients with the chronic ischemia of the lower extremities in our hospital. Their mean follow up period is 18.9 months ranging from 4 months to 44 months. Mean age is 60.9 years ranging 47 years to 76 years and the most prevalent incidence is the 6th decade. 21 patients are male and 2 patients are female. 19 of 23 patients are smokers. Clinical classifications by Fontaine are class II[21.7% , class III[34.8% and class IV[43.5% .Diabetes mellitus[47.8% , hypertension[43.5% , hyperlipid-emia[26% , tuberculosis[21.7% , cerebrovascular accident[13.0% and cardiac diseases[8.7% are associated. Aorto-single femoral bypass in 4 cases, aorto-bifemoral bypass in 5 cases, aortofemoral & femoropopliteal bypass in 2 cases, femoropopliteal bypass in 10 cases, popliteotibial bypass in 3 cases, femoropedal bypass[composite graft bypass in 2 cases were surgically approached. There are complicated early thrombosis in 4 cases those are required immediately reoperation, wound infection in 3 cases, hematoma in 3 cases, and so on. Postoperative complication rate is 53.8%.Postoperative patency rates are 84.6% at 6 months, 75.0% at 1 year, 70.0% at 2 years and 66.7% at 3 years. We usually used 6 mm & 8 mm graft for bypass, and the rate of thrombosis formation is 28.6%[2/7 in 6 mm graft and 12.5%[2/16 in 8 mm separately. In according to the graft materials, the rate of thrombosis formation is higher in the group using artificial graft than in that using autologous saphenous vein[16.6% vs 12.5% . Limb salvage rate is 76.9%. Postoperative mortality rate is zero %.

  • PDF

Linguistic Characteristics of Domestic Men's Formal Wear Brand Names

  • Kwon, Hae-Sook
    • Journal of Fashion Business
    • /
    • v.14 no.6
    • /
    • pp.11-22
    • /
    • 2010
  • The main purpose of this research was to examine the linguistic characteristics of domestic men's formal wear brand name. Four linguistic characteristics of language type, combined structure type of language, word class, length of brand name were investigated in this research and also examined the difference between brand type. For sample selection, the 209 men's fashion brands were selected from '2009 Korea Fashion Yearbook' and then, 25 brands which could not collect proper informations about the brand name or naming were excluded. Among total 184 men's brand names, 66 men's formal wear brands were selected and studied. For data analysis, quantitative evaluation of the frequency and qualitative evaluation have been used. The result as follows.; (1) Seven language types were found in domestic men's formal wear brand names. English has been used the most, then followed by Italian and French. (2) For combined structure type of brand name language, the single word used the most, followed by separately combined word type, artificially combined word, and unified word type. (3) The most frequently used the type of word class was noun, and followed by phrase, adjective, and verb. In the noun type, 6 different types which expressed a person, concrete & abstract entity, place, acronym, and neologic were found. For phrase, only noun type was appeared, however, 6 out of 20 phrases were abbreviated type. All eight adjective brand names implied an attributive character of the brand such as 'Dainty' or 'Solus(Solo)'. (4) The long name used most and then followed by normal and short length of brand name. Looking by the number of syllable, 4 syllables appeared the most and then followed by 3, 5, 6, 2 & 7 showed the same rate, and 8 syllables. (5) The result which compared the difference according to each brand type showed a difference in its language type, language combined style, word class, but length of brand name.

Comparison of Digital Number Distribution Changes of Each Class according to Atmospheric Correction in LANDSAT-5 TM (LANDSAT-5 TM 영상의 대기보정에 따른 클래스별 화소값 분포 변화 비교)

  • Jung, Tae-Woong;Eo, Yang-Dam;Jin, Tailie;Lim, Sang-Boem;Park, Doo-Youl;Park, Hwang-Soo;Piao, Minghe;Park, Wan-Yong
    • Korean Journal of Remote Sensing
    • /
    • v.25 no.1
    • /
    • pp.11-20
    • /
    • 2009
  • Due to increasing frequency of yellow dust, not to mention high rate of precipitation and cloud formation in summer season of Korea, atmospheric correction of satellite remote sensing is necessary. This research analyzes the effect of atmospheric correction has on imagery classification by comparing DN distribution before and after atmospheric correction. The image used in the research is LANDSAT-5 TM. As for atmospheric correction module, commercial product ATCOR, FLAASH as well as COST model released on the internet, were used. The result of experiment shows that class separability increased in building areas.

A Polynomial Time Algorithm for Edge Coloring Problem (간선 색칠 문제의 다항시간 알고리즘)

  • Lee, Sang-Un
    • Journal of the Korea Society of Computer and Information
    • /
    • v.18 no.11
    • /
    • pp.159-165
    • /
    • 2013
  • This paper proposes a O(E) polynomial-time algorithm that has been devised to simultaneously solve edge-coloring problem and graph classification problem both of which remain NP-complete. The proposed algorithm selects an edge connecting maximum and minimum degree vertices so as to determine the number of edge coloring ${\chi}^{\prime}(G)$. Determined ${\chi}^{\prime}(G)$ is in turn either ${\Delta}(G)$ or ${\Delta}(G)+1$. Eventually, the result could be classified as class 1 if ${\chi}^{\prime}(G)={\Delta}(G)$ and as category 2 if ${\chi}^{\prime}(G)={\Delta}(G)+1$. This paper also proves Vizing's planar graph conjecture, which states that 'all simple, planar graphs with maximum degree six or seven are of class one, closing the remaining possible case', which has known to be NP-complete.

A Study on the Effectiveness of e-learning video class using the online learning judgement system : Focused on the social studies classes in Elementary school (온라인 학습판단 시스템을 활용한 e-러닝 동영상 수업의 효과연구 : 초등학교 사회과 수업을 중심으로)

  • Kim, Jihyun;Jung, Jaebum;Jo, Jaechoon;Lim, Heuiseok
    • Journal of the Korea Convergence Society
    • /
    • v.10 no.2
    • /
    • pp.141-148
    • /
    • 2019
  • The purpose of this study is to analyze and compare the effectiveness of elementary in e-Learning video lessons. In an elementary school where the educational videos are frequently used, the learning about video materials is important but it is difficult to judge all students by a teacher in a classroom. In order to solve the problems of the field, In the fifth-grade elementary school social studies class, learning using video material was conducted by using the online learning judgment system for the experimental group, and learning using video material was conducted by the traditional method for the controlled group. As a result of the experiment, the class using the online learning judgment system was effective in enhancing the learner 's academic achievement. It also positively influenced learners' learning satisfaction. Teachers' satisfaction was not statistically significant because of the small number of teachers. However, The mean value of the teachers' satisfaction in the experimental group was high and the deviation was small.