• Title/Summary/Keyword: Feature evaluation

Search Result 974, Processing Time 0.03 seconds

A Study on the Drug Classification Using Machine Learning Techniques (머신러닝 기법을 이용한 약물 분류 방법 연구)

  • Anmol Kumar Singh;Ayush Kumar;Adya Singh;Akashika Anshum;Pradeep Kumar Mallick
    • Advanced Industrial SCIence
    • /
    • v.3 no.2
    • /
    • pp.8-16
    • /
    • 2024
  • This paper shows the system of drug classification, the goal of this is to foretell the apt drug for the patients based on their demographic and physiological traits. The dataset consists of various attributes like Age, Sex, BP (Blood Pressure), Cholesterol Level, and Na_to_K (Sodium to Potassium ratio), with the objective to determine the kind of drug being given. The models used in this paper are K-Nearest Neighbors (KNN), Logistic Regression and Random Forest. Further to fine-tune hyper parameters using 5-fold cross-validation, GridSearchCV was used and each model was trained and tested on the dataset. To assess the performance of each model both with and without hyper parameter tuning evaluation metrics like accuracy, confusion matrices, and classification reports were used and the accuracy of the models without GridSearchCV was 0.7, 0.875, 0.975 and with GridSearchCV was 0.75, 1.0, 0.975. According to GridSearchCV Logistic Regression is the most suitable model for drug classification among the three-model used followed by the K-Nearest Neighbors. Also, Na_to_K is an essential feature in predicting the outcome.

A Comparative Study on Physics Inquiry Activities of Science Textbooks for Secondary School in Korea and Singapore (한국과 싱가포르의 중학교 과학 교과서의 물리 영역 탐구 활동의 특징 비교)

  • Lee, Jae-Bong;Shin, Kwang-Moon;Park, Jong-Chan;Kim, Dong-Hoon;Lee, Sung-Muk;Kim, Tae-Il
    • Journal of The Korean Association For Science Education
    • /
    • v.27 no.7
    • /
    • pp.547-558
    • /
    • 2007
  • The purpose of this study is to compare inquiry activities in science textbooks' physics contents for Korean secondary schools with those of Singapore in order to provide a reference for further improvement of inquiry activities in Korean science textbooks. We analyzed inquiry activities using the framework of Millar et al.(1998) and Chinn & Malhotra (2002). The results of this study are as follows: There are differences between Korean and Singaporean inquiry activities in the area of 'learning objectives', 'students' thinking activities' and 'degree of openness'. In the area of 'learning objectives', the Korean textbooks have more activities associated with the learning of science content than those in Singaporean, whereas the Singaporean textbooks have more activities associated with the processes of scientific inquiry than in Korean textbooks. In the area of 'students' thinking activities', the Singaporean textbooks have activities like 'test a prediction', which Korean textbooks lack. The 'degree of openness' is higher in Singaporean textbooks than in Korean textbooks. And some differences in the area of 'authentic scientific inquiry' between Korean and Singaporean textbooks were also found. While the Korean textbooks do not have any activities associated with 'generating research questions', the Singaporean ones feature such activities. In the area of 'designing studies', the Singaporean textbooks have activities corresponding to 'selecting variables' and 'controlling variables', while the Korean ones never have such activities. The results of this study imply that it is necessary to balance inquiry activities in the area of 'learning objectives', 'students' thinking activities' and 'degree of openness', and to present activities close to authentic scientific inquiry in inquiry activities in textbooks.

Development of the Cloud Monitoring Program using Machine Learning-based Python Module from the MAAO All-sky Camera Images (기계학습 기반의 파이썬 모듈을 이용한 밀양아리랑우주천문대 전천 영상의 운량 모니터링 프로그램 개발)

  • Gu Lim;Dohyeong Kim;Donghyun Kim;Keun-Hong Park
    • Journal of the Korean earth science society
    • /
    • v.45 no.2
    • /
    • pp.111-120
    • /
    • 2024
  • Cloud coverage is a key factor in determining whether to proceed with observations. In the past, human judgment played an important role in weather evaluation for observations. However, the development of remote and robotic observation has diminished the role of human judgment. Moreover, it is not easy to evaluate weather conditions automatically because of the diverse cloud shapes and their rapid movement. In this paper, we present the development of a cloud monitoring program by applying a machine learning-based Python module "cloudynight" on all-sky camera images obtained at Miryang Arirang Astronomical Observatory (MAAO). The machine learning model was built by training 39,996 subregions divided from 1,212 images with altitude/azimuth angles and extracting 16 feature spaces. For our training model, the F1-score from the validation samples was 0.97, indicating good performance in identifying clouds in the all-sky image. As a result, this program calculates "Cloudiness" as the ratio of the number of total subregions to the number of subregions predicted to be covered by clouds. In the robotic observation, we set a policy that allows the telescope system to halt the observation when the "Cloudiness" exceeds 0.6 during the last 30 minutes. Following this policy, we found that there were no improper halts in the telescope system due to incorrect program decisions. We expect that robotic observation with the 0.7 m telescope at MAAO can be successfully operated using the cloud monitoring program.

Incorporating Social Relationship discovered from User's Behavior into Collaborative Filtering (사용자 행동 기반의 사회적 관계를 결합한 사용자 협업적 여과 방법)

  • Thay, Setha;Ha, Inay;Jo, Geun-Sik
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.2
    • /
    • pp.1-20
    • /
    • 2013
  • Nowadays, social network is a huge communication platform for providing people to connect with one another and to bring users together to share common interests, experiences, and their daily activities. Users spend hours per day in maintaining personal information and interacting with other people via posting, commenting, messaging, games, social events, and applications. Due to the growth of user's distributed information in social network, there is a great potential to utilize the social data to enhance the quality of recommender system. There are some researches focusing on social network analysis that investigate how social network can be used in recommendation domain. Among these researches, we are interested in taking advantages of the interaction between a user and others in social network that can be determined and known as social relationship. Furthermore, mostly user's decisions before purchasing some products depend on suggestion of people who have either the same preferences or closer relationship. For this reason, we believe that user's relationship in social network can provide an effective way to increase the quality in prediction user's interests of recommender system. Therefore, social relationship between users encountered from social network is a common factor to improve the way of predicting user's preferences in the conventional approach. Recommender system is dramatically increasing in popularity and currently being used by many e-commerce sites such as Amazon.com, Last.fm, eBay.com, etc. Collaborative filtering (CF) method is one of the essential and powerful techniques in recommender system for suggesting the appropriate items to user by learning user's preferences. CF method focuses on user data and generates automatic prediction about user's interests by gathering information from users who share similar background and preferences. Specifically, the intension of CF method is to find users who have similar preferences and to suggest target user items that were mostly preferred by those nearest neighbor users. There are two basic units that need to be considered by CF method, the user and the item. Each user needs to provide his rating value on items i.e. movies, products, books, etc to indicate their interests on those items. In addition, CF uses the user-rating matrix to find a group of users who have similar rating with target user. Then, it predicts unknown rating value for items that target user has not rated. Currently, CF has been successfully implemented in both information filtering and e-commerce applications. However, it remains some important challenges such as cold start, data sparsity, and scalability reflected on quality and accuracy of prediction. In order to overcome these challenges, many researchers have proposed various kinds of CF method such as hybrid CF, trust-based CF, social network-based CF, etc. In the purpose of improving the recommendation performance and prediction accuracy of standard CF, in this paper we propose a method which integrates traditional CF technique with social relationship between users discovered from user's behavior in social network i.e. Facebook. We identify user's relationship from behavior of user such as posts and comments interacted with friends in Facebook. We believe that social relationship implicitly inferred from user's behavior can be likely applied to compensate the limitation of conventional approach. Therefore, we extract posts and comments of each user by using Facebook Graph API and calculate feature score among each term to obtain feature vector for computing similarity of user. Then, we combine the result with similarity value computed using traditional CF technique. Finally, our system provides a list of recommended items according to neighbor users who have the biggest total similarity value to the target user. In order to verify and evaluate our proposed method we have performed an experiment on data collected from our Movies Rating System. Prediction accuracy evaluation is conducted to demonstrate how much our algorithm gives the correctness of recommendation to user in terms of MAE. Then, the evaluation of performance is made to show the effectiveness of our method in terms of precision, recall, and F1-measure. Evaluation on coverage is also included in our experiment to see the ability of generating recommendation. The experimental results show that our proposed method outperform and more accurate in suggesting items to users with better performance. The effectiveness of user's behavior in social network particularly shows the significant improvement by up to 6% on recommendation accuracy. Moreover, experiment of recommendation performance shows that incorporating social relationship observed from user's behavior into CF is beneficial and useful to generate recommendation with 7% improvement of performance compared with benchmark methods. Finally, we confirm that interaction between users in social network is able to enhance the accuracy and give better recommendation in conventional approach.

Rear Vehicle Detection Method in Harsh Environment Using Improved Image Information (개선된 영상 정보를 이용한 가혹한 환경에서의 후방 차량 감지 방법)

  • Jeong, Jin-Seong;Kim, Hyun-Tae;Jang, Young-Min;Cho, Sang-Bok
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.54 no.1
    • /
    • pp.96-110
    • /
    • 2017
  • Most of vehicle detection studies using the existing general lens or wide-angle lens have a blind spot in the rear detection situation, the image is vulnerable to noise and a variety of external environments. In this paper, we propose a method that is detection in harsh external environment with noise, blind spots, etc. First, using a fish-eye lens will help minimize blind spots compared to the wide-angle lens. When angle of the lens is growing because nonlinear radial distortion also increase, calibration was used after initializing and optimizing the distortion constant in order to ensure accuracy. In addition, the original image was analyzed along with calibration to remove fog and calibrate brightness and thereby enable detection even when visibility is obstructed due to light and dark adaptations from foggy situations or sudden changes in illumination. Fog removal generally takes a considerably significant amount of time to calculate. Thus in order to reduce the calculation time, remove the fog used the major fog removal algorithm Dark Channel Prior. While Gamma Correction was used to calibrate brightness, a brightness and contrast evaluation was conducted on the image in order to determine the Gamma Value needed for correction. The evaluation used only a part instead of the entirety of the image in order to reduce the time allotted to calculation. When the brightness and contrast values were calculated, those values were used to decided Gamma value and to correct the entire image. The brightness correction and fog removal were processed in parallel, and the images were registered as a single image to minimize the calculation time needed for all the processes. Then the feature extraction method HOG was used to detect the vehicle in the corrected image. As a result, it took 0.064 seconds per frame to detect the vehicle using image correction as proposed herein, which showed a 7.5% improvement in detection rate compared to the existing vehicle detection method.

Analysis of Teachers' Perceptions on the Subject Competencies of Integrated Science (통합과학 교과 역량에 대한 교사들의 인식 분석)

  • Ahn, Yumin;Byun, Taejin
    • Journal of The Korean Association For Science Education
    • /
    • v.40 no.2
    • /
    • pp.97-111
    • /
    • 2020
  • In the 2015 revised curriculum, 'Integrated Science' was established to increase convergent thinking and designated as a common subject for all students to learn, regardless of career. In addition, the 2015 revised curriculum introduced 'competence' as a distinctive feature from the previous curriculum. In the 2015 revised curriculum, competencies are divided into core competencies of cross-curricular character and subject competencies based on academic knowledge and skills of the subject. The science curriculum contains five subject competencies: scientific thinking, scientific inquiry, scientific problem solving, scientific communication, scientific participation and life-long learning. However, the description of competencies in curriculum documents is insufficient, and experts' perceptions of competencies are not uniform. Therefore, this study examines the perceptions of science subjects in science high school teachers by deciding that comprehension of competencies should be preceded in order for competency-based education to be properly applied to school sites. First, we analyzed the relationship between achievement standards and subject competencies of integrated science through the operation of an expert working group with a high understanding of the integrated science achievement standards. Next, 31 high school science teachers examined the perception of the five subject competencies through a descriptive questionnaire. The semantic network analysis has been utilized to analyze the teachers' responses. The results of the analysis showed that the three curriculum competencies of scientific inquiry, scientific communication, scientific participation and life-long learning ability are similar to the definitions of teachers and curriculum documents, but in the case of scientific thinking and scientific problem solving, there are some gaps in perception and definition in curriculum documents. In addition, the results of the comprehensive analysis of teachers' perceptions on the five competencies show that the five curriculum competencies are more relevant than mutually exclusive or independent.

A Comparative Analysis of the Level of Occupational Health : Before and After the Subsidiary Program on Health Care Management of Small Scale Industries (영세사업장 보건관리 지원사업 실시 전후의 산업보건수준 비교 분석)

  • Jung, Hye Sun
    • Korean Journal of Occupational Health Nursing
    • /
    • v.4
    • /
    • pp.58-83
    • /
    • 1995
  • The small scale industries which have less than 30 employees occupy 86.5% of total number of industries in Korea. And though they have higher accident rate and lower environmental condition than big industries, it has been not mandatory to appointing health care manager at factory. So, from 1993, government subsidizes to the health care management of small industries. The purpose of this study is to identify the real feature of health care status in small industries, and to evaluate the level of health care management, before and after the subsidiary program. 65 small plating industries which have been managed by the same health care management support institution in 1993 were selected for study. Of the 65 industries, 3 which have not taken both environmental evaluation and health screening in 1994, and 9 which have closed were excluded from study sample. And the remaining 53 were analyzed by using the results of environmental evaluation and health screening, reported to the Ministry of Labor, before and after the subsidiary program, the analysis was done by the comparison of the two year paired data of the same industry. Over-permissible-limit rate, health screening implementation rate, above grade C rate were calculated and compared. The status of health care management ; 1. Of the sample industries, 96.9% provide protective equipment and 80.0% set up ventilating system. Protective gloves (89.2%) and protective clothing (80.0%) are widely provided, but ear plugs (4.6%) are rarely provided. 21.5% of the protective equipment are well put on, and 40.4% of the ventilating systems function well. 2. In 1993, 35 industries, 53.8% of the sample, checked working environment twice. Over-permissible-limit rates of heavy metal (12.2%), suspended particle (11.1%), noise (5.5%) were high. To put on protective equipment and to set up local ventilating system were pointed out by the examiners. 3. General health screening was done at 63.1% of the sample industries and 35.3% of total workers were examined. Specific health screening was done at 93.8% of the sample industries and 75.4% of workers were examined. 15.5% of workers was provided to be above grade C and to have digestive system disease (43.3%), circulatory disease (18.9%), and hematopoietic disease (14.2%), etc. 4. In 1993, the subsidiary program of health care management was provided in forms of health education, health counseling, and rounding check of working field. And 61.5%, 83.0%, 55.4% of sample industries respectively received it. The average visit per industry was 1.8. Comparisons of the level of occupational health before and after the subsidiary program ; 1. Over-permissible-limit rates of hazardous factors of 1993 and that of 1994 were compared. The rates of suspended particle, noise, organic solvent of 1994 (37.5%, 13.4%, 24.2% respectively) were higher than that of 1993 (25.0%, 6.0%, 6.3% respectively). In the case of acid, there was no difference between the rate of 1993 and that of 1994. Only the rate of heavy metal decreased from 12.9% in 1993 to 3.0% in 1994. 2. General health screening was done at 38.7% of the sample industries in 1993 and at 44.6% in 1994. But the implementation rate of specific health screening decreased from 72.4% in 1993 to 64.6% in 1994. 3. The implementation rate of specific health screening was analyzed by some health factors. The rate of suspended particle increased from 61.8% in 1993 to 91.2% in 1994. But the rates of the others-noise, organic solvent, heavy metal, specific chemical substances-decreased. 4. Above grade C rate in health screening increased from 27.8% in 1993 to 35.5% in 1994. But that of endocrine disorders and pulmonary disease decreased.

  • PDF

Evaluation of Characteristics of Particle Composition and Pollution of Heavy Metals for Bottom Sediments in Cheonsu Bay, Korea -Comparison of the Sediments Environment of Farming Area and Non-farming Area (천수만 해저 퇴적물의 입도특성 및 중금속 오염도 평가 -어장해역과 비어장해역의 퇴적환경 비교-)

  • Kim, Jong-Gu;Jang, Hyo-Sang
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.20 no.4
    • /
    • pp.358-371
    • /
    • 2014
  • For the systematic scientific management in Cheonsu Bay of Korea, this study was conducedt to survey the particle composition, organic matter(I.L.) and heavy metals in farming and non-farming areas. The sediment of study area showed feature mixed property by sand, silt and clay. The farming area showed superior by fine-grained sediment, non-farming area showed superior by coarse-grained sediment. The organic pollution of farming area were appeared to be heavily polluted more than non-farming area. The concentration of total nitorgen in sediment was higher farming area than non-farming area. Also, in the case of heavy metals pollution in sediments, farming area was higher than non-farming area. The correlation analysis among to heavy metals, organic matter and particle size was found to have a good interrelationship. For evaluation of heavy metals pollution of sediments, three criteria are applied, Enrichment Factor(EF), Geoaccumulation index(Igeo) and NOAA criteria for sediment. In the case of EF, Heavy metals pollution was appeared to artificial effect all heavy metals if except Cu. In the case of Geoaccumulation index, Cu, Al, Pb was shown zero grade, that is non polluted group, and Cd, Hg, Cr was shown to 0~1 grade, that is mid polluted group, As was shown to 2 grade, that is moderately polluted group. In the case of NOAA, pollution levels of heavy metals except Cd belonged to a group of ERL(Effect range low)~ERM(Effect range median).

Analysis on the Dosimetric Characteristics of Tangential Breast Intensity Modulated Radiotherapy (유방암의 접선 세기조절 방사선치료 선량 특성 분석)

  • Yoon, Mee Sun;Kim, Yong-Hyeob;Jeong, Jae-Uk;Nam, Taek-Keun;Ahn, Sung-Ja;Chung, Wong-Ki;Song, Ju-Young
    • Progress in Medical Physics
    • /
    • v.23 no.4
    • /
    • pp.219-228
    • /
    • 2012
  • The tangential breast intensity modulated radiotherapy (T-B IMRT) technique, which uses the same tangential fields as conventional 3-dimensional conformal radiotherapy (3D-CRT) plans with physical wedges, was analyzed in terms of the calculated dose distribution feature and dosimetric accuracy of beam delivery during treatment. T-B IMRT plans were prepared for 15 patients with breast cancer who were already treated with conventional 3D-CRT. The homogeneity of the dose distribution to the target volume was improved, and the dose delivered to the normal tissues and critical organs was reduced compared with that in 3D-CRT plans. Quality assurance (QA) plans with the appropriate phantoms were used to analyze the dosimetric accuracy of T-B IMRT. An ionization chamber placed at the hole of an acrylic cylindrical phantom was used for the point dose measurement, and the mean error from the calculated dose was $0.7{\pm}1.4%$. The accuracy of the dose distribution was verified with a 2D diode detector array, and the mean pass rate calculated from the gamma evaluation was $97.3{\pm}2.9%$. We confirmed the advantages of a T-B IMRT in the dose distribution and verified the dosimetric accuracy from the QA performance which should still be regarded as an important process even in the simple technique as T-B IMRT in order to maintain a good quality.

A Clinical Study of Hospitalized Infants 28 to 90 Days of Age with Fever without Source (원인 없는 열로 입원한 생후 28일에서 90일 사이 영아들에 대한 임상적 고찰)

  • Rye, Min Hyuk;Noh, Yn Il;Lee, Seong Hun;Lee, Sun Young;Hur, Nam Jin;Lee, Dong Jin
    • Pediatric Infection and Vaccine
    • /
    • v.8 no.2
    • /
    • pp.191-198
    • /
    • 2001
  • Purpose : The purpose of this study was to investigate clinical features of hospitalized infants 28~90 days of age with fever without source and to analyze those of young febrile infants using risk criteria for serious bacterial infection. Methods : The clinical features of 131 infants 28~90 days of age admitted to the Ulsan Dong-Kang General Hospital Pediatric Department because of fever(temperature ${\geq}38^{\circ}C$ rectally) without source, from January 2000 to December 2000, were investigated by retrospective chart review. The clinical features of 131 febrile infants were analyzed using Rochester criteria. Results : Among 131 cases, there were 60 cases(45.8%) of urinary tract infection, 33 cases (25.2%) of aseptic meningitis, 2 cases(1.5%) of bacteremia and 36 cases(27.5%) of no specific diagnosis. Among 131 cases, there were 57 cases(43.5%) in low risk group and 74 cases(56.5%) in not low risk one by Rochester criteria. A significant difference in the incidence of urinary tract infection, aseptic meningitis and no specific diagnosis was not found between both groups. Male to female ratio was 1.8 : 1. Sex ratio between both groups was not significantly different. Most febrile infant were noted in spring(35.1%) and the summer(36.7%). The peak incidence of aseptic meningitis was noted in May and June. The fever subsided mostly within 48~72 hours after administering antimicrobial agents(61.8~83.2%). A significant difference in duration of fever after administering antimicrobial agents was not found between both groups. Conclusion : A selected group of low risk infants 28~90 days of age with fever without source can be managed as outpatients provided that a thorough initial evaluation is performed, that parents can reliably monitor their infant closely at home and that careful follow up can be assured. Because bag collected specimens were more likely to yield indeterminate urine culture result, a suprapubic or catheter obtained urine specimen for culture is a necessary part of the evaluation of all febrile infants 28~90 days of age. The further prospective study on evaluation and management of young febrile infant should be performed in our hospital.

  • PDF