• Title/Summary/Keyword: Test and Evaluation Item

Search Result 335, Processing Time 0.022 seconds

Factors of Predicting Difficulty of Mathematics Test Items in College Scholastic Ability Test (고등학교 수리영역 시험의 난이도 예측 요인 분석)

  • Ko, Ho-Kyoung;Yi, Hyun-Sook
    • Journal of the Korean School Mathematics Society
    • /
    • v.10 no.1
    • /
    • pp.113-127
    • /
    • 2007
  • This study explored the possibility of building a statistical model predicting difficulty of mathematics test items through the analysis of nation-wide scholastic ability test results for the past 5 years. Multiple linear regression analysis was conducted in predicting difficulty of mathematics test items. We adopted three major areas for independent variables: the content area, the behavior area, and the test item format area, each of which was categorized into more detailed sub-areas. For the dependent variable, the proportion of correct answer was used to represent the item difficulty. Statistically significant independent variables were included in the regression model based on the stepwise selection method. Several important factors affecting difficulty of mathematics test items for each area were identified. R-squares for the final regression model were fairly high, implying that the regression equation can be used to predict difficulty of test items at an acceptable level. Lastly, the regression model was cross-validated using independently collected data. We believe that this study will provide basic but very critical information for predicting the proportion of correct answer by showing the factors that should be considered for developing mathematics test items for the college entrance examination or high school classroom test.

  • PDF

A study for development and validation of the 'course evaluation' scale for learner-centered (학습자 중심의 '강의평가' 도구 개발 및 타당화 연구)

  • Park, Sung-Mi
    • Journal of Fisheries and Marine Sciences Education
    • /
    • v.23 no.1
    • /
    • pp.13-22
    • /
    • 2011
  • The purpose of this study was to development and validation of the 'course evaluation' scale for learner-centered in university. The research collected preliminary data from 1,567 university students's responses for item and scale quality analyses, and collected 2,539 university students's for item and scale quality analyses, and 300 university professors's responses for validation. Data were analyzed to obtain item quality, reliability, and validity analysis. The results of the study were as follows; The 'course evaluation' scale for learner-centered in university was defined by 5 factors. The 5 factors were structure and sincerity of lecture, suitability of report and test, level of consulting for student, application of educational media, communication. The results of the confirmatory factor analysis confirmed five sub-scales in the 'course evaluation' scale for learner-centered in university scale. Criterion-related validity evidence was obtained from the correlation analysis as the criterion measures. Cross validity evidence was obtained from the confirmatory factor analysis in university professors.

Psychometric Properties and Item Evaluation of Korean Version of Night Eating Questionnaire (KNEQ) (한국어판 야식증후군 측정도구의 신뢰도, 타당도 및 문항반응이론에 의한 문항분석)

  • Kim, Beomjong;Kim, Inja;Choi, Heejung
    • Journal of Korean Academy of Nursing
    • /
    • v.46 no.1
    • /
    • pp.109-117
    • /
    • 2016
  • Purpose: The aim of this study was to develop a Korean version of Night Eating Questionnaire (KNEQ) and test its psychometric properties and evaluate items according to item response theory. Methods: The 14-item NEQ as a measure of severity of the night eating syndrome was translated into Korean, and then this KNEQ was evaluated. A total of 1171 participants aged 20 to 50 completed the KNEQ on the Internet. To test reliability and validity, Cronbach's alpha, correlation, simple regression, and factor analysis were used. Each item was analyzed according to Rasch-Andrich rating scale model and item difficulty, discrimination, infit/outfit, and point measure correlation were evaluated. Results: Construct validity was evident. Cronbach's alpha was .78. The items of evening hyperphagia and nocturnal ingestion showed high ability in discriminating people with night eating syndrome, while items of morning anorexia and mood/sleep provided relatively little information. The results of item analysis showed that item2 and item7 needed to be revised to improve the reliability of KNEQ. Conclusion: KNEQ is an appropriate instrument to measure severity of night eating syndrome with good validity and reliability. However, further studies are needed to find cut-off scores to screen persons with night eating syndrome.

Exploring the direction of Assessment in Korean High School Mathematics through College Scholastic Ability Test Mathematics Domain Changes (대학수학능력시험 수학 영역의 변화를 통해 살펴본 고등학교 수학 평가의 방향 탐색)

  • Choi, Inseon;Lee, Sehyung;Moon, Duyeol
    • Journal of the Korean School Mathematics Society
    • /
    • v.26 no.2
    • /
    • pp.137-158
    • /
    • 2023
  • This study aimed to analyze the shifts in the mathematics domains of the College Scholastic Ability Test (CSAT) since its inception in 1993, with the intent of identifying improvements for the future. The goal is to provide insights for exploring the direction of assessment in Korean high school mathematics education. To this end, we focused on the test system, content area, and behavioral area within the CSAT mathematics domains. Key findings include: first, the test structure influences the assessment factors and item types, in addition to the examination time and number of items. Second, by analyzing the content area, we established a correlation between the national curriculum and assessment area, and confirmed the importance of setting the assessment area. Third, the examination of the behavioral area tended to the item-type fixation, demonstrating the necessity of the ongoing modifications in evaluation item types. Building upon these findings, we discuss the direction of an evaluation that considers the evolving demands and shifts within mathematics education.

A study on the perception of occupational therapy majors on Cognitive Impairment Screening Test (CIST)

  • Lee, Sun-myung;Chae, Joo-hyun;Sung, I-sul;Lee, Soo-jin;Moon, Soo-bin;Park, Da-hee;Park, So-hyun
    • Journal of Korean Clinical Health Science
    • /
    • v.9 no.2
    • /
    • pp.1493-1501
    • /
    • 2021
  • Purpose: The purpose of this study is to classify the characteristics of each item of CIST evaluation and to find out the degree of recognition of the characteristics of the cognitive tool. Methods: This study was conducted for occupational therapy majors at M University located in Gyeongsangnam-do. The data collection from May to June 2021. Total of 25 copies of the data were finally analyzed, SPSS Statistics 26 was used for data analysis. Results: As a result of the study, the significance level was visual reasoning 1 test strip and the visual reasoning 1 tool. In the relationship between the correspondence 1 figure simulation sheet and the figure simulation tool for each item and statistically significant, and the correspondence 2 visual reasoning 2 sheet. Visual reasoning 2 sheet and visual reasoning tool also showed that was found to be statistically significant. The correlation for visual reasoning 1 sheet and the visual reasoning 1 tool, reasoning 2 tool and visual reasoning 1 sheet, and the visual reasoning 2 tool and the verbal reasoning sheet. Conclusion: In this study, in the CIST items that may be difficult, it is better to attach the actual tool rather than the verbal explanation of the test paper to increase the efficiency of the test and the understanding of subjects with mild cognitive impairment. It was implemented by applying the tool, and it was found that the use of the tool in the visual reasoning item showed a high correlation by item. Furthermore, based on this study, it will be possible to suggest a method to control the difficulty of each subject of the cognitive evaluation tool, and to prepare a standard for future research.

A NEW INDEX OF DIMENSIONALITY - DETECT

  • Kim, Hae-Rim
    • The Pure and Applied Mathematics
    • /
    • v.3 no.2
    • /
    • pp.141-154
    • /
    • 1996
  • A data-driven index of dimensionality for an educational or psychological test - DETECT, short for Dimensionality Evaluation To Enumerate Contributing Traits, is proposed in this paper. It is based on estimated conditional covariances of item pairs, given score on remaining test items. Its purpose is to detect whatever multidimensionality structure exists, especially in the case of approximate simple structure. It does so by assigning items to relatively dimensionally homogeneous clusters via attempted maximization of the DETECT over all possible item cluster partitions. The performance of DETECT is studied through real and simulated data analyses.

  • PDF

Domestic Occupational Therapist Awareness Survey for the Need to Apply Artificial Intelligence Measurement Technology for Clinical Observation Evaluation Based on Sensory Integration (감각통합에 기초한 임상 관찰 평가의 AI 측정 기술 적용 필요성을 위한 국내 작업치료사 인식 조사)

  • Cho, Sun-Young;Jung, Young-Jin;Kim, Jung-Ran
    • Therapeutic Science for Rehabilitation
    • /
    • v.12 no.1
    • /
    • pp.23-35
    • /
    • 2023
  • Objective : This study is to examine the practical use of clinical observational evaluation of sensory integration therapy and the difficulty and importance of measuring results for each sub-item, and through this, to confirm the usefulness of the application of Artificial Intelligence measurement technology in clinical observational measurement and the need for application. Methods : The questionnaire consisted of the actual use of the sensory integration evaluation tool, the difficulty of measurement for each detailed item of clinical observation, the usefulness of AI measurement technology, the importance of evaluation for each detailed item, and the need for developing AI measurement technology. Results : The detailed items that were difficult to measure during clinical observation were the Finger-to-Nose Test and Postural control (71.0%), followed by Eye movement and Protective Extension Test (67.7%). 83.9% of the study subjects answered that it would be useful to apply AI measurement technology when observing images. Postural control (on the ball) (90.3%) was the highest item that answered that AI measurement technology was needed, followed by Eye movement (83.9%), and Prone Extension and Protective Extension Test (77.4%). Conclusion : The results confirmed the desire of therapists that clinical observation is an important evaluation tool in the field of child occupational therapy in Korea.

A Structure of Personalized e-Learning System Using On/Off-line Mixed Estimations Based on Multiple-Choice Items

  • Oh, Yong-Sun
    • International Journal of Contents
    • /
    • v.5 no.1
    • /
    • pp.51-55
    • /
    • 2009
  • In this paper, we present a structure of personalized e-Learning system to study for a test formalized by uniform multiple-choice using on/off line mixed estimations as is the case of Driver :s License Test in Korea. Using the system a candidate can study toward the license through the Internet (and/or mobile instruments) within the personalized concept based on IRT(item response theory). The system accurately estimates user's ability parameter and dynamically offers optimal evaluation problems and learning contents according to the estimated ability so that the user can take possession of the license in shorter time. In order to establish the personalized e-Learning concepts, we build up 3 databases and 2 agents in this system. Content DB maintains learning contents for studying toward the license as the shape of objects separated by concept-unit. Item-bank DB manages items with their parameters such as difficulties, discriminations, and guessing factors, which are firmly related to the learning contents in Content DB through the concept of object parameters. User profile DB maintains users' status information, item responses, and ability parameters. With these DB formations, Interface agent processes user ID, password, status information, and various queries generated by learners. In addition, it hooks up user's item response with Selection & Feedback agent. On the other hand, Selection & Feedback agent offers problems and content objects according to the corresponding user's ability parameter, and re-estimates the ability parameter to activate dynamic personalized learning situation and so forth.

Measuring health activation among foreign students in South Korea: initial evaluation of the feasibility, dimensionality, and reliability of the Consumer Health Activation Index (CHAI)

  • Park, MJ;Jung, Hun Sik
    • International Journal of Advanced Culture Technology
    • /
    • v.8 no.3
    • /
    • pp.192-197
    • /
    • 2020
  • Foreign students in South Korea face important challenges when they try to maintain their health. As a measure of their motivation to actively build skills for overcoming those challenges, we evaluated the 10-item Consumer Health Activation Index (CHAI), testing its feasibility, dimensionality, and reliability. There were no missing data, there was no floor effect, and for the total scores the ceiling effect was trivial (< 2%). Results of the Kaiser-Meyer-Olkin test and Bartlett's test of sphericity indicated that the data were suitable for the detection of structure by factor analysis. The results of parallel analysis and the shape of the scree plot supported a two-factor solution. One factor had 3 items concerning "my doctor" and the other factor had the 7 remaining items. Reliability was high for the 10-item CHAI (alpha = 0.856), for the 3-item subscale (alpha = 0.838), and for the 7-item subscale (alpha = 0.857). Reliability could not be improved by deletion of any items. Use of the CHAI to gather data from these foreign students is feasible, and reliable results can be obtained whether one uses the total score from all 10 items or scores from the proposed 7-item and 3-item subscales.

Development of Short Form of the Korean Version- the Boston Naming Test (K-BNT-15) Based on Item Response Theory (문항반응이론을 적용한 한국판 보스톤 이름대기 검사 단축형(K-BNT-15) 개발)

  • Kim, HyangHee;Kim, Soo Ryon
    • The Journal of the Korea Contents Association
    • /
    • v.13 no.12
    • /
    • pp.321-327
    • /
    • 2013
  • Impaired naming difficulty is common in normal elderly as well as in patients with neurological impairment. The 60-item Korean version-Boston Naming Test(K-BNT) is one of the most commonly used test for measuring confrontational naming ability. However, age-related cognitive decline may make the elderly difficult concentrating during the 60-item test, therefore, item reduction of the K-BNT would improve test validity and reliability. Thus, the purpose of this study was to develop a short form of the K-BNT based on Item Response Theory(IRT). Considering item-fit index, sex factor, and item difficulty through Rasch analysis, the 15-item K-BNT(i.e., K-BNT-15) was developed. Via administration of the K-BNT-15, we observed age-related decline in naming ability and significantly different performance between the normal elderly and patients with mild cognitive impairment. This study demonstrates the utility of IRT for developing a short-form language evaluation tool. The K-BNT-15 can be effective as a language screening tool to differentiate between normal aging and pathological diseases.