• Title/Summary/Keyword: Item response theory

Search Result 95, Processing Time 0.027 seconds

Responsiveness Comparisons of Self-Report Versus Therapist-Scored Functional Capacity for Workers With Low Back Pain

  • Choi, Bongsam;Park, So-Yeon
    • Physical Therapy Korea
    • /
    • v.19 no.3
    • /
    • pp.91-97
    • /
    • 2012
  • The primary aim of this study was to compare responsiveness of self-report by worker and therapist-scored functional capacity instrument. Self-report and therapist-scored interval-level person measures and item difficulties were compared at admission and discharge. Therapist and worker ratings were collected on 230 clients from 27 rehabilitation sites using the newly developed Occupational Rehabilitation Data Base (ORDB) functional capacity instrument. ORDB comprises several subscales measuring relevant variables of "a return-to-work model" in work-related rehabilitation clinics. The functional capacity scale deals with 10 DOT job factors. The rating scale categories were 1-severely impaired, 2-moderately impaired, 3-mildly impaired, and 4-not impaired. Only data from clients with low back pain (n=98) with complete data (both admission and discharge scores) were used for the present study. Therapists and workers completed the functional capacity instrument at admission and discharge. Rasch analysis [1-parameter item response theory model (IRT)] was applied to calibrate item difficulty and person ability measure of therapist and workers ratings. Effect sizes for therapist and self-report ratings were slightly different, .69 and .30, respectively. Therapist and worker ratings were more consistent at discharge (r=.54) than at admission (r=.32). Workers have a tendency to be more severe in their ratings (show higher item difficulties) than therapists at admission and discharge. Therapists and workers report similar magnitudes of improvement following treatment program. These findings challenge the belief that injured workers may unreliable source for monitoring therapeutic outcomes. Self-report measures have the advantage of conserving therapist time for treatment (versus evaluation). While the therapist and self-report ratings are comparable at discharge, there is less consistency at admission. Comparable therapist-worker ratings may be achieved by controlling for rating severity using IRT methodologies.

A Method for Developing Items to Assess Earth Science Creativity (지구과학 창의력 평가 문항 개발 방법에 관한 연구)

  • Lee, Hang-Ro
    • Journal of the Korean earth science society
    • /
    • v.24 no.3
    • /
    • pp.150-159
    • /
    • 2003
  • This study suggests methods of assessing scientific creativity and developing items, which can be achieved when both earth science knowledge and general creativity are applied at the same time. According to the results of this study, the cognitive ability gaps between creativity and scientific creativity were clearly defined by the terms' operational definition. Four factors in the Subcategory Of Scientific Creativity-fluency, flexibility, elaboration, and originality-were selected, and the possibility of developing items out of these factors was discovered. The operational definitions of the four factors were given and the criteria for assessment and scoring were set. The validity, reliability, discrimination, and difficulty, which were the conditions required for the assessment instruments, were verified through three field trials of inputting the assessment instruments for scientific creativity. The assessment instruments were composed of 8 items with 2items for each factor. The average item fitness index obtained was 0.99, Cronbach , the item inter-consistency was 0.79,the inter-rater reliability of each item was 0.78, the inter-rater reliability of each factor was 0.75, the item discrimination power was 0.19, and the item difficulty was 0.00. Because the results were within the permitted limit of the conditions required for assessment instruments, the assessment instruments developed for scientific creativity in this study can be said to be very favorable.

Psychometric properties of an instrument 2: structural validity, internal consistency, and cross-cultural validity/measurement invariance (측정도구의 심리계량적 속성 2: 구조타당도, 내적일관성 및 교차문화타당도/측정동일성)

  • Lee, Eun-Hyun
    • Women's Health Nursing
    • /
    • v.27 no.2
    • /
    • pp.69-74
    • /
    • 2021
  • Structural validity, internal consistency, and cross-cultural validity/measurement invariance are psychometric properties of the internal structure of an instrument. In psychometric studies published in Korean nursing journals, structural validity has mainly been assessed using confirmatory factor analysis. Cross-cultural validity/measurement invariance has rarely been evaluated. It is recommended for Korean nursing researchers to evaluate the internal structure of instruments using a greater variety of methods, such as item response theory, Rasch analysis, multi-group confirmatory factor analysis, and differential item functioning.

Out-of-Stock versus Sold-Out: Consumers' Cognitive Processes Triggered by Unavailability Marks in Online Shopping Malls

  • Cheul Rhee;Wooseok Park
    • Asia pacific journal of information systems
    • /
    • v.30 no.2
    • /
    • pp.439-456
    • /
    • 2020
  • In online shopping, "out-of-stock" and "sold-out" are used to indicate product unavailability, and this unavailability and its effects on consumers' behaviors have been studied with great interest for practical purposes. However, few studies have specifically discussed out-of-stock and sold-out products in the same paper. We hypothesized that consumers might cognitively interpret items marked out-of-stock and sold-out differently, and in this paper, we studied these potential differences from the perspectives of consumers' emotions, behaviors, and loyalty based on the stimulus-organism-response framework. In order to explore the differences, we used a multi-method approach that consisted of experiments, surveys, and interviews. Specifically, we built an experimental website on which the same products were categorized as either out-of-stock or sold-out, and we measured the participants' emotions, attitudes, and intentions after the experiment. After two weeks, we conducted interviews to confirm our results and to learn more about consumers' everyday behavior. In the results, males and females demonstrated differences in emotion, behaviors, and loyalty with the interaction effects of an item's being marked out-of-stock versus sold-out. We found that the consumers demonstrated different levels of loyalty based on whether the item was marked out-of-stock or sold-out. We discuss the strategic implications of our findings.

A Web-based Adaptive Testing System to Diagnose Underachievers (학습부진아 진단을 위한 웹 기반 적응형 평가시스템)

  • 김광호;이재무
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.9 no.4
    • /
    • pp.431-438
    • /
    • 2003
  • In this study, we have developed a web-based adaptive testing system using item response theory´s computerized adaptive testing to diagnose underachievers, and to check the evaluation results immediately. Adaptive testing system simple is not the fact that it presents a question to students. It calculates information of a question and presents the question to students. It controls the response of the students under extraction conditions of the next question. It extracts the question which is the most suitable it presents. In this adaptive testing system, you can extract questions according to the level of the students, and adjust the length and the level of the difficulty according to the response of the students.

VA Design of Personalized e-Learning System for the Driver's License Test in Korea (개인 맞춤형 운전면허 학습시스템 설계)

  • Oh, Yong-Sun
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2009.05a
    • /
    • pp.1055-1060
    • /
    • 2009
  • In this paper, we design an e-Learning system for the Driver's License Teste studying through the Internet. The proposed system make users to be arrived at the goal for the license in a shorter time by offering learning contents and items according to the item-responses made by the users based on the Item Response Theory. Moreover we design the scheme to give the optimum items and the most necessary content to the user during the learning procedure in the form of concept-based objects. All the items in the problem bank DB maintain their difficulties, discriminations, and guessing parameters as is the case of 3-parameter logistic model. In addition user profile DB stores users' status informations, item responses, and ability parameters. Using these structures and combining agents, we can offer the optimum learning process or dynamic personalized studying structure to the user. We can construct interface agent and content selection and feedback agent with the DB's described above. User can study without any awareness of system operations or personal fitting scheme.

  • PDF

Study on the Academic Competency Assessment of Herbology Test using Rasch Model (라쉬 모델을 사용한 본초학 시험의 학업역량 분석 연구)

  • Chae, Han;Lee, Soo Jin;Han, Chang-ho;Cho, Young Il;Kim, Hyungwoo
    • The Journal of Korean Medicine
    • /
    • v.43 no.2
    • /
    • pp.27-41
    • /
    • 2022
  • Objectives: There should be an objective analysis on the academic competency for incorporating Computer-based Test (CBT) in the education of traditional Korean medicine (TKM). However, the Item Response Theory (IRT) for analyzing latent competency has not been introduced for its difficulty in calculation, interpretation and utilization. Methods: The current study analyzed responses of 390 students of 8 years to the herbology test with 14 items by utilizing Rasch model, and the characteristics of test and items were evaluated by using characteristic curve, information curve, difficulty, academic competency, and test score. The academic competency of the students across gender and years were presented with scale characteristic curve, Kernel density map, and Wright map, and examined based on T-test and ANOVA. Results: The estimated item, test, and ability parameters based on Rasch model provided reliable information on academic competency, and organized insights on students, test and items not available with test score calculated by the summation of item scores. The test showed acceptable validity for analyzing academic competency, but some of items revealed difficulty parameters to be modified with Wright map. The gender difference was not distinctive, however the differences between test years were obvious with Kernel density map. Conclusion: The current study analyzed the responses in the herbology test for measuring academic competency in the education of TKM using Rasch model, and structured analysis for competency-based Teaching in the e-learning era was suggested. It would provide the foundation for the learning analytics essential for self-directed learning and competency adaptive learning in TKM.

A Consideration about Online Ratings in Internet Shopping Malls (인터넷 쇼핑몰에서 고객의 상품평점에 대한 소고)

  • Jang, Dae-Heung
    • The Korean Journal of Applied Statistics
    • /
    • v.22 no.2
    • /
    • pp.309-315
    • /
    • 2009
  • The degree of the impression about a special commodity in the internet shopping malls depends on the evaluation and the corresponding rating of customers who purchased and used this commodity. We can find the problems in online ratings system of Korean internet shopping malls and suggest the simple solutions.

Analyzing Beaver Challenge Questions as a Computing Computing Assessment Tool : Based on Item Response Theory (컴퓨팅 사고력 평가 도구로써 비버 챌린지 문항 분석: 문항반응이론을 기반으로)

  • Kim, Eun-Ji;Lee, Tae-Wuk
    • Proceedings of The KACE
    • /
    • 2018.01a
    • /
    • pp.107-110
    • /
    • 2018
  • 본 연구에서는 컴퓨팅 사고력 평가도구로써 비버 챌린지 문항을 활용하기 위하여 문항반응이론을 통해 비버 챌린지 문항을 분석하고, 비버 챌린지에서 기존에 제시하는 난이도와 문항반응이론을 통한 난이도 간의 상관관계를 분석하였다. 분석 결과 비버 챌린지는 쉽고 변별력이 높은 검사로 나타났으나, 비버 챌린지에서 제시하는 난이도와 문항반응이론을 통한 난이도 간의 상관관계는 없었다. 난이도에 따라 가점과 감점이 이루어지는 비버 챌린지 채점 기준을 고려할 때 정확한 컴퓨팅 사고력 측정을 위해서는 난이도에 대한 수정 및 보완이 필요하다.

  • PDF

Development and Validation of a Practical Instrument for Injury Prevention: The Occupational Safety and Health Monitoring and Assessment Tool (OSH-MAT)

  • Sun, Yi;Arning, Martin;Bochmann, Frank;Borger, Jutta;Heitmann, Thomas
    • Safety and Health at Work
    • /
    • v.9 no.2
    • /
    • pp.140-143
    • /
    • 2018
  • Background: The Occupational Safety and Health Monitoring and Assessment Tool (OSH-MAT) is a practical instrument that is currently used in the German woodworking and metalworking industries to monitor safety conditions at workplaces. The 12-item scoring system has three subscales rating technical, organizational, and personnel-related conditions in a company. Each item has a rating value ranging from 1 to 9, with higher values indicating higher standard of safety conditions. Methods: The reliability of this instrument was evaluated in a cross-sectional survey among 128 companies and its validity among 30,514 companies. The inter-rater reliability of the instrument was examined independently and simultaneously by two well-trained safety engineers. Agreement between the double ratings was quantified by the intraclass correlation coefficient and absolute agreement of the rating values. The content validity of the OSH-MAT was evaluated by quantifying the association between OSH-MAT values and 5-year average injury rates by Poisson regression analysis adjusted for the size of the companies and industrial sectors. The construct validity of OSH-MAT was examined by principle component factor analysis. Results: Our analysis indicated good to very good inter-rater reliability (intraclass correlation coefficient = 0.64-0.74) of OSH-MAT values with an absolute agreement of between 72% and 81%. Factor analysis identified three component subscales that met exactly the structure theory of this instrument. The Poisson regression analysis demonstrated a statistically significant exposure-response relationship between OSH-MAT values and the 5-year average injury rates. Conclusion: These analyses indicate that OSH-MAT is a valid and reliable instrument that can be used effectively to monitor safety conditions at workplaces.