• Title/Summary/Keyword: test items analysis

Search Result 1,932, Processing Time 0.033 seconds

An Analysis of Paper and Pencil Test Items of Life Science I in High School (고등학교 생명 과학 I의 지필평가 문항 분석)

  • Lee, Donghoon;Jeong, Eunyoung
    • Journal of Science Education
    • /
    • v.38 no.3
    • /
    • pp.670-690
    • /
    • 2014
  • The purpose of this study was to analyze paper and pencil test items of life science I in high school to diagnose problems of the test items developed by teachers, and to provide some implication for better assessment. 690 selection-type items and 162 supply-type items in life science I were collected from 10 general high schools. In the analysis of test items, the ratio of the selection-type item and the supply-type item was 81:19 in the number of items based on item type, while the ratio was 74.4:25.6 in the distribution of marks, indicating that the distribution of marks compared to the number of items was higher in the supply-type items. In the analysis by the Bloom's revised taxonomy of educational objectives, the items of 'conceptual knowledge' in the knowledge and those of 'understanding' in the cognition process were shown most in both the selection-type item and the supply-type item. In the analysis by the science assessment frameworks of NAEA, the items of 'knowledge' were shown 9 times more than those of 'inquiry'. When compared to the level of difficulty presented in the two-way specification table and the percentage of correct answers in the selection-type item, the concurrence was 41.5%. When compared to the ratio of number of items based on the item type of the supply-type items, the short-answer items were 34.0%, the descriptive items were 61.1%, and the drawing items were 4.9%. The drawing items were mainly developed in the unit of 'Cells and Continuity of Life'. When the descriptive items were classified by the acceptance of response, all the items were 'response restricted' type, and the items of 'restricted in content range' type among them were highest. When the items were classified by presentation of data, the items of 'presentation of data' type were highest(65.4%), and when classified by type of question, the items of 'knowledge description' type were highest(80.4%). In conclusion, it is needed to develop items belonging to 'inquiry' area more in the school, and to increase the ratio of the descriptive items, presenting various types of items.

  • PDF

Analysis of Test Items and the Applicants' Responses on the Chemistry Part in the General Science of College Scholastics Ability Test (대학수학능력시험 공통과학 중 화학 영역의 문항 및 응시자 응답 분석)

  • Hong, Mi-Young;Jeon, Kyung-Moon;Lee, Yang-Rak;Yi, Bum-Hong
    • Journal of The Korean Association For Science Education
    • /
    • v.22 no.2
    • /
    • pp.378-386
    • /
    • 2002
  • In this study, the students' responses on the chemistry items of in the general science of College Scholastics Ability Test (CSAT) implemented for the past 3 years since 1999 were investigated. The number of items by content and inquiry process, the average percent correct by content and inquiry process, the distribution of items by the level of percent correct, and the items with high and/or low percent correct were analysed. There were the fewest items in 'environment' area, especially in 'ozon layer', no test item had been made. The most difficult content area was 'acid rain' in 'environment'. By inquiry process, the most number of items belonged to 'analyzing & interpreting data', and 'identifying problems & formulating hypothesis' was the most difficult process. No test item came under the level of 'very difficult', and many items under the 'easy' or 'very easy' level. Students were generally poor at solving test items demanding several concepts, and very good at simply requiring basic concept treated in lower grade. Educational implications are discussed.

Analysis of Science Items of the Japanese National Center Test for University Admissions (일본 대학입시센터시험 이과 문항 분석)

  • Kim, Hyun-Kyung;Kim, Dong-Young;Choi, Hyuk-Joon;Ku, Ja-Ok;Dong, Hyo-Kwan;Shin, Il-Yong;Lee, Yang-Rak
    • Journal of The Korean Association For Science Education
    • /
    • v.30 no.4
    • /
    • pp.452-471
    • /
    • 2010
  • As the Korean College scholastic Ability Test (CSAT) has been implemented for 17 years since 1994, it is becoming more and more difficult to make new items that haven't been previously used to measure students' thinking ability. Therefore, it is necessary to keep conducting research on making new test items that can measure students' scholastic ability reliably. For this reason, multiple choice items on the Japanese university entrance exam, which is a Japanese National Center Test for University Admissions (NCTUA) equivalent of CSAT, were analyzed in order to draw implications for CSAT item development. In this study, we analyzed the Japanese NCTUA administered in January 2009 to investigate the structure of its science test. We also analyzed the NCTUA items by the domains of contents and behaviors, and tried to predict item difficulty from the perspective of Korean applicants. Major findings are as follows: Most NCTUA items measure understanding knowledge or low level thinking ability. Also the alloted time for each item is longer than CSAT. The number of test items, and the number of choice and alloted points for each item are diverse, unlike CSAT. The number of items using real-life materials are much more, but the items are not rigorous in sentence expression compared to CSAT. And the difference of difficulty level among science tests were larger with reference to CSAT. Also science score is required for most applicants regardless whether they are taking liberal arts or going onto the science track.

Development and Validation of the Nurse Needs Satisfaction Scale Based on Maslow's Hierarchy of Needs Theory (Maslow의 욕구위계이론에 근거한 간호사 욕구만족도 측정도구 개발 및 타당화)

  • Kim, Hwa Jin;Shin, Sun Hwa
    • Journal of Korean Academy of Nursing
    • /
    • v.50 no.6
    • /
    • pp.848-862
    • /
    • 2020
  • Purpose: The purpose of this study was to develop an instrument to evaluate the needs satisfaction of nurses and examine its validity and reliability. Methods: The initial items for the instrument were developed through a literature review and interviews, using the conceptual framework of Maslow's hierarchy of needs theory. The initial items were evaluated for content validity by 14 experts. Four hundred and eighty-six clinical nurses participated in this study through offline and online surveys to test the reliability and validity of the instrument. The first evaluation (n = 256) was used for item analysis and exploratory factor analysis, and the second evaluation (n = 230) was used to conduct a confirmatory factor analysis and to assess the criterion-related validity and internal consistency of the instrument. Test-retest reliability was analyzed using data from 30 nurses. Results: The final instrument consisted of 30 items with two sub-factors for five needs that were identified through the confirmatory factor analysis. The criterion-related validity was established using the five need satisfaction measures (r = .56). Cronbach's α for total items was .90, and test-retest reliability was .89. Conclusion: The findings from this study indicate that this instrument has sufficient validity and reliability. This instrument can be used for the development of nursing interventions to improve the needs satisfaction of clinical nurses.

Development of Meaning in Life Scale II (생의 의미 측정도구의 개발 II)

  • Choi Soon-Ock;Kim Sook-Nam;Shin Kyung-Il;Lee Jong-Ji
    • Journal of Korean Academy of Nursing
    • /
    • v.35 no.5
    • /
    • pp.931-942
    • /
    • 2005
  • Purpose: The purpose of this study was to develop a meaning of life scale with high validity and reliability. Method: A conceptual framework composed of 4 phases of meanings of life was identified. And 49 preliminary items on a 4-points scale were developed through content validity. A reliability and validity test of the 49 items was conducted on 564 adults. By means of internal consistency of the 49 items, 1 item was deleted. To verify the 48 items, factor analysis, reliability test, and LISEREL were done. Result: Through exploratory factor analysis of the 48 items, 8 factors were extracted. These factors were labeled as 'self- awareness and self-acceptance', 'hope', 'responsibility awareness', 'love experience', 'self transcendence', 'relation experience', 'self contentedness', and 'Commitment'. Through LISEREL of the 48 items, 2 items were excluded and finally 46 itemsremained. Cronbach's Alpha of the 46 items was .94. The correlation coefficient of the Self-esteem scale was .79. Conclusion: By the above results, the researchers recommend the following: An exploratory study on the variables related to the meaning of life are needed for criterion validity of this scale. Studies on meaning of life of different groupa, and subjects are needed for reverification.

Item Response Analysis on Items Related to Statistical Unit in the National Academic Aptitude Test -Empirical Study for Jellabuk-do Preliminary Testee- (대학수학능력시험의 통계단원 문제에 대한 문항반응분석 - 전북지역 예비 수험생을 대상으로 한 탐색연구 -)

  • Choi, Kyoung-Ho
    • Communications for Statistical Applications and Methods
    • /
    • v.17 no.3
    • /
    • pp.327-335
    • /
    • 2010
  • Item response theory provides a fixed results about students, regardless of the item difficulty and discrimina-tion and it is also a kind of item analysis methods which provides the same proper competence scores to students in spite of them taking different test repeatedly. In this paper, we researched item difficulty and item discrimina-tion and analyzed items in the national academic aptitude test which were given from 2000 to 2009 in the past 10 years through item response theory, especially, in connection with given items about statistical unit. As a result, we found that about 60 percents of the items were too difficult for high school students to solve, however, item discrimination proved to be great.

A Study on the Measurement of Clothing Behavior of Elementary School Children (학령기 아동의 의복행동 측정도구 개발에 관한 연구 -4, 5, 6학년 아동을 중심으로-)

  • Lee Myoung Hee
    • Journal of the Korean Society of Clothing and Textiles
    • /
    • v.11 no.2 s.24
    • /
    • pp.1-11
    • /
    • 1987
  • The purpose of this study was to develope a questionaire measuring clothing behavior of elementary school children. At first, after pretest, the clothing behavior questionaire consisted of 70 items which were devidad. into 7 subscales i.e. Clothing interest. Clothing satisfaction. Clothing management, Clothing sex-role. Clothing comfort. Clothing conformity. and Clothing independence. Each item was rated on a 3-point scale. Samples were 447 boys and girls (4 th, 5 th, 6 th grade) of three elementary schools in Seoul. Korea. The data were analyzed by item analysis and factor analysis. Factor analysis was useful in attempting to establish contruct validity. Item validity was examined based on the correlation coefficients and item discriminating power through the chi-square. Reliability was examined based on the Cronbach's Alpha Reliability Coefficient and test-retest method. With this analysis the subscales were reconstructed to following 6 factors. Clothing conformity items were not clustered by the factor analysis. 52 items of 6 factors selected by the analysis. The factors characteristics were as follows: 1. Clothing interest (10 items) 2. Clothing satisfaction (11 items) 3. Clothing management (8 items) 4. Clothing sex-role (12 items) 5. Clothing comfort (6 items) 6. Clothing independence (5 items)

  • PDF

Development of a Cardiovascular Disease Resilience Scale (심혈관질환용 회복력(Cardiovascular Disease Resilience) 측정도구 개발 및 평가)

  • Shin, Su-Jin
    • Korean Journal of Adult Nursing
    • /
    • v.22 no.2
    • /
    • pp.161-170
    • /
    • 2010
  • Purpose: The purpose of this study was to develop a Cardiovascular Disease Resilience (CDR) scale to evaluate disease specific resilience for recovery. Methods: The study was conducted as follows: items generation, and test of validity and reliability. Items were developed via literature review, review of instruments, and data acquired from the interviews. In order to test validity and reliability, seven panels of experts reviewed the preliminary questionnaire and then data were collected from 550 cardiovascular disease patients. Factor analysis, Pearson correlation, ANOVA, and Cronbach's alpha were used to analyze the data. Results: In the preliminary stage, forty-four items were generated. A reduction to 40 items was accomplished through content validity analysis. Factor analysis extracted 7 factors with a total of 25 items. The CDR items were moderately correlated with the subscales of the CD-RISC (Connor-Davidson Resilience Scale) and the mean score of CDR was associated with quality of life measured with CD-QOL (Cardiovascular Disease Quality of Life). Cronbach's ${\alpha}$=.84. Conclusion: Content validity, construct validity, criterion validity, and reliability of the CDR were established. The CDR is a reliable and valid instrument which the resilience of cardiovascular disease specific recovery state can be evaluated.

Qualitative and Quantitative Analysis of Paper-Pencil Test Items for Exploring its Appropriateness as a Selection Tool of the Gifted in Science (과학 영재 선발 도구로서 지필 검사의 적합성 탐색을 위한 질적 및 양적 문항 분석)

  • Lee, Ki-Young;Dong, Hyo-Kwan;Hong, Jun-Eui;Kim, Hyun-Kyung;Jo, Bong-Jae
    • Journal of The Korean Association For Science Education
    • /
    • v.28 no.1
    • /
    • pp.32-46
    • /
    • 2008
  • The purpose of this study was to analyse the qualitative and quantitative characteristics of paper-pencil tests for exploring its appropriateness as a selection tool of the gifted in science. For this purpose, we developed two (internal and external) item analysis frameworks, and applied these frameworks to analyse qualitative characteristics. Also, we analysed the relationship between two characteristics. The results of analysing qualitative characteristics revealed that the portion of items with acceleration context exceeding middle school curriculum level was relatively large, which caused low content validity. Furthermore, there was considerable deviation in content and context by subject matter and year, which caused test unstability. Items measuring knowledge domain was the most prevalent, and too much weight on data interpretation & analysis domain in inquiry process skills. In case of creativity test, the portion of items measuring convergent thinking was much larger than that of divergent or associative thinking. Most of these items were represented by using pictures and tables rather than using graphs. Item types of multiple-choice and short answers were superior to essay types. Discrimination index, on the whole, was appropriate (above 0.3), but item difficulty showed a vast deviation ($0.01{\sim}0.90$). Correlation coefficients among subject matters and test tools were very low, and test reliabilities were also low. Low item difficulty & high discrimination index item types were distinguishable. Items with acceleration context were more discriminating than enrichment context. Implications of developing quality paper-pencil test items in the selection of gifted students are discussed.

A Study on Comparison of Responses to Short Form Sasang Classification Questionnaire for American (SF_SSCQ-A) : Pilot test (미국인용 체질진단지에 의한 체질별 응답차이에 따른 문항 분석:Pilot test)

  • Lee, Eui-Ju;Yoo, Jung-Hee
    • Journal of Sasang Constitutional Medicine
    • /
    • v.21 no.1
    • /
    • pp.63-78
    • /
    • 2009
  • 1. Purpose This study has focused on response rates of the questionnaire which considered as a basic data to identify constitution for American. 2. Methods By analysing the tendency of the respondents who has defined constitution by clinical diagnosis and comparing of their answers, the result of their constitution analysis by our questionnaire were re-examed. The answer of each question to each constitution were tested how it is relevent to a scale of a constitution. Each item response rate on SF_SSCQ-A was analysed about those who had been tested and diagnosed as Taeyangin, Soyangin, Taeeumin, Soeumin respectively. 3. Results There were the 55 significant items; 13 Taeyangin items, 13 Soyangin items, 20 Taeeumin items, 9 Soeumin items. However, there were the 11 low response rate items (below 10 %) and 4 no response items.

  • PDF