透過您的圖書館登入
IP:3.140.185.147
  • 期刊

科學能力的建構反應評量之發展與信效度分析:以自然科光學為例

Developing and Validating a Constructed-Response Assessment of Scientific Abilities: A Case of the Optics Unit

摘要


由於建構反應試題較選擇題更適於評估學生的高階認知能力,本研究目的係在發展科學能力的建構反應評量,建立評分規準,並分析信度與效度。全評量包含「科學知識的記憶與瞭解」、「科學程序的應用與分析」、「科學邏輯的論證與表達」,以及「問題解決的評估與創造」四個分評量,共計32題。分析結果顯示,評分者內之Cronbach's α與評分者間之Kendall ω和諧係數值均大於.90,表示評分者內與評分者間的一致性良好。再者,評分者嚴苛度之多面向Rasch測量模式之卡方考驗未達顯著水準,表示評分者間的嚴苛度未有差異存在,infit與outfit MNSQ介於1 ± 0.5,顯示無論嚴格或寬鬆的評分者,均能有效區分高、低能力的學生。另RSM與PCM模式比較的卡方考驗達顯著水準,將所估計的Deviance進行BIC轉換,結果發現RSM較為適配,顯示評分者間有相同的評分閾值。此外,全評量之Cronbach's α在.85以上,顯示具有不錯的信度。驗證性因素分析結果顯示,「科學知識的記憶與瞭解」、「科學程序的應用與分析」、「科學邏輯的論證與表達」,以及「問題解決的評估與創造」所檢測四個一階潛在因素,可被二階因素之「科學能力」解釋的變異量分別為.92、.56、.46、.46,實徵資料尚且支持「科學能力的建構反應評量」的理論構念模式,係為一項精確測量科學能力的工具。

並列摘要


This study aimed to develop and validate a constructed-response assessment of scientific abilities and an accompanying rubric. The assessment included 32 open-ended test items that were categorized into four subscales-Remembering and understanding scientific knowledge, application and analysis of scientific procedures, argumentation and expression of scientific logic, and evaluation and innovation during problem solving. The analysis revealed the following results: First, the Cronbach's α values were higher than .90, indicating high intrarater consistency. Second, Kendall's coefficient of concordance was higher than .90 and its p value was less than .001, denoting a consistent scoring pattern between raters. In addition, many-facet Rasch measurement (MFRM) analysis revealed no significant difference in rater severity, whereas a comparison of the rating scale model (RSM) and partial credit model (PCM) indicated that each rater had a unique rating scale structure. The infit and outfit mean squares of the MFRM were 1 ± 0.5, which suggested that both severe and lenient raters could effectively distinguish high and low-ability students. The Deviance values estimated by the RSM and PCM were converted to Bayesian information criterion values, and the RSM was viewed to fit the empirical data appropriately compared with the PCM. Therefore, the severity thresholds of the raters were the same. Third, Cronbach's α coefficients of the four subassessments and the full assessment were higher than .85, indicating that the constructed-response assessment of scientific abilities (CRASA) provided a high internal-consistency reliability. Finally, confirmatory factor analysis revealed acceptable goodness-of-fit for the CRASA. These results suggested that the CRASA is a useful tool for accurately measuring scientific abilities.

參考文獻


林小慧、曾玉村(2017)。科學多重文本閱讀理解評量及規準之建構與信效度分析—以氣候變遷與三峽大壩之間的關係題本為例。教育心理學報,49(2),215-241。【Lin, H.-H., & Tzeng, Y.-T. (2017). Developing and validating a scientific multi-text reading comprehension assessment: Evidence from texts describing relationships between climate changes and the Three Gorges Dam. Bulletin of Educational Psychology, 49(2), 215-241. 】
Bennett, R. E., & Ward, W. C. (1993). Construction versus choice in cognitive measurement: Issues in constructed response, performance testing, and portfolio assessment. Hillsdale, NJ: Lawrence Erlbaum Associates.
Cohen, J. (2013). Statistical power analysis for the behavioral sciences (2nd ed.). Hoboken, NJ: Taylor and Francis.
Eckes, T. (2009). Many-facet Rasch measurement. In S. Takala (Ed.), Reference supplement to the manual for relating language examinations to the Common European Framework of Reference for languages: Learning, teaching, assessment (Section H). Strasbourg, France: Council of Europe/Language Policy Division.
Kuo, C.-Y., Wu, H.-K., Jen, T.-H., & Hsu, Y.-S. (2015). Development and validation of a multimedia-based assessment of scientific inquiry abilities. International Journal of Science Education, 37(14), 2326-2357.

被引用紀錄


何德華、張惠環、許婉儀(2023)。對話者之語言能力與評分嚴苛度對印尼語口語評量成績之影響教育心理學報55(1),25-46。https://doi.org/10.6251/BEP.202309_55(1).0002
謝名娟(2020)。從多層面Rasch模式來檢視不同的評分者等化連結設計對參數估計的影響教育心理學報52(2),415-436。https://doi.org/10.6251/BEP.202012_52(2).0008
林小慧、吳心楷(2019)。科學探究能力評量之標準設定與其效度檢核教育心理學報50(3),473-502。https://doi.org/10.6251/BEP.201903_50(3).0005
林小慧、郭哲宇、吳心楷(2021)。學生學習投入、好奇心、教師集體層級變項與科學探究能力的關係:跨層級調節式中介效果之探討教育科學研究期刊66(2),75-110。https://doi.org/10.6209/JORIES.202106_66(2).0003
劉怡薰、宋曜廷(2020)。中等學校師資生任教學科專門知識檢測機制之探討教育科學研究期刊65(2),167-194。https://doi.org/10.6209/JORIES.202006_65(2).0006

延伸閱讀