Identifying Potentially Flawed Items in the Context of Small Sample IRT Analysis

Panagiotis Fotaris; Theodoros Mastoras; Ioannis Mavridis; Athanasios Manitsaris

Identifying Potentially Flawed Items in the Context of Small Sample IRT Analysis

Panagiotis Fotaris, Theodoros Mastoras, Ioannis Mavridis, Athanasios Manitsaris

University of Brighton

Research output: Contribution to journal › Article › peer-review

Abstract

Although Classical Test Theory has been used by the measurement community for almost a century, Item Response Theory has become commonplace for educational assessment development, evaluation and refinement in recent decades. Its potential for improving test items as well as eliminating the ambiguous or misleading ones is substantial. However, in order to estimate its parameters and produce reliable results, IRT requires a large sample size of examinees, thus limiting its use to large-scale testing programs. Nevertheless, the accuracy of parameter estimates becomes of lesser importance when trying to detect items whose parameters exceed a threshold value. Under this consideration, the present study investigates the application of IRT-based assessment evaluation to small sample sizes through a series of simulations. Additionally, it introduces a set of quality indices, which exhibit the success rate of identifying potentially flawed items in a way that test developers without a significant statistical background can easily comprehend and utilize.

Original language	English
Pages (from-to)	31-42
Number of pages	12
Journal	International Journal on Advances in Intelligent Systems
Volume	4
Issue number	1-2
Publication status	Published - 1 Sept 2011

Bibliographical note

Creative Commons "Attribution-Non Commercial-Share Alike" license

Keywords

item response theory
computer-aided assessment
item quality
educational measurement
learning assessment evaluation
e-learning, psychometrics

Access to Document

Identifying potentially-flawed items in the context of small sample IRT analysis (Edit).pdfFinal published version, 1.36 MBLicence: CC BY-NC-SA

http://www.iariajournals.org/intelligent_systems/intsys_v4_n12_2011_paged.pdf#page=37Licence: Unspecified

Cite this

@article{cff19627323240aaa61415b2517d291e,

title = "Identifying Potentially Flawed Items in the Context of Small Sample IRT Analysis",

abstract = "Although Classical Test Theory has been used by the measurement community for almost a century, Item Response Theory has become commonplace for educational assessment development, evaluation and refinement in recent decades. Its potential for improving test items as well as eliminating the ambiguous or misleading ones is substantial. However, in order to estimate its parameters and produce reliable results, IRT requires a large sample size of examinees, thus limiting its use to large-scale testing programs. Nevertheless, the accuracy of parameter estimates becomes of lesser importance when trying to detect items whose parameters exceed a threshold value. Under this consideration, the present study investigates the application of IRT-based assessment evaluation to small sample sizes through a series of simulations. Additionally, it introduces a set of quality indices, which exhibit the success rate of identifying potentially flawed items in a way that test developers without a significant statistical background can easily comprehend and utilize.",

keywords = "item response theory, computer-aided assessment, item quality, educational measurement, learning assessment evaluation, e-learning, psychometrics",

author = "Panagiotis Fotaris and Theodoros Mastoras and Ioannis Mavridis and Athanasios Manitsaris",

note = "Creative Commons {"}Attribution-Non Commercial-Share Alike{"} license",

year = "2011",

month = sep,

day = "1",

language = "English",

volume = "4",

pages = "31--42",

journal = "International Journal on Advances in Intelligent Systems",

issn = "1942-2679",

number = "1-2",

}

TY - JOUR

T1 - Identifying Potentially Flawed Items in the Context of Small Sample IRT Analysis

AU - Fotaris, Panagiotis

AU - Mastoras, Theodoros

AU - Mavridis, Ioannis

AU - Manitsaris, Athanasios

N1 - Creative Commons "Attribution-Non Commercial-Share Alike" license

PY - 2011/9/1

Y1 - 2011/9/1

N2 - Although Classical Test Theory has been used by the measurement community for almost a century, Item Response Theory has become commonplace for educational assessment development, evaluation and refinement in recent decades. Its potential for improving test items as well as eliminating the ambiguous or misleading ones is substantial. However, in order to estimate its parameters and produce reliable results, IRT requires a large sample size of examinees, thus limiting its use to large-scale testing programs. Nevertheless, the accuracy of parameter estimates becomes of lesser importance when trying to detect items whose parameters exceed a threshold value. Under this consideration, the present study investigates the application of IRT-based assessment evaluation to small sample sizes through a series of simulations. Additionally, it introduces a set of quality indices, which exhibit the success rate of identifying potentially flawed items in a way that test developers without a significant statistical background can easily comprehend and utilize.

AB - Although Classical Test Theory has been used by the measurement community for almost a century, Item Response Theory has become commonplace for educational assessment development, evaluation and refinement in recent decades. Its potential for improving test items as well as eliminating the ambiguous or misleading ones is substantial. However, in order to estimate its parameters and produce reliable results, IRT requires a large sample size of examinees, thus limiting its use to large-scale testing programs. Nevertheless, the accuracy of parameter estimates becomes of lesser importance when trying to detect items whose parameters exceed a threshold value. Under this consideration, the present study investigates the application of IRT-based assessment evaluation to small sample sizes through a series of simulations. Additionally, it introduces a set of quality indices, which exhibit the success rate of identifying potentially flawed items in a way that test developers without a significant statistical background can easily comprehend and utilize.

KW - item response theory

KW - computer-aided assessment

KW - item quality

KW - educational measurement

KW - learning assessment evaluation

KW - e-learning, psychometrics

M3 - Article

SN - 1942-2679

VL - 4

SP - 31

EP - 42

JO - International Journal on Advances in Intelligent Systems

JF - International Journal on Advances in Intelligent Systems

IS - 1-2

ER -

Identifying Potentially Flawed Items in the Context of Small Sample IRT Analysis

Abstract

Bibliographical note

Keywords

Access to Document

Fingerprint

Cite this