1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260
|
\name{anchor}
\alias{anchor}
\alias{anchor.default}
\alias{anchor.formula}
\alias{print.anchor}
\alias{print.summary.anchor}
\alias{summary.anchor}
\title{Anchor Methods for the Detection of Uniform DIF the Rasch Model}
\description{
The \code{anchor} function provides a variety of anchor
methods for the detection of uniform differential item functioning (DIF)
in the Rasch model between two pre-specified groups. These methods can
be divided in an anchor class that determines characteristics of the
anchor method and an anchor selection that determines the ranking order
of candidate anchor items. The aim of the \code{anchor} function is to
provide anchor items for DIF testing, e.g. with
\code{\link{anchortest}}.
}
\usage{
anchor(object, \dots)
\method{anchor}{default}(object, object2,
class = c("constant", "forward"), select = NULL,
length = NULL, range = c(0.1, 0.8), \dots)
\method{anchor}{formula}(formula, data = NULL, subset = NULL,
na.action = NULL, weights = NULL, model = raschmodel, \dots)
}
\arguments{
\item{object, object2}{Fitted model objects of class ``raschmodel''
estimated via conditional maximum likelihood using \code{\link{raschmodel}}.}
\item{\dots}{further arguments passed over to an internal call
of \code{\link{anchor.default}} in the formula method. In the
default method, these additional arguments are currently not being used.}
\item{class}{character. Available anchor classes are the
\code{constant} anchor class implying a constant anchor length
defined by \code{length} and the iterative \code{forward} anchor
class, for an overview see Kopf et al. (2014a).}
\item{select}{character. Several anchor selection strategies are
available: \code{"MTT"}, \code{"MPT"}, \code{"MT"}, \code{"MP"},
\code{"NST"}, \code{"AO"}, \code{"AOP"}. For details see below and
Kopf et al. (2014b). Following their recommendations, the default
\code{selection = "MPT"} is used for \code{class = "constant"} whereas
\code{selection = "MTT"} is used for \code{class = "forward"}.
Moreover, \code{"Gini"}, \code{"CLF"}, \code{"GiniT"}, and \code{"CLFT"}
have been suggested by Strobl et al. (2020) and can be combined
with \code{class = "constant"} and \code{length = 1}. The
\code{select = "Gini"} method will likely become the default in
future versions.}
\item{length}{integer. It pre-defines a maximum anchor length.
Per default, the \code{forward} anchor grows up to the proportion
of currently presumed DIF-free items specified in \code{range}
and the \code{constant} anchor class selects four anchor items,
unless an explicit limiting number is defined in \code{length}
by the user.}
\item{range}{numeric vector of length 2. The first element is
the percentage of first anchor candidates to be excluded for
consideration when the \code{forward} anchor class is used and the
second element determines a percentage of currently presumed DIF-free
items up to which the anchor from the \code{forward} anchor class is
allowed to grow.}
\item{formula}{formula of type \code{y ~ x} where \code{y} specifies a
matrix of dichotomous item responses and \code{x} the grouping
variable, e.g., gender, for which DIF should be tested for.}
\item{data}{a data frame containing the variables of the specified
\code{formula}.}
\item{subset}{logical expression indicating elements or rows to keep:
missing values are taken as false.}
\item{na.action}{a function which indicates what should happen when the data
contain missing values (\code{NA}s).}
\item{weights}{an optional vector of weights (interpreted as case weights).}
\item{model}{an IRT model fitting function with a suitable \code{itempar}
method, by default \code{\link{raschmodel}}.}
}
\details{
The anchor methods provided consist of an anchor class that determines
characteristics of the anchor and an anchor selection that determines
the ranking order of candidate anchor items.
In the \code{constant} anchor class, the anchor length is pre-defined by the
user within the argument \code{length}. The current default is four anchor
items. The iterative \code{forward} class starts with a single anchor item and
includes items in the anchor as long as the anchor length is shorter
than a certain percentage of the number of items that do not display
statistically significant DIF (default: 0.8). Furthermore, a percentage
of first anchor candidates is excluded from consideration (default: 0.1)
and the user is allowed to set a maximum number of anchor items using the
argument \code{length}. A detailed description of the anchor classes can be found
in Kopf et al. (2014a).
In more recent work Strobl et al. (2020) suggest a simpler yet powerful
anchor method based on inequality criteria like the Gini coefficient
or the component loss function (CLF). This can be shown to attain its optimium
for a single-anchor, thus correponding to a \code{constant} class of \code{length}
1. This will likely become the default method in future versions.
Both anchor classes require an explicit anchor selection strategy (as opposed to
the \code{all-other} anchor class which is therefore not included in the
function \code{anchor}). The anchor selection strategy determines the ranking order of
candidate anchor items. In case of two groups, each item \eqn{j, j = 1,
\ldots, k}{j, j = 1, \dots, k} (where \eqn{k}{k} denotes the number of
items in the test) obtains a criterion value \eqn{c_j}{c_j} that is
defined by the anchor selection strategy. The ranking order is
determined by the rank of the criterion value
rank\eqn{(c_j)}{(c_j)}.
The criterion values are defined by the respective anchor selection
strategy as in the following equations, where \eqn{1}{1} denotes the
indicator function, \eqn{t_j}{t_j} denotes the test statistic for item
\eqn{j}{j}, followed by the underlying anchor items in parenthesis,
\eqn{p_j}{p_j} the corresponding p-values (also with anchor items in parenthesis),
\eqn{\lceil 0.5\cdot k\rceil}{<0.5k>} denotes the empirical 50\% quantile
and \eqn{A_{purified}}{A_{purified}} the anchor after purification steps
(a detailed description of the anchor selection strategies implemented in
this function is provided in Kopf et al., 2014b).
All-other selection by Woods (2009), here abbreviated AO:
\deqn{ c_j^{AO} = | t_j (\{1,\ldots,k\}\textbackslash j) |}{%
c_j^{AO} = | t_j ({1,\dots,k}\\j) |}
All-other purified selection by Wang et al. (2012), here abbreviated AOP:
\deqn{ c_j^{AOP} = | t_j ( A_{purified} ) |}{%
c_j^{AOP} = | t_j ( A_{purified} ) |}
Number of significant threshold selection based on Wang et al. (2004), here abbreviated NST:
\deqn{ c_j^{NST} = \sum_{l \in \{1,\ldots,k\} \textbackslash j} 1 \left\{ p_j ( \{l\} ) \leq \alpha \right\}) |}{%
c_j^{NST} = \sum_{l in {1,\dots,k} \\ j} 1 { p_j ({l}) \le \alpha }) |}
Mean test statistic selection by Shih et al. (2009), here abbreviated MT:
\deqn{ c_j^{MT} = \frac{1}{k-1} \sum_{l \in \{1,\ldots,k\} \textbackslash j} \left| t_j ( \{l\}) \right| }{%
c_j^{MT} = 1/(k-1) \sum_{l in {1,\dots,k} \\ j} | t_j ({l}) |}
Mean p-value selection by Kopf et al. (2014b), here abbreviated MP:
\deqn{ c_j^{MP} = - \frac{1}{k-1} \sum_{l \in \{1,\ldots,k\} \textbackslash j} p_j ( \{l\} ) }{%
c_j^{MP} = - 1/(k-1) \sum_{l in {1,\dots,k} \\ j} p_j ({l})}
Mean test statistic threshold selection by Kopf et al. (2014b), here abbreviated MTT:
\deqn{ c_j^{MTT} =
\sum_{l \in \{1,\ldots,k\} \textbackslash j}
1 \left\{ \left| t_j ( \{l\} ) \right| >
\left( \left| \frac{1}{k-1} \sum_{l \in \{ 1, \ldots, k \} \textbackslash j}
t_j ( \{l\} ) \right|
\right)_{\left( \lceil 0.5\cdot k\rceil \right)}
\right\} }{%
c_j^{MTT} =
\sum_{l in {1,\dots,k} \\ j}
1 { | t_j ({l}) | > ( | 1/(k-1) \sum_{l \in { 1,\dots, k } \\ j}
t_j ({l}) | )_{<0.5k>}}
}
Mean p-value threshold selection by Kopf et al. (2014b), here abbreviated MPT:
\deqn{ c_j^{MPT} =
- \sum_{l \in \{1,\ldots,k\} \textbackslash j}
1 \left\{ p_j ( \{l\} ) > \left(
\frac{1}{k-1} \sum_{l \in \{ 1, \ldots, k \} \textbackslash j}
p_j ( \{l\} )
\right)_{ \left( \lceil 0.5\cdot k\rceil \right)}
\right\} }{%
c_j^{MPT} =
- \sum_{l in {1,\dots,k} \\ j}
1 { p_j ({l}) > ( 1/(k-1) \sum_{l in { 1, \dots, k } \\ j}
p_j ({l}))_{<0.5k>}
}
}
Kopf et al. (2015b) recommend to combine the \code{class = "constant"} with
\code{select = "MPT"} and the \code{class = "forward"} with \code{select = "MTT"},
respectively.
The \code{all-other} anchor class (that assumes that DIF is balanced i.e. no
group has an advantage in the test) is here
not considered as explicit anchor selection and, thus, not included
in the \code{anchor} function (but in the \code{\link{anchortest}}
function). Note that the \code{all-other} anchor class requires strong prior
knowledge that DIF is balanced.
}
\value{
An object of class \code{anchor}, i.e. a list including
\item{anchor_items}{the anchor items for DIF analysis}
\item{ranking_order}{a ranking order of candidate anchor items}
\item{criteria}{the criterion values obtained by the respective anchor
selection}
}
\references{
Kopf J, Zeileis A, Strobl C (2015a).
A Framework for Anchor Methods and an Iterative Forward Approach for DIF Detection.
\emph{Applied Psychological Measurement}, \bold{39}(2), 83--103.
\doi{10.1177/0146621614544195}
Kopf J, Zeileis A, Strobl C (2015b).
Anchor Selection Strategies for DIF Analysis: Review, Assessment, and New Approaches.
\emph{Educational and Psychological Measurement}, \bold{75}(1), 22--56.
\doi{10.1177/0013164414529792}
Shih CL, Wang WC (2009).
Differential Item Functioning Detection Using the Multiple Indicators, Multiple Causes Method with a Pure Short Anchor.
\emph{Applied Psychological Measurement}, \bold{33}(3), 184--199.
Strobl C, Kopf J, Kohler L, von Oertzen T, Zeileis A (2020).
Anchor Point Selection -- Scale Alignment Based on an Inequality Criterion.
\emph{Applied Psychological Measurement}, Forthcoming.
\url{https://EconPapers.repec.org/RePEc:inn:wpaper:2018-03}
Wang WC (2004).
Effects of Anchor Item Methods on the Detection of Differential Item Functioning within the Family of Rasch Models.
\emph{Journal of Experimental Education}, \bold{72}(3), 221--261.
Wang WC, Shih CL, Sun GW (2012).
The DIF-Free-then-DIF Strategy for the Assessment of Differential Item Functioning.
\emph{Educational and Psychological Measurement}, \bold{72}(4), 687--708.
Woods C (2009).
Empirical Selection of Anchors for Tests of Differential Item Functioning.
\emph{Applied Psychological Measurement}, \bold{33}(1), 42--57.
}
\seealso{
\code{\link{anchortest}}
}
\examples{
if(requireNamespace("multcomp")) {
## Verbal aggression data
data("VerbalAggression", package = "psychotools")
## Rasch model for the self-to-blame situations; gender DIF test
raschmodels <- with(VerbalAggression, lapply(levels(gender), function(i)
raschmodel(resp2[gender == i, 1:12])))
## four anchor items from constant anchor class using MPT-selection
canchor <- anchor(object = raschmodels[[1]], object2 = raschmodels[[2]],
class = "constant", select = "MPT", length = 4)
canchor
summary(canchor)
## iterative forward anchor class using MTT-selection
set.seed(1)
fanchor <- anchor(object = raschmodels[[1]], object2 = raschmodels[[2]],
class = "forward", select = "MTT", range = c(0.05, 1))
fanchor
## the same using the formula interface
set.seed(1)
fanchor2 <- anchor(resp2[, 1:12] ~ gender , data = VerbalAggression,
class = "forward", select = "MTT", range = c(0.05, 1))
## really the same?
all.equal(fanchor, fanchor2, check.attributes = FALSE)
}
}
\keyword{regression}
|