Nonsymmetric correspondence analysis, for analysing contingency tables with a dependence structure

nsCA(X, ncp = 5, row.sup = NULL,
     col.sup = NULL, quanti.sup = NULL, quali.sup = NULL, 
     graph = FALSE, axes = c(1,2), row.w = NULL)

Arguments

X

a data frame or a table with n rows and p columns, i.e. a contingency table. Predictor variable should be in rows and response variable in columns.

ncp

number of dimensions kept in the results (by default 5)

row.sup

a vector indicating the indexes of the supplementary rows

col.sup

a vector indicating the indexes of the supplementary columns

quanti.sup

a vector indicating the indexes of the supplementary continuous variables

quali.sup

a vector indicating the indexes of the categorical supplementary variables

graph

boolean, if TRUE a graph is displayed

axes

a length 2 vector specifying the components to plot

row.w

an optional row weights (by default, a vector of 1 and each row has a weight equals to its margin); the weights are given only for the active rows

Details

When dealing with a contingency table with a dependence structure, i.e. when the role of the two variables is not symmetrical but, on the contrary, one can be considered as predicting the other, nonsymmetric correspondence analysis (NSCA) can be used to represent the predictive structure in the table and to assess the predictive power of the predictor variable.

Technically, NSCA is very similar to the standard CA, the main difference being that the columns of the contingency table are not weighted by their rarity (i.e. the inverse of the marginal frequencies).

Value

An object of class CA from FactoMineR package, with an additional item :

GK.tau

Goodman and Kruskal tau

References

Kroonenberg P.M. and Lombardo R., 1999, "Nonsymmetric Correspondence Analysis: A Tool for Analysing Contingency Tables with a Dependence Structure", Multivariate Behavioral Research, 34 (3), 367-396.

Author

Nicolas Robette

Note

The code is adapted from the CA function in FactoMineR package.

See also

Examples

data(Music)
# The combination of Gender and Age is the predictor variable
# "Focused" listening to music is the response variable
tab <- with(Music, table(interaction(Gender, Age), OnlyMus))
nsca <- nsCA(tab)
nsca.biplot(nsca)

# Goodman and Kruskal tau
nsca$GK.tau
#> [1] 0.05991598