cluster.cor {psych}R Documentation

Find correlations of composite variables from a larger matrix


Given a n x c cluster definition matrix of -1s, 0s, and 1s (the keys) , and a n x n correlation matrix, find the correlations of the composite clusters. The keys matrix can be entered by hand, copied from the clipboard (read.clipboard), or taken as output from the factor2cluster function. Similar functionality to scoreItems which also gives item by cluster correlations.


cluster.cor(keys, r.mat, correct = TRUE,SMC=TRUE,item.smc=NULL,impute=TRUE)



A matrix of cluster keys


A correlation matrix


TRUE shows both raw and corrected for attenuation correlations


Should squared multiple correlations be used as communality estimates for the correlation matrix?


the smcs of the items may be passed into the function for speed, or calculated if SMC=TRUE


if TRUE, impute missing scale correlations based upon the average interitem correlation, otherwise return NA.


This is one of the functions used in the SAPA ( procedures to form synthetic correlation matrices. Given any correlation matrix of items, it is easy to find the correlation matrix of scales made up of those items. This can also be done from the original data matrix or from the correlation matrix using score.items which is probably preferred.

A typical use in the SAPA project is to form item composites by clustering or factoring (see fa, ICLUST, principal), extract the clusters from these results (factor2cluster), and then form the composite correlation matrix using cluster.cor. The variables in this reduced matrix may then be used in multiple correlatin procedures using mat.regress.

The original correlation is pre and post multiplied by the (transpose) of the keys matrix.

If some correlations are missing from the original matrix this will lead to missing values (NA) for scale intercorrelations based upon those lower level correlations. If impute=TRUE (the default), a warning is issued and the correlations are imputed based upon the average correlations of the non-missing elements of each scale.

Because the alpha estimate of reliability is based upon the correlations of the items rather than upon the covariances, this estimate of alpha is sometimes called “standardized alpha". If the raw items are available, it is useful to compare standardized alpha with the raw alpha found using scoreItems. They will differ substantially only if the items differ a great deal in their variances.



the (raw) correlation matrix of the clusters


standard deviation of the cluster scores


raw correlations below the diagonal, alphas on diagonal, disattenuated above diagonal


The (standardized) alpha reliability of each scale.


Guttman's Lambda 6 reliability estimate is based upon the smcs for each item in a scale. G6 uses the smc based upon the entire item domain.


The average inter item correlation within a scale


How many items are in each cluster?


See SAPA Revelle, W., Wilt, J., and Rosenthal, A. (2010) Personality and Cognition: The Personality-Cognition Link. In Gruszka, A. and Matthews, G. and Szymura, B. (Eds.) Handbook of Individual Differences in Cognition: Attention, Memory and Executive Control, Springer.


Maintainer: William Revelle

See Also

factor2cluster, mat.regress, alpha, and most importantly, scoreItems, which will do all of what cluster.cor does for most users. cluster.cor is an important helper function for iclust


## Not run: 
keys <- matrix(c(1,1,1,0,0,0,0,
colnames(keys) <- c("first","second")
r.mat <- cor(attitude)

## End(Not run)
#       first second
#first    1.0    0.6
#second   0.6    1.0
# first second 
#  2.57   3.01 
#       first second
#first   0.82   0.77
#second  0.60   0.74
# first second 
#     3      4 

[Package psych version 1.4.5 Index]
Part of the Personality Project      Take our Personality Test