Introduction
The corrplot package is a graphical display of a correlation matrix, confidence interval. It also contains some algorithms to do matrix reordering. In addition, corrplot is good at details, including choosing color, text labels, color labels, layout, etc.
Visualization methods
There are seven visualization methods (parameter method
) in corrplot package, named "circle"
, "square"
, "ellipse"
, "number"
, "shade"
, "color"
, "pie"
.
Positive correlations are displayed in blue and negative correlations in red color. Color intensity and the size of the circle are proportional to the correlation coefficients.
## corrplot 0.88 loaded
Layout
There are three layout types (parameter type
): - "full"
(default) : display full correlation matrix - "upper"
: display upper triangular of the correlation matrix - "lower"
: display lower triangular of the correlation matrix
corrplot.mixed()
is a wrapped function for mixed visualization style.
Reorder a correlation matrix
The correlation matrix can be reordered according to the correlation coefficient. This is important to identify the hidden structure and pattern in the matrix. There are four methods in corrplot (parameter order
), named "AOE"
, "FPC"
, "hclust"
, "alphabet"
. More algorithms can be found in seriation package.
You can also reorder the matrix “manually” via function corrMatOrder()
.
"AOE"
is for the angular order of the eigenvectors. It is calculated from the order of the angles \(a_i\),\[ a_i = \begin{cases} \arctan (e_{i2}/e_{i1}), & \text{if $e_{i1}>0$;} \newline \arctan (e_{i2}/e_{i1}) + \pi, & \text{otherwise.} \end{cases} \]
where \(e_1\) and \(e_2\) are the largest two eigenvalues of the correlation matrix. See Michael Friendly (2002) for details.
"FPC"
for the first principal component order."hclust"
for hierarchical clustering order, and"hclust.method"
for the agglomeration method to be used."hclust.method"
should be one of"ward"
,"single"
,"complete"
,"average"
,"mcquitty"
,"median"
or"centroid"
."alphabet"
for alphabetical order.
If using "hclust"
, corrplot()
can draw rectangles around the chart of corrrlation matrix based on the results of hierarchical clustering.
# Change background color to lightblue
corrplot(M, type = "upper", order = "hclust", cl.pos = "n",
col = c("black", "white"), bg = "lightblue")
Using different color spectra
As shown in the above section, the color of the correlogram can be customized. The function colorRampPalette()
is very convenient for generating color spectrum.
## color example, suitable for matrix in [-N, 0] or [0, N]
col0 <- colorRampPalette(c("white", "cyan", "#007FFF", "blue","#00007F"))
## diverging color example, suitable for matrix in [-N, N]
col1 <- colorRampPalette(c("#7F0000", "red", "#FF7F00", "yellow", "white",
"cyan", "#007FFF", "blue", "#00007F"))
col2 <- colorRampPalette(c("#67001F", "#B2182B", "#D6604D", "#F4A582",
"#FDDBC7", "#FFFFFF", "#D1E5F0", "#92C5DE",
"#4393C3", "#2166AC", "#053061"))
col3 <- colorRampPalette(c("red", "white", "blue"))
col4 <- colorRampPalette(c("#7F0000", "red", "#FF7F00", "yellow", "#7FFF7F",
"cyan", "#007FFF", "blue", "#00007F"))
whiteblack <- c("white", "black")
## using these color spectra
corrplot(M, order = "hclust", addrect = 2, col = col1(100))
You can also use the standard color palettes (package grDevices
)
Other option would be to use RcolorBrewer
package.
library(RColorBrewer)
corrplot(M, type = "upper", order = "hclust",
col = brewer.pal(n = 8, name = "RdBu"))
Changing color and rotation of text labels and legend
Parameter cl.*
is for color legend, and tl.*
if for text legend. For the text label, tl.col
(text label color) and tl.srt
(text label string rotation) are used to change text colors and rotations.
Here are some examples.
## remove color legend, text legend and principal diagonal glyph
corrplot(M, order = "AOE", cl.pos = "n", tl.pos = "n", diag = FALSE)
## bottom color legend, diagonal text legend, rotate text label
corrplot(M, order = "AOE", cl.pos = "b", tl.pos = "d")
## a wider color legend with numbers right aligned
corrplot(M, order = "AOE", cl.ratio = 0.2, cl.align = "r")
## text labels rotated 45 degrees
corrplot(M, type = "lower", order = "hclust", tl.col = "black", tl.srt = 45)
Assign and display colors by cl.lim
# when is.corr=TRUE, cl.lim only affect the color legend
# If you change it, the color on matrix is still assigned on [-1, 1]
corrplot(M/2)
# when is.corr=FALSE, cl.lim is also used to assign colors
# if the matrix have both positive and negative values
# the matrix transformation keep every values positive and negative
corrplot(M*2, is.corr = FALSE, cl.lim=c(-2, 2))
Dealing with a non-correlation matrix
# for showing the usage of cl.lim, run with warning
corrplot(M*2, is.corr = FALSE, cl.lim=c(-2, 2) * 2)
## Warning in corrplot(M * 2, is.corr = FALSE, cl.lim = c(-2, 2) * 2): cl.lim
## interval too wide, please set a suitable value
If your matrix is rectangular, you can adjust the aspect ratio with the win.asp
parameter to make the matrix rendered as a square.
Dealing with missing (NA) values
By default, corrplot renders NA values as "?"
characters. Using na.label
parameter, it is possible to use a different value (max. two characters are supported).
Using “plotmath” expressions in labels
Since version 0.78
, it is possible to use plotmath expression in variable names. To activate plotmath rendering, prefix your label with one of the characters ":"
, "="
or "$"
.
M2 <- M[1:5,1:5]
colnames(M2) <- c("alpha", "beta", ":alpha+beta", ":alpha[0]", "=alpha[beta]")
rownames(M2) <- c("$alpha", "$beta", NA, "$alpha[0]", "$alpha[beta]")
corrplot(M2)
Combining correlogram with the significance test
res1 <- cor.mtest(mtcars, conf.level = .95)
res2 <- cor.mtest(mtcars, conf.level = .99)
## specialized the insignificant value according to the significant level
corrplot(M, p.mat = res1$p, sig.level = .2)
## add cross on no significant coefficient
corrplot(M, p.mat = res1$p, order = "hclust", insig = "pch", addrect = 3)
Visualize confidence interval
corrplot(M, low = res1$lowCI, upp = res1$uppCI, order = "hclust",
rect.col = "navy", plotC = "rect", cl.pos = "n")
corrplot(M, p.mat = res1$p, low = res1$lowCI, upp = res1$uppCI,
order = "hclust", pch.col = "red", sig.level = 0.01,
addrect = 3, rect.col = "navy", plotC = "rect", cl.pos = "n")
res1 <- cor.mtest(mtcars, conf.level = .95)
corrplot(M, p.mat = res1$p, insig = "label_sig",
sig.level = c(.001, .01, .05), pch.cex = .9, pch.col = "white")
corrplot(M, p.mat = res1$p, method = "color", type = "upper",
sig.level = c(.001, .01, .05), pch.cex = .9,
insig = "label_sig", pch.col = "white", order = "AOE")
corrplot(M, p.mat = res1$p, insig = "label_sig", pch.col = "white",
pch = "p<.05", pch.cex = .5, order = "AOE")