class: center, middle, title-slide .title[ # Color scales ] .author[ ### Claus O. Wilke ] .date[ ### last updated: 2023-01-23 ] --- # Uses of color in data visualization -- <table style = "border: none; line-height: 2.5;"> <tr style = "background: white;"> <td style = "text-align: left; width: 50%;"> 1. Distinguish categories (qualitative) </td> <td> <img src = "color-scales_files/qualitative.png" width = 100% style = "text-align: right; vertical-align: middle"></img> </td> </tr> </table> --- # Qualitative scale example .center.nogap[ data:image/s3,"s3://crabby-images/fbda2/fbda2ddec4edb1b466528dc9ba565d839ce915e4" alt=""<!-- --> ] Palette name: Okabe-Ito ??? Figure redrawn from [Claus O. Wilke. Fundamentals of Data Visualization. O'Reilly, 2019.](https://clauswilke.com/dataviz) --- # Qualitative scale example .center.nogap[ data:image/s3,"s3://crabby-images/689c1/689c1aa9d9e2bb380393215ebd632f97f0a0d048" alt=""<!-- --> ] Palette name: ColorBrewer Set1 ??? Figure redrawn from [Claus O. Wilke. Fundamentals of Data Visualization. O'Reilly, 2019.](https://clauswilke.com/dataviz) --- # Qualitative scale example .center.nogap[ data:image/s3,"s3://crabby-images/53060/53060a8cb7d7e18059d73ace055b5848956e69d8" alt=""<!-- --> ] Palette name: ColorBrewer Set3 ??? Figure redrawn from [Claus O. Wilke. Fundamentals of Data Visualization. O'Reilly, 2019.](https://clauswilke.com/dataviz) --- # Uses of color in data visualization <table style = "border: none; line-height: 2.5;"> <tr style = "background: white;"> <td style = "text-align: left; width: 50%;"> 1. Distinguish categories (qualitative) </td> <td> <img src = "color-scales_files/qualitative.png" width = 100% style = "text-align: right; vertical-align: middle"></img> </td> </tr> <tr style = "background: white;"> <td style = "text-align: left;"> 2. Represent numeric values (sequential) </td> <td> <img src = "color-scales_files/sequential.png" width = 100% style = "text-align: right; vertical-align: middle"></img> </td> </tr> </table> --- # Sequential scale example <br> .center[ data:image/s3,"s3://crabby-images/72a59/72a59c1e2d16a19706812a2a77d7637858c62f2c" alt=""<!-- --> ] Palette name: Viridis ??? Figure redrawn from [Claus O. Wilke. Fundamentals of Data Visualization. O'Reilly, 2019.](https://clauswilke.com/dataviz) --- # Sequential scale example <br> .center[ data:image/s3,"s3://crabby-images/e8a75/e8a7592b2bff4af52495cbb96d06833dcfbb193b" alt=""<!-- --> ] Palette name: Inferno ??? Figure redrawn from [Claus O. Wilke. Fundamentals of Data Visualization. O'Reilly, 2019.](https://clauswilke.com/dataviz) --- # Sequential scale example <br> .center[ data:image/s3,"s3://crabby-images/6cb52/6cb5281c42d8d24d610ad34438600d697696ff9f" alt=""<!-- --> ] Palette name: Cividis ??? Figure redrawn from [Claus O. Wilke. Fundamentals of Data Visualization. O'Reilly, 2019.](https://clauswilke.com/dataviz) --- # Uses of color in data visualization <table style = "border: none; line-height: 2.5;"> <tr style = "background: white;"> <td style = "text-align: left; width: 50%;"> 1. Distinguish categories (qualitative) </td> <td> <img src = "color-scales_files/qualitative.png" width = 100% style = "text-align: right; vertical-align: middle"></img> </td> </tr> <tr style = "background: white;"> <td style = "text-align: left;"> 2. Represent numeric values (sequential) </td> <td> <img src = "color-scales_files/sequential.png" width = 100% style = "text-align: right; vertical-align: middle"></img> </td> </tr> <tr style = "background: white;"> <td style = "text-align: left;"> 3. Represent numeric values (diverging) </td> <td> <img src = "color-scales_files/diverging.png" width = 100% style = "text-align: right; vertical-align: middle"></img> </td> </tr> </table> --- # Diverging scale example .center.nogap[ data:image/s3,"s3://crabby-images/229ee/229eed9e69c85a75a69fe57bb326d90c605321b8" alt=""<!-- --> ] Palette name: ColorBrewer PiYG ??? Figure redrawn from [Claus O. Wilke. Fundamentals of Data Visualization. O'Reilly, 2019.](https://clauswilke.com/dataviz) --- # Diverging scale example .center.nogap[ data:image/s3,"s3://crabby-images/35574/35574dcdd72f2b398f433211403c71e4de3f16b1" alt=""<!-- --> ] Palette name: Carto Earth ??? Figure redrawn from [Claus O. Wilke. Fundamentals of Data Visualization. O'Reilly, 2019.](https://clauswilke.com/dataviz) --- # Diverging scale example .center.nogap[ data:image/s3,"s3://crabby-images/17947/17947f67597ccaddd47358628b2a938b90985fc6" alt=""<!-- --> ] Palette name: Blue-Red ??? Figure redrawn from [Claus O. Wilke. Fundamentals of Data Visualization. O'Reilly, 2019.](https://clauswilke.com/dataviz) --- # Uses of color in data visualization <table style = "border: none; line-height: 2.5;"> <tr style = "background: white;"> <td style = "text-align: left; width: 50%;"> 1. Distinguish categories (qualitative) </td> <td> <img src = "color-scales_files/qualitative.png" width = 100% style = "text-align: right; vertical-align: middle;"></img> </td> </tr> <tr style = "background: white;"> <td style = "text-align: left;"> 2. Represent numeric values (sequential) </td> <td> <img src = "color-scales_files/sequential.png" width = 100% style = "text-align: right; vertical-align: middle"></img> </td> </tr> <tr style = "background: white;"> <td style = "text-align: left;"> 3. Represent numeric values (diverging) </td> <td> <img src = "color-scales_files/diverging.png" width = 100% style = "text-align: right; vertical-align: middle"></img> </td> </tr> <tr style = "background: white;"> <td style = "text-align: left;"> 4. Highlight </td> <td> <img src = "color-scales_files/highlight.png" width = 100% style = "text-align: right; vertical-align: middle"></img> </td> </tr> </table> --- # Highlight example .center[ data:image/s3,"s3://crabby-images/5a6a5/5a6a52be822eb1959f836cd33ff89f7463959b86" alt=""<!-- --> ] Palette name: Grays with accents ??? Figure redrawn from [Claus O. Wilke. Fundamentals of Data Visualization. O'Reilly, 2019.](https://clauswilke.com/dataviz) --- # Highlight example .center[ data:image/s3,"s3://crabby-images/6cd7c/6cd7c4f03dbb383a05b90d11d4ade03b37071a3f" alt=""<!-- --> ] Palette name: Okabe-Ito accent ??? Figure redrawn from [Claus O. Wilke. Fundamentals of Data Visualization. O'Reilly, 2019.](https://clauswilke.com/dataviz) --- # Highlight example .center[ data:image/s3,"s3://crabby-images/0fe8e/0fe8e15ff6dc630eb005a2382851a9d73a52a06e" alt=""<!-- --> ] Palette name: ColorBrewer accent ??? Figure redrawn from [Claus O. Wilke. Fundamentals of Data Visualization. O'Reilly, 2019.](https://clauswilke.com/dataviz) --- # Uses of color in data visualization <table style = "border: none; line-height: 2.5;"> <tr style = "background: white;"> <td style = "text-align: left; width: 50%;"> 1. Distinguish categories (qualitative) </td> <td> <img src = "color-scales_files/qualitative.png" width = 100% style = "text-align: right; vertical-align: middle;"></img> </td> </tr> <tr style = "background: white;"> <td style = "text-align: left;"> 2. Represent numeric values (sequential) </td> <td> <img src = "color-scales_files/sequential.png" width = 100% style = "text-align: right; vertical-align: middle"></img> </td> </tr> <tr style = "background: white;"> <td style = "text-align: left;"> 3. Represent numeric values (diverging) </td> <td> <img src = "color-scales_files/diverging.png" width = 100% style = "text-align: right; vertical-align: middle"></img> </td> </tr> <tr style = "background: white;"> <td style = "text-align: left;"> 4. Highlight </td> <td> <img src = "color-scales_files/highlight.png" width = 100% style = "text-align: right; vertical-align: middle"></img> </td> </tr> </table> [//]: # "segment ends here" --- class: center middle ## Color scales in **ggplot2** --- ## Getting the data The `temps_months` dataset: .tiny-font[ ```r temps_months <- read_csv("https://wilkelab.org/DSC385/datasets/tempnormals.csv") %>% group_by(location, month_name) %>% summarize(mean = mean(temperature)) %>% mutate( month = factor( month_name, levels = c("Jan", "Feb", "Mar", "Apr", "May", "Jun", "Jul", "Aug", "Sep", "Oct", "Nov", "Dec") ), location = factor( location, levels = c("Death Valley", "Houston", "San Diego", "Chicago") ) ) %>% select(-month_name) ``` ] --- ## Getting the data The `popgrowth` dataset: .tiny-font[ ```r US_census <- read_csv("https://wilkelab.org/DSC385/datasets/US_census.csv") US_regions <- read_csv("https://wilkelab.org/DSC385/datasets/US_regions.csv") popgrowth <- left_join(US_census, US_regions) %>% group_by(region, division, state) %>% summarize( pop2000 = sum(pop2000, na.rm = TRUE), pop2010 = sum(pop2010, na.rm = TRUE), popgrowth = (pop2010-pop2000)/pop2000, .groups = "drop" ) %>% mutate(region = factor(region, levels = c("West", "South", "Midwest", "Northeast"))) ``` ] --- ## **ggplot2** color scale functions are a bit of a mess -- .small-font.center[ Scale function | Aesthetic | Data type | Palette type :----------- | :---------- | :------------ | :------------ `scale_color_hue()` | `color` | discrete | qualitative ] --- ## **ggplot2** color scale functions are a bit of a mess .small-font.center[ Scale function | Aesthetic | Data type | Palette type :----------- | :---------- | :------------ | :------------ `scale_color_hue()` | `color` | discrete | qualitative `scale_fill_hue()` | `fill ` | discrete | qualitative ] --- ## **ggplot2** color scale functions are a bit of a mess .small-font.center[ Scale function | Aesthetic | Data type | Palette type :----------- | :---------- | :------------ | :------------ `scale_color_hue()` | `color` | discrete | qualitative `scale_fill_hue()` | `fill ` | discrete | qualitative `scale_color_gradient()` | `color` | continuous | sequential ] --- ## **ggplot2** color scale functions are a bit of a mess .small-font.center[ Scale function | Aesthetic | Data type | Palette type :----------- | :---------- | :------------ | :------------ `scale_color_hue()` | `color` | discrete | qualitative `scale_fill_hue()` | `fill ` | discrete | qualitative `scale_color_gradient()` | `color` | continuous | sequential `scale_color_gradient2()` | `color` | continuous | diverging ] --- ## **ggplot2** color scale functions are a bit of a mess .small-font.center[ Scale function | Aesthetic | Data type | Palette type :----------- | :---------- | :------------ | :------------ `scale_color_hue()` | `color` | discrete | qualitative `scale_fill_hue()` | `fill ` | discrete | qualitative `scale_color_gradient()` | `color` | continuous | sequential `scale_color_gradient2()` | `color` | continuous | diverging `scale_fill_viridis_c()` | `color` | continuous | sequential `scale_fill_viridis_d()` | `fill` | discrete | sequential `scale_color_brewer()` | `color` | discrete | qualitative, diverging, sequential `scale_fill_brewer()` | `fill` | discrete | qualitative, diverging, sequential `scale_color_distiller()` | `color` | continuous | qualitative, diverging, sequential ] ... and there are many many more --- ## Examples .small-font[ ```r ggplot(temps_months, aes(x = month, y = location, fill = mean)) + geom_tile(width = 0.95, height = 0.95) + coord_fixed(expand = FALSE) + theme_classic() # no fill scale defined, default is scale_fill_gradient() ``` ] .center[ data:image/s3,"s3://crabby-images/6555e/6555ecb601c0c02127cdc2b2218819fdf15109eb" alt=""<!-- --> ] --- ## Examples .small-font[ ```r ggplot(temps_months, aes(x = month, y = location, fill = mean)) + geom_tile(width = 0.95, height = 0.95) + coord_fixed(expand = FALSE) + theme_classic() + scale_fill_gradient() ``` ] .center[ data:image/s3,"s3://crabby-images/191ba/191ba34414bc83693ebbf6a7a0e09c013b776b51" alt=""<!-- --> ] --- ## Examples .small-font[ ```r ggplot(temps_months, aes(x = month, y = location, fill = mean)) + geom_tile(width = 0.95, height = 0.95) + coord_fixed(expand = FALSE) + theme_classic() + scale_fill_viridis_c() ``` ] .center[ data:image/s3,"s3://crabby-images/1bf66/1bf66cb1c0e0d3590d403da5b0ff150f579025e1" alt=""<!-- --> ] --- ## Examples .small-font[ ```r ggplot(temps_months, aes(x = month, y = location, fill = mean)) + geom_tile(width = 0.95, height = 0.95) + coord_fixed(expand = FALSE) + theme_classic() + scale_fill_viridis_c(option = "B", begin = 0.15) ``` ] .center[ data:image/s3,"s3://crabby-images/bc2c4/bc2c48c728fda34244dc4115be5076fa4e9c6ade" alt=""<!-- --> ] --- ## Examples .small-font[ ```r ggplot(temps_months, aes(x = month, y = location, fill = mean)) + geom_tile(width = 0.95, height = 0.95) + coord_fixed(expand = FALSE) + theme_classic() + scale_fill_distiller(palette = "YlGnBu") ``` ] .center[ data:image/s3,"s3://crabby-images/e7012/e7012e96a4c298a9d1f9cab7b605876ca713e776" alt=""<!-- --> ] --- ## The **colorspace** package creates some order Scale name: `scale_<aesthetic>_<datatype>_<colorscale>()` -- .small-font[ - `<aesthetic>`: name of the aesthetic (`fill`, `color`, `colour`) - `<datatype>`: type of variable plotted (`discrete`, `continuous`, `binned`) - `<colorscale>`: type of the color scale (`qualitative`, `sequential`, `diverging`, `divergingx`) ] -- .small-font.center[ Scale function | Aesthetic | Data type | Palette type :----------- | :-------- | :--------- | :------------ `scale_color_discrete_qualitative()` | `color` | discrete | qualitative `scale_fill_continuous_sequential()` | `fill` | continuous | sequential `scale_colour_continous_divergingx()` | `colour` | continuous | diverging ] --- ## Examples .tiny-font[ ```r ggplot(temps_months, aes(x = month, y = location, fill = mean)) + geom_tile(width = 0.95, height = 0.95) + coord_fixed(expand = FALSE) + theme_classic() + scale_fill_continuous_sequential(palette = "YlGnBu", rev = FALSE) ``` ] .center[ data:image/s3,"s3://crabby-images/ccfbf/ccfbfda1707448749315ae984cdf0a673344d880" alt=""<!-- --> ] --- ## Examples .tiny-font[ ```r ggplot(temps_months, aes(x = month, y = location, fill = mean)) + geom_tile(width = 0.95, height = 0.95) + coord_fixed(expand = FALSE) + theme_classic() + scale_fill_continuous_sequential(palette = "Viridis", rev = FALSE) ``` ] .center[ data:image/s3,"s3://crabby-images/7f3d7/7f3d7cc9256c6b5c632c14a75a03d669b6a12b24" alt=""<!-- --> ] --- ## Examples .tiny-font[ ```r ggplot(temps_months, aes(x = month, y = location, fill = mean)) + geom_tile(width = 0.95, height = 0.95) + coord_fixed(expand = FALSE) + theme_classic() + scale_fill_continuous_sequential(palette = "Inferno", begin = 0.15, rev = FALSE) ``` ] .center[ data:image/s3,"s3://crabby-images/92b1c/92b1cc2272936352415f9ead8558acc1f6d72bb1" alt=""<!-- --> ] --- .tiny-font.nogap[ ```r colorspace::hcl_palettes(type = "sequential", plot = TRUE) # all sequential palettes ``` .center[ data:image/s3,"s3://crabby-images/0e2f9/0e2f9e12451d09075636cab1144a74847cad2a8d" alt=""<!-- --> ] ] --- .tiny-font.nogap[ ```r colorspace::hcl_palettes(type = "diverging", plot = TRUE, n = 9) # all diverging palettes ``` .center[ data:image/s3,"s3://crabby-images/ac946/ac946a31654db07f486d6ef9412096be80f86b73" alt=""<!-- --> ] ] --- .tiny-font.nogap[ ```r colorspace::divergingx_palettes(plot = TRUE, n = 9) # all divergingx palettes ``` .center[ data:image/s3,"s3://crabby-images/28401/28401092f4d83cc3db68cbf6b080aed4146215a5" alt=""<!-- --> ] ] [//]: # "segment ends here" --- class: center middle ## Setting colors manually for discrete, qualitative scales --- ## Discrete, qualitative scales are best set manually .tiny-font[ ```r ggplot(popgrowth, aes(x = pop2000, y = popgrowth, color = region)) + geom_point() + scale_x_log10() # no color scale defined, default is scale_color_hue() ``` ] .center[ data:image/s3,"s3://crabby-images/d6655/d6655f536f7e3ecc99190511ad9c54f67283a162" alt=""<!-- --> ] --- ## Discrete, qualitative scales are best set manually .tiny-font[ ```r ggplot(popgrowth, aes(x = pop2000, y = popgrowth, color = region)) + geom_point() + scale_x_log10() + scale_color_hue() ``` ] .center[ data:image/s3,"s3://crabby-images/24a50/24a5020780d4d098723fa017e16e9156ee7a54f9" alt=""<!-- --> ] --- ## Discrete, qualitative scales are best set manually .tiny-font[ ```r library(ggthemes) # for scale_color_colorblind() ggplot(popgrowth, aes(x = pop2000, y = popgrowth, color = region)) + geom_point() + scale_x_log10() + scale_color_colorblind() # uses Okabe-Ito colors ``` ] .center[ data:image/s3,"s3://crabby-images/73b54/73b54431047dfdd93914b65d294f196fc504c8dd" alt=""<!-- --> ] --- ## Discrete, qualitative scales are best set manually .tiny-font[ ```r ggplot(popgrowth, aes(x = pop2000, y = popgrowth, color = region)) + geom_point() + scale_x_log10() + scale_color_manual( values = c(West = "#E69F00", South = "#56B4E9", Midwest = "#009E73", Northeast = "#F0E442") ) ``` ] .center[ data:image/s3,"s3://crabby-images/4fd3e/4fd3e9be9f9e011925bddad17da83d9c59b064b1" alt=""<!-- --> ] --- ## Okabe-Ito RGB codes .center[ <img src = "https://clauswilke.com/dataviz/pitfalls_of_color_use_files/figure-html/palette-Okabe-Ito-1.png", width = 100%></img> ] .tiny-font[ Name | Hex code | R, G, B (0-255) :---------- | :------- | :-------- orange | #E69F00 | 230, 159, 0 sky blue | #56B4E9 | 86, 180, 233 bluish green | #009E73 | 0, 158, 115 yellow | #F0E442 | 240, 228, 66 blue | #0072B2 | 0, 114, 178 vermilion | #D55E00 | 213, 94, 0 reddish purple | #CC79A7 | 204, 121, 167 black | #000000 | 0, 0, 0 ] ??? Figure from [Claus O. Wilke. Fundamentals of Data Visualization. O'Reilly, 2019.](https://clauswilke.com/dataviz) [//]: # "segment ends here" --- ## Further reading - Fundamentals of Data Visualization: [Chapter 4: Color scales](https://clauswilke.com/dataviz/color-basics.html) - Fundamentals of Data Visualization: [Figure 19.10: Okabe-Ito color palette](https://clauswilke.com/dataviz/color-pitfalls.html#fig:palette-Okabe-Ito) - **ggplot2** book: [Colour scales and legends](https://ggplot2-book.org/scale-colour.html) - **ggplot2** reference documentation: [Scales](https://ggplot2.tidyverse.org/reference/index.html#section-scales) - **colorspace** package: [HCL-Based Color Scales for ggplot2](https://colorspace.r-forge.r-project.org/articles/ggplot2_color_scales.html)