class: center, middle, title-slide .title[ # Coordinate systems and axes ] .author[ ### Claus O. Wilke ] .date[ ### last updated: 2023-01-19 ] --- ## Most data visualizations use Cartesian coordinates .center[ data:image/s3,"s3://crabby-images/9c7d3/9c7d38523d17bcba4801454d10658679990c0561" alt=""<!-- --> ] ??? Figure redrawn from [Claus O. Wilke. Fundamentals of Data Visualization. O'Reilly, 2019.](https://clauswilke.com/dataviz) --- ## Changing units does not change the plot .pull-left[ data:image/s3,"s3://crabby-images/7d6ec/7d6ec394d103b15983d101c2afa948a887e6264d" alt=""<!-- --> ] ??? Figure redrawn from [Claus O. Wilke. Fundamentals of Data Visualization. O'Reilly, 2019.](https://clauswilke.com/dataviz) -- .pull-right[ data:image/s3,"s3://crabby-images/84d9a/84d9abac66c235f249b614a6481eb65c9a410c32" alt=""<!-- --> ] --- ## If scale units are unrelated, aspect ratio is arbitrary .center[ data:image/s3,"s3://crabby-images/cb8a4/cb8a43a9791e4a3208dad8d5f64e988a4728f6a7" alt=""<!-- --> ] ??? Figure redrawn from [Claus O. Wilke. Fundamentals of Data Visualization. O'Reilly, 2019.](https://clauswilke.com/dataviz) --- class: center middle ## Non-linear scales and coordinate systems --- ## Logarithmic scales (log scales) .small-text[ Visualize these five values: 1, 3.16, 10, 31.6, 100 ] -- .center.nogap[ data:image/s3,"s3://crabby-images/ec94b/ec94b363f482e281edacf5418762f8fca15f5f85" alt=""<!-- --> ] -- .center.nogap[ data:image/s3,"s3://crabby-images/ad9a2/ad9a2ca943585cc449e335fbc19719f47751a197" alt=""<!-- --> ] -- .center.nogap[ data:image/s3,"s3://crabby-images/c943b/c943bcf1476d59275d16b7abff188b835363a13d" alt=""<!-- --> ] --- ## Example: Population number of Texas counties A linear scale emphasizes large counties .center.nogap[ data:image/s3,"s3://crabby-images/17215/17215ae61729515d60088d82ea42351895c57b36" alt=""<!-- --> ] ??? Figure redrawn from [Claus O. Wilke. Fundamentals of Data Visualization. O'Reilly, 2019.](https://clauswilke.com/dataviz) --- ## Example: Population number of Texas counties A log scale shows symmetry around the median .center.nogap[ data:image/s3,"s3://crabby-images/ebff9/ebff919a312beddc8448aa63817f5b2d7a02e7db" alt=""<!-- --> ] ??? Figure redrawn from [Claus O. Wilke. Fundamentals of Data Visualization. O'Reilly, 2019.](https://clauswilke.com/dataviz) --- ## Nonlinear coordinate systems: Polar coordinates .pull-left[ data:image/s3,"s3://crabby-images/f7cdf/f7cdfa452604e5ba0b2944f3bad465b6889cc8e8" alt=""<!-- --> ] ??? Figure redrawn from [Claus O. Wilke. Fundamentals of Data Visualization. O'Reilly, 2019.](https://clauswilke.com/dataviz) -- .pull-right[ data:image/s3,"s3://crabby-images/bef09/bef097b8a832b58c490d32d7176e8b1821e84536" alt=""<!-- --> ] --- ## Cartesian vs polar example .pull-left[ data:image/s3,"s3://crabby-images/6efb1/6efb18d67bbd54cf5157e39ee2d5d8ec7e0a3c2b" alt=""<!-- --> ] ??? Figure redrawn from [Claus O. Wilke. Fundamentals of Data Visualization. O'Reilly, 2019.](https://clauswilke.com/dataviz) -- .pull-right[ data:image/s3,"s3://crabby-images/0bcfe/0bcfee1c602b47cac60326d6576ace0000697557" alt=""<!-- --> ] [//]: # "segment ends here" --- class: center middle ## Scales and coordinate systems in **ggplot2** --- ## Getting the data The `boxoffice` dataset: .tiny-font[ ```r boxoffice <- tibble( rank = 1:5, title = c("Star Wars", "Jumanji", "Pitch Perfect 3", "Greatest Showman", "Ferdinand"), amount = c(71.57, 36.17, 19.93, 8.81, 7.32) # million USD ) ``` ] -- The `tx_counties` dataset: .tiny-font[ ```r tx_counties <- read_csv("https://wilkelab.org/DSC385/datasets/US_census.csv") %>% filter(state == "Texas") %>% mutate(popratio = pop2010/median(pop2010)) %>% arrange(desc(popratio)) %>% mutate(index = 1:n()) ``` ] --- ## Getting the data The `temperatures` dataset: .tiny-font[ ```r temperatures <- read_csv("https://wilkelab.org/DSC385/datasets/tempnormals.csv") %>% mutate( location = factor( location, levels = c("Death Valley", "Houston", "San Diego", "Chicago") ) ) %>% select(location, station_id, day_of_year, month, temperature) ``` ] --- ## Scale functions customize the x and y axes Recall the box-office example from a prior lecture .pull-left.tiny-font[ ```r ggplot(boxoffice) + aes(amount, fct_reorder(title, amount)) + geom_col() ``` ] -- .pull-right[ data:image/s3,"s3://crabby-images/ad217/ad21730f1914b2ccb0125d95f9a399ee86d1d263" alt=""<!-- --> ] --- ## Scale functions customize the x and y axes Add scale functions (no change in figure so far) .pull-left.tiny-font[ ```r ggplot(boxoffice) + aes(amount, fct_reorder(title, amount)) + geom_col() + * scale_x_continuous() + * scale_y_discrete() ``` ] .pull-right[ data:image/s3,"s3://crabby-images/2bbb7/2bbb77853af1585ec4a0c922a6ae8e8917e258c2" alt=""<!-- --> ] --- ## Scale functions customize the x and y axes The parameter `name` sets the axis title .pull-left.tiny-font[ ```r ggplot(boxoffice) + aes(amount, fct_reorder(title, amount)) + geom_col() + scale_x_continuous( * name = "weekend gross (million USD)" ) + scale_y_discrete( * name = NULL # no axis title ) ``` ] .pull-right[ data:image/s3,"s3://crabby-images/b82bd/b82bd31d241d464d147f2d97f3f6365768a79abc" alt=""<!-- --> ] Note: We could do the same with `xlab()` and `ylab()` --- ## Scale functions customize the x and y axes The parameter `limits` sets the scale limits .pull-left.tiny-font[ ```r ggplot(boxoffice) + aes(amount, fct_reorder(title, amount)) + geom_col() + scale_x_continuous( name = "weekend gross (million USD)", * limits = c(0, 80) ) + scale_y_discrete( name = NULL ) ``` ] .pull-right[ data:image/s3,"s3://crabby-images/d95c0/d95c003b76895c8be0d6a31754568b6f045a13bb" alt=""<!-- --> ] Note: We could do the same with `xlim()` and `ylim()` --- ## Scale functions customize the x and y axes The parameter `breaks` sets the axis tick positions .pull-left.tiny-font[ ```r ggplot(boxoffice) + aes(amount, fct_reorder(title, amount)) + geom_col() + scale_x_continuous( name = "weekend gross (million USD)", limits = c(0, 80), * breaks = c(0, 25, 50, 75) ) + scale_y_discrete( name = NULL ) ``` ] .pull-right[ data:image/s3,"s3://crabby-images/15d50/15d5077d1bab49a08cdde038bb8710e52f239f75" alt=""<!-- --> ] --- ## Scale functions customize the x and y axes The parameter `labels` sets the axis tick labels .pull-left.tiny-font[ ```r ggplot(boxoffice) + aes(amount, fct_reorder(title, amount)) + geom_col() + scale_x_continuous( name = "weekend gross", limits = c(0, 80), breaks = c(0, 25, 50, 75), * labels = c("0", "$25M", "$50M", "$75M") ) + scale_y_discrete( name = NULL ) ``` ] .pull-right[ data:image/s3,"s3://crabby-images/56cae/56cae2b6e71cdf888245b5fded3cc155dcf33fc1" alt=""<!-- --> ] --- ## Scale functions customize the x and y axes The parameter `expand` sets the axis expansion .pull-left.tiny-font[ ```r ggplot(boxoffice) + aes(amount, fct_reorder(title, amount)) + geom_col() + scale_x_continuous( name = "weekend gross", limits = c(0, 80), breaks = c(0, 25, 50, 75), labels = c("0", "$25M", "$50M", "$75M"), * expand = expansion(mult = c(0, 0.06)) ) + scale_y_discrete( name = NULL ) ``` ] .pull-right[ data:image/s3,"s3://crabby-images/8e5fe/8e5fe75107a90b05fed6f820d16949d613e10f11" alt=""<!-- --> ] --- ## Scale functions define transformations .pull-left.nogap[ Linear y scale: .tiny-font[ ```r ggplot(tx_counties) + aes(x = index, y = popratio) + geom_point() + * scale_y_continuous() ``` data:image/s3,"s3://crabby-images/9d9dc/9d9dccb622b56cc3ce58fc9744b300d4042995fc" alt=""<!-- --> ]] -- .pull-right.nogap[ Log y scale: .tiny-font[ ```r ggplot(tx_counties) + aes(x = index, y = popratio) + geom_point() + * scale_y_log10() ``` data:image/s3,"s3://crabby-images/e70bc/e70bc8bb25ddc8d517409485a97a26b11e848b39" alt=""<!-- --> ]] --- ## Parameters work the same for all scale functions .pull-left.nogap.tiny-font[ ```r ggplot(tx_counties) + aes(x = index, y = popratio) + geom_point() + scale_y_continuous( name = "population number / median", breaks = c(0, 100, 200), labels = c("0", "100", "200") ) ``` data:image/s3,"s3://crabby-images/516c4/516c4ea219b9e70cdc52be98b71ab99a2fcc7e37" alt=""<!-- --> ] .pull-right.nogap.tiny-font[ ```r ggplot(tx_counties) + aes(x = index, y = popratio) + geom_point() + scale_y_log10( name = "population number / median", breaks = c(0.01, 1, 100), labels = c("0.01", "1", "100") ) ``` data:image/s3,"s3://crabby-images/e9c90/e9c90a6a4b6010cb5d3de9e8ff78d54f8cc0dde3" alt=""<!-- --> ] --- ## Coords define the coordinate system .nogap.tiny-font[ ```r ggplot(temperatures, aes(day_of_year, temperature, color = location)) + geom_line() + * coord_cartesian() # cartesian coords are the default ``` .center[ data:image/s3,"s3://crabby-images/527a3/527a3420a1ee1c9dfc96051d3425f33c745805ad" alt=""<!-- --> ]] --- ## Coords define the coordinate system .nogap.tiny-font[ ```r ggplot(temperatures, aes(day_of_year, temperature, color = location)) + geom_line() + * coord_polar() # polar coords ``` .center[ data:image/s3,"s3://crabby-images/a4c6a/a4c6abdadbea6e175138eb6b019f7fbd000e8cb8" alt=""<!-- --> ]] --- ## Coords define the coordinate system .nogap.tiny-font[ ```r ggplot(temperatures, aes(day_of_year, temperature, color = location)) + geom_line() + coord_polar() + * scale_y_continuous(limits = c(0, 105)) # fix up temperature limits ``` .center[ data:image/s3,"s3://crabby-images/abb21/abb2102dc1a40431b0c386d934be56a677a2f3e4" alt=""<!-- --> ]] [//]: # "segment ends here" --- ## Further reading - Fundamentals of Data Visualization: [Chapter 3: Coordinate systems and axes](https://clauswilke.com/dataviz/coordinate-systems-axes.html) - **ggplot2** reference documentation: [Scales](https://ggplot2.tidyverse.org/reference/index.html#section-scales) - **ggplot2** reference documentation: [Coordinate systems](https://ggplot2.tidyverse.org/reference/index.html#section-coordinate-systems) - **ggplot2** book: [Position scales](https://ggplot2-book.org/scale-position.html) - **ggplot2** book: [Coordinate systems](https://ggplot2-book.org/coord.html)