Homework 2

Enter your name and EID here

This homework is due on Feb. 5, 2019 at 4:00pm. Please submit as a PDF file on Canvas.

This homework uses the Cars93 data set. Each observation in the data frame contains information on passenger cars from 1993. This is a big data frame with 27 columns. We are interested in the information on manufacturer (Manufacturer), car model (Model), type of car (Type), car company origin (Origin), midrange price in $1000 (Price), city MPG (miles per US gallon, MPG.city), and fuel tank capacity in gallons (Fuel.tank.capacity).

Cars93 <- read.csv("http://wilkelab.org/classes/SDS348/data_sets/Cars93.csv")
head(Cars93)
##   Manufacturer   Model    Type Min.Price Price Max.Price MPG.city
## 1        Acura Integra   Small      12.9  15.9      18.8       25
## 2        Acura  Legend Midsize      29.2  33.9      38.7       18
## 3         Audi      90 Compact      25.9  29.1      32.3       20
## 4         Audi     100 Midsize      30.8  37.7      44.6       19
## 5          BMW    535i Midsize      23.7  30.0      36.2       22
## 6        Buick Century Midsize      14.2  15.7      17.3       22
##   MPG.highway            AirBags DriveTrain Cylinders EngineSize
## 1          31               None      Front         4        1.8
## 2          25 Driver & Passenger      Front         6        3.2
## 3          26        Driver only      Front         6        2.8
## 4          26 Driver & Passenger      Front         6        2.8
## 5          30        Driver only       Rear         4        3.5
## 6          31        Driver only      Front         4        2.2
##   Horsepower  RPM Rev.per.mile Man.trans.avail Fuel.tank.capacity
## 1        140 6300         2890             Yes               13.2
## 2        200 5500         2335             Yes               18.0
## 3        172 5500         2280             Yes               16.9
## 4        172 5500         2535             Yes               21.1
## 5        208 5700         2545             Yes               21.1
## 6        110 5200         2565              No               16.4
##   Passengers Length Wheelbase Width Turn.circle Rear.seat.room
## 1          5    177       102    68          37           26.5
## 2          5    195       115    71          38           30.0
## 3          5    180       102    67          37           28.0
## 4          6    193       106    70          37           31.0
## 5          4    186       109    69          39           27.0
## 6          6    189       105    69          41           28.0
##   Luggage.room Weight  Origin          Make
## 1           11   2705 non-USA Acura Integra
## 2           15   3560 non-USA  Acura Legend
## 3           14   3375 non-USA       Audi 90
## 4           17   3405 non-USA      Audi 100
## 5           13   3640 non-USA      BMW 535i
## 6           16   2880     USA Buick Century

Problem 1: (2 pts) Use ggplot2 to create a scatter plot of the city MPG versus the car prices. In the same plot, fit a curve to these data using geom_smooth(). In one sentence, what broad trend do you observe in city MPG for different car prices? HINT: Plot city MPG on the y-axis and price on the x-axis.

# R code goes here

Your answer goes here. 1 sentence only.

Problem 2: (4 pts) Next, create a bar plot that shows the origin of cars, stacked on top of each other, for each car type. Make two observations about the data from this plot. State each in 1 sentence.

# R code goes here

Your answer goes here. 2 sentences only.

Problem 3: (2 pts) Plot the distribution of fuel tank capacity, once using geom_histogram() and once using geom_density().

# R code goes here

Problem 4: (2 pts) What does the y-axis in your histogram represent? In your density plot, what is the total area under the curve? For the total area, please give a single number as your answer. HINT: You do not need to do any additional calculations to determine the area under the curve. Use Google to find the answer.

Your answer goes here. 2-3 sentences only.