Note that there are five types of callouts, including: note, tip, warning, caution, and important.
Tip With Caption
This is an example of a callout with a caption.
Meet Quarto
Quarto enables you to weave together content and executable code into a finished document. To learn more about Quarto see
Meet the penguins
The penguins data from the palmerpenguins package contains size measurements for 344 penguins from three species observed on three islands in the Palmer Archipelago, Antarctica.
The plot below shows the relationship between flipper and bill lengths of these penguins.
ggplot(penguins, aes(x =flipper_length_mm, y =bill_length_mm))+geom_point(aes(color =species, shape =species))+scale_color_manual(values =c("darkorange","purple","cyan4"))+labs( title ="Flipper and bill length", subtitle ="Dimensions for penguins at Palmer Station LTER", x ="Flipper length (mm)", y ="Bill length (mm)", color ="Penguin species", shape ="Penguin species")+theme_minimal()
A basic scatter plot of flipper length versus bill length
Part II: Computations
This is the code from the computations
This dataset contains a subset of the fuel economy data from the EPA. Specifically, we use the mpg dataset from the ggplot2 package.
The visualization below shows a positive, strong, and linear relationship between the city and highway mileage of these cars. Additionally, mileage is higher for cars with fewer cylinders.
ggplot(mpg, aes(x =hwy, y =cty, color =cyl))+geom_point(alpha =0.5, size =2)+scale_color_viridis_c()+theme_minimal()
There are 234 observations in our data.
The average city mileage of the cars in our data is 16.86 and the average highway mileage is 23.44.
The plots in Figure Figure 1 show the relationship between city and highway mileage for 38 popular models of cars. In Figure Figure 1 (a) the points are colored by the number of cylinders while in Figure Figure 1 (b) the points are colored by engine displacement.
ggplot(mpg, aes(x =hwy, y =cty, color =cyl))+geom_point(alpha =0.5, size =2)+scale_color_viridis_c()+theme_minimal()ggplot(mpg, aes(x =hwy, y =cty, color =displ))+geom_point(alpha =0.5, size =2)+scale_color_viridis_c(option ="E")+theme_minimal()
(a) Color by number of cylinders
(b) Color by engine displacement, in liters
Figure 1: City and highway mileage for 38 popular models of cars.
Part III: Authoring
In this analysis, Authoring using housing prices, we build a model predicting sale prices of houses based on data on houses that were sold in the Duke Forest neighborhood of Durham, NC around November 2020. Let’s start by loading the packages we’ll use for the analysis.
We present the results of exploratory data analysis in Section 3.2 and the regression model in Section 3.3.
We’re going to do this analysis using literate programming [@knuth1984].
Exploratory data analysis
The data contains 98 houses. As part of the exploratory analysis let’s visualize and summarize the relationship between areas and prices of these houses.
Data visualization
Figure 2 shows two histograms displaying the distributions of price and area individually.
