Think Summer: Project 3 — 2023
Submission
Students need to submit the following file by 10:00PM EST through Gradescope inside Brightspace.
-
A Jupyter notebook (a
.ipynb
file).
We’ve provided you with a template notebook for you to use. Please carefully read this section to get started.
When you are finished with the project, please make sure to run every cell in the notebook prior to submitting. To do this click |
Questions
Question 1
How many flights departed from O’Hare (ORD
) and landed in Denver (DEN
)?
-
R used to solve this problem. (2 pts)
-
Output from running R. (1 pt)
Question 2
What is the Distance (in miles) of any individual flight from O’Hare to Denver?
-
R used to solve this problem. (2 pts)
-
Output from running R. (1 pt)
Question 3
Consider only the flights that arrive to Honolulu (airport code HNL
), i.e., for which Honolulu is the destination. What are the 10 most popular origin airports?
-
R used to solve this problem. (2 pts)
-
Output from running R. (1 pt)
Question 4
Each airplane has a unique TailNum
. Which airplane flew the most flights in 2005? Hint: If you strictly look at the TailNum
values, you will need to ignore the top two results, because they are missing data.
-
R used to solve this problem. (2 pts)
-
Output from running R. (1 pt)
Question 5
Which airplane flew the largest number of times from O’Hare to Denver? Hint: Again, if you strictly look at the TailNum
values, you will need to ignore the top result, because it has missing data.
-
R used to solve this problem. (2 pts)
-
Output from running R. (1 pt)
Question 6
What were the 10 most popular days of the year to fly in 2005 (where "popularity" is evaluated by the number of flights on that day)? Hint: You might paste together the year, month, and day of the flights.
-
R used to solve this problem. (2 pts)
-
Output from running R. (1 pt)
Question 7
In which month are the average departure delays the worst? Hint: You might use a tapply function.
-
R used to solve this problem. (2 pts)
-
Output from running R. (1 pt)
Question 8
Make a dotchart
that illustrates the data from the previous question (about the average flight delays in each month).
-
R used to solve this problem. (2 pts)
-
Output from running R. (1 pt)
Question 9
The UniqueCarrier
is the airline carrier for the flight. If you add all of the miles flown (in the Distance
) for each airline carrier, which carrier flew the most miles altogether in 2005? Hint: You might use a tapply function.
-
R used to solve this problem. (2 pts)
-
Output from running R. (1 pt)