Report_NYC_Taxi_Operations_Starter23
Report_NYC_Taxi_Operations_Starter23
Operations
Include your visualisations, analysis, results, insights, and outcomes. Explain your methodology
and approach to the tasks. Add your conclusions to the sections.
1. Data Preparation
1.1. Loading the dataset
2. Data Cleaning
2.1. Fixing Columns
2.1.1. Fix the index
3.1.2. Analyse the distribution of taxi pickups by hours, days of the week,
and months
3.1.3. Filter out the zero/negative values in fares, distance and tips
3.1.5. Find the proportion of each quarter’s revenue in the yearly revenue
3.1.6. Analyse and visualise the relationship between distance and fare
amount
3.1.12. Add the number of trips for each zone to the zones dataframe
3.2.2. Calculate the hourly number of trips and identify the busy hours
3.2.3. Scale up the number of trips from above to find the actual number of
trips
3.2.5. Identify the top 10 zones with high hourly pickups and drops
3.2.8. Find the revenue share for nighttime and daytime hours
3.2.9. For the different passenger counts, find the average fare per mile
per passenger
3.2.10. Find the average fare per mile by hours of the day and by days of
the week
3.2.11. Analyse the average fare per mile for the different vendors
3.2.16. Analyse the pickup/dropoff zones or times when extra charges are
applied more frequently.
4. Conclusions
4.1. Final Insights and Recommendations
4.1.1. Recommendations to optimize routing and dispatching based on
demand patterns and operational inefficiencies.