Skip to content

Latest commit

 

History

History
55 lines (33 loc) · 1.67 KB

TASKS.md

File metadata and controls

55 lines (33 loc) · 1.67 KB

CURRENT SPRINT AND TODOS:

END GOAL: PARSE GTFS DATA WITH CAPABILITIES WE NEED TO EXTRACT:

SMALL GOALS: FILTER DATA BASED ON CONDITIONS E.G ROUTE

# Filter routes with a certain route_type
filtered_routes = df[df['route_type'] == 3]

SORT, AGG, ADD/MODIFY COLS

# Sort routes by route_short_name in ascending order
sorted_routes = df.sort_values('route_short_name')


# Calculate the number of routes for each agency_id
route_counts = df['agency_id'].value_counts()


# Add a new column that combines route_short_name and route_long_name
df['route_name'] = df['route_short_name'] + ' - ' + df['route_long_name']

Common tasks you can perform on a DataFrame include:

  • Accessing and retrieving data from specific rows and columns
  • Filtering and subsetting data based on specific conditions
  • Aggregating and summarizing data
  • Applying mathematical and statistical operations to columns
  • Merging, joining, and reshaping data from multiple DataFrames
  • Handling missing data and data cleaning
  • Visualizing data through various plotting functions