• Oct 18, 2021
  • 10 min read

Google Data Analytics Capstone Project

Updated: Jul 5, 2023

I worked on the Google Data Analytics Capstone Project, Track 1, Case Study 1. I will be diving into the background, my full process of cleaning, analyzing and visualizing the data, along with my final suggestions and summary of the data.

Quick Links :

Tableau Dashboard | Github R Code for Analysis | Github R Code for Tableau Visualization | LinkedIn Post

Below is a table of contents in case you want to go to a specific section.

Table of Contents:

Microsoft excel.

Finished Project

Summary of Data

Business Suggestions

What I Learned

Cyclistic is a bike sharing program which features more than 5,800 bikes and 600 docking stations. It offers reclining bikes, hand tricycles, and cargo bikes, making it more inclusive to people with disabilities and riders who can't use a standard two-wheeled bike. It was founded in 2016 and has grown tremendously into a fleet of bicycles that are geotracked and locked into a network of 692 stations across Chicago. The bikes can be unlocked from one station and returned to any other station in the system anytime.

Previously, Cyclistic's marketing strategy tried to build the general awareness and appeal to broad consumers. It has flexible pricing plans: single-ride passes, full-day passes, and annual memberships. Those who purchase single-ride or full-day passes are referred to as casual riders while those who purchase annual memberships are Cyclistic members .

My Role : In this scenario I am a junior data analyst at Cyclistic and my team has been tasked with the overall goal (see below) of designing marketing strategies

Overall Goal : Design marketing strategies aimed at converting casual riders into annual members.

Business Question : "How do annual members and casual riders use Cyclistic bikes differently?"

Below I will describe step-by-step the process I used to for this project. If you want to skip ahead to the business suggestions move onto the section "Insights".

Overview : I first analyzed the data separately (each month) in Excel, then used R to analyze the data as a whole (one year). Finally I created a dashboard in Tableau and used Figma to support the design elements.

I initially wanted to gather and analyze my data in Excel because it was the tool I was most familiar with and I could get a general understanding of the data quicker. I did not combine all of the spreadsheets into one because that would've taken more processing power than my computer had.

I began downloading the data from divvy-tripdata , and turning the .csv files into excel spreadsheets. I downloaded the most recent year of data which was at the time of starting my project:

August 2020

September 2020

October 2020

November 2020

December 2020

January 2021

February 2021

Added two columns to all of the months:

ride_length calculated the total ride length for each trip using the start_at column which was: ending time minus starting time.

day_of_week calculated the day of the week for each trip using the start_at column date.

Went over the business task and the information I had at hand and how that could be used to figure out how members and casual riders use the bike service differently

Came up with metrics to look at such as :

total number of rides per hour, per day of the month, per season, per day of the week, and for different bike types

Average ride length between members and casual

For every month in Excel created pivot tables and charts to go with the analysis on (this took the longest):

Total Rides per Weekday - calculated the total rides for members and casual and separated it by day of the week; used a cluster column chart

Average Ride Length - calculated the average ride length for members and casual and separated it by day of the week; used a cluster column chart

Total Rides per Hour - calculated the total rides for members and casual separated by the time of the day (24hr); used a line comparison chart

Total Rides per Day - calculated the total rides for members and casual separated by the day of the month; used a line comparison chart

Total Rides per Bike Type - calculated the total rides for members and casual separated by Bike type; used stacked column chart

I also created a Google docs Notes list where I wrote down the exact steps for each month (had a checklist) and included my insights for each month

Time Spent:

535 minutes or just under 9 hours to complete.

I originally wanted to use SQL but the files were too big to upload and I couldn't figure out how to utilize Google Cloud Platform. Instead I used R to analyze the data because it could handle all of the information quicker than Excel, and I wanted to work on my R skills. Below is my general process in R, I didn't include my mistakes/missteps or errors for the sake of brevity.

View my full code on my Github for this capstone project here .

Load all of the libraries I used: tidyverse, lubridate, hms, data.table

Uploaded all of the original data from the data source divytrip into R using read_csv function to upload all individual csv files and save them in separate data frames. For august 2020 data I saved it into aug08_df, september 2020 to sep09_df and so on.

Merged the 12 months of data together using rbind to create a one year view

Created a new data frame called cyclistic_date that would contain all of my new columns

Created new columns for:

Ride Length - did this by subtracting end_at time from start_at time

Day of the Week

Time - convert the time to HH:MM:SS format

Season - Spring, Summer, Winter or Fall

Time of Day - Night, Morning, Afternoon or Evening

Cleaned the data by:

Removing duplicate rows

Remove rows with NA values (blank rows)

Remove where ride_length is 0 or negative (ride_length should be a positive number)

Remove unnecessary columns: ride_id, start_station_id, end_station_id, start_lat, start_long, end_lat, end_lng

Calculated Total Rides for:

Total number of rides which was just the row count = 4,152,139

Member type - casual riders vs. annual members

Type of Bike - classic vs docked vs electric; separated by member type and total rides for each bike type

Hour - separated by member type and total rides for each hour in a day

Time of Day - separated by member type and total rides for each time of day (morning, afternoon, evening, night)

Day of the Week - separated by member type and total rides for each day of the week

Day of the Month - separated by member type and total rides for each day of the month

Month - separated by member type and total rides for each month

Season - separated by member type and total rides for each season (spring, summer, fall, winter)

Calculated Average Ride Length for:

Total average ride length

Type of Bike - separated by member type and average ride length for each bike type

Hour - separated by member type and average ride length for each hour in a day

Time of Day - separated by member type and average ride length for each time of day (morning, afternoon, evening, night)

Day of the Week - separated by member type and average ride length for each day of the week

Day of the Month - separated by member type and average ride length for each day of the month

Month - separated by member type and average ride length for each month

Season - separated by member type and average ride lengths for each season (spring, summer, fall, winter)

Then using all of this data I created my own summary in my case notes and took note of the: total rides for each variable, average ride lengths for each variable, and the difference between members versus casual riders. I originally wanted to create a report using R Markdown as well but for the sake of time (I had already spent over 20 hours on the project so far), I decided to skip this step, and write this article instead.

1045 minutes or about 17 and a half hours to complete.

While I learned the basics of Tableau in the Google Course I wanted more practice with visualizing data and creating dashboards.

To view my completed dashboard click here .

I created a separate R code (you can view it here on Github) that made some changes for specifically the Tableau portion.

For ride length I rounded the digits by 1, meaning my numbers were 29.8 or 12.5.

Revised how I created my "month" column. I used mutate() to create a column that had the month in ___ format and not number format. So instead of 01 it would say "January"

Cleaned the data: removed rows with NA values, removed duplicate rows, removed where ride_length was 0 or negative and removed unnecessary columns like: ride_id, start_station_id, end_station_id, start_lat, start_long, end_lat, end_lng

Created a new dataframe with this information so I could test the difference between the original data frame (cyclistic_date) that I used for my analysis and the data frame I would use for Tableau (cyclistic_tableau).

In this new data frame I removed more columns to make calculations quicker in Tableau. I removed: start_station_name, end_station_name, time, started_at, ended_at

Downloaded this data frame into a .csv file which I uploaded to Tableau

Created graphs similar to those I created in Excel but added a few:

Total Rides by Bike Type

Ride Length by Weekday

Total Rides by Weekday

Total rides by hour, total rides by month.

Then I created a basic dashboard with all of that information, a prototype for me to view while I was creating the final dashboard ( Figure 1 below).

Created a prototype mockup in Figma

Created a final version of the mockup in Figma

Edited Dashboard in Tableau to reflect design in Figma

Edited graphs in Tableau

Made bar graphs round

Added annotations

Highlights to specific important notes

Got rid of labels for visual purposes

Combined Figma and Tableau (used dashboard created in Figma as the background for my Tableau Dashboard) to create a final prototype ( Figure 2 below)

Made minor edits to design elements and created final dashboard ( Figure 3 - Cyclistic Dashboard V1 )

On April 24, 2023 I decided to update my dashboard (See Finished Project , image Final Dashboard - Cyclistic Dashboard V2 ). All of the analysis is the same. The only changes have been to the dashboard. Which include:

Adding horizontal grid lines to a few of the charts

Updating the tool tips.

Making all of the top metric values (e.g. Total Rides, Average Ride Length, etc.) interactive in Tableau instead of in Figma.

765 minutes or almost 13 hours to complete.

Tableau Prototype

Below was my first draft of the dashboard only using Tableau.

Prototype of my dashboard for my google capstone project

Prototype using Figma Background

Combined Figma and Tableau (used dashboard created in Figma as the background for my Tableau Dashboard) to create a final prototype.

Dashboard Prototype with Figma background

Final Dashboard V1

Made minor edits to design elements and created final dashboard. This was the original final dashboard.

capstone project walkthrough

I am including the other tools I used.

Figma to create my background and help develop the dashboard aesthetics.

Google Docs helped me keep track of all of my documents for this project like:

Date Log - I wrote down what I did that day related to my project

Resources - A list of resources I frequently used

Case Notes - Notes for the case study including the final insights, what I was looking for, and anything else having to do with the case

Evernote to draft this article before I uploaded it here.

FINISHED PROJECT

Here is my finished project: Google Capstone Project (V2) . You can view the links to my R code on Github used for analysis here and the code for Tableau here .

Note: This is V2 with a few minor changes to the dashboard. Including:

Final dashboard for capstone project

SUMMARY OF DATA

Those who purchase single-ride or full-day passes are referred to as casual riders while those who purchase annual memberships are Cyclistic members .

Total Rides by User Type

Average Ride Length per User Type

Average Ride per Weekday

Members had more rides with 2,328,763 total rides or 56% and casual riders had 1,823,376 total rides or 43%.

Total Rides by Rider Type Pie chart

Total Rides per Bike Type

Both casual riders and members used the classic bike the most with 1,777,593 rides or 43% of total rides, followed by docked bikes with 1,545,936 rides or 37% of total rides, and lastly with electric bikes at 828,610 rides or 20% of total rides.

Total Rides per Bike Type - bar chart

Average Ride Length by User Type

The total average ride length was 24 minutes. For casual riders it was longer at 27 minutes while members was 14 minutes.

Average ride length by rider type

Average Ride Length per Weekday

For the average ride length per weekday both casual riders and members had an increase in the average ride length on the weekends. For both Sunday was the longest at 31 minutes.

average ride length per weekday - bar chart

Saturday was the most popular weekday combining casual riders and member rides with 784,239 rides or 19% of total rides. But for member rides only Wednesday was the most popular day with 356,060 rides, 5,407 rides more than Saturday.

Total rides by weekday - bar chart

5PM or 17:00 was the busiest hour for both members and casual riders with 426,685 rides or 10% of the total rides. Typically rides began increasing in the morning at 6AM and rose until 5PM then dropped afterwards. The afternoon was the busiest for both rider types with 1,905,797 rides or 45% of total rides. 4AM was the least popular hour.

Total rides by hour

July was the busiest month combining casual riders and member rides at 691,476 rides or 16% of total rides. While summer was the most popular season for both at 1,903,446 rides or 46% of total rides. Looking at just members August is actually the busiest month with 323,140 rides, 816 rides more than July. Winter is the least popular season and February is the least popular month.

Total bike rides per month - bar chart

Final Summary

The most popular bike among with riders was the classic.

Busiest time was afternoon and the peak time was at 5PM for both casual riders and members.

Busiest weekday was Saturday, casual riders used the service the most on the weekends.

Busiest season was Summer for both types of riders.

Most rides by User Type was members but casual riders weren't far behind.

The average ride length was 24 minutes but casual riders on average rode 23 minutes longer than members.

BUSINESS SUGGESTIONS

This was the hardest part for me for the whole project. I have never provided suggestions for a business nor worked in marketing. Any feedback here would be appreciated.

These are my suggestions for the marketing team to convert casual riders to annual members:

Personalize discounts and show perks in the membership program based on their preferences and riding habits.

Emphasize the benefits of memberships, including discounts during busy times of the year like during Summer, or on the weekends.

Have existing members to share their stories about how using Cyclistic's system has changed their life, to create a sense of community, offer a discount if they do so this will help encourage new riders to join the program.

WHAT I LEARNED

Below is what I learned/practiced from over 40 hours spent on this project:

Pivot Tables in Microsoft Excel

Practice using R for data analysis and cleaning specifically using the tidyverse package for data analysis

Graphs in Tableau, edited visual elements along with creating different charts and filters.

Design elements of an effective dashboard

Combining the design feature of Figma with the functionality of Tableau

R portion of my project I found Itamar's case study on Kaggle using R as well, a helpful resource.

Tableau portion I used Navneet Singh's Tableau Dashboard as inspiration.

  • Data Analytics
  • Portfolio Projects

Recent Posts

The Power of Collaborative Projects in Data Analytics

Deep Work Dashboard 2023

Behind the Scenes: Creating a SQL Course

Wow Kelly. This really is impressive. Glad to see you are really into this process. You have definitely found your niche!!

View SAM Project Answer Keys

View the correct answers for SAM projects.

SAM

  • Click SAM assignments .

preview

  • Click the file under Solution File to download it to your computer.
  • Open the solution file.

Brandon A. Griffin

Mastering the capsim simulation: a comprehensive guide..

TLDR: A link to tons of guides, videos, and templates are here  https://psychologyofbusiness.beehiiv.com/p/capsim-materials

When I was a student I had to do  CapSim . The simulation was frustrating to learn and I didn’t quite understand it. It was basically a more boring version of Roller-Coaster Tycoon. Fast-forward to the end of that semester, and I found that the simulation was actually fun. I could make business decisions and battle against the other students in my course. Following the success I had as a student, I have gone on the be a TA and instructor teaching this simulation.

Despite the resources available to students in learning the simulation, many often find it frustrating to learn the “rules of the game”. The best way to look at this simulation is like a game. Just when you learn a card game or board game, it takes time and practice to truly understand the ins and outs of it.

The CapSim Business Simulation is a “game” that incorporates all aspects of business under one roof. As the owner of the company, you are in charge of research and development (R&D), marketing and sales, production, human resources (HR), total quality management (TQM), finance, and more. This simulation is often employed as a capstone project in Strategic Management courses near completion of a business degree. The game is effective in allowing students to make strategic decisions while incorporating the aspects from the other various departments.

I have put together a comprehensive guide for LEARNING the CapSim Simulation. I will not do it for you. This guide is not a cheat sheet (in the cheating sense). It will help you learn and give you the tools needed to thrive in your class. The resources included in the guide are:

  • Capsim Round 1 Walkthrough in Excel
  • Capsim 4 Round Walkthrough in Excel
  • Capsim Formula Sheet in Excel
  • Capsim Marketing and Forecasting Sheet in Excel
  • Capsim Marketing Resources
  • Capsim Production Resources
  • Capsim R&D Resources
  • Capsim Ideal Spots
  • Capsim sample process to complete a round
  • Capsim situation Analysis Walkthrough
  • Capsim TQM Tips

Access the CapSim Simulation Materials here:  https://psychologyofbusiness.beehiiv.com/p/capsim-materials

If you found this helpful, consider subscribing to my YouTube.

A WordPress.com Website .

' src=

  • Copy shortlink
  • Report this content
  • Manage subscriptions

Navigation Menu

Search code, repositories, users, issues, pull requests..., provide feedback.

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly.

To see all available qualifiers, see our documentation .

  • Notifications

A repository for my Google Data Anlaytics Certificate Capstone Project.

akorez/Google-Data-Analytics-CapStone-Project

Folders and files, repository files navigation, google-data-analytics-capstone-project.

bcyclist

Case Study: How Does a Bike-Share Navigate Speedy Success?

The purpose of this repository is to consolidate downloaded Divvy data into a single dataframe and then conduct simple analysis to help answer the key question: what ways do members and casual riders use Divvy bikes differently.

Introduction

This exploratory analysis case study is towards Capstome project requirement for Google Data Analytics Professional Certificate . The case study involves a bikeshare company's data of its customer's trip details over a 12 month period (November 2020 - October 2021). The data has been made available by Motivate International Inc. under this license .

The analysis will follow the 6 phases of the Data Analysis process: Ask, Prepare, Process, Analyze, and Act. A brief explanation of these processes:

  • Ask effective questions
  • Define the scope of the analysis
  • Define what success looks like
  • Verify data integrity
  • Check data credibility and reliability
  • Check data types
  • Merge datasets
  • Clean, Remove and Transform data
  • Document cleaning processes and results
  • Identify patterns
  • Draw conclusions
  • Make predictions
  • Create effective visuals
  • Create a story for data
  • Share insights to stakeholders
  • Give recommendations based on insights
  • Solve problems
  • Create something new
  • HTML 100.0%

IMAGES

  1. Capstone Project PowerPoint Presentation Slides

    capstone project walkthrough

  2. Capstone Project Walkthrough

    capstone project walkthrough

  3. Capstone Project

    capstone project walkthrough

  4. 201810 Concepts: 1st Capstone project walkthrough

    capstone project walkthrough

  5. Effective Capstone Project Examples for You to Follow

    capstone project walkthrough

  6. 50+ Best Capstone Project Ideas for Students in 2022

    capstone project walkthrough

VIDEO

  1. TIP-QC Capstone Design 10: New Caloocan City Jail

  2. Capstone Project Part 3

  3. [Diablo 4] Fallen Temple Capstone Dungeon

  4. Capstone Walkthrough

  5. Capstone Project Part 2

  6. Hudsons Capstone Walkthrough

COMMENTS

  1. Hands-On Data Engineering: A Comprehensive Walkthrough of the ...

    The capstone project offers several key learning outcomes. You will gain proficiency in data engineering frameworks like ETL, and understand the differences between transactional and analytical ...

  2. AWS Solutions Architect Associate-Module 15

    Mod15 - AWS Solutions Architect - Capstone project - Simplified - Anand KProject overviewThis project provides you with an opportunity to demonstrate the sol...

  3. This capstone project is a "structured walkthrough ...

    This capstone project is a "structured walkthrough" penetration test of a fictional company, Artemis, Incorporated ("Artemis"). A structured walkthrough is an organized procedure for a group of peers to review and discuss the technical aspects of various IT, IT Security, and IT Audit work products. The major objectives of a structured walkthrough are to find errors and to improve the ...

  4. Full Tutorial of Modules 5-8: SAM Capstone Project 1a

    #cengage #solutions1. Benicio Cortez is a financial consultant with NewSight Consulting in Denver, Colorado..Go to the U.S. worksheet...2. Go to the All Loca...

  5. A friendly walk-through of a Data Science Capstone Project

    Figure 3: Feature conversion on 'Geography' & 'Gender' columns Exploratory Data Analysis. Next, we explore the data to gain insights into our dataset and what it contains.

  6. Google Data Analytics Capstone Project

    Google Data Analytics Capstone Project. Updated: Jul 5, 2023. I worked on the Google Data Analytics Capstone Project, Track 1, Case Study 1. I will be diving into the background, my full process of cleaning, analyzing and visualizing the data, along with my final suggestions and summary of the data. Quick Links:

  7. Google Data Analytics Capstone: Complete a Case Study

    Module 1 • 2 hours to complete. A capstone is a crowning achievement. In this part of the course, you'll be introduced to capstone projects, case studies, and portfolios, and will learn how they help employers better understand your skills and capabilities. You'll also have an opportunity to explore the online portfolios of real data ...

  8. Google Data Analytics Capstone: Cyclistic Case study

    This case study represents course 8 "Capstone project" of the Google Data Analytics Professional Certificate on Coursera. The Scenario: Cyclistic is a fictional bike-share company in Chicago ...

  9. Final capstone project solution walkthrough

    Final capstone project solution walkthrough - [Instructor] Welcome to the Capstone Solution video. Throughout this video, I'll guide you through various steps of building a linear regression model.

  10. Google Data Analytics Course Capstone Project: Case Study 1 ...

    Below is my walkthrough of Case Study 1 within the course using various tools and methods. Scenario: You are a data analyst working at Cyclistic, a bike-share company in Chicago. Cyclistic allows ...

  11. PDF Capstone Handbook

    The Capstone Project is an academic study that offers an opportunity to explore a particular issue in much greater depth than is feasible in a class paper, and to argue your own perspective on that issue. LIS 4901 Capstone (4 credit hours) is an alternative to the Practicum. Either the Capstone OR the

  12. View SAM Project Answer Keys

    Click . Click SAM assignments. Locate a project assignment and click . Click the file under Solution File to download it to your computer. Open the solution file. View the correct answers for SAM projects.

  13. Google Data Analytics Certificate Course 8 of 8

    Interested in a career in Analytics? Take the first module of my Analytics Career Access program - Analyst Career Foundations - for FREE! Sign up today at ww...

  14. Google Capstone Project: How Can Bellabeat, A Wellness ...

    This is an optional capstone project from the Google Data Analytics Course no: Capstone Project which is posted on GitHub and Kaggle. The analysis follows the 6 steps of Data Analysis taught in ...

  15. How to Create a UX Capstone Project: Step-by-Step Guide

    3. Write a clear problem statement. Write a concise description of the problem that needs to be solved. Explain the background of the problem, whom it affects, and how it impacts the organization. Include the 4Ws using the information you gathered in the 'Empathize' phase (the first phase of the design thinking process).

  16. AP0824/Penetration-Test-and-Reconnaissance-Capstone-Project

    This capstone project is a "structured walkthrough" penetration test of a fictional company, Artemis, Incorporated ("Artemis"). In this "structured walkthrough", a group of peers reviews and discusses the technical aspects of various IT, IT Security, and IT Audit work products to find errors and to improve the quality of the final product ...

  17. Mastering The CapSim Simulation: A Comprehensive Guide

    This simulation is often employed as a capstone project in Strategic Management courses near completion of a business degree. The game is effective in allowing students to make strategic decisions while incorporating the aspects from the other various departments. ... Capsim situation Analysis Walkthrough; Capsim TQM Tips; Access the CapSim ...

  18. 201810 Concepts: 1st Capstone project walkthrough

    Simnet capstone project walkthrough

  19. Google Data Analytics Capstone Project: Cyclistic Case Study

    In this case study, I am assuming the position of 'Jr. Data Analyst' at Cyclistic, a bike-share company based in Chicago. Cyclistic offers over 6000 bikes at 800+ docking stations spread ...

  20. Solved IntroductionThe capstone project is a "structured

    The capstone project is a "structured walkthrough" penetration test of a fictional. company, Artemis, Incorporated ("Artemis"). A structured walkthrough is an. organized procedure for a group of peers to review and discuss the technical. aspects of various IT, IT Security, and IT Audit work products.

  21. Capstone Project Walkthrough

    Dohn 22+ Capstone Project Walkthrough and Tutorial.

  22. Solved Capstone Project Guidelines Updated August 30,

    Capstone Project Guidelines. Updated August 30, 2021. Introduction. The capstone project is a "structured walkthrough" penetration test of a fictional company, Artemis, Incorporated ("Artemis"). A structured walkthrough is an organized procedure for a group of peers to review and discuss the technical aspects of various IT, IT Security ...

  23. akorez/Google-Data-Analytics-CapStone-Project

    A repository for my Google Data Anlaytics Certificate Capstone Project. Case Study: How Does a Bike-Share Navigate Speedy Success? The purpose of this repository is to consolidate downloaded Divvy data into a single dataframe and then conduct simple analysis to help answer the key question: what ways do members and casual riders use Divvy bikes ...