Your Privacy

This site uses cookies to enhance your browsing experience and deliver personalized content. By continuing to use this site, you consent to our use of cookies.
COOKIE POLICY

Skip to main content

Patterns in Natural Language Data | A Paw Patrol Analysis

Patterns in Natural Language Data | A Paw Patrol Analysis
Back to insights

After watching too much Paw Patrol with my two kids, I identified a rigid episode structure that, with some work, can answer some deeper questions about the team dynamics of a children’s television show. These questions about a popular children’s television show are, at heart, nonsensical, as it is a silly show. However, the exercise of finding structure in natural language and using that to extract a more profound meaning can be used to gather insight about your business as well. We will review the Paw Patrol formula through this blog post and determine how we can apply that over unstructured episode transcripts to identify richer team dynamics and demographics within the show. 

Background

We first must start by describing some Paw Patrol lore. The show is about a team of search and rescue dogs that work together to help solve problems in their town. These dogs, referred to as “pups” for the remainder of this article, have a specialty loosely tied to popular first responder professions (police, firefighter, Coast Guard, etc.). The pups have a pup pack with a collection of tools associated with their profession that they can use to solve specific scenarios. For example, Marshal, the firefighter pup, has a water cannon to put out a fire. The pups use these tools to solve missions around the town. 

Secondly, the show takes place in 3 acts: 

  • Act 1: Some shenanigans cause some problems in the town. 
  • Act 2: Ryder, the human leader of the paw patrol, assigns tasks to individual pups with instructions on which tool they should use to solve a problem. For example: “Marshal, use your water cannon to put out the fire.” 
  • Act 3: The pups use their tools and sometimes improvise to solve the mission. 

Purpose 

With a better understanding of the show, let’s introduce some questions that this format can help us answer: 

  • Who is the most and least useful pup? 
  • Who goes outside the plan and uses their tools without being assigned a task? 
  • Are there any pups assigned a task, but do not use their tools during Act 3? 
  • Who is the best at sticking with the plan and regularly does their assigned task? 

These are questions nobody should ask, but I felt the need to answer them as someone who enjoys team dynamics and cares about what my children watch. With some additional metadata around the transcripts, we can also answer a few other important questions: 

What is the gender representation across the various seasons of the show? 

What is the species (dog, human, chicken, etc.) representation within the show? 

What are the most popular tools, and how frequently are they used? 

Solution

The first step to solving this problem was to find a data source with the following minimum criteria: 

  1. Must list the quote and character that said the line. 
  2. Must maintain the correct order of the quotes as they are spoken. 
  3. Must separate the data into episodes. 

To my surprise, a data source from the Paw Patrol fan wiki exists. Since this humble site does not have an API that I’m aware of, I resorted to web page scraping the data. Which provided me with the following data points: season, episode name, actor, and quote. From there, we can start to cleanse the unstructured data and parse it into a format that allows for richer analysis. This involves loading it into PySpark data frames, casting data types, and removing unnecessary characters. 

With a working data structure, the most manual step began. We needed a business user with expert knowledge of the pups, their tools, and the universe in general. Since, to my knowledge, this person doesn’t exist, I took on that challenge. This process required a manual review of all the pups’ prospective tools in their pup packs, identifying the main and supporting characters in the show, assigning the characters perceived gender and species, and cataloging all the character’s catchphrases. The catchphrases are critical since they signify quite a bit of the formula: 

  • Task assignment from Ryder 
  • Acceptance of the task from Ryder 
  • Tool usage 
  • Identifies and distinguishes the three acts of the episode 

Findings 

General

After aggregating all the data, we came up with the following: 

General Data

A specific call out is that this analysis is based on the early seasons of Paw Patrol. This is due to only having a valid dataset for the first seasons of Paw Patrol, and our findings would likely change with more data.  

Gender

Based on the perceived gender of the main characters, we can quickly identify the number of quotes for each gender: 

Gender Data

Tool Usage

From our manual identification of tools available to the pups in their pup packs, we can aggregate the sum to identify what is the most popular tool in the data we have: 

Tool Usage

One thing unexpected here is that there are some of the same tools available to multiple pups, for example, both Chase and Skye have goggles in their pup-pack. Also, this is only about half of the tools available to the pups, with another 39 less frequently used tools not shown in the diagram. 

Role Assignments

During the show’s second act, Ryder assigns each pup a task to perform during the third act. The following highlights how frequently Ryder assigns a task to each pup: 

Role Assignments

In defense of Everest and Tracker, they are “guest stars” in the show and only appear in a limited number of episodes. Below is a count of the number of lines for each character highlighting that. 

Character Lines 

Team Contribution

Lastly, we wanted to identify how each pup contributed to the overall team. This was the most important and challenging thing to identify. We categorized contribution into four categories:  

Team Contributions

Based on this we have found the following: 

Contributions

Note on Skye: Her pup pack contains tools to fly which is reported on the least based on anecdotal data. However, given the current data set, I cannot objectively corroborate that without a lot of manual effort. 

Data is everywhere and can allow us to make better decisions for our business. This nonsensical example is just one way a company can identify how a team is performing based on how well the team reports on their usage. Creative use of any existing data from a business can provide interesting insights, and I hope this silly example is, at minimum, a highlight of this point. 

 

Digging In

  • Data & Analytics

    Legacy Data Modernization: A Comprehensive Guide to Upgrading Your Data Platform

    Though they may have been more than functional in the past, legacy data platforms can become a burden to your organization and prevent it from realizing its full potential. That’s why legacy data modernization can effectively transform your organization’s obsolete data systems into modern platforms that are scalable, efficient, and better equipped to handle today’s […]

  • Data & Analytics

    Masking Data 101: Safeguarding PII in Your Organization

    In today’s digital age, data security and privacy are paramount. As organizations increasingly collect, store, and process personal data, protecting Personally Identifiable Information (PII) has never been more critical. One essential practice that organizations can implement at the database level to secure this sensitive information is to obfuscate it through the usage of data masking […]

  • Data & Analytics

    Unlocking the Full Potential of a Customer 360: A Comprehensive Guide

    In today’s fast-paced digital economy, understanding your customer has never been more critical. The concept of a customer 360 view has emerged as a revolutionary approach to gaining a comprehensive understanding of consumers by integrating data from different touchpoints to offer a holistic view. A customer 360 view is about taking an overarching approach to […]

  • Data & Analytics

    Microsoft Fabric: A New Unified Data Platform

    MicroPopular data services and tools often specialize in specific aspects of the data analytics pipeline, serving teams in the data lifecycle. For instance, Snowflake addresses large-scale data warehousing challenges, while Databricks focuses on data engineering and science. Power BI and Tableau have become standard tools for business intelligence tasks. So, where does Microsoft Fabric create […]

  • Data & Analytics

    Improve Member Experience: Maximize Engagement & Value for Associations

    As you know, member engagement is key to providing value and retaining members over time. However, you must also recognize that member needs and preferences are evolving rapidly, especially as they desire more seamless digital experiences. Additionally, member expectations for personalized, omnichannel interactions have risen in recent years, and this means that associations must strategically […]

  • Data & Analytics

    A Guide to Data Strategy Success in Your Association

    While countless organizations aim to harness the potential of data, few possess a clear strategy to transform raw information into actionable insights that fuel their operations and marketing efforts. Don’t fall into the trap of investing in limited, tactical solutions.