March 28th, 2025

This week, I have been looking at the tags column in the American dataset. This column seems to contain additional data about each demonstration in the dataset. This data includes whether or not the protesters were armed, which is interesting because I found that some demonstrations that were labelled as peaceful protests still had armed protesters which was interesting.

The tags column also contained whether or not the protest was a counter demonstration or not and the size of the crowd. I used the str_detect() function in R’s stringr library to extract the information and put them into extra columns.

The crowd size will need additional formatting because the data was represented in a very inconsistent format

Leave a Reply

Your email address will not be published. Required fields are marked *