BLM Convos & Clusters– Do They Exist? Are They United?

Gephi is network analysis tool, which visualizes people and their interactions using weighted lines (edges) and circles (nodes). With the use of math algorithms and by formatting attributes, conversation groups (clusters) are visualized using distinct colors for each cluster. To begin learning how to use Gephi, I performed a network analysis on Les Miserables.  The width of edges connecting two nodes (characters) corresponds to the number of interactions between the characters. There are 77 nodes and 254 edges. When the Les Mis file was opened in Gephi, I chose to use an undirected analysis, because I was more concerned with the number of interactions and less concerned with the direction of communication between characters. Below is the initial network analysis created by Gephi, when the data (to be analyzed) is opened with Gephi.

les mis 2

Looking at the list of nodes, we see that Valjean (ID = 11) and Fantine (ID = 23) are interacting with some of the same characters, and are interacting with almost as many different characters. Below is the network analysis that is generated after running algorithms and increasing node repulsion strength.

Screen Shot 2017-03-30 at 11.28.09 AM
Les Mis Network Analysis: Repulsion Strength 1000. Repulsion strength correlates to the distance between each node.

The average path length, or average degrees of separation between characters, is 2.641, and the network diameter, or maximum degrees of separation between any two characters, is 5. From this data, we can guess that many of the characters are closely connected. However, we do not know if the characters are interacting in cluster. Also, with the current format, it is difficult to determine which character is interacting the most with other characters. The following network analysis correlates node diameter with the the total number of interactions a character has.

Screen Shot 2017-03-30 at 11.41.51 AM

Les Mis Network Analysis: Betweenness Centrality: min = 10 max = 200

Node size was set to correspond to Betweenness Centrality. This attribute results in the nodes with the most network connections to have the largest diameter, and the nodes with the fewest network connections to have the smallest diameter. As seen in the figure above, Valjean, Myriel, and Gavroche have the most interactions with other characters. However, this network analysis makes seeing clusters and each individual node difficult.

LesMis_NetworkAnalysis
Les Mis Network Analysis: Final Analysis

The final network analysis, shown above, reveals that while Gavroche, Marius, Fantine, and Myriel have the most interactions with other characters, they each belong to relatively distinct conversation clusters.  Valjean, the main character, is an exception and interacts with most clusters.

——————————————————————————————————————————————

Next, I applied the same methods to generate a network analysis, using  Gephi, for 3,000 #BlackLivesMatter tweets that were posted on February 26, 2017, the anniversary of Trayvon Martin’s death. When I performed a text analysis (see word cloud below) for the same set of tweets, I found that “trayvonmartin” was one of the most commonly used words. Therefore, I will look to see if each cluster was tweeting about Trayvon Martin on February 26, 2017.

BLM-Word-Cloud-45
#BlackLivesMatter Twitter Word Cloud– Feb. 22-28, 2017

For this network analysis, I chose not to differentiate between retweets and non-retweets and to include singletons. Consequently, I expect to see larger clusters. had I chosen to differentiate between tweet type, the clusters might also be separated by tweet type. For example, suppose only two users are posting new tweets, and 200 users are retweeting the pair of users’ tweets. The authors of the original tweets may appear in a separate cluster from those who are retweeting. Please note that this is a guess, not an observation.

 

Screen Shot 2017-03-29 at 10.54.42 AM

From the start, we can see (figure above) eight main clusters circled around the following users: texasinafrica, crystal1johnson, blklivesmatter, acluofga, khaledgeydoun, nmaahc, delo_taylor, and robertglasper. In the text analysis (see word cloud above), I filtered out “texasinafrica”, due to its frequent use. After a quick lookup in Twitter, I was able to determine that @texasinafrica is maintained by Laura Seay, a white woman from Texas, who lived and studied communities’ responses to “state weakness” in the Democratic Republic of Congo. Seay lived in the Democratic Republic of Congo while she performed her studies. Since Seay is a white person from Texas, I expect to see that users in her cluster also use #woke or other language related to white participation in the Black Lives Matter movement. At a first glance, I am unsure as to why @acluofga, “American Civil Liberties Union of GA”, appears to be the user through which @texasinafrica and @blklivesmatter interact. Perhaps @texasinafrica and @blklivesmatter interact directly, but this cannot be seen in the image below. For reference, @nmaahc stands for National Museum of African American History and Culture, and @crystal1johnson is maintained by a young black woman. Both @nmaahc and @crystal1johnson appear in clusters that are separate from the other clusters.

Screen Shot 2017-03-29 at 10.56.50 AM

zooming in…

Screen Shot 2017-03-29 at 10.59.28 AM

Searching through the tweets that were analyzed, I found that only two different tweets included @texasinafrica:

  1. @ceasarishome @realJerryFlavor @texasinafrica Wrong. Go support your racist man in the White House. #TrayvonMartin #BlackLivesMatter
  2. RT @texasinafrica: Trayvon Martin would have been 22 years old & graduating from college this spring. #BlackLivesMatter https://t.co/FW34Ec…

Moreover, neither of these tweets were created by @texasinafrica. While the first tweet seems to infer that @texasinafrica does not support the Black Lives Matter Movement, this is likely not the case since the second tweet only occurred in the data (3000 tweets) once. On the other hand, the second tweet was occurred 235 times in the data set. Thus my hypothesis that @texasinafrica would be connected to mention of Trayvon Martin was correct. The most common tweets to include @blklivesmatter also mention Trayvon Martin:

  1. RT @Blklivesmatter: #DearTrayvonsMom: We will honor your son’s memory each and every day. #RememberingTrayvon #BlackLivesMatter https://t.c…
  2. RT @Blklivesmatter: #RememberingTrayvon #BlackLivesMatter https://t.co/Yx76Z0mpx5
  3. RT @BLMChi: #TrayvonTaughtMe #JusticeForTrayvon #TrayvonMartin #RememberingTrayvon #BlackLivesMatter @Blklivesmatter https://t.co/ll4QRUZIkq

Again, the most common tweet to include @ACLUofGA references the memory of Trayvon Martin:

  1. RT @ACLUofGA: Today we honor #TrayvonMartin by continuing to fight for racial justice. #BlackLivesMatter

So while there are distinct cluster of users using #BlackLivesMatter, on February 26, 2017 all of the clusters were tweeting about Trayvon Martin. I believe that a network analysis of a different day might have revealed more about why the different clusters exist.

 

2 thoughts on “BLM Convos & Clusters– Do They Exist? Are They United?

  1. I’m surprised that your data had so few singletons, and that most mentions involved a few key nodes. Do you think that level of connectedness is related to the conversation about Trayvon Martin—would it have been less centralized on another day?
    For my own data, it would be interesting to go back and see which of the most connected units are being retweeted, and which are being mentioned frequently by other users.

  2. I really like the different directions you took to interpret multiple sides of your plot. I am surprised of how few nodes there are even considering you looked at tweets surrounded Treyvon Martin’s death. Do you think the blacklivesmatter conversation thrives the most around specific events? Or do you believe the conversation has continued had a steady pace over the course of the past year? I think it is really interesting that your hashtag actually has an account (@blklivesmatter) to specifically tweet about the cause. It clearly created a lot of other conversations that were able to stem from some of this accounts tweets. The nodes I found on my plot were not very interconnected, maybe by one or two users for each, so I is cool to see your three major nodes have a decent amount of connectivity. It says a lot about the strength about the blicklivesmatter conversation.

Leave a Reply

Your email address will not be published. Required fields are marked *