{"id":2919,"date":"2017-03-31T10:51:46","date_gmt":"2017-03-31T15:51:46","guid":{"rendered":"http:\/\/commons.trincoll.edu\/amst-data-driven\/?p=2919"},"modified":"2017-03-31T10:51:46","modified_gmt":"2017-03-31T15:51:46","slug":"communication-flows-social-network-analysis-of-nodapl","status":"publish","type":"post","link":"http:\/\/commons.trincoll.edu\/amst-data-driven\/2017\/03\/31\/communication-flows-social-network-analysis-of-nodapl\/","title":{"rendered":"Communication Flows: Social Network Analysis of #NoDAPL"},"content":{"rendered":"<p>With the social network analysis, I hope to gain a better understanding of what connections exist between Twitter users discussing #NoDAPL. For this analysis, I chose to use tweets from February 21-23, yielding a total of approximately 1,800 tweets. This time period includes the days leading up to the forced clearing of the protestors from the Standing Rock camp.<\/p>\n<p>The social network analysis was performed using Gephi. To familiarize myself with the software, I used one of the sample networks provided\u2014<em>Les Miserables<\/em>. The first choice available upon opening the file is either a directed or undirected graph. For the purposes of the exercise, I chose the undirected graph. However, a directed graph may be useful for my analysis of #NoDAPL tweets, as it will better display the flows of the conversation\u2014whether individuals are directing messages at (or retweeting) more connected users, or whether the connected users are speaking to a large group of followers. In the\u00a0<em>Les Mis<\/em> file, there are 77 nodes (characters), and 254 edges (interactions).<\/p>\n<figure id=\"attachment_3035\" aria-describedby=\"caption-attachment-3035\" style=\"width: 290px\" class=\"wp-caption alignnone\"><img loading=\"lazy\" class=\"size-medium wp-image-3035\" src=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Screen-Shot-2017-03-30-at-8.59.01-PM-290x300.png\" alt=\"Unsorted Graph of Les Mis\" width=\"290\" height=\"300\" srcset=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Screen-Shot-2017-03-30-at-8.59.01-PM-290x300.png 290w, http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Screen-Shot-2017-03-30-at-8.59.01-PM.png 656w\" sizes=\"(max-width: 290px) 100vw, 290px\" \/><figcaption id=\"caption-attachment-3035\" class=\"wp-caption-text\">Unsorted Graph of Les Mis<\/figcaption><\/figure>\n<p>The Data Laboratory shows the connections between sources and targets\u2014essentially, who is speaking, and to whom. Valjean, the main character (ID no. 11) only speaks to four other characters, but he is spoken about by considerably more. Fantine (ID no. 23), on the other hand, speaks to Valjean and nine others, but is spoken about by fewer. Valjean speaks to Myriel, Labarre, Mademoiselle Baptiste, and Madame Magliore. Fantine speaks to Valjean, Marguerite, Tholomyes, Fameuil, Blacheville, Dahlia, and Zephine. Curiously, the only apparent overlap between the two is when Fantine speaks directly to Valjean. It will be interesting to see how Fantine is connected to Valjean such that she can speak to him directly, but is never spoken to in return (at least by name).<\/p>\n<p>Running the statistical analyses yielded some additional information. The average path length between nodes was 2.64, and the diameter, or most &#8220;degrees of separation,&#8221; between nodes was 5.<\/p>\n<figure id=\"attachment_3059\" aria-describedby=\"caption-attachment-3059\" style=\"width: 229px\" class=\"wp-caption alignnone\"><img loading=\"lazy\" class=\"wp-image-3059 size-medium\" src=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Screen-Shot-2017-03-27-at-10.46.17-AM-229x300.png\" width=\"229\" height=\"300\" srcset=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Screen-Shot-2017-03-27-at-10.46.17-AM-229x300.png 229w, http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Screen-Shot-2017-03-27-at-10.46.17-AM.png 342w\" sizes=\"(max-width: 229px) 100vw, 229px\" \/><figcaption id=\"caption-attachment-3059\" class=\"wp-caption-text\">Les Mis network with repulsion strength increased<\/figcaption><\/figure>\n<p>&nbsp;<\/p>\n<figure id=\"attachment_3058\" aria-describedby=\"caption-attachment-3058\" style=\"width: 300px\" class=\"wp-caption alignnone\"><img loading=\"lazy\" class=\"size-medium wp-image-3058\" src=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/screenshot_094838-300x225.png\" alt=\"Les Mis social network colorized by modularity\" width=\"300\" height=\"225\" srcset=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/screenshot_094838-300x225.png 300w, http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/screenshot_094838-768x576.png 768w, http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/screenshot_094838.png 1024w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><figcaption id=\"caption-attachment-3058\" class=\"wp-caption-text\">Les Mis social network colorized by modularity<\/figcaption><\/figure>\n<p>Several adjustments were necessary to make the network analysis readable. Increasing the repulsion strength separated the nodes from one another, lengthening the edges between them. The relative scale of the connections did not change, but having a stronger &#8220;push&#8221; between nodes begins to separate them into distinct groups. Changing the parameters for node size to a range from 10-200 magnified the more connected nodes, better showing their level of connectedness. The nodes were ranked by number of connections, and assigned a size within the given range based on that rank.<\/p>\n<figure id=\"attachment_3060\" aria-describedby=\"caption-attachment-3060\" style=\"width: 300px\" class=\"wp-caption alignnone\"><img loading=\"lazy\" class=\"size-medium wp-image-3060\" src=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Untitled-4-300x300.png\" alt=\"Les Mis network analysis, visualized and sorted by modularity\" width=\"300\" height=\"300\" srcset=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Untitled-4-300x300.png 300w, http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Untitled-4-150x150.png 150w, http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Untitled-4-768x768.png 768w, http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Untitled-4-380x380.png 380w, http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Untitled-4.png 1024w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><figcaption id=\"caption-attachment-3060\" class=\"wp-caption-text\">Les Mis network analysis, visualized and sorted by modularity<\/figcaption><\/figure>\n<p>Running Gephi with my own data, I was able to be more flexible with the settings in order to create a better visualization. I chose a repulsion strength of 6,000, a compromise between the default value and the value used in the\u00a0<em>Les Mis<\/em> analysis. Using a repulsion strength of 10,000 moved the nodes too far apart, increasing the overall size of the plot to the point that large nodes and groups were difficult to see clearly. The node size range was kept the same, between 10 and 200.<\/p>\n<p>The statistical analyses of #NoDAPL produced very different results than those from\u00a0<em>Les Mis.\u00a0<\/em>The average path length between users was 5.23, and the diameter was 13. Obviously, a real-world sample of Twitter users will not be as tightly connected as the characters of a play, explaining why both metrics are higher, by a factor of about 2 and 2.5, respectively. With 1,778 nodes and 1,994 edges, I am surprised that the distance between nodes is still relatively short. The fact that the average path length (5.23) is less than half of the maximum path length (13) indicates a tendency of users of the #NoDAPL hashtag to be connected to one another. A larger diameter indicates a larger gap between two users or their groups; this may have many causes, but one likely explanation is that these disconnected users are invested in different issues\u2014perhaps the environment and indigenous rights\u2014under the same banner.<\/p>\n<figure id=\"attachment_3062\" aria-describedby=\"caption-attachment-3062\" style=\"width: 300px\" class=\"wp-caption alignnone\"><img loading=\"lazy\" class=\"size-medium wp-image-3062\" src=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/NoDAPL2-300x300.png\" alt=\"#NoDAPL network, visualized and grouped by modularity\" width=\"300\" height=\"300\" srcset=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/NoDAPL2-300x300.png 300w, http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/NoDAPL2-150x150.png 150w, http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/NoDAPL2-768x768.png 768w, http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/NoDAPL2-380x380.png 380w, http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/NoDAPL2.png 1024w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><figcaption id=\"caption-attachment-3062\" class=\"wp-caption-text\">#NoDAPL network, visualized and grouped by modularity<\/figcaption><\/figure>\n<p>The final visualization reveals several small clusters, centered around a single, larger node. These large nodes appear to be mostly individual activists or advocacy groups, each with thousands or tens of thousands of tweets. User @<em>ruthhhopkins<\/em> is the largest node, but is not disproportionately larger than other central nodes. Many users are not strongly connected to any others, positioned in small clusters with a degree range of one or two. I chose to filter out singletons, with a degree of 0. I was conflicted about filtering past zero degrees\u2014it created a less cluttered, more understandable plot, but eliminated many\u00a0of the clusters around larger nodes. These single connections suggest that more connected users are simply being retweeted, rather than really being engaged in conversation. The directional edges support this assumption, with most being pointed at the large nodes, indicating that they were mentioned in other commons, as is practiced when retweeting.<\/p>\n<p>As I mentioned, most of the major nodes are activists involved in the protests of the Dakota Access pipeline\u2014either individuals or organizations. Most of these accounts are highly active. Even those claiming to belong to a single person are most likely managed\u2014the largest node, @<em>ruthhhopkins<\/em> has almost 65 thousand tweets, and over 44 thousand followers. The other notable group of major nodes is online news organizations. Users\u00a0<em>@fusion<\/em> and<em>@UR_Ninja<\/em>, as well as some others, are general news sites or media-specific branches. Most of the users surrounding the main nodes are unremarkable\u2014regular users with a smaller following and fewer tweets. A couple of interesting groupings did appear, however.\u00a0<em>@Wmn4Srvl\u00a0<\/em>(Women for Survival) was linked with several major news organizations (BBC, ABC, CNN, and others), but not strongly linked to any other nodes. In some\u00a0cluster, there were two major nodes (<em>@joshfoxfilm\u00a0<\/em>and\u00a0<em>@npr, \u00a0<\/em><em>@potus\u00a0<\/em>and\u00a0<em>@americanindian8<\/em>,\u00a0and\u00a0<em>@fusion\u00a0<\/em>and\u00a0<em>@markruffalo<\/em>), with a group of shared connections. Most of these pairings had a large, public account (<em>@npr<\/em>,\u00a0<em>@potus<\/em>, and\u00a0<em>@markruffalo<\/em>) and a smaller, non-mainstream account. Perhaps these are instances of one account sharing a message directed at another, which is then retweeted by a group of followers.<\/p>\n<p>Although it is interesting to see the small clusters that did form, it appears that the majority of the hashtag users did not have particularly strong connections to other users. Also, while there are several large nodes focused on similar topics, there are few connections between them. I was surprised to see this fragmentation within the community. Instead, I would have expected users with similar concerns to group together, creating clusters based on interest in particular social or environmental issues, rather than simply choosing a particular figure and following them.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>With the social network analysis, I hope to gain a better understanding of what connections exist between Twitter users discussing #NoDAPL. For this analysis, I chose to use tweets from February 21-23, yielding a total of approximately 1,800 tweets. This time period includes the days leading up to the forced clearing of the protestors from&#8230;<\/p>\n","protected":false},"author":1967,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[6],"tags":[],"_links":{"self":[{"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/posts\/2919"}],"collection":[{"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/users\/1967"}],"replies":[{"embeddable":true,"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/comments?post=2919"}],"version-history":[{"count":3,"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/posts\/2919\/revisions"}],"predecessor-version":[{"id":3067,"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/posts\/2919\/revisions\/3067"}],"wp:attachment":[{"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/media?parent=2919"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/categories?post=2919"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/tags?post=2919"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}