{"id":2808,"date":"2017-03-27T09:37:24","date_gmt":"2017-03-27T14:37:24","guid":{"rendered":"http:\/\/commons.trincoll.edu\/amst-data-driven\/?p=2808"},"modified":"2017-03-30T19:46:26","modified_gmt":"2017-03-31T00:46:26","slug":"lab-4-obamacare-network-analyses-with-gephi","status":"publish","type":"post","link":"http:\/\/commons.trincoll.edu\/amst-data-driven\/2017\/03\/27\/lab-4-obamacare-network-analyses-with-gephi\/","title":{"rendered":"Lab 4: Obamacare Network Analyses with Gephi"},"content":{"rendered":"<p>I chose to have 2500 tweets in my data set with all those tweets coming from March 8th &#8211; March 9th. I chose 2500 tweets because we were recommended to have a subset data set of tweets between 2000-3000 tweets and I thought that 2500 tweets would be a good sample size. The time period I chose really had no significance; my data set ended on March 9th and to make things run smoother, I just used the first 2500 tweets I had. I have a total of 3524 tweets in my new file. I believe that some columns on the right are blank because some tweets didn&#8217;t mention anybody in them or on the other hand, mentioned more than one person.<\/p>\n<p>There are a total of 77 nodes and 254 edges in the file.<\/p>\n<p>I chose undirected because it shows all the different mutual relationships in the data file. Undirected graph is able to display connections between people in a better way when compared to directed graphs.<\/p>\n<p><img loading=\"lazy\" class=\"alignnone size-medium wp-image-2844\" src=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Screen-Shot-2017-03-27-at-11.08.37-AM-300x297.png\" alt=\"Screen Shot 2017-03-27 at 11.08.37 AM\" width=\"300\" height=\"297\" srcset=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Screen-Shot-2017-03-27-at-11.08.37-AM-300x297.png 300w, http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Screen-Shot-2017-03-27-at-11.08.37-AM-150x150.png 150w, http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Screen-Shot-2017-03-27-at-11.08.37-AM.png 456w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><\/p>\n<p>Valjean has ID number 11. Fantine has ID number 23. Valjean is connected to four targets with ID numbers 0, 2, 3 and 10 while Fantine is connected to nine targets with ID numbers 11, 12, 16, 17, 18, 19, 20, 21 and 22. Although Valjean is the main character, Fantine had just over double the amount of target connections that Valjean had. Based on this data, I believe that Fantine is the main source.<\/p>\n<p><img loading=\"lazy\" class=\"alignnone size-medium wp-image-3030\" src=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Screen-Shot-2017-03-30-at-7.04.47-PM-300x247.png\" alt=\"Screen Shot 2017-03-30 at 7.04.47 PM\" width=\"300\" height=\"247\" srcset=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Screen-Shot-2017-03-30-at-7.04.47-PM-300x247.png 300w, http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Screen-Shot-2017-03-30-at-7.04.47-PM.png 466w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><\/p>\n<p>For some reason, when I increased the repulsion strength, my graph became colorful, but not as messy and disorganized. When I increased the repulsion strength, I think that my graph narrowed down all the connections to the connections that are seen most frequently on Twitter. The most frequent interactions are now shown in the graph above, whereas they were not shown in the first graph.<\/p>\n<p>Average path length = \u00a02.6411483253588517 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 Diameter = 5<\/p>\n<p>When I picked Betweenness Centrality and set the minimum size to 10 and maximum size to 200, the size of each node changed. What I think happened is that the larger the node, the more connections that node has whereas the smaller the node, the less connections that node has (I could be 100% wrong).<\/p>\n<p><img loading=\"lazy\" class=\"alignnone size-medium wp-image-3031\" src=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Screen-Shot-2017-03-30-at-7.44.20-PM-300x198.png\" alt=\"Screen Shot 2017-03-30 at 7.44.20 PM\" width=\"300\" height=\"198\" srcset=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Screen-Shot-2017-03-30-at-7.44.20-PM-300x198.png 300w, http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Screen-Shot-2017-03-30-at-7.44.20-PM-768x508.png 768w, http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Screen-Shot-2017-03-30-at-7.44.20-PM-1024x677.png 1024w, http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Screen-Shot-2017-03-30-at-7.44.20-PM.png 1085w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><\/p>\n<p>PART III:<\/p>\n<p>The repulsion strength I chose was 10,000 because I was advised to do so by Professor Gieseking. My average path length was 5.016561091911934\u00a0and diameter was 16. This means that most relationships are 19 connections apart and separated by 6.732 degrees of diameter.<\/p>\n<p><img loading=\"lazy\" class=\"alignnone size-medium wp-image-3032\" src=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Screen-Shot-2017-03-30-at-7.46.59-PM-300x286.png\" alt=\"Screen Shot 2017-03-30 at 7.46.59 PM\" width=\"300\" height=\"286\" srcset=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Screen-Shot-2017-03-30-at-7.46.59-PM-300x286.png 300w, http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Screen-Shot-2017-03-30-at-7.46.59-PM.png 764w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><\/p>\n<p>The largest nodes in my graph were &#8220;POTUS&#8221; and &#8220;speakerryan&#8221;. What I find interesting about this was that POTUS and speakerryan were both the largest words in my word cloud we created in lab three. This makes sense as many people are tweeting at the president of the United States, Donald Trump, as his goal is to repeal Obamacare; this would negatively impact a majority of people in America as they depend heavily on Obamacare for health insurance and medical expenses. Speakerryan was a large node too which makes sense as he&#8217;s heavily involved in this whole process of repealing Obamacare and coming up with a new plan. Interestingly enough, POTUS and speakerryan were separated whereas I thought that they would be more connected.<\/p>\n<p>I expected many of the big government figures to be connected, but like I said above, they don&#8217;t seem to be very connected, meaning they don&#8217;t communicate too often on Twitter. I feel like POTUS was the largest node as so many people are against President Trump and his new theories\/policies. The dates of the tweets I used in my dataset were from March 8th- March 9th, but many huge things have happened involving Obamacare in the last week and a half, which kind of makes the time period of this data not very helpful. In the last week, it was decided that Obamacare was not going to be repealed, but unfortunately, I stopped collecting tweets on Obamacare. I think that the singletons reveal that people may be talking about Obamacare, but not talking in the same conversations that the people in color are talking in (different context).<\/p>\n","protected":false},"excerpt":{"rendered":"<p>I chose to have 2500 tweets in my data set with all those tweets coming from March 8th &#8211; March 9th. I chose 2500 tweets because we were recommended to have a subset data set of tweets between 2000-3000 tweets and I thought that 2500 tweets would be a good sample size. The time period&#8230;<\/p>\n","protected":false},"author":872,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[1],"tags":[],"_links":{"self":[{"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/posts\/2808"}],"collection":[{"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/users\/872"}],"replies":[{"embeddable":true,"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/comments?post=2808"}],"version-history":[{"count":5,"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/posts\/2808\/revisions"}],"predecessor-version":[{"id":3033,"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/posts\/2808\/revisions\/3033"}],"wp:attachment":[{"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/media?parent=2808"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/categories?post=2808"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/tags?post=2808"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}