{"id":2453,"date":"2017-03-01T11:13:11","date_gmt":"2017-03-01T16:13:11","guid":{"rendered":"http:\/\/commons.trincoll.edu\/amst-data-driven\/?p=2453"},"modified":"2017-03-01T22:07:17","modified_gmt":"2017-03-02T03:07:17","slug":"analyzing-data-lab-3","status":"publish","type":"post","link":"http:\/\/commons.trincoll.edu\/amst-data-driven\/2017\/03\/01\/analyzing-data-lab-3\/","title":{"rendered":"Analyzing cloud data lab 3"},"content":{"rendered":"<p>In my text analysis I hope to see tweets covering President Trump. I think that there will be a lot of recent tweets regarding his first address to congress.<\/p>\n<p>My number of words is 396,197 and 42,955 unique words. Interestingly, the most common word, or phrase in this instance, is &#8220;02&#8221; and is tweeted 19,265 times. alternativefacts is only used 18,739 times. Many of the phrases that stand out and I expected to be there were &#8220;realdonaldtrump&#8221;, &#8220;trump&#8221;, &#8220;fake news&#8221;, &#8220;theresistance&#8221; and &#8220;nobannowall&#8221;. Some of the weird ones that are common are &#8220;02&#8221;, &#8220;_u&#8221;, &#8220;u_&#8221; and &#8220;killerbee805&#8221;. I don&#8217;t see the how any of these are useful in tweets regarding alternative facts or Trumps administration.<\/p>\n<p>02<\/p>\n<p>rt<\/p>\n<p>alternativefacts<\/p>\n<p>https<\/p>\n<p>t.co<\/p>\n<p>I included 95 words<\/p>\n<p>It revels much about the controversy revolving around #alternativefacts. I was expecting to see much about Trump&#8217;s decisions and actions. Some of the key words are; &#8220;nobannowall&#8221;, &#8220;potus&#8221;, &#8220;noraids&#8221;, &#8220;resist&#8221;, &#8220;believe&#8221;, and &#8220;seanspicer&#8221;. Many of these make sense as they all have to do with Trump&#8217;s administration. This confirms what I thought most of my data was going to be about. Five of my most common terms are &#8220;nobannowall&#8221;, &#8220;potus&#8221;, &#8220;noraids&#8221;, &#8220;resist&#8221;, and &#8220;seanspicer&#8221;. All of these were not surprising as most of these are very common things talked about when talking about Trump. Also, Sean Spicer tweets a lot about politics so it makes sense that he would be a top word. Another interesting one was &#8220;impeachtrump&#8221; which appeared 186 times.<\/p>\n<p><img loading=\"lazy\" class=\"alignnone size-medium wp-image-2512\" src=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Screen-Shot-2017-03-01-at-9.27.59-PM-300x250.png\" alt=\"Screen Shot 2017-03-01 at 9.27.59 PM\" width=\"300\" height=\"250\" srcset=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Screen-Shot-2017-03-01-at-9.27.59-PM-300x250.png 300w, http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Screen-Shot-2017-03-01-at-9.27.59-PM.png 399w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><\/p>\n<p>&nbsp;<\/p>\n<p>I chose to do &#8220;nobannowall&#8221; &#8220;impeachtrump&#8221; and &#8220;resist&#8221;. I am choosing to do these because I think they will come up with the most interesting and controversial data. I am interested in impeach Trump the most because I want to see what everyone is saying about it and why since there are currently no grounds for impeachment.<\/p>\n<p><img loading=\"lazy\" class=\"alignnone size-medium wp-image-2514\" src=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Screen-Shot-2017-03-01-at-9.37.19-PM-300x188.png\" alt=\"Screen Shot 2017-03-01 at 9.37.19 PM\" width=\"300\" height=\"188\" srcset=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Screen-Shot-2017-03-01-at-9.37.19-PM-300x188.png 300w, http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Screen-Shot-2017-03-01-at-9.37.19-PM.png 414w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><\/p>\n<p>http:\/\/voyant-tools.org\/?corpus=c1a41e49aa06b78c7eba9e39aa0b4df7&#038;stopList=keywords-702dc271899e3005adb694ee462c1f0b&#038;panels=cirrus,reader,trends,summary,contexts<\/p>\n<p>I chose to do an article by The Hill titled, &#8220;The House should start impeachment against Trump now&#8221;. I chose to do this because I think the controversy surrounding him is intriguing. I also would like to read about valid reasons for people arguing impeachment, not just angry liberals ranting on social media.<\/p>\n<p>&nbsp;<\/p>\n<p><img loading=\"lazy\" class=\"alignnone size-medium wp-image-2517\" src=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Screen-Shot-2017-03-01-at-9.56.00-PM-300x247.png\" alt=\"Screen Shot 2017-03-01 at 9.56.00 PM\" width=\"300\" height=\"247\" srcset=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Screen-Shot-2017-03-01-at-9.56.00-PM-300x247.png 300w, http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2017\/03\/Screen-Shot-2017-03-01-at-9.56.00-PM.png 406w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><\/p>\n<p>&nbsp;<\/p>\n<p>Everything is related to the American aspect because almost all of the data is relevant to Trump and he is the President of the United States. Most of the new key words about things Trump is either trying to change or already changing. Some of these include healthcare, energy and business. I think this really bring a more American aspect to the dat because it is talking about American policies that are being affected and changed in some way or another. Although most of the data from twitter talks about similar things, this is a more formal way of viewing it.<\/p>\n<p>I think that Tufte&#8217;s approach is best. He says that superior methods are better, making them more precise, thus making them more credible. I think the last part of the lab demonstrates this best. The data scraped from twitter was not as serious or credible as the data scraped from The Hill article. This was easy to see when comparing the two word clouds. The one from twitter was filled with bizarre users and irrelevant words. The word cloud from the article was filled with all words connected right to Trump and his administration. It proves that more data is not always better data. Yau seems to believe that the more data the better but this proved that the more superior methods of collecting data find a better depiction of the actual data.<\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>In my text analysis I hope to see tweets covering President Trump. I think that there will be a lot of recent tweets regarding his first address to congress. My number of words is 396,197 and 42,955 unique words. Interestingly, the most common word, or phrase in this instance, is &#8220;02&#8221; and is tweeted 19,265&#8230;<\/p>\n","protected":false},"author":1776,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[1],"tags":[],"_links":{"self":[{"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/posts\/2453"}],"collection":[{"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/users\/1776"}],"replies":[{"embeddable":true,"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/comments?post=2453"}],"version-history":[{"count":4,"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/posts\/2453\/revisions"}],"predecessor-version":[{"id":2519,"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/posts\/2453\/revisions\/2519"}],"wp:attachment":[{"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/media?parent=2453"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/categories?post=2453"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/tags?post=2453"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}