{"id":751,"date":"2016-03-02T23:14:23","date_gmt":"2016-03-03T04:14:23","guid":{"rendered":"http:\/\/commons.trincoll.edu\/amst-data-driven\/?p=751"},"modified":"2018-05-30T16:44:51","modified_gmt":"2018-05-30T21:44:51","slug":"text-analysis-trumptrain","status":"publish","type":"post","link":"http:\/\/commons.trincoll.edu\/amst-data-driven\/2016\/03\/02\/text-analysis-trumptrain\/","title":{"rendered":"Text Analysis #TrumpTrain"},"content":{"rendered":"<p><u><\/u>When reading the specific tweets with reference to their geolocated spaces it became apparent how support (or lack thereof) is regional. \u00a0The tweets from the southern region of the United States displayed complete support while the northeast and northwest showed unanimous lack of support. \u00a0I truly wish that there were more geolocated tweets because I believe it would have been fascinating to cross reference them with the text analysis. \u00a0I think that if we could create a text analysis based on the regional location it would prove to be vitally important and would undoubtedly allow for a deeper understanding. \u00a0I hope to see many interesting topics such as references to different primaries, politically driven hashtags other than #TrumpTrain, and finally I hope to find some keywords which express disgust with Donald Trump&#8217;s campaign.<\/p>\n<p>494,121 Words. \u00a021,974 Unique Words. \u00a0There were two data points which I found interesting. \u00a0They are both terms which are labeled as &#8220;distinctive to the corpus.&#8221; \u00a0These are <a class=\"document-type keyword\" href=\"https:\/\/twitter.com\/search?q=%23nhprimary&amp;src=typd&amp;lang=en\" target=\"_blank\" rel=\"noopener\">#nhprimary<\/a>\u00a0and\u00a0<a href=\"https:\/\/twitter.com\/DanScavino?lang=en\" target=\"_blank\" rel=\"noopener\">@danscavino<\/a>. \u00a0The hashtag #nhprimary was of clear relevance the past few weeks as it has a grip on the entire political scene as it is the\u00a0first of the\u00a0primary elections in the nation. \u00a0The second term: @danscavino, is extremely important. \u00a0This is the Twitter handle of Dan Scavino who is Donald Trump&#8217;s senior adviser and director of social media. \u00a0I have previously discussed his role in previous posts; however, it is important to reiterate that his tweets consistently receive a multitude of retweets and favorites which displays a large support for Trump&#8217;s campaign. \u00a0The first word cloud produced can be seen in Figure 1.<\/p>\n<figure id=\"attachment_790\" aria-describedby=\"caption-attachment-790\" style=\"width: 300px\" class=\"wp-caption aligncenter\"><a href=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2016\/03\/Screen-Shot-2016-03-02-at-11.03.51-AM.png\" rel=\"attachment wp-att-790\"><img loading=\"lazy\" class=\"wp-image-790 size-medium\" src=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2016\/03\/Screen-Shot-2016-03-02-at-11.03.51-AM-300x219.png\" alt=\"\" width=\"300\" height=\"219\" srcset=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2016\/03\/Screen-Shot-2016-03-02-at-11.03.51-AM-300x219.png 300w, http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2016\/03\/Screen-Shot-2016-03-02-at-11.03.51-AM.png 750w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><\/a><figcaption id=\"caption-attachment-790\" class=\"wp-caption-text\">Figure 1.<\/figcaption><\/figure>\n<p>I found that there were many words in this first cloud which needed to be removed. \u00a0These words contained no useful and legitimate information as they were simply the most common numbers and auxiliary verbs. \u00a0I found this to be very interesting but not very valuable. \u00a0After adding stop words to filter these words, the word cloud made much more sense. \u00a0The updated word cloud can be found in Figure 2. \u00a0The stop words\u00a0I edited included the following: t.co, 23, 20, boyddhhx, pls, #t\u00e9, could, can, would, has, just, almost, \u00e9, rt (and many many more). \u00a0Although I negated the terms \u00e9, t\u00e9, and #t\u00e9, I think they are an extremely valuable piece of information. \u00a0The letter \u00e9 is used in Spanish and therefore it is a definitive possibility that there is a large hispanic population expressing views on Trump. \u00a0This is a valuable piece of information because the hispanic population is extremely important to the 2016 presidential race. \u00a0The more hispanic support, the better chance of winning.<\/p>\n<p>Figure 2 reveals a vast amount of useful information pertaining to the hashtag, #TrumpTrain. \u00a0Quite obviously, #TrumpTrain is the most common term used as it was included in every single tweet analyzed. \u00a0The largest words in the cloud made complete sense as they are Donald Trump&#8217;s campaign slogan (<a href=\"https:\/\/twitter.com\/search?q=%23makeamericagreatagain&amp;src=typd&amp;lang=en\">#makeamericagreatagain<\/a>), his own Twitter handle (<a href=\"https:\/\/twitter.com\/realDonaldTrump?lang=en\">@realdonaldtrump<\/a>), his campaign hashtag (<a href=\"https:\/\/twitter.com\/search?q=%23trump2016&amp;src=typd&amp;lang=en\">#trump2016<\/a>), and a popular tweet in support of Trump in the South Carolina primary (<a href=\"https:\/\/twitter.com\/search?q=%23votetrumpsc&amp;src=typd&amp;lang=en\">#votetrumpsc<\/a>). \u00a0Due to the fact that I have never made a word cloud before I was surprised with Voayant&#8217;s\u00a0ease of use as well as the final product. \u00a0I did not know what to expect and I was blown away by the filtered data. \u00a0Further, I could not believe that there were so many different terms, all important, which related to #TrumpTrain. \u00a0It made it easier to understand my data as a whole as it extracted the most popular points without having to read through each and every tweet. \u00a0I think that this word cloud suggests that there are topics which are subtle. \u00a0Indeed, the terms present may be the most popular, however, Voyant allows for the visualization of nearly ALL of these terms which we may not have been able to conclude simply through reading the tweets. \u00a0Moving forward, I will certainly look at my data differently as well as my analysis. \u00a0This software allowed me to realize that there is so much data in which we can not find just by reading. \u00a0This technology has broken it down into workable components which allows for a greater understanding of the entire data set.<\/p>\n<figure id=\"attachment_819\" aria-describedby=\"caption-attachment-819\" style=\"width: 300px\" class=\"wp-caption aligncenter\"><a href=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2016\/03\/Screen-Shot-2016-03-02-at-11.13.57-AM.png\" rel=\"attachment wp-att-819\"><img loading=\"lazy\" class=\"wp-image-819 size-medium\" src=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2016\/03\/Screen-Shot-2016-03-02-at-11.13.57-AM-300x198.png\" alt=\"\" width=\"300\" height=\"198\" srcset=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2016\/03\/Screen-Shot-2016-03-02-at-11.13.57-AM-300x198.png 300w, http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2016\/03\/Screen-Shot-2016-03-02-at-11.13.57-AM-768x507.png 768w, http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2016\/03\/Screen-Shot-2016-03-02-at-11.13.57-AM.png 858w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><\/a><figcaption id=\"caption-attachment-819\" class=\"wp-caption-text\">Figure 2.<\/figcaption><\/figure>\n<p>The five most popular terms were as follows:\u00a0#trumptrain,\u00a0#trump2016,\u00a0trump, #makeamericagreatagain, #votetrumpsc. \u00a0I briefly mentioned these words in the previous paragraph; however, a more in depth analysis is needed. \u00a0I was not at all surprised by any of these terms. \u00a0They were unquestionably on my radar before I began to analyze the data through Voyant Tools. \u00a0I have noticed from the very first data collection that they are ever-present and will remain that way. \u00a0Nearly every single tweet contained these five terms and I conject that they will continue to do so. \u00a0I think that #votetrumpsc will alter slightly depending on the state currently voting. \u00a0The terms\u00a0<em>landslide<\/em>,<em> #trumpleadstheway<\/em>,\u00a0and<em> #yuge<\/em> speak to me in high frequencies. \u00a0I chose these words because they are all important terms (and comical) and apply directly to the support of Donald Trump. \u00a0I question the use of these terms historically because I doubt that they have been around for every long on Twitter. \u00a0I think that each of these terms has gained extreme popularity in the past few months as Trump gains political support.<\/p>\n<figure id=\"attachment_821\" aria-describedby=\"caption-attachment-821\" style=\"width: 300px\" class=\"wp-caption aligncenter\"><a href=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2016\/03\/Screen-Shot-2016-03-04-at-12.36.58-AM.png\" rel=\"attachment wp-att-821\"><img loading=\"lazy\" class=\"wp-image-821 size-medium\" src=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2016\/03\/Screen-Shot-2016-03-04-at-12.36.58-AM-300x176.png\" alt=\"Screen Shot 2016-03-04 at 12.36.58 AM\" width=\"300\" height=\"176\" srcset=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2016\/03\/Screen-Shot-2016-03-04-at-12.36.58-AM-300x176.png 300w, http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2016\/03\/Screen-Shot-2016-03-04-at-12.36.58-AM-768x452.png 768w, http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2016\/03\/Screen-Shot-2016-03-04-at-12.36.58-AM-1024x602.png 1024w, http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2016\/03\/Screen-Shot-2016-03-04-at-12.36.58-AM.png 1996w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><\/a><figcaption id=\"caption-attachment-821\" class=\"wp-caption-text\">Figure 3.<\/figcaption><\/figure>\n<p>The most fascinating piece of information regarding Figure 3. is how each of the terms has the same amount of frequency, yet over time they dip and spike at varying times. \u00a0Unfortunately, due to my large data set, I only imported 40 hours of data. \u00a0This cumulated to roughly 56,000 tweets. \u00a0Yes, this is a large number, but I find it difficult to analyze the use based simply on this small window of time. \u00a0With that being said, I did notice that between the hours of 11:59 and 8 am on February 10th, there were close to zero tweets using any of these terms. \u00a0After this time, there was a clear increase throughout the day. \u00a0The highest spike was the hashtag, #yuge, which reached its max between the hours of 8:00pm and 11:59pm on February 10th. \u00a0This may be due to the most activity on Twitter at this time. \u00a0People may be catching up on the news this day and feel the urge to tweet. \u00a0After looking back to this day there were no major events that I could find to\u00a0correlate with the peak.<\/p>\n<p><a href=\"http:\/\/voyant-tools.org\/tool\/TypeFrequenciesChart\/?corpus=1456929307261.4105&amp;type=%23trumpleadstheway&amp;type=%23yuge&amp;type=landslide&amp;mode=corpus\">http:\/\/voyant-tools.org\/tool\/TypeFrequenciesChart\/?corpus=1456929307261.4105&amp;type=%23trumpleadstheway&amp;type=%23yuge&amp;type=landslide&amp;mode=corpus<\/a><\/p>\n<p>I chose an article title &#8220;Immigration Reform that will Make America Great Again.&#8221; [1] \u00a0It was found on Donald Tump&#8217;s campaign website within his &#8220;Positions&#8221; page. \u00a0I thought that this was important because this topic has likely been the most controversial and widely discussed topic during this presidential race. \u00a0I thought that through an analysis of this document a greater knowledge of his stance may be found.<\/p>\n<figure id=\"attachment_856\" aria-describedby=\"caption-attachment-856\" style=\"width: 300px\" class=\"wp-caption aligncenter\"><a href=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2016\/03\/Screen-Shot-2016-03-04-at-1.01.02-PM.png\" rel=\"attachment wp-att-856\"><img loading=\"lazy\" class=\"wp-image-856 size-medium\" src=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2016\/03\/Screen-Shot-2016-03-04-at-1.01.02-PM-300x206.png\" alt=\"Screen Shot 2016-03-04 at 1.01.02 PM\" width=\"300\" height=\"206\" srcset=\"http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2016\/03\/Screen-Shot-2016-03-04-at-1.01.02-PM-300x206.png 300w, http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2016\/03\/Screen-Shot-2016-03-04-at-1.01.02-PM-768x526.png 768w, http:\/\/commons.trincoll.edu\/amst-data-driven\/files\/2016\/03\/Screen-Shot-2016-03-04-at-1.01.02-PM.png 852w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><\/a><figcaption id=\"caption-attachment-856\" class=\"wp-caption-text\">Figure 4.<\/figcaption><\/figure>\n<p>I thought that this was fascinating. \u00a0I really was not expecting to see such important words used so often. \u00a0The first word cloud created using this document had a very small amount of stop words. \u00a0I was blown away after they were removed. \u00a0I think that this word cloud displays the words Donald Trump has been focusing on the most surrounding this issue. \u00a0He tends to discuss the terms illegal, wall, visa, Mexico, border, gang, etc. to a great extent. \u00a0I find that these words have instilled a fear into the citizens of the United States. \u00a0The other candidates speak more kindly regarding Immigration Reform while Donald Trump uses these harsh words to attack those who wish to come to the United States. \u00a0This word cloud clearly expresses this. \u00a0When comparing this document&#8217;s data to the Twitter data it is clear that it a far narrower topic and thus the terms surround one specific topic rather than the entirety of his campaign. \u00a0This is important and I think it would be fascinating to compare a word cloud on a few different issues and see if there are any comparisons or themes which he discusses across the board. \u00a0I think that this data it definitely assists in the reading of Twitter data because it proves how wide the data used by Twitter actually is. \u00a0Rarely did I see specific examples regarding his stance on important issues.<\/p>\n<p>Both Yau and Tufte&#8217;s approaches to data visualization are important and credible. \u00a0Tufte writes, \u201cSuperior methods are more likely to produce truthful, credible, and precise findings\u201d [2] while Yau writes, \u201cData is an abstraction of real life, and real life can be complicated, but if you gather enough context, you can at least put forth a solid effort to make sense of it.&#8221; [3]. \u00a0I personally find that Yau&#8217;s statement to his approach is best. \u00a0I think that the term &#8216;real life&#8217; is the most important piece. \u00a0People change, opinions change, and life changes. \u00a0Analysts need to realize this and understand its intense complexity. \u00a0With enough context, sense can be found; however, this sense may be wrong. \u00a0Realizing this is the first step. \u00a0I think that this is especially clear when comparing to Donald Trump and his campaign. \u00a0 He is an extremely sporadic and scattered individual who makes it difficult to make sense of. \u00a0In this context, I find it difficult to use Tufte&#8217;s belief as &#8216;precise findings&#8217; are nearly impossible when it comes to human interactions and beliefs. \u00a0This is especially true when discussing his Immigration Reform. \u00a0Trump has stated countless times that the Mexican people &#8220;love him&#8221; yet, in his Immigration Reform he consistently discusses how illegal immigrants from Mexico should not be allowed and a wall should be built to keep the Mexican people out. \u00a0Yau&#8217;s approch is clearly more valuable when looking at this because this data truly is an abstraction of real life and is truly complicated. \u00a0With the right amount of context, we can being and attempt to understand it.<\/p>\n<p>&nbsp;<\/p>\n<p>SOURCES:<\/p>\n<ol>\n<li>Tufte, Edward R. 2011. \u201cVisual &amp; Statistical Thinking: Displays of Evidence for Making Decisions.\u201d In<em>Envisioning Information<\/em>, 27. Cheshire, CT.: Graphics Press.<\/li>\n<li>Yau, Nathan. 2013. \u201cRepresenting Data.\u201d In <em>Data Points<\/em>, 41. Hoboken: Wiley.<\/li>\n<li>Trump, Donald J. &#8220;Immigration Reform.&#8221; Immigration Reform. Accessed March 04, 2016. <a href=\"https:\/\/www.donaldjtrump.com\/positions\/immigration-reform\">https:\/\/www.donaldjtrump.com\/positions\/immigration-reform<\/a>.<\/li>\n<\/ol>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>When reading the specific tweets with reference to their geolocated spaces it became apparent how support (or lack thereof) is regional. \u00a0The tweets from the southern region of the United States displayed complete support while the northeast and northwest showed unanimous lack of support. \u00a0I truly wish that there were more geolocated tweets because I&#8230;<\/p>\n","protected":false},"author":1630,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[4],"tags":[],"_links":{"self":[{"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/posts\/751"}],"collection":[{"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/users\/1630"}],"replies":[{"embeddable":true,"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/comments?post=751"}],"version-history":[{"count":10,"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/posts\/751\/revisions"}],"predecessor-version":[{"id":3822,"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/posts\/751\/revisions\/3822"}],"wp:attachment":[{"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/media?parent=751"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/categories?post=751"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/commons.trincoll.edu\/amst-data-driven\/wp-json\/wp\/v2\/tags?post=751"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}