{"id":11787,"date":"2016-10-12T03:26:24","date_gmt":"2016-10-12T03:26:24","guid":{"rendered":"https:\/\/socialmedialab.ca\/web\/?p=11787"},"modified":"2024-12-11T01:01:56","modified_gmt":"2024-12-11T01:01:56","slug":"examining-posting-vs-commenting-behaviour-on-reddit","status":"publish","type":"post","link":"https:\/\/socialmedialab.ca\/web\/2016\/10\/12\/examining-posting-vs-commenting-behaviour-on-reddit\/","title":{"rendered":"Examining Posting vs Commenting Behaviour on @reddit with @Tableau"},"content":{"rendered":"<p>As part of the Social Media Lab\u2019s ongoing efforts to study the changing landscape of social media websites and their communities, in this post we share some of our preliminary data analysis of a popular social news aggregation and discussion site called <strong><a href=\"https:\/\/www.reddit.com\/\">reddit<\/a><\/strong>. On <em>reddit<\/em>,<em>\u00a0<\/em>users can submit content such as text or links, and can also comment or\u00a0vote on the content posted by others. The site\u00a0is\u00a0organized by areas of interest called <em>subreddits <\/em>(think of them as separate online groups).<\/p>\n<figure id=\"attachment_11817\" aria-describedby=\"caption-attachment-11817\" style=\"width: 600px\" class=\"wp-caption aligncenter\"><img decoding=\"async\" class=\"wp-image-11817 size-large\" src=\"https:\/\/socialmedialab.ca\/web\/wp-content\/uploads\/2016\/10\/reddit_screen-600x279.png\" alt=\"reddit\" width=\"600\" height=\"279\" srcset=\"https:\/\/socialmedialab.ca\/web\/wp-content\/uploads\/2016\/10\/reddit_screen-600x279.png 600w, https:\/\/socialmedialab.ca\/web\/wp-content\/uploads\/2016\/10\/reddit_screen-300x140.png 300w, https:\/\/socialmedialab.ca\/web\/wp-content\/uploads\/2016\/10\/reddit_screen-768x358.png 768w, https:\/\/socialmedialab.ca\/web\/wp-content\/uploads\/2016\/10\/reddit_screen-696x324.png 696w, https:\/\/socialmedialab.ca\/web\/wp-content\/uploads\/2016\/10\/reddit_screen-1068x497.png 1068w, https:\/\/socialmedialab.ca\/web\/wp-content\/uploads\/2016\/10\/reddit_screen-902x420.png 902w\" sizes=\"(max-width: 600px) 100vw, 600px\" \/><figcaption id=\"caption-attachment-11817\" class=\"wp-caption-text\">A front page of reddit, a social news aggregation, rating and discussion site.<\/figcaption><\/figure>\n<p>Inspired by some of the previous work on <a href=\"https:\/\/redditstuff.github.io\/sna\/vizit\/#\">mapping\u00a0linkages\u00a0between <em>reddit<\/em>&#8216;s various groups<\/a>,\u00a0we want to better understand the rapidly growing community of over <a href=\"https:\/\/en.wikipedia.org\/wiki\/Reddit\">200M\u00a0<em>reddit<\/em> users<\/a>\u00a0and their posting practices.\u00a0As an initial step,\u00a0we are developing\u00a0a typology of users based on their engagement with the site and others. In particular, we ask\u00a0whether\u00a0there are different types of users on <em>reddit<\/em> and across different <em>subreddits<\/em>.<\/p>\n<p>For our exploratory analysis we used <a href=\"https:\/\/www.tableau.com\/\">Tableau<\/a>, a visualization engine\u00a0that helps to summarize and discover interesting patterns\u00a0in structured\u00a0data. The\u00a0<em>subreddits <\/em>that we chose for this analysis were\u00a0\u201cask&#8221; subreddits, where users ask and answer questions about various topics from history (<a href=\"https:\/\/www.reddit.com\/r\/AskHistorians\/\">r\/AskHistorians\/<\/a>) to astronomy (<a href=\"https:\/\/www.reddit.com\/r\/askastronomy\/\">r\/AskAstronomy\/<\/a>). In total, using\u00a0<em>reddit<\/em>&#8216;s API,\u00a0we have\u00a0collected ~250,000\u00a0publicly available posts and comments submitted to the 13 &#8220;ask&#8221;\u00a0<em>subreddits <\/em>during the period of one year\u00a0in 2015.<\/p>\n<p>Below is an interactive visualization\u00a0showing\u00a0 the relationship between posting and commenting behaviour. Each data point\/shape represents a <em>reddit<\/em> user who either asked a question (submitted a post) or answered\/commented on other user&#8217;s\u00a0question, or both. The X axis represents the number of posts (~questions) and the\u00a0Y axis represents the number of comments (~answers). Different\u00a0shapes relate to different\u00a0<em>subreddits <\/em>(see the legend to\u00a0the right) and the size of the shape represents the number of likes the user received. We wanted to know if there are groups of\u00a0users who tend to either ask or answer questions\u00a0and why, and\u00a0whether it is different for different\u00a0<em>subreddits<\/em>.<\/p>\n<p>Note:\u00a0Since we used a logarithmic function to\u00a0reduce the effect of outliers, users who submitted only\u00a0posts or only comments are not visible in this chart.<\/p>\n<div id=\"viz1476283887521\" class=\"tableauPlaceholder\" style=\"position: relative;\"><noscript>&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;lt;span class=&#8221;mceItemHidden&#8221; data-mce-bogus=&#8221;1&#8243;&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;gt;&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;lt;span&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;gt;&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;lt;\/span&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;gt;&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;lt;a href=&#8217;#&#8217;&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;gt;&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;lt;&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;lt;span class=&#8221;mceItemHidden&#8221; data-mce-bogus=&#8221;1&#8243;&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;gt;&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;lt;span class=&#8221;hiddenSpellError&#8221; pre=&#8221;&#8221; data-mce-bogus=&#8221;1&#8243;&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;gt;img&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;lt;\/span&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;gt;&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;lt;\/span&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;gt; alt=&#8217;Posting vs Commenting Behaviour on &amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;lt;span class=&#8221;hiddenSpellError&#8221; pre=&#8221;on &#8221; data-mce-bogus=&#8221;1&#8243;&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;gt;reddit&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;lt;\/span&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;gt; &#8216; src=&#8217;https:&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;#47;&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;#47;public.tableau.com&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;#47;static&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;#47;images&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;#47;Po&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;#47;PostingvsCommentingbehaviouronreddit&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;#47;Dashboard1&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;#47;1_rss.png&#8217; style=&#8217;border: none&#8217; \/&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;gt;&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;lt;\/a&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;gt;&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;lt;\/span&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;gt;<\/noscript><object class=\"tableauViz\" style=\"display: none;\" width=\"900\" height=\"700\"><param name=\"host_url\" value=\"https%3A%2F%2Fpublic.tableau.com%2F\" \/><param name=\"site_root\" value=\"\" \/><param name=\"name\" value=\"PostingvsCommentingbehaviouronreddit\/Dashboard1\" \/><param name=\"tabs\" value=\"no\" \/><param name=\"toolbar\" value=\"yes\" \/><param name=\"static_image\" value=\"https:\/\/public.tableau.com\/static\/images\/Po\/PostingvsCommentingbehaviouronreddit\/Dashboard1\/1.png\" \/><param name=\"animate_transition\" value=\"yes\" \/><param name=\"display_static_image\" value=\"yes\" \/><param name=\"display_spinner\" value=\"yes\" \/><param name=\"display_overlay\" value=\"yes\" \/><param name=\"display_count\" value=\"yes\" \/><\/object><\/div>\n<p><script type=\"text\/javascript\">                    var divElement = document.getElementById('viz1476283887521');                    var vizElement = divElement.getElementsByTagName('object')[0];                    vizElement.style.width='804px';vizElement.style.height='669px';                    var scriptElement = document.createElement('script');                    scriptElement.src = 'https:\/\/public.tableau.com\/javascripts\/api\/viz_v1.js';                    vizElement.parentNode.insertBefore(scriptElement, vizElement);                <\/script><\/p>\n<p>The three different colors in the visualization represent different types of users, detected automatically by Tableau using a <em>k-means<\/em> clustering algorithm that\u00a0takes into account:\u00a0# of\u00a0posts, # of comments, and #\u00a0of likes. The\u00a0clustering algorithm allows to group users with similar posting\u00a0behaviour. Each\u00a0line shown in the visualization\u00a0represents a linear relationship (regression)\u00a0between # of\u00a0posts and comments for each cluster.<\/p>\n<p>Overall,\u00a0we found\u00a0that <strong>active users contribute both posts\/questions and\u00a0comments\/answers with a slight preference towards commenting\/answering<\/strong>.\u00a0This suggests the presence of a generally attentive community of users who are willing to help and contribute to the group by answering and commenting on other people&#8217;s posts and questions, and are not just there to get their own questions answered.\u00a0This trend is especially visible among users grouped in the largest (orange) cluster, labelled\u00a0as<img decoding=\"async\" class=\"alignnone size-full wp-image-11809\" src=\"https:\/\/socialmedialab.ca\/web\/wp-content\/uploads\/2016\/10\/orange-cluster.png\" alt=\"orange-cluster\" width=\"81\" height=\"20\" srcset=\"https:\/\/socialmedialab.ca\/web\/wp-content\/uploads\/2016\/10\/orange-cluster.png 81w, https:\/\/socialmedialab.ca\/web\/wp-content\/uploads\/2016\/10\/orange-cluster-80x20.png 80w\" sizes=\"(max-width: 81px) 100vw, 81px\" \/><\/p>\n<p>Based on the clustering analysis,\u00a0we also found two\u00a0extremes:<\/p>\n<p style=\"padding-left: 30px;\"><img decoding=\"async\" class=\"alignnone size-full wp-image-11806\" src=\"https:\/\/socialmedialab.ca\/web\/wp-content\/uploads\/2016\/10\/red-cluster-4.png\" alt=\"red-cluster-4\" width=\"81\" height=\"20\" srcset=\"https:\/\/socialmedialab.ca\/web\/wp-content\/uploads\/2016\/10\/red-cluster-4.png 81w, https:\/\/socialmedialab.ca\/web\/wp-content\/uploads\/2016\/10\/red-cluster-4-80x20.png 80w\" sizes=\"(max-width: 81px) 100vw, 81px\" \/>\u00a0users who contributed 10 or more posts\/questions (log&gt;=1); these are the users who presumably found group&#8217;s answers helpful in the past and came back to ask more questions;<\/p>\n<p style=\"padding-left: 30px;\"><img decoding=\"async\" class=\"alignnone size-full wp-image-11808\" src=\"https:\/\/socialmedialab.ca\/web\/wp-content\/uploads\/2016\/10\/blue-cluster.png\" alt=\"blue-cluster\" width=\"86\" height=\"20\" srcset=\"https:\/\/socialmedialab.ca\/web\/wp-content\/uploads\/2016\/10\/blue-cluster.png 86w, https:\/\/socialmedialab.ca\/web\/wp-content\/uploads\/2016\/10\/blue-cluster-80x20.png 80w\" sizes=\"(max-width: 86px) 100vw, 86px\" \/>users who contributed 100 or more comments\/answers (log&gt;=2); these users are especially active communicators who engage others in discussion by answering and\u00a0contributing to other people&#8217;s questions.<\/p>\n<p>One of the most interesting features\u00a0of this interactive visualization is its ability to view the\u00a0prevalence of the two extremes\u00a0across different <em>subreddits<\/em> by using the <strong>highlight subreddit<\/strong> feature (in the bottom left side of the visualization). Using this feature, for example, we can see that\u00a0<em>AskPhysics<\/em>\u00a0has some extreme posters\/questioners (red cluster) but\u00a0none of the extreme commentators\/answerers (blue cluster); while <em>AskLiteraryStudies <\/em>has no extreme posters\/questioners (red cluster) and only three extreme commentators\/answerers (blue cluster). This suggests that there may be slight variations in posting behaviour among members of different <em>subreddits<\/em>.<\/p>\n<p>Our future work will examine\u00a0why some users like to publish more posts than comments and vise versa.\u00a0And\u00a0why do some <em>subreddits<\/em>\u00a0encourage different posting behaviour than others? We also plan to use Social Network Analysis to discover and compare posting practices\u00a0at the group level across different\u00a0<em>subreddits.<\/em><\/p>\n<blockquote><p>Note: the analysis is done by\u00a0Bradly Dahdaly, a data science intern at the Social Media Lab with contributions by Anatoliy Gruzd, Philip Mai and other members of the Lab.<\/p><\/blockquote>\n","protected":false},"excerpt":{"rendered":"<p>As part of the Social Media Lab\u2019s ongoing efforts to study the changing landscape of social media websites and their communities, in this post we share some of our preliminary data analysis of a popular social news aggregation and discussion site called reddit. On reddit,\u00a0users can submit content such as text or links, and can [&hellip;]<\/p>\n","protected":false},"author":49,"featured_media":11804,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[489,265,264],"tags":[335,416,419],"class_list":["post-11787","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-org-soc-med-use","category-research","category-web-apps","tag-plasma","tag-reddit","tag-tableau"],"_links":{"self":[{"href":"https:\/\/socialmedialab.ca\/web\/wp-json\/wp\/v2\/posts\/11787","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/socialmedialab.ca\/web\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/socialmedialab.ca\/web\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/socialmedialab.ca\/web\/wp-json\/wp\/v2\/users\/49"}],"replies":[{"embeddable":true,"href":"https:\/\/socialmedialab.ca\/web\/wp-json\/wp\/v2\/comments?post=11787"}],"version-history":[{"count":23,"href":"https:\/\/socialmedialab.ca\/web\/wp-json\/wp\/v2\/posts\/11787\/revisions"}],"predecessor-version":[{"id":20120,"href":"https:\/\/socialmedialab.ca\/web\/wp-json\/wp\/v2\/posts\/11787\/revisions\/20120"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/socialmedialab.ca\/web\/wp-json\/wp\/v2\/media\/11804"}],"wp:attachment":[{"href":"https:\/\/socialmedialab.ca\/web\/wp-json\/wp\/v2\/media?parent=11787"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/socialmedialab.ca\/web\/wp-json\/wp\/v2\/categories?post=11787"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/socialmedialab.ca\/web\/wp-json\/wp\/v2\/tags?post=11787"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}