{"id":22366,"date":"2024-03-12T00:47:17","date_gmt":"2024-03-12T00:47:17","guid":{"rendered":"https:\/\/socialmedialab.ca\/web\/?page_id=22366"},"modified":"2024-03-19T21:05:09","modified_gmt":"2024-03-19T21:05:09","slug":"2024-icwsm-tutorial-interactive-topic-analysis","status":"publish","type":"page","link":"https:\/\/socialmedialab.ca\/web\/events\/2024-icwsm-tutorial-interactive-topic-analysis\/","title":{"rendered":"Interactive Topic Analysis with Multi-Lingual Embeddings in Communalytic @ #ICWSM2024"},"content":{"rendered":"\n<div class=\"wp-block-group alignfull has-text-color has-background\" style=\"color:#000000;background-color:#ffffff\"><div class=\"wp-block-group__inner-container is-layout-flow wp-block-group-is-layout-flow\">\n<div style=\"height:15px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n<\/div><\/div>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-8f761849 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column .column { padding: 20px; } is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:33.33%\"><div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img decoding=\"async\" width=\"1600\" height=\"900\" src=\"https:\/\/socialmedialab.ca\/web\/wp-content\/uploads\/2024\/03\/SML-CDMRN-Computational-Social-Science-Summer-School-1-1600x900.jpg\" alt=\"\" class=\"wp-image-22499\" srcset=\"https:\/\/socialmedialab.ca\/web\/wp-content\/uploads\/2024\/03\/SML-CDMRN-Computational-Social-Science-Summer-School-1-1600x900.jpg 1600w, https:\/\/socialmedialab.ca\/web\/wp-content\/uploads\/2024\/03\/SML-CDMRN-Computational-Social-Science-Summer-School-1-300x169.jpg 300w, https:\/\/socialmedialab.ca\/web\/wp-content\/uploads\/2024\/03\/SML-CDMRN-Computational-Social-Science-Summer-School-1-768x432.jpg 768w, https:\/\/socialmedialab.ca\/web\/wp-content\/uploads\/2024\/03\/SML-CDMRN-Computational-Social-Science-Summer-School-1-1024x576.jpg 1024w, https:\/\/socialmedialab.ca\/web\/wp-content\/uploads\/2024\/03\/SML-CDMRN-Computational-Social-Science-Summer-School-1-1536x864.jpg 1536w, https:\/\/socialmedialab.ca\/web\/wp-content\/uploads\/2024\/03\/SML-CDMRN-Computational-Social-Science-Summer-School-1-696x392.jpg 696w, https:\/\/socialmedialab.ca\/web\/wp-content\/uploads\/2024\/03\/SML-CDMRN-Computational-Social-Science-Summer-School-1-1068x601.jpg 1068w, https:\/\/socialmedialab.ca\/web\/wp-content\/uploads\/2024\/03\/SML-CDMRN-Computational-Social-Science-Summer-School-1-747x420.jpg 747w\" sizes=\"(max-width: 1600px) 100vw, 1600px\" \/><\/figure>\n<\/div>\n\n\n<p class=\"wp-block-paragraph\"><strong>The workshop is part of the <a href=\"https:\/\/www.icwsm.org\/2024\/index.html\/\" target=\"_blank\" rel=\"noreferrer noopener\">2024 AAAI ICWSM<\/a> <\/strong><\/p>\n\n\n\n<p class=\"has-text-align-left wp-block-paragraph\"><strong>Where: <\/strong>Buffalo, NY<\/p>\n\n\n\n<p class=\"has-text-align-left wp-block-paragraph\"><strong>When: <\/strong>June 3, 2024<\/p>\n\n\n\n<p class=\"has-text-align-left wp-block-paragraph\">The event is organized by the Social Media Lab at Toronto Metropolitan University and hosted as part of the 18th International AAAI Conference on Web and Social Media (ICWSM).<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Conference registration is required to participate in the event.<\/p>\n\n\n\n<h5 class=\"wp-block-heading\"><strong>Contact Info<\/strong><\/h5>\n\n\n\n<p class=\"wp-block-paragraph\"><a href=\"mailto:info@socialmedialab.ca\" target=\"_blank\" rel=\"noreferrer noopener\">info@socialmedialab.ca<\/a><br>X <a href=\"https:\/\/twitter.com\/smlabto\" target=\"_blank\" rel=\"noreferrer noopener\">@SMLabTO<\/a><\/p>\n\n\n\n<div style=\"height:28px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<hr class=\"wp-block-separator has-css-opacity is-style-wide\"\/>\n\n\n\n<h3 class=\"wp-block-heading\">Agenda<\/h3>\n\n\n\n<ol class=\"wp-block-list\">\n<li>Introduction to Communalytic and Data Collection from Social Media (20 min)<\/li>\n\n\n\n<li>Representing Posts as Embeddings &nbsp;(20 min)<\/li>\n\n\n\n<li>Projecting and Visualizing Embeddings (20 min)<\/li>\n\n\n\n<li>Hands-on Part (60 min)<\/li>\n<\/ol>\n\n\n\n<p class=\"wp-block-paragraph\">Participants need a laptop with internet access and a modern web browser to participate in the tutorial. The primary tool to be used during the tutorial is Communalytic, which runs from within a web browser and does not require any additional software.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Upon completion of the tutorial, participants should be able to: 1) collect publicly available social media data from platforms such as Reddit, Telegram and Mastodon using Communalytic, 2) conduct a topic analysis with the collected data.<\/p>\n<\/div>\n\n\n\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:66.66%\">\n<h3 class=\"wp-block-heading\">Objectives<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">This hands-on tutorial at <a href=\"https:\/\/www.icwsm.org\/2024\/index.html\/index.html\">#ICWSM2024<\/a> will introduce users to <a href=\"https:\/\/communalytic.org\/\" target=\"_blank\" rel=\"noreferrer noopener\">Communalytic<\/a>, a research tool developed by the Social Media Lab for studying online communities and discourse. The session will include an overview of Communalytic&#8217;s features and a step-by-step guide on using Communalytic&#8217;s built-in topic analysis module.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">By the end of the tutorial, participants will know how to use a large language model (LLM) to transform social media data into vectors of numbers known as embeddings. The tutorial will also show attendees how to visualize the resulting vectors via Nomic Atlas, a third-party tool that enables users to represent and explore embeddings in an interactive map with labels assigned automatically based on the semantic similarity of the posts&#8217; content.  <\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Considering the interdisciplinary nature of this area, we welcome participants from a wide range of disciplines, including (but not limited to) Information Science, Communication, Education, Journalism, Management, Political Science, Psychology and Sociology.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">Background<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">Current topic modelling techniques such as Latent Dirichlet Allocation (LDA) and BERTopic have limitations in that they often identify abstract topics that can be challenging for human analysts to interpret due to their non-descriptive nature. This is caused in part by the fact LDA and BERTopic are typically defined by a set of tokens and their probabilities (Fig 1). To overcome the limitations of current topic modelling techniques, this tutorial introduces an alternative approach using embeddings and clustering. <\/p>\n\n\n\n<p class=\"wp-block-paragraph\">This method has a distinct advantage: It allows researchers to view a high-level map of posts clustered based on their semantic similarity while allowing researchers to zoom in on specific clusters and examine the underlying posts (Fig 2).<\/p>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-8f761849 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\">\n<p class=\"wp-block-paragraph\"><strong>Fig 1<\/strong>: Example of Topic Modelling Visualization based on LDA.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full is-style-default\"><img decoding=\"async\" width=\"493\" height=\"286\" src=\"https:\/\/socialmedialab.ca\/web\/wp-content\/uploads\/2024\/03\/image-1.png\" alt=\"\" class=\"wp-image-22370\" srcset=\"https:\/\/socialmedialab.ca\/web\/wp-content\/uploads\/2024\/03\/image-1.png 493w, https:\/\/socialmedialab.ca\/web\/wp-content\/uploads\/2024\/03\/image-1-300x174.png 300w\" sizes=\"(max-width: 493px) 100vw, 493px\" \/><\/figure>\n<\/div>\n\n\n\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\">\n<p class=\"wp-block-paragraph\"><strong>Fig 2<\/strong>: Example of Visualization of Social Media Posts based on Embeddings.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" width=\"552\" height=\"309\" src=\"https:\/\/socialmedialab.ca\/web\/wp-content\/uploads\/2024\/03\/image-2.png\" alt=\"\" class=\"wp-image-22371\" srcset=\"https:\/\/socialmedialab.ca\/web\/wp-content\/uploads\/2024\/03\/image-2.png 552w, https:\/\/socialmedialab.ca\/web\/wp-content\/uploads\/2024\/03\/image-2-300x168.png 300w\" sizes=\"(max-width: 552px) 100vw, 552px\" \/><\/figure>\n<\/div>\n<\/div>\n\n\n\n<div style=\"height:25px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h3 class=\"wp-block-heading\">Organizers<\/h3>\n\n\n<style type=\"text\/css\"><\/style>\r\n\t\t<div  class=\"team-container\" style=\"background-image:url();text-align:left;\">\r\n\t\t<ul  id=\"team-22375\" class=\"team-items team-rounded\"><li style=\"width:;text-align:left;margin:\" class=\"team-item even\" ><div class=\"team-post\"><div style=\"height:;\" class=\"team-thumb\"><img decoding=\"async\" src=\"https:\/\/socialmedialab.ca\/web\/wp-content\/uploads\/2015\/02\/Gruzd_profile_photo_network.jpg\" \/><\/div><div class=\"team-title\" style=\"color:;font-size:14px\">Anatoliy Gruzd, Phd\r\n\t\t\t<\/div><div class=\"team-social\" ><span class=\"Twitter\">\r\n\t\t\t\t\t\t<a target=\"_blank\" href=\"https:\/\/twitter.com\/gruzd\"> <\/a>\r\n\t\t\t\t\t<\/span><span class=\"website\">\r\n\t\t\t\t\t\t<a target=\"_blank\" href=\"http:\/\/anatoliygruzd.ca\/\"><\/a>\r\n\t\t\t\t\t<\/span><\/div><div class=\"team-content\" style=\"color:;font-size:13px;\">Canada Research Chair | Co-Director, Social Media Lab | Professor, Information Technology Management, Toronto Metropolitan University, Canada<\/div><\/div>\r\n\r\n\t\t<\/li><li style=\"width:;text-align:left;margin:\" class=\"team-item odd\" ><div class=\"team-post\"><div style=\"height:;\" class=\"team-thumb\"><img decoding=\"async\" src=\"https:\/\/socialmedialab.ca\/web\/wp-content\/uploads\/2015\/02\/Philip_Headshot-4.jpg\" \/><\/div><div class=\"team-title\" style=\"color:;font-size:14px\">Philip Mai, MA, JD\r\n\t\t\t<\/div><div class=\"team-social\" ><span class=\"Twitter\">\r\n\t\t\t\t\t\t<a target=\"_blank\" href=\"https:\/\/twitter.com\/PhMai\"> <\/a>\r\n\t\t\t\t\t<\/span><span class=\"website\">\r\n\t\t\t\t\t\t<a target=\"_blank\" href=\"https:\/\/philipmai.com\/\"><\/a>\r\n\t\t\t\t\t<\/span><\/div><div class=\"team-content\" style=\"color:;font-size:13px;\">Co-Director, Social Media Lab, Ted Rogers School of Management, Toronto Metropolitan University, Canada<\/div><\/div>\r\n\r\n\t\t<\/li><li style=\"width:;text-align:left;margin:\" class=\"team-item even\" ><div class=\"team-post\"><div style=\"height:;\" class=\"team-thumb\"><img decoding=\"async\" src=\"https:\/\/socialmedialab.ca\/web\/wp-content\/uploads\/2023\/05\/AmiraGhenai-1-1200x1200.jpg\" \/><\/div><div class=\"team-title\" style=\"color:;font-size:14px\">Amira Ghenai, PhD\r\n\t\t\t<\/div><div class=\"team-social\" ><span class=\"website\">\r\n\t\t\t\t\t\t<a target=\"_blank\" href=\"https:\/\/aghenai.github.io\/\"><\/a>\r\n\t\t\t\t\t<\/span><\/div><div class=\"team-content\" style=\"color:;font-size:13px;\">Assistant Professor, Information Technology Management, Toronto Metropolitan University, Canada<\/div><\/div>\r\n\r\n\t\t<\/li><\/ul><\/div>\n<\/div>\n<\/div>\n\n\n\n<div style=\"height:26px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<div style=\"height:20px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n","protected":false},"excerpt":{"rendered":"<p>The workshop is part of the 2024 AAAI ICWSM Where: Buffalo, NY When: June 3, 2024 The event is organized by the Social Media Lab at Toronto Metropolitan University and hosted as part of the 18th International AAAI Conference on Web and Social Media (ICWSM). Conference registration is required to participate in the event. Contact [&hellip;]<\/p>\n","protected":false},"author":49,"featured_media":22371,"parent":6246,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-22366","page","type-page","status-publish","has-post-thumbnail","hentry"],"_links":{"self":[{"href":"https:\/\/socialmedialab.ca\/web\/wp-json\/wp\/v2\/pages\/22366","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/socialmedialab.ca\/web\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/socialmedialab.ca\/web\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/socialmedialab.ca\/web\/wp-json\/wp\/v2\/users\/49"}],"replies":[{"embeddable":true,"href":"https:\/\/socialmedialab.ca\/web\/wp-json\/wp\/v2\/comments?post=22366"}],"version-history":[{"count":39,"href":"https:\/\/socialmedialab.ca\/web\/wp-json\/wp\/v2\/pages\/22366\/revisions"}],"predecessor-version":[{"id":22548,"href":"https:\/\/socialmedialab.ca\/web\/wp-json\/wp\/v2\/pages\/22366\/revisions\/22548"}],"up":[{"embeddable":true,"href":"https:\/\/socialmedialab.ca\/web\/wp-json\/wp\/v2\/pages\/6246"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/socialmedialab.ca\/web\/wp-json\/wp\/v2\/media\/22371"}],"wp:attachment":[{"href":"https:\/\/socialmedialab.ca\/web\/wp-json\/wp\/v2\/media?parent=22366"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}