<img height="1" width="1" style="display:none" src="https://www.facebook.com/tr?id=648880075207035&amp;ev=PageView&amp;noscript=1">

Webinar Recording

Analyzing the 2016 General Social Survey with BayesiaLab's Unsupervised Learning Algorithms

Recorded on February 16, 2018


Webinar Materials


In this webinar, we explore the high-dimensional relationships between hundreds of variables from the 2016 General Social Survey.[1] Our objective is to employ one of BayesiaLab's Unsupervised Learning algorithms to generate a Bayesian network model that approximates the joint probability distribution of the underlying survey responses. 

Once we have this representation, we can address research questions that specifically require computing the joint probability. In this context, a prototypical—albeit theoretical—issue would be how to best approximate the diversity of many voters through a small number of elected representatives. This translates directly into a variable and data clustering task, which will be at the core of our presentation. Our objective will be to trade off the faithfulness of voter representation with the number of variable/data clusters to be generated.

Workflow to be presented in the webinar:

  • Transform original GSS data in SPSS format with Stat/Transfer 14.
  • Import data, variables labels, and value labels into BayesiaLab.
  • Define variable classes, e.g., demographics, voting behavior, etc.
  • Perform Unsupervised Learning.
  • Optimize Bayesian network model with Structural Coefficient Analysis.
  • Identify Latent Factors through BayesiaLab's Variable Clustering.
  • Cluster voters into segments with Data Clustering (Multi-Net).

[1] Smith, Tom W, Peter Marsden, Michael Hout, and Jibum Kim. General Social Surveys, 1972-2014 [machine-readable data file] /Principal Investigator, Tom W. Smith; Co-Principal Investigator, Peter V. Marsden; Co-Principal Investigator, Michael Hout; Sponsored by National Science Foundation. --NORC ed.-- Chicago: NORC at the University of Chicago [producer]; Storrs, CT: The Roper Center for Public Opinion Research, University of Connecticut [distributor], 2015.

1 data file (57,061 logical records) + 1 codebook (3,567p.).  (National Data Program for the Social Sciences, No. 22).

BayesiaLab Courses

May 8–10, 2019 Singapore Introductory Course (3 Days)
May 13–15, 2019 Sydney, Australia Introductory Course (3 Days)
May 21–23, 2019 Paris, France Advanced Course (3 Days, in French) 
June 5, 2019 Washington, D.C. BayesiaLab 101 Short Course (1 Day)
June 12–14, 2019 Seattle, WA Introductory Course (3 Days)
June 17–19, 2019 Seattle, WA Advanced Course (3 Days)

Upcoming Seminars, Webinars, and Conferences

Live Webinar May 16, 2019 11:00 – 12:00 (CDT, UTC-5) Human-Machine Teaming
Live Webinar May 30, 2019 11:00 – 12:00 (CDT, UTC-5) Causal Counterfactuals for Contribution Analysis — Explaining a Misunderstood Concept with Bayesian Networks
Live Webinar June 13, 2019 11:00 – 12:00 (CDT, UTC-5) Black Swans & Bayesian Networks — Jointly Representing Common and Rare Events
Please check out our archive of recordings of previous events.

7th Annual BayesiaLab Conference

October 7–9, 2019 Durham, NC 3-Day Introductory Course
October 10–11, 2019 Durham, NC 7th Annual BayesiaLab Conference
October 14–16, 2019 Durham, NC 3-Day Advanced Course