SOCI 269
An Introduction to Quantitative Sociology—Culture and Power

Amherst College

Key Information

Instructor

  Sakeef M. Karim
  skarim@amherst.edu

Location

  Seeley Mudd Room 014  

Time

Spring 2025—Tuesdays and Thursdays—2:30 to 3:50 PM

Office Hours

Fridays, 2:30-5:00 PM in Morgan Hall (Room 203 A) or during a Zoom Open Slot.

Sakeef M. Karim’s Appointment Policy

All meetings, even during office hours, must be scheduled in advance via Google Calendar.

Course Description

How do sociologists define, model, and visualize social phenomena using quantitative tools and statistical software? This seminar will provide a technical, theoretical and practical overview. During the semester, students will learn how to use and Python to clean, analyze and visualize data that are suitable for sociological analysis. At the same time, the course will interrogate how social inequality can be masked—and deeply pernicious ideas can be reproduced—if quantitative data analyses are not informed by, or sensitized to, social theory and the hierarchies of power and privilege that structure the social world. To this end, we will engage with recent work in cultural sociology that draws attention to variation within and across social groups (defined in terms of race, gender, class and so on) to understand how social inequalities emerge and endure. Throughout the course, we will scrutinize policy-relevant social issues while discussing topics like race, ethnicity, religion, class, gender and sexuality.

Caveat Emptor

Prior knowledge of statistics or programming is not required but may be an asset.

Figure 2 from Karim (2024b)

Structure

The course consists of four distinct modules:

  • Module I will provide a comprehensive introduction to the programming language for statistical computing and visualization, with significant attention paid to data visualization using ggplot2. Classes will largely be hands-on and interactive. That is, students will regularly work on simple coding exercises during lecture sessions. Collaboration will be encouraged.

  • Module II will shift focus to the substantive realm—i.e., by spotlighting applied quantitative research published in many of sociology’s flagship journals. In Module II, I will begin with a basic lecture informed by the week’s readings. Then, I will toss the baton over to all of you. Working in small groups, you will respond to the questions or prompts I provide. Each synchronous session will conclude with a plenary discussion, where we will explore the themes that emerged during your small group conversations.

  • Returning to statistical computing, Module III introduces the powerful Python programming language. During the module, we will manipulate tabular data using pandas and polars, visualize data with seaborn, and build basic machine learning algorithms using scikit-learn.

  • Module IV will feature a series of in-class presentations related to your term papers.

Readings

Data Visualization: A Practical Introduction
(Healy 2019)

Python for Data Analysis: Data Wrangling with Pandas, NumPy, and Jupyter
(McKinney 2022)

R for Data Science: Import, Tidy, Transform, Visualize, and Model Data
(Wickham, Çetinkaya-Rundel, and Grolemund 2023)

ggplot2: Elegant Graphics for Data Analysis
(Wickham, Navarro, and Pedersen 2025)

Supplemental readings are available through the mystifying power of Moodle. As you plan for the semester, it may be useful to bookmark the eReserves page on our course website. New readings may be introduced as the world around us evolves, whether via the incremental march of science or in response to social, economic and political shocks that warrant further reflection or empirical scrutiny.

Evaluations

A Bird’s Eye View

Task Description Weight Deadline or Evaluative Time Horizon
Participation Students are expected to actively participate in class discussions by raising their hands to share ideas, asking clarifying questions, assisting peers when needed, and providing constructive feedback during final presentations. 10% All Semester
Coding Assignment Students are required to submit a short coding assignment in early March. For this assignment, they will clean a dataset in R, report basic descriptive statistics, and create simple data visualizations. Students must also include their script file (i.e., a .R document) as part of their submission. Additional assignment instructions can be accessed here. 10% Monday, March 10th at 8:00 PM.
Midterm Assignment For their midterm assignment, students must—either individually or in groups of 2-3—submit a relatively complex data visualization; an annotated script file or Quarto/RMarkdown document featuring their underlying code; and a 5–10-page reflection memo (double-spaced) where they interpret their results and establish connections between their visualization and recent social scientific scholarship. Additional assignment instructions can be found here. 30% Friday, April 4th at 8:00 PM.
Final Presentations In Module IV of the class, students will deliver a 10–15-minute presentation based on, or informed by, their term paper. A rubric detailing my basic expectations will be included in this syllabus by early April. 15% During Module IV
Term Paper Drawing on the applied examples featured in Module II, students must submit a term paper on a topic related to (i) race, ethnicity and nation; (ii) gender and sexuality; or (iii) culture. To earn an A, students must also submit a companion data visualization using a truncated version of the General Social Survey, which will be made available on Moodle. Students are free to create this visualization in either R or Python. Additional assignment instructions will be posted online by early April. 35% Friday, May 9th at 8:00 PM.

Guidelines for Key Deliverables

Guidelines for Major Assignments

Guidelines for the three key deliverables—i.e., the midterm assignment; the final presentation; and the term paper—will be gradually rolled out (or uploaded online) as deadlines come into focus.

Coding Assignment

You can access the assignment instructions by clicking here or expanding the tab below.

“Midterm” Assignment

To access assignment instructions, click here or expand the tab below.

Norms, Rules & Regulations

Please review the Amherst College Honor Code, which can be accessed in its entirety here.

Violations of the Honor Code will be promptly reported to the Dean of Students. As Section 1.1 of the Honor Code indicates, plagiarism is a serious offense. In most cases, students who plagiarize the work of others will fail this class and may face additional disciplinary penalties. Moreover, as detailed in Sections 1.2 to 1.4 of the Honor Code, students must respect others in the classroom, including those whose views deviate from their own. Failure to do so will prompt disciplinary action.

There is no reason to pretend like generative artificial intelligence (GAI) does not exist in the world out there. These systems have arrived, and they may revolutionize how higher education “works.” With this in mind, you are free to use ChatGPT and its analogues for class assignments—but you have to cite the GAI you are using. Failure to do so amounts to plagiarism.

To reiterate:

Generative AI Policy

If you use a GAI tool (like ChatGPT) and do not cite it, it is a form of plagiarism.

You are expected to attend each and every class. If you do not, you will lose points for participation. That said, I am aware that you are all human beings whose lives are often fraught with uncertainty. If something comes up, please let me know and I will do my best to be as accommodating as possible. Extended absences may, however, require additional documentation (e.g., note from a physician).

Provisionally, I have decided to allow students to use laptops and tablets in class. This is, however, highly conditional. If I observe students using their electronic devices for non-academic pursuits (e.g., shopping, consuming social media and so on), I will institute a sweeping ban on electronics. Do not be the one to contravene our social contract.

On weekdays and non-holidays, I will respond to e-mails within 48 hours. If I fail to meet this standard, please send me a follow-up message with a gentle reminder. On weekends1 and breaks, I will not respond to e-mails unless you have an emergency. If you do, please include EMERGENCY in the subject line.

Assignments must be submitted on time. A late submission will result in a penalty of 5% for each day beyond the deadline.2 However, as noted, I am well aware that life can present unexpected challenges. If you anticipate missing a deadline or have an emergency, please let me know soon as you can. Extensions may be granted on a case-by-case basis.

Accessibility and Accommodations

If you require accommodations, please contact Student Accessibility Services as soon as possible and submit an application through the new AIM Portal. More generally, if you have any suggestions about how this class can be more accessible and inclusive, please let me know via e-mail or during office hours.

Weekly Schedule

Course Readings

As noted, non-textbook readings can be accessed via the eReserves page on our course website.

Readings underlined below are recommended but optional.

Module I: An Introduction to

Week 1: Introduction to the Course and Setting Up

January 28th & January 30th

R for Data Science: Import, Tidy, Transform, Visualize, and Model Data
(Wickham et al. 2023)

Transcending General Linear Reality
(Abbott 1988)

Sequence Analysis: New Methods for Old Ideas
(Abbott 1995)

Endogenous Selection Bias: The Problem of Conditioning on a Collider Variable
(Elwert and Winship 2014)

What Is Your Estimand? Defining the Target Quantity Connects Statistical Evidence to Theory
(Lundberg, Johnson, and Stewart 2021)

Inequality without Groups: Contemporary Theories of Categories, Intersectional Typicality, and the Disaggregation of Difference
(Monk 2022)

Computational Grounded Theory: A Methodological Framework
(Nelson 2020)

Week 2: Cleaning and Data Analysis in

February 4th & February 6th

R for Data Science: Import, Tidy, Transform, Visualize, and Model Data
(Wickham et al. 2023)

Week 3: Cleaning and Data Analysis in II

February 11th & February 13th

R for Data Science: Import, Tidy, Transform, Visualize, and Model Data
(Wickham et al. 2023)

Week 4: Data Visualization with ggplot2

February 18th & February 20th

Data Visualization: A Practical Introduction
(Healy 2019)

ggplot2: Elegant Graphics for Data Analysis
(Wickham et al. 2025)

R for Data Science: Import, Tidy, Transform, Visualize, and Model Data
(Wickham et al. 2023)

Week 5: Data Visualization with ggplot2 II

February 25th & February 27th

R for Data Science: Import, Tidy, Transform, Visualize, and Model Data
(Wickham et al. 2023)

Data Visualization: A Practical Introduction
(Healy 2019)

ggplot2: Elegant Graphics for Data Analysis
(Wickham et al. 2025)

Module II: Applied Examples

Week 6: Race, Ethnicity and Nation

March 4th & March 6th

Contraction as a Response to Group Threat: Demographic Decline and Whites’ Classification of People Who Are Ambiguously White
(Abascal 2020)

The Partisan Sorting of “America”: How Nationalist Cleavages Shaped the 2016
U.S. Presidential Election

(Bonikowski, Feinstein, and Bock 2021)

The Politics of Police
(Donahue 2023)

The Organization of Ethnocultural Attachments Among Second-Generation Germans
(Karim 2024b)

Separate from Class? Toward a Theory of Race as Resource Signal
(Torres 2024)

Reclaiming the Past to Transcend the Present: Nostalgic Appeals in U.S. Presidential Elections
(Bonikowski and Stuhler 2022)

The Curious Case of Black ‘Conservatives’: Assessing the Validity of the Liberal-Conservative Scale among Black Americans
(Jefferson 2024)

Administrative Records Mask Racially Biased Policing
(Knox, Lowe, and Mummolo 2020)

Coding Assignment Deadline

Your coding assignments are due by 8:00 PM on Monday, March 10th.

Week 7: Gender and Sexuality

March 11th & March 13th

School, Studying, and Smarts: Gender Stereotypes and Education Across 80 Years
of American Print Media

(Boutyline, Arseniev-Koehler, and Cornell 2023)

Abortion and Women’s Future Socioeconomic Attainment
(Everett and Taylor 2024)

Has There Been a Transgender Tipping Point? Gender Identification Differences in U.S. Cohorts Born between 1935 and 2001
(Lagos 2022)

Money, Birth, Gender: Explaining Unequal Earnings Trajectories Following Parenthood
(Machado and Jaspers 2023)

Intersecting the Academic Gender Gap: The Education of Lesbian, Gay, and Bisexual America
(Mittleman 2022)

Sexual Orientation Identity Mobility in the United Kingdom: A Research Note
(Hu and Denier 2023)

Marriage, Cohabitation, and Institutional Context: Household Specialization among Same-Sex and Different-Sex Couples
(Yang 2025)

Week 8: Spring Break—

Week 9: Culture

March 25th & March 27th

Pluralistic Collapse: The ‘Oil Spill’ Model of Mass Opinion Polarization
(DellaPosta 2020)

Mapping Shared Understandings Using Relational Class Analysis: The Case of the Cultural Omnivore Reexamined
(Goldberg 2011)

Islam and the Transmission of Cultural Identity in Four European Countries
(Karim 2024a)

Life-Course Transitions and Political Orientations
(Keskintürk 2024)

Change in Personal Culture over the Life Course
(Lersch 2023)

Theoretical Foundations and Limits of Word Embeddings: What Types of Meaning Can
They Capture?

(Arseniev-Koehler 2024)

Cultural Schemas: What They Are, How to Find Them, and What to Do Once You’ve Caught One
(Boutyline and Soter 2021)

Party, Race, and Neutrality: Investigating the Interdependence of Attitudes toward Social Groups
(Brensinger and Sotoudeh 2022)

Module III: An Introduction to Python

Week 10: Cleaning and Data Analysis in Python

April 1st & 3rd

Python for Data Analysis: Data Wrangling with Pandas, NumPy, and Jupyter
(McKinney 2022)

Midterm Assignment Deadline

Your midterm assignments are due by 8:00 PM on Friday, April 4th.

Week 11: A Very Gentle Introduction to Machine Learning in Python

April 8th

No required readings.

Researcher Reasoning Meets Computational Capacity: Machine Learning for Social Science
(Lundberg, Brand, and Jeon 2022)

Machine Learning for Sociology
(Molina and Garip 2019)

Leveraging the Alignment between Machine Learning and Intersectionality: Using Word Embeddings to Measure Intersectional Experiences of the Nineteenth Century U.S. South
(Nelson 2021)

Predictability Hypotheses: A Meta-Theoretical and Methodological Introduction
(Loon 2022)

Week 12: Data Visualization with seaborn

April 15th & 17th

Python for Data Analysis: Data Wrangling with Pandas, NumPy, and Jupyter
(McKinney 2022)

Module IV: The Final Stretch

Week 13: Final Presentations I

April 24th

Week 14—

Week 15: Final Presentations II

May 6th
Term Paper Deadline

Your term papers are due by 8:00 PM on Friday, May 9th.

Slide Decks

Slides should be up within 24 hours of a class session.

Module I: An Introduction to

Week 1

Introduction to the Course and Setting Up

Week 2

Cleaning and Data Analysis in

Week 3

Cleaning and Data Analysis in II

Week 4

Introduction to ggplot2

Week 5

Introduction to ggplot2 II

Module II: Applied Examples

Week 6

Race, Ethnicity and Nation

Week 7

Gender and Sexuality

Week 9

Culture

References

Abascal, Maria. 2020. “Contraction as a Response to Group Threat: Demographic Decline and WhitesClassification of People Who Are Ambiguously White.” American Sociological Review 85(2):298–322. doi: 10.1177/0003122420905127.
Abbott, Andrew. 1988. “Transcending General Linear Reality.” Sociological Theory 6(2):169–86. doi: 10.2307/202114.
Abbott, Andrew. 1995. Sequence Analysis: New Methods for Old Ideas.” Annual Review of Sociology 21:93–113.
Arseniev-Koehler, Alina. 2024. “Theoretical Foundations and Limits of Word Embeddings: What Types of Meaning Can They Capture?” Sociological Methods & Research 53(4):1753–93. doi: 10.1177/00491241221140142.
Bonikowski, Bart, Yuval Feinstein, and Sean Bock. 2021. “The Partisan Sorting of America: How Nationalist Cleavages Shaped the 2016 U.S. Presidential Election.” American Journal of Sociology 127(2):492–561. doi: 10.1086/717103.
Bonikowski, Bart, and Oscar Stuhler. 2022. “Reclaiming the Past to Transcend the Present: Nostalgic Appeals in U.S. Presidential Elections.” Sociological Forum 37(S1):1263–93. doi: 10.1111/socf.12838.
Boutyline, Andrei, Alina Arseniev-Koehler, and Devin J. Cornell. 2023. “School, Studying, and Smarts: Gender Stereotypes and Education Across 80 Years of American Print Media, 1930–2009†.” Social Forces 102(1):263–86. doi: 10.1093/sf/soac148.
Boutyline, Andrei, and Laura K. Soter. 2021. “Cultural Schemas: What They Are, How to Find Them, and What to Do Once You’ve Caught One.” American Sociological Review 86(4):728–58. doi: 10.1177/00031224211024525.
Brensinger, Jordan, and Ramina Sotoudeh. 2022. “Party, Race, and Neutrality: Investigating the Interdependence of Attitudes Toward Social Groups.” American Sociological Review 87(6):1049–93. doi: 10.1177/00031224221135797.
DellaPosta, Daniel. 2020. “Pluralistic Collapse: The Oil Spill Model of Mass Opinion Polarization.” American Sociological Review 85(3):507–36. doi: 10.1177/0003122420922989.
Donahue, Samuel Thomas. 2023. “The Politics of Police.” American Sociological Review 88(4):656–80. doi: 10.1177/00031224231173070.
Elwert, Felix, and Christopher Winship. 2014. “Endogenous Selection Bias: The Problem of Conditioning on a Collider Variable.” Annual Review of Sociology 40(Volume 40, 2014):31–53. doi: 10.1146/annurev-soc-071913-043455.
Everett, Bethany G., and Catherine J. Taylor. 2024. “Abortion and Women’s Future Socioeconomic Attainment.” American Sociological Review 89(6):1044–74. doi: 10.1177/00031224241292058.
Goldberg, Amir. 2011. “Mapping Shared Understandings Using Relational Class Analysis: The Case of the Cultural Omnivore Reexamined.” American Journal of Sociology 116(5):1397–1436. doi: 10.1086/657976.
Healy, Kieran Joseph. 2019. Data Visualization: A Practical Introduction. Princeton, NJ: Princeton University Press.
Hu, Yang, and Nicole Denier. 2023. “Sexual Orientation Identity Mobility in the United Kingdom: A Research Note.” Demography 60(3):659–73. doi: 10.1215/00703370-10769825.
Jefferson, Hakeem. 2024. “The Curious Case of Black Conservatives: Assessing the Validity of the Liberal-Conservative Scale Among Black Americans.” Public Opinion Quarterly 88(3):909–32. doi: 10.1093/poq/nfae037.
Karim, Sakeef M. 2024a. “Islam and the Transmission of Cultural Identity in Four European Countries.” Social Forces 103(2):756–79. doi: 10.1093/sf/soae076.
Karim, Sakeef M. 2024b. “The Organization of Ethnocultural Attachments Among Second- Generation Germans.” Social Science Research 118:102959. doi: 10.1016/j.ssresearch.2023.102959.
Keskintürk, Turgut. 2024. “Life-Course Transitions and Political Orientations.” Sociological Science 11:907–33. doi: 10.15195/v11.a33.
Knox, Dean, Will Lowe, and Jonathan Mummolo. 2020. “Administrative Records Mask Racially Biased Policing.” American Political Science Review 114(3):619–37. doi: 10.1017/S0003055420000039.
Lagos, Danya. 2022. “Has There Been a Transgender Tipping Point? Gender Identification Differences in U.S. Cohorts Born Between 1935 and 2001.” American Journal of Sociology 128(1):94–143. doi: 10.1086/719714.
Lersch, Philipp M. 2023. “Change in Personal Culture over the Life Course.” American Sociological Review 88(2):220–51. doi: 10.1177/00031224231156456.
Loon, Austin van. 2022. Predictability Hypotheses: A Meta-Theoretical and Methodological Introduction.”
Lundberg, Ian, Jennie E. Brand, and Nanum Jeon. 2022. “Researcher Reasoning Meets Computational Capacity: Machine Learning for Social Science.” Social Science Research 108:102807. doi: 10.1016/j.ssresearch.2022.102807.
Lundberg, Ian, Rebecca Johnson, and Brandon M. Stewart. 2021. “What Is Your Estimand? Defining the Target Quantity Connects Statistical Evidence to Theory.” American Sociological Review 86(3):532–65. doi: 10.1177/00031224211004187.
Machado, Weverthon, and Eva Jaspers. 2023. “Money, Birth, Gender: Explaining Unequal Earnings Trajectories Following Parenthood.” Sociological Science 10:429–53. doi: 10.15195/v10.a14.
McKinney, Wes. 2022. Python for Data Analysis: Data Wrangling with Pandas, NumPy, and Jupyter. 3rd Edition. Sebastopol, CA: O’Reilly.
Mittleman, Joel. 2022. “Intersecting the Academic Gender Gap: The Education of Lesbian, Gay, and Bisexual America.” American Sociological Review 87(2):303–35. doi: 10.1177/00031224221075776.
Molina, Mario, and Filiz Garip. 2019. “Machine Learning for Sociology.” Annual Review of Sociology 45(Volume 45, 2019):27–45. doi: 10.1146/annurev-soc-073117-041106.
Monk, Ellis P. 2022. “Inequality Without Groups: Contemporary Theories of Categories, Intersectional Typicality, and the Disaggregation of Difference.” Sociological Theory 40(1):3–27. doi: 10.1177/07352751221076863.
Nelson, Laura K. 2020. “Computational Grounded Theory: A Methodological Framework.” Sociological Methods & Research 49(1):3–42. doi: 10.1177/0049124117729703.
Nelson, Laura K. 2021. “Leveraging the Alignment Between Machine Learning and Intersectionality: Using Word Embeddings to Measure Intersectional Experiences of the Nineteenth Century U.S. South.” Poetics 88:101539. doi: 10.1016/j.poetic.2021.101539.
Torres, Mo. 2024. “Separate from Class? Toward a Theory of Race as Resource Signal.” Social Problems spae044. doi: 10.1093/socpro/spae044.
Wickham, Hadley, Mine Çetinkaya-Rundel, and Garrett Grolemund. 2023. R for Data Science: Import, Tidy, Transform, Visualize, and Model Data. 2nd edition. Sebastopol, CA: O’Reilly.
Wickham, Hadley, Danielle Navarro, and Thomas Lin Pedersen. 2025. ggplot2: Elegant Graphics for Data Analysis. 3rd Edition. New York: Springer.
Yang, Chih-lan Winnie. 2025. “Marriage, Cohabitation, and Institutional Context: Household Specialization Among Same-Sex and Different-Sex Couples.” Journal of Marriage and Family 87(1):300–321. doi: 10.1111/jomf.13002.

Footnotes

  1. Concretely, Friday at 5:00 PM to Monday at 8:00 AM.↩︎

  2. If an assignment is due at 8:00 PM and you submit it at 8:03 PM, you will be considered a day late.↩︎