A picture of a group of students talking in front of a glass exterior of a UBC building, with the text UBC and other events overlaid in white

Department of Psychology: Dr Tessa Charlesworth, "A word is characterized by the company it keeps": Using word embeddings to uncover social beliefs in child and adult language

January 13, 2020, 2:45 pm to 4:30 pm

Suedfeld Lounge, Kenny Room 2510 2136 West Mall

Social beliefs are known to be reflected in the language of children and adults. But how can they be quantitatively studied to understand the relative strength of social beliefs across language from different sources (e.g., books vs. speech), different speakers (e.g., children vs. adults), and even different time periods (e.g., 1900 vs. present-day)?

Advances in machine learning (word embeddings) can transform large text corpora into vectors and newly quantify social beliefs in real-world natural language. In this project, we use word embeddings derived from 7 corpora (65+ million words) to provide the first comprehensive test of beliefs about gender in children’s and adults’ language, including speech, TV/movies, and books.

Gender beliefs associating male/female with well-studied concepts (e.g., work/home, science/arts) were consistently present across corpora. Moreover, gender beliefs associating male/female with 600+ traits and 300+ professions were pervasive, with 71% and 79% of traits/professions showing medium-to-large associations to gender. Descriptive differences by language sources, speaker age, and time period emerged as well.

Together, these results illustrate a novel methodological approach that can promote new theories of whether, when, and to what extent consequential social beliefs emerge in children’s and adult’s real-world linguistic environments.


First Nations land acknowledegement

We acknowledge that UBC’s campuses are situated within the traditional territories of the Musqueam, Squamish and Tsleil-Waututh, and in the traditional, ancestral, unceded territory of the Syilx Okanagan Nation and their peoples.


UBC Crest The official logo of the University of British Columbia. Urgent Message An exclamation mark in a speech bubble. Caret An arrowhead indicating direction. Arrow An arrow indicating direction. Arrow in Circle An arrow indicating direction. Arrow in Circle An arrow indicating direction. Bluesky The logo for the Bluesky social media service. Chats Two speech clouds. Facebook The logo for the Facebook social media service. Information The letter 'i' in a circle. Instagram The logo for the Instagram social media service. External Link An arrow entering a square. Linkedin The logo for the LinkedIn social media service. Location Pin A map location pin. Mail An envelope. Menu Three horizontal lines indicating a menu. Minus A minus sign. Telephone An antique telephone. Plus A plus symbol indicating more or the ability to add. Search A magnifying glass. Twitter The logo for the Twitter social media service. Youtube The logo for the YouTube video sharing service.