Demszky Dorottya előadást tart Content Analysis of Textbooks via Natural Language Processing címmel

Demszky Dorottya (Stanford University, Linguistics Department) 2020. november 24-én 10:00-tól előadást tart Content Analysis of Textbooks via Natural Language Processing: Findings on Gender, Race, and Ethnicity in Texas U.S. History Textbooks címmel

Az előadás absztraktja:

Cutting-edge data science techniques can shed new light on fundamental questions in educational research. We apply techniques from natural language processing (lexicons, word embeddings, topic models) to 15 U.S. history textbooks widely used in Texas between 2015 and 2017, studying their depiction of historically marginalized groups. We find that Latinx people are rarely discussed, and the most common famous figures are nearly all White men. Lexicon-based approaches show that Black people are described as performing actions associated with low agency and power. Word embeddings reveal that women tend to be discussed in the contexts of work and the home. Topic modeling highlights the higher prominence of political topics compared with social ones. We also find that more conservative counties tend to purchase textbooks with less representation of women and Black people. Building on a rich tradition of textbook analysis, we release our computational toolkit to support new research directions.

Az előadás magyar nyelvű. Az előadó az előadás során ki fog térni az említett módszerek magyar szövegekre vonatkozó lehetséges alkalmazásaira.

A rendezvény a járványügyi szabályok betartása mellett online formában kerül megrendezésre. A részvétel regisztációhoz kötött. Regisztrálni 2020. november 20-ig az alábbi linken lehet.