INTEGRATOR logo

INTEGRATOR

Jan 27, 2020 demographic

INTEGRATOR logo

INTEGRATOR

Jan 27, 2020 demographic

Dirk Hovy, scientific director of DMI and Professor of computer science, has won an ERC starting grant of 1.5mln euros. His project INTEGRATOR, funded under grant agreement 949944, introduces demographic factors into language processing systems, which will improve algorithmic performance, avoid racism, sexism, and ageism, and open up new applications. What if I wrote that “winning an ERC Grant, Dirk Hovy got a sick result?”. Those familiar with the use of “sick” as a synonym for “great” or “awesome” among teenagers would think that Bocconi Knowledge hired a very young writer (or someone posing as such). The rest would think I went crazy. Current artificial intelligence-based language systems wouldn’t have a clue. “Natural language processing (NLP) technologies,” Prof. Hovy says, “fail to account for demographics both in understanding language and in generating it. And this failure prevents us from reaching human-like performance. It limits possible future applications and it introduces systematic bias against underrepresented demographic groups”.

🗞️🗞️ Related articles featured in Corriere Innovazione and Bocconi News.

demographic NLP

Publications

Divine LLaMAs: Bias, Stereotypes, Stigmatization, and Emotion Representation of Religion in Large Language Models

Emotions play important epistemological and cognitive roles in our lives, revealing our values and guiding our actions. Previous work …

Flor Miriam Plaza-del-Arco, Amanda Cercas Curry, Susanna Paoli, Alba Curry, Dirk Hovy

Compromesso! Italian Many-Shot Jailbreaks Undermine the Safety of Large Language Models

As diverse linguistic communities and users adopt large language models (LLMs), assessing their safety across languages becomes …

Fabio Pernisi, Dirk Hovy, Paul Röttger

PDF Project Project

My Answer is C: First-Token Probabilities Do Not Match Text Answers in Instruction-Tuned Language Models

The open-ended nature of language generation makes the evaluation of autoregressive large language models (LLMs) challenging. One …

Xinpeng Wang, Bolei Ma, Chengzhi Hu, Leon Weber-Genzel, Paul Röttger, Frauke Kreuter, Dirk Hovy, Barbara Plank

PDF Project Project

XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models

Without proper safeguards, large language models will readily follow malicious instructions and generate toxic content. This risk …

Paul Röttger, Hannah Rose Kirk, Bertie Vidgen, Giuseppe Attanasio, Federico Bianchi, Dirk Hovy

PDF Project Project

Beyond Flesch-Kincaid: Prompt-based Metrics Improve Difficulty Classification of Educational Texts

Using large language models (LLMs) for educational applications like dialogue-based teaching is a hot topic. Effective teaching, …

Donya Rooein, Paul Rottger, Anastassia Shaitarova, Dirk Hovy

SafetyPrompts: a Systematic Review of Open Datasets for Evaluating and Improving Large Language Model Safety

The last two years have seen a rapid growth in concerns around the safety of large language models (LLMs). Researchers and …

Paul Röttger, Fabio Pernisi, Bertie Vidgen, Dirk Hovy

PDF Project Project

Angry Men, Sad Women: Large Language Models Reflect Gendered Stereotypes in Emotion Attribution

Large language models (LLMs) reflect societal norms and biases, especially about gender. While societal biases and stereotypes have …

Flor Miriam Plaza-del-Arco, Amanda Cercas Curry, Alba Curry, Gavin Abercrombie, Dirk Hovy

Conversations as a Source for Teaching Scientific Concepts at Different Education Levels

Open conversations are one of the most engaging forms of teaching. However, creating those conversations in educational software is a …

Donya Rooein, Dirk Hovy

Emotion Analysis in NLP: Trends, Gaps and Roadmap for Future Directions

Emotions are a central aspect of communication. Consequently, emotion analysis (EA) is a rapidly growing field in natural language …

Flor Miriam Plaza-del-Arco, Alba Curry, Amanda Cercas Curry, Dirk Hovy

Classist Tools: Social Class Correlates with Performance in NLP

Since the foundational work of William Labov on the social stratification of language (Labov, 1964), linguistics has made concentrated …

Amanda Cercas Curry, Giuseppe Attanasio, Zeerak Talat, Dirk Hovy

Impoverished Language Technology: The Lack of (Social) Class in NLP

Since Labov’s (1964) foundational work on the social stratification of language, linguistics has dedicated concerted efforts …

Amanda Cercas Curry, Zeerak Talat, Dirk Hovy

Wisdom of Instruction-Tuned Language Model Crowds. Exploring Model Label Variation

Large Language Models (LLMs) exhibit remarkable text classification capabilities, excelling in zero- and few-shot learning (ZSL and …

Flor Miriam Plaza-del-Arco, Debora Nozza, Dirk Hovy

MilaNLP at SemEval-2023 Task 10: Ensembling Domain-Adapted and Regularized Pretrained Language Models for Robust Sexism Detection

We present the system proposed by the MilaNLP team for the Explainable Detection of Online Sexism (EDOS) shared task. We propose an …

Amanda Cercas Curry, Giuseppe Attanasio, Debora Nozza, Dirk Hovy

PDF Code Project

Respectful or Toxic? Using Zero-Shot Learning with Language Models to Detect Hate Speech

Hate speech detection faces two significant challenges: 1) the limited availability of labeled data and 2) the high variability of hate …

Flor Miriam Plaza-del-Arco, Debora Nozza, Dirk Hovy

Temporal and Second Language Influence on Intra-Annotator Agreement and Stability in Hate Speech Labelling

Much work in natural language processing (NLP) relies on human annotation. The majority of this implicitly assumes that annotator’s …

Gavin Abercrombie, Dirk Hovy, Vinodkumar Prabhakaran

The Ecological Fallacy in Annotation: Modeling Human Label Variation goes beyond Sociodemographics

Many NLP tasks exhibit human label variation, where different annotators give different labels to the same texts. This variation is …

Matthias Orlikowski, Paul Röttger, Philipp Cimiano, Dirk Hovy

The State of Profanity Obfuscation in Natural Language Processing Scientific Publications

Work on hate speech has made considering rude and harmful examples in scientific publications inevitable. This situation raises various …

Debora Nozza, Dirk Hovy

PDF Code Project

What about ''em''? How Commercial Machine Translation Fails to Handle (Neo-)Pronouns

As 3rd-person pronoun usage shifts to include novel forms, e.g., neopronouns, we need more research on identity-inclusive NLP. …

Anne Lauscher, Debora Nozza, Ehm Miltersen, Archie Crowley, Dirk Hovy

What about ''em''? How Commercial Machine Translation Fails to Handle (Neo-)Pronouns

As 3rd-person pronoun usage shifts to include novel forms, e.g., neopronouns, we need more research on identity-inclusive NLP. …

Anne Lauscher, Debora Nozza, Ehm Miltersen, Archie Crowley, Dirk Hovy

Can Demographic Factors Improve Text Classification? Revisiting Demographic Adaptation in the Age of Transformers

Demographic factors (e.g., gender or age) shape our language. Previous work showed that incorporating demographic factors can …

Chia-chien Hung, Anne Lauscher, Dirk Hovy, Simone Paolo Ponzetto, Goran Glavaš

Know Your Audience: Do LLMs Adapt to Different Age and Education Levels?

Large language models (LLMs) offer a range of new possibilities, including adapting the text to different audiences and their reading …

Donya Rooein, Amanda Cercas Curry, Dirk Hovy

Viewpoint: Artificial Intelligence Accidents Waiting to Happen?

Artificial Intelligence (AI) is at a crucial point in its development: stable enough to be used in production systems, and increasingly …

Federico Bianchi, Amanda Cercas Curry, Dirk Hovy

Twitter-Demographer: A Flow-based Tool to Enrich Twitter Data

Twitter data have become essential to Natural Language Processing (NLP) and social science research, driving various scientific …

Federico Bianchi, Vincenzo Cutrona, Dirk Hovy

PDF Code Project

Bridging Fairness and Environmental Sustainability in Natural Language Processing

Fairness and environmental impact are important research directions for the sustainable development of artificial intelligence. …

Marius Hessenthaler, Emma Strubell, Dirk Hovy, Anne Lauscher

SocioProbe: What, When, and Where Language Models Learn about Sociodemographics

Pre-trained language models (PLMs) have outperformed other NLP models on a wide range of tasks. Opting for a more thorough …

Anne Lauscher, Federico Bianchi, Samuel R. Bowman, Dirk Hovy

Data-Efficient Strategies for Expanding Hate Speech Detection into Under-Resourced Languages

Hate speech is a global phenomenon, but most hate speech datasets so far focus on English-language content. This hinders the …

Paul Röttger, Debora Nozza, Federico Bianchi, Dirk Hovy

Is It Worth the (Environmental) Cost? Limited Evidence for the Benefits of Diachronic Continuous Training

Language is constantly changing and evolving, leaving language models to quickly become outdated, both factually and linguistically. …

Giuseppe Attanasio, Debora Nozza, Federico Bianchi, Dirk Hovy

Welcome to the Modern World of Pronouns: Identity-Inclusive Natural Language Processing beyond Gender

The world of pronouns is changing – from a closed word class with few members to an open set of terms to reflect identities. However, …

Anne Lauscher, Archie Crowley, Dirk Hovy

Guiding the Release of Safer E2E Conversational AI through Value Sensitive Design

Over the last several years, end-to-end neural conversational agents have vastly improved their ability to carry unrestricted, …

A. Stevie Bergman, Gavin Abercrombie, Shannon Spruit, Dirk Hovy, Emily Dinan, Y-Lan Boureau, Verena Rieser

Hard and Soft Evaluation of NLP models with BOOtSTrap SAmpling - BooStSa

Natural Language Processing (NLP) ‘s applied nature makes it necessary to select the most effective and robust models. Producing …

Tommaso Fornaciari, Alexandra Uma, Massimo Poesio, Dirk Hovy

PDF Code Project

Benchmarking Post-Hoc Interpretability Approaches for Transformer-based Misogyny Detection

Transformer-based Natural Language Processing models have become the standard for hate speech detection. However, the unconscious use …

Giuseppe Attanasio, Debora Nozza, Eliana Pastor, Dirk Hovy

Measuring Harmful Sentence Completion in Language Models for LGBTQIA+ Individuals

Current language technology is ubiquitous and directly influences individuals’ lives worldwide. Given the recent trend in AI on …

Debora Nozza, Federico Bianchi, Anne Lauscher, Dirk Hovy

PDF Code Project

Pipelines for Social Bias Testing of Large Language Models

The maturity level of language models is now at a stage in which many companies rely on them to solve various tasks. However, while …

Debora Nozza, Federico Bianchi, Dirk Hovy

PDF Project Poster

Two Contrasting Data Annotation Paradigms for Subjective NLP Tasks

Labelled data is the foundation of most natural language processing tasks. However, labelling data is difficult and there often are …

Paul Röttger, Bertie Vidgen, Dirk Hovy, Janet B. Pierrehumbert

XLM-EMO: Multilingual Emotion Prediction in Social Media Text

Detecting emotion in text allows social and computational scientists to study how people behave and react to online events. However, …

Federico Bianchi, Debora Nozza, Dirk Hovy

PDF Code Project

Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists

Natural Language Processing (NLP) models risk overfitting to specific terms in the training data, thereby reducing their performance, …

Giuseppe Attanasio, Debora Nozza, Dirk Hovy, Elena Baralis

PDF Code Project

SAFETYKIT: First Aid for Measuring Safety in Open-domain Conversational Systems

The social impact of natural language processing and its applications has received increasing attention. In this position paper, we …

Emily Dinan, Gavin Abercrombie, A. Stevie Bergman, Shannon Spruit, Dirk Hovy, Y-Lan Boureau, Verena Rieser

Five sources of bias in natural language processing

Recently, there has been an increased interest in demographically grounded bias in natural language processing (NLP) applications. Much …

Dirk Hovy, Shrimai Prabhumoye

HONEST: Measuring Hurtful Sentence Completion in Language Models

Language models have revolutionized the field of NLP. However, language models capture and proliferate hurtful stereotypes, especially …

Debora Nozza, Federico Bianchi, Dirk Hovy

PDF Code Project Poster Slides Blog Post

The Importance of Modeling Social Factors of Language: Theory and Practice

Natural language processing (NLP) applications are now more powerful and ubiquitous than ever before. With rapidly developing (neural) …

Dirk Hovy, Diyi Yang