Dirk Hovy | Dirk Hovy

Latest

How to professor
Divine LLaMAs: Bias, Stereotypes, Stigmatization, and Emotion Representation of Religion in Large Language Models
Compromesso! Italian Many-Shot Jailbreaks Undermine the Safety of Large Language Models
My Answer is C: First-Token Probabilities Do Not Match Text Answers in Instruction-Tuned Language Models
Political Compass or Spinning Arrow? Towards More Meaningful Evaluations for Values and Opinions in Large Language Models
XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models
DADIT: A Dataset for Demographic Classification of Italian Twitter Users and a Comparison of Prediction Methods
Beyond Flesch-Kincaid: Prompt-based Metrics Improve Difficulty Classification of Educational Texts
SafetyPrompts: a Systematic Review of Open Datasets for Evaluating and Improving Large Language Model Safety
Angry Men, Sad Women: Large Language Models Reflect Gendered Stereotypes in Emotion Attribution
Conversations as a Source for Teaching Scientific Concepts at Different Education Levels
Emotion Analysis in NLP: Trends, Gaps and Roadmap for Future Directions
Classist Tools: Social Class Correlates with Performance in NLP
Impoverished Language Technology: The Lack of (Social) Class in NLP
Wisdom of Instruction-Tuned Language Model Crowds. Exploring Model Label Variation
MilaNLP at SemEval-2023 Task 10: Ensembling Domain-Adapted and Regularized Pretrained Language Models for Robust Sexism Detection
Respectful or Toxic? Using Zero-Shot Learning with Language Models to Detect Hate Speech
Temporal and Second Language Influence on Intra-Annotator Agreement and Stability in Hate Speech Labelling
The Ecological Fallacy in Annotation: Modeling Human Label Variation goes beyond Sociodemographics
The State of Profanity Obfuscation in Natural Language Processing Scientific Publications
What about ''em''? How Commercial Machine Translation Fails to Handle (Neo-)Pronouns
What about ''em''? How Commercial Machine Translation Fails to Handle (Neo-)Pronouns
Leveraging Social Interactions to Detect Misinformation on Social Media
Proceedings of the First Workshop on Cross-Cultural Considerations in NLP (C3NLP)
Can Demographic Factors Improve Text Classification? Revisiting Demographic Adaptation in the Age of Transformers
Know Your Audience: Do LLMs Adapt to Different Age and Education Levels?
Beyond Digital 'Echo Chambers': The Role of Viewpoint Diversity in Political Discussion
Viewpoint: Artificial Intelligence Accidents Waiting to Happen?
It's Not Just Hate: A Multi-Dimensional Perspective on Detecting Harmful Speech Online
Twitter-Demographer: A Flow-based Tool to Enrich Twitter Data
Bridging Fairness and Environmental Sustainability in Natural Language Processing
SocioProbe: What, When, and Where Language Models Learn about Sociodemographics
Data-Efficient Strategies for Expanding Hate Speech Detection into Under-Resourced Languages
Is It Worth the (Environmental) Cost? Limited Evidence for the Benefits of Diachronic Continuous Training
Welcome to the Modern World of Pronouns: Identity-Inclusive Natural Language Processing beyond Gender
Guiding the Release of Safer E2E Conversational AI through Value Sensitive Design
Hard and Soft Evaluation of NLP models with BOOtSTrap SAmpling - BooStSa
Language Invariant Properties in Natural Language Processing
Benchmarking Post-Hoc Interpretability Approaches for Transformer-based Misogyny Detection
Measuring Harmful Sentence Completion in Language Models for LGBTQIA+ Individuals
Pipelines for Social Bias Testing of Large Language Models
Two Contrasting Data Annotation Paradigms for Subjective NLP Tasks
XLM-EMO: Multilingual Emotion Prediction in Social Media Text
Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists
SAFETYKIT: First Aid for Measuring Safety in Open-domain Conversational Systems
Text Analysis in Python for Social Scientists – Prediction and Classification
Learning from Disagreement: A Survey
Five sources of bias in natural language processing
On the Gap between Adoption and Understanding in NLP
Pre-training is a Hot Topic: Contextualized Document Embeddings Improve Topic Coherence
'We will Reduce Taxes' - Identifying Election Pledges with Language Models
HONEST: Measuring Hurtful Sentence Completion in Language Models
The Importance of Modeling Social Factors of Language: Theory and Practice
FEEL-IT: Emotion and Sentiment Classification for the Italian Language
MilaNLP @ WASSA: Does BERT Feel Sad When You Cry?
Universal Joy A Data Set and Results for Classifying Emotions Across Languages
BERTective: Language Models and Contextual Information for Deception Detection
Cross-lingual Contextualized Topic Models with Zero-shot Learning
Text Analysis in Python for Social Scientists – Discovery and Exploration
“You Sound Just Like Your Father” Commercial Machine Translation Systems Include Stylistic Biases
Predictive Biases in Natural Language Processing Models: A Conceptual Framework and Overview
Visualizing Regional Language Variation Across Europe on Twitter
Helpful or Hierarchical? Predicting the Communicative Strategies of Chat Participants, and their Impact on Success
What the [MASK]? Making Sense of Language-Specific BERT Models
A Case for Soft Loss Functions
Dense Node Representation for Geolocation
Geolocation with Attention-Based Multitask Learning Models
Hey Siri. Ok Google. Alexa: A topic modeling of user reviews for smart speakers
Identifying Linguistic Areas for Geolocation
Women’s Syntactic Resilience and Men’s Grammatical Luck: Gender-Bias in Part-of-Speech Tagging and Dependency Parsing
Peer networks and entrepreneurship: A Pan-African RCT
Increasing In-Class Similarity by Retrofitting Embeddings with Demographic Information
Capturing Regional Variation with Distributed Place Representations and Geographic Retrofitting
Comparing Bayesian Models of Annotation
Predicting News Headline Popularity with Syntactic and Semantic Knowledge Using Multi-Task Learning
The Social and the Neural Network: How to Make Natural Language Processing about People again
Some Thoughts on the Future of NLP Conferences
It’s All About the He Said, She Said―A Quantitative Analysis of the Three Presidential Debates
Science’s genius complex
Fun with Movie Titles
How usable is sentiment analysis?
Wow: Such Meme, Much NLP, Very Generate!
Are our models ageist?
What I do: Learning whom to trust
What I do: Significance Testing
How to be a Good Grad Student
MACE available for download
Fake social network names won’t protect your privacy
Trimming Papers
The Art of Good Presentations
Orange Chicken
In Other Words
Science and Showmanship
One wish
Language change
Remembering the Dead, 1
New York I love you, but you’re bringing me down
Belated Birthday Wishes
Food Nerd
Summertime
Science vs Engineering
Bratwurst mit Sauerkraut
Movie World
Insights of a travelling salesman
Picture this…
After rain comes sunshine
Have a nice Vorurteil!
See Europe in two weeks…
Speak in tongues
Shake it, baby
News from Behind the Mirror’s Glass
It’s over
A classic
Persistency
At least…
The town that wasn’t there
The Art of Self-Contradiction
My house, my car, my…
If a bit under the weather…
Babel
Arrived
Packed