Dirk Hovy
Latest
- Divine LLaMAs: Bias, Stereotypes, Stigmatization, and Emotion Representation of Religion in Large Language Models
- Compromesso! Italian Many-Shot Jailbreaks Undermine the Safety of Large Language Models
- My Answer is C: First-Token Probabilities Do Not Match Text Answers in Instruction-Tuned Language Models
- Political Compass or Spinning Arrow? Towards More Meaningful Evaluations for Values and Opinions in Large Language Models
- XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models
- DADIT: A Dataset for Demographic Classification of Italian Twitter Users and a Comparison of Prediction Methods
- Beyond Flesch-Kincaid: Prompt-based Metrics Improve Difficulty Classification of Educational Texts
- SafetyPrompts: a Systematic Review of Open Datasets for Evaluating and Improving Large Language Model Safety
- Angry Men, Sad Women: Large Language Models Reflect Gendered Stereotypes in Emotion Attribution
- Conversations as a Source for Teaching Scientific Concepts at Different Education Levels
- Emotion Analysis in NLP: Trends, Gaps and Roadmap for Future Directions
- Classist Tools: Social Class Correlates with Performance in NLP
- Impoverished Language Technology: The Lack of (Social) Class in NLP
- Wisdom of Instruction-Tuned Language Model Crowds. Exploring Model Label Variation
- MilaNLP at SemEval-2023 Task 10: Ensembling Domain-Adapted and Regularized Pretrained Language Models for Robust Sexism Detection
- Respectful or Toxic? Using Zero-Shot Learning with Language Models to Detect Hate Speech
- Temporal and Second Language Influence on Intra-Annotator Agreement and Stability in Hate Speech Labelling
- The Ecological Fallacy in Annotation: Modeling Human Label Variation goes beyond Sociodemographics
- The State of Profanity Obfuscation in Natural Language Processing Scientific Publications
- What about ''em''? How Commercial Machine Translation Fails to Handle (Neo-)Pronouns
- What about ''em''? How Commercial Machine Translation Fails to Handle (Neo-)Pronouns
- Leveraging Social Interactions to Detect Misinformation on Social Media
- Proceedings of the First Workshop on Cross-Cultural Considerations in NLP (C3NLP)
- Can Demographic Factors Improve Text Classification? Revisiting Demographic Adaptation in the Age of Transformers
- Know Your Audience: Do LLMs Adapt to Different Age and Education Levels?
- Beyond Digital 'Echo Chambers': The Role of Viewpoint Diversity in Political Discussion
- Viewpoint: Artificial Intelligence Accidents Waiting to Happen?
- It's Not Just Hate: A Multi-Dimensional Perspective on Detecting Harmful Speech Online
- Twitter-Demographer: A Flow-based Tool to Enrich Twitter Data
- Bridging Fairness and Environmental Sustainability in Natural Language Processing
- SocioProbe: What, When, and Where Language Models Learn about Sociodemographics
- Data-Efficient Strategies for Expanding Hate Speech Detection into Under-Resourced Languages
- Is It Worth the (Environmental) Cost? Limited Evidence for the Benefits of Diachronic Continuous Training
- Welcome to the Modern World of Pronouns: Identity-Inclusive Natural Language Processing beyond Gender
- Guiding the Release of Safer E2E Conversational AI through Value Sensitive Design
- Hard and Soft Evaluation of NLP models with BOOtSTrap SAmpling - BooStSa
- Language Invariant Properties in Natural Language Processing
- Benchmarking Post-Hoc Interpretability Approaches for Transformer-based Misogyny Detection
- Measuring Harmful Sentence Completion in Language Models for LGBTQIA+ Individuals
- Pipelines for Social Bias Testing of Large Language Models
- Two Contrasting Data Annotation Paradigms for Subjective NLP Tasks
- XLM-EMO: Multilingual Emotion Prediction in Social Media Text
- Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists
- SAFETYKIT: First Aid for Measuring Safety in Open-domain Conversational Systems
- Text Analysis in Python for Social Scientists – Prediction and Classification
- Learning from Disagreement: A Survey
- Five sources of bias in natural language processing
- On the Gap between Adoption and Understanding in NLP
- Pre-training is a Hot Topic: Contextualized Document Embeddings Improve Topic Coherence
- 'We will Reduce Taxes' - Identifying Election Pledges with Language Models
- HONEST: Measuring Hurtful Sentence Completion in Language Models
- The Importance of Modeling Social Factors of Language: Theory and Practice
- FEEL-IT: Emotion and Sentiment Classification for the Italian Language
- MilaNLP @ WASSA: Does BERT Feel Sad When You Cry?
- Universal Joy A Data Set and Results for Classifying Emotions Across Languages
- BERTective: Language Models and Contextual Information for Deception Detection
- Cross-lingual Contextualized Topic Models with Zero-shot Learning
- Text Analysis in Python for Social Scientists – Discovery and Exploration
- “You Sound Just Like Your Father” Commercial Machine Translation Systems Include Stylistic Biases
- Predictive Biases in Natural Language Processing Models: A Conceptual Framework and Overview
- Visualizing Regional Language Variation Across Europe on Twitter
- Helpful or Hierarchical? Predicting the Communicative Strategies of Chat Participants, and their Impact on Success
- What the [MASK]? Making Sense of Language-Specific BERT Models
- A Case for Soft Loss Functions
- Dense Node Representation for Geolocation
- Geolocation with Attention-Based Multitask Learning Models
- Hey Siri. Ok Google. Alexa: A topic modeling of user reviews for smart speakers
- Identifying Linguistic Areas for Geolocation
- Women’s Syntactic Resilience and Men’s Grammatical Luck: Gender-Bias in Part-of-Speech Tagging and Dependency Parsing
- Peer networks and entrepreneurship: A Pan-African RCT
- Increasing In-Class Similarity by Retrofitting Embeddings with Demographic Information
- Capturing Regional Variation with Distributed Place Representations and Geographic Retrofitting
- Comparing Bayesian Models of Annotation
- Predicting News Headline Popularity with Syntactic and Semantic Knowledge Using Multi-Task Learning
- The Social and the Neural Network: How to Make Natural Language Processing about People again
- Some Thoughts on the Future of NLP Conferences
- It’s All About the He Said, She Said―A Quantitative Analysis of the Three Presidential Debates
- Science’s genius complex
- Fun with Movie Titles
- How usable is sentiment analysis?
- Wow: Such Meme, Much NLP, Very Generate!
- Are our models ageist?
- What I do: Learning whom to trust
- What I do: Significance Testing
- How to be a Good Grad Student
- MACE available for download
- Fake social network names won’t protect your privacy
- Trimming Papers
- The Art of Good Presentations
- Orange Chicken
- In Other Words
- Science and Showmanship
- One wish
- Language change
- Remembering the Dead, 1
- New York I love you, but you’re bringing me down
- Belated Birthday Wishes
- Food Nerd
- Summertime
- Science vs Engineering
- Bratwurst mit Sauerkraut
- Movie World
- Insights of a travelling salesman
- Picture this…
- After rain comes sunshine
- Have a nice Vorurteil!
- See Europe in two weeks…
- Speak in tongues
- Shake it, baby
- News from Behind the Mirror’s Glass
- It’s over
- A classic
- Persistency
- At least…
- The town that wasn’t there
- The Art of Self-Contradiction
- My house, my car, my…
- If a bit under the weather…
- Babel
- Arrived
- Packed