A study conducted by researchers at CCC, which is based at the MIT Media Lab, found that state-of-the-art AI chatbots — including OpenAI’s GPT-4, Anthropic’s Claude 3 Opus, and Meta’s Llama 3 — sometimes provide less-accurate and less-truthful responses to users who have lower English proficiency, less formal education, or who originate from outside the United States. The models also refuse to answer questions at higher rates for these users, and in some cases, respond with condescending or patronizing language.


This study is a joke. The bios listed at the end of the paper are the cause of their issues.
It goes on like that! This is their input!! The LLM is just mirroring their style.
The point is that mirroring the prompt style puts the LLM in a context space where it performs badly. This is because it doesn’t try to give correct answers, but likely ones. Incorrect answers are more likely to follow a prompt that is written with poor grammar and spelling.