Language | Accuracy |
---|---|
English | 82% |
Spanish | 78% |
Mandarin | 75% |
Table representing LLMs’ accuracy in detecting toxicity among different languages based on RTP-LX
The RTP-LX project unveils a specially designed corpus to test LLMs in detecting culturally-specific toxic language across 28 languages. Here’s a comprehensive breakdown of their approach and findings: