Directions In Abusive Language Training Data: Garbage In, Garbage Out | Awesome LLM Papers Add your paper to Awesome LLM Papers

Directions In Abusive Language Training Data: Garbage In, Garbage Out

Bertie Vidgen, Leon Derczynski . PLOS ONE 2020 – 132 citations

[Paper]   Search on Google Scholar   Search on Semantic Scholar
Datasets Survey Paper

Data-driven analysis and detection of abusive online content covers many different tasks, phenomena, contexts, and methodologies. This paper systematically reviews abusive language dataset creation and content in conjunction with an open website for cataloguing abusive language data. This collection of knowledge leads to a synthesis providing evidence-based recommendations for practitioners working with this complex and highly diverse data.

Similar Work