As with most harmful speech classifiers (even classic models) this most likely w...

junon on March 25, 2022 | parent | context | favorite | on: Building a no-code toxicity classifier by talking ...

As with most harmful speech classifiers (even classic models) this most likely won't catch the more passive aggressive remarks. Those worded innocently but imply something terrible. I've had a 100% success rate getting these sorts of models to tell me asking someone to "kindly end their own life" is not rude, toxic or harmful.