Baturinsky t1_j9y8qxh wrote on February 25, 2023 at 1:05 PM

Reply to comment by luffreezer in Likelihood of OpenAI moderation flagging a sentence containing negative adjectives about a demographic as 'Hateful'. by grungabunga

You mean, OpenAI was taught on the texts that had way more anti-disabled hate than anti-republican hate? Where have they found them?

luffreezer t1_j9yo44f wrote on February 25, 2023 at 3:17 PM

It is the whole internet that is like that. As a said, it is a reflexion of our society:

You will never find people insulting "normal weighted people" or "people without a disability". So it is not surprising that the model does not perform well in those areas.

In the US, saying something is "socialism" can even be interpreted as a criticism, so I am not surprised it flags more left-winged things than right-winged.

Spire_Citron t1_ja06ja2 wrote on February 25, 2023 at 9:23 PM

It's not necessarily just the amount but also the type of hate.

Moist-Question t1_ja0fp45 wrote on February 25, 2023 at 10:28 PM

Likely because there is a larger volume of hate content for disabilities than for republicans.

LightVelox t1_j9ynlb6 wrote on February 25, 2023 at 3:13 PM

4Chan is the only place i can think of where you wouldn't get instabanned for anti-disabled hate, but considering most models are trained on Reddit it would make sense for it to be extremely biased to the left