Baturinsky t1_j9y8qxh wrote
Reply to comment by luffreezer in Likelihood of OpenAI moderation flagging a sentence containing negative adjectives about a demographic as 'Hateful'. by grungabunga
You mean, OpenAI was taught on the texts that had way more anti-disabled hate than anti-republican hate? Where have they found them?
luffreezer t1_j9yo44f wrote
It is the whole internet that is like that. As a said, it is a reflexion of our society:
You will never find people insulting "normal weighted people" or "people without a disability". So it is not surprising that the model does not perform well in those areas.
In the US, saying something is "socialism" can even be interpreted as a criticism, so I am not surprised it flags more left-winged things than right-winged.
Spire_Citron t1_ja06ja2 wrote
It's not necessarily just the amount but also the type of hate.
Moist-Question t1_ja0fp45 wrote
Likely because there is a larger volume of hate content for disabilities than for republicans.
LightVelox t1_j9ynlb6 wrote
4Chan is the only place i can think of where you wouldn't get instabanned for anti-disabled hate, but considering most models are trained on Reddit it would make sense for it to be extremely biased to the left
Viewing a single comment thread. View all comments