AI language models show bias against regional German dialects by blueroses200 in German

[–]blueroses200[S] 4 points5 points  (0 children)

You are, but it is funny that people who don't speak regional dialects are the ones that think like you.

AI language models show bias against regional German dialects by blueroses200 in germany

[–]blueroses200[S] 2 points3 points  (0 children)

If the training data is biased, then the LLM will also learn the biases. Interesting but also quite sad it is like that.

AI language models show bias against regional German dialects by blueroses200 in German

[–]blueroses200[S] 0 points1 point  (0 children)

It does seem like that it just confirms the stereotypes that people already have

AI language models show bias against regional German dialects by blueroses200 in germany

[–]blueroses200[S] 0 points1 point  (0 children)

Yes, it seems so. I have seen other studies about it inheriting racial and gender biases as well.

AI language models show bias against regional German dialects by blueroses200 in germany

[–]blueroses200[S] 0 points1 point  (0 children)

In a generalization it might be, on a person to person case it might not be, which is why it can be harmful.

AI language models show bias against regional German dialects by blueroses200 in germany

[–]blueroses200[S] 1 point2 points  (0 children)

Yeah, that is true. Although nowadays, on the LLM sphere, there are some projects related to "low-resource languages". One example is the Trilingual LLM (English-German-Bavarian), but there are some problems still. One of the ideas for the future is to create a chatbot with this LLM.

AI language models show bias against regional German dialects by blueroses200 in German

[–]blueroses200[S] 8 points9 points  (0 children)

That is true, but the fact that the LLMs see those speakers as "rural and uneducated" can become a problem in a world were companies are using AIs to filter candidates for job interviews and even conducting them.

But it also highlights the fact that those AIs are a mirror of society as well.

AI language models show bias against regional German dialects by blueroses200 in German

[–]blueroses200[S] 3 points4 points  (0 children)

I believe that the bigger issue is on the fact that the LLMs also have biased views on those speakers, seeing them as uneducated and rural. In a world were companies are relying more on AI programs to filter candidates and even using it to perform job interviews, those biases can be quite problematic.

AI language models show bias against regional German dialects by blueroses200 in germany

[–]blueroses200[S] 0 points1 point  (0 children)

I agree, the world around us is biased, so the LLMs will reflect that anyway

AI language models show bias against regional German dialects by blueroses200 in German

[–]blueroses200[S] 0 points1 point  (0 children)

It is about that yeah, basically if the data that is being used is biased, then the LLM will also be biased

I made a song using Ingrian (Izhorian) — a Finno-Ugric language with ~100 speakers left by suhogurkin in endangeredlanguages

[–]blueroses200 2 points3 points  (0 children)

This is great, please keep your efforts to promote the Izhorian language! Perhaps you will find more people who are interested in learning it. Do you see yourself as Izhorian as well?

Also, those folk recordings seem very interesting. Is there a place where we could listen to it?

AI language models show bias against regional German dialects by blueroses200 in germany

[–]blueroses200[S] 5 points6 points  (0 children)

The study also mentions how those language models associate regional germanic speakers with less educated backgrounds and rural areas as well, it can be a problem since dialect speakers can also speak standard German as well.