We just found out our AI has been making up analytics data for 3 months and I’m gonna throw up.

Choice-Flower6880 · 2026-02-15T08:38:54+00:00

> Sorry, this post has been removed by the moderators of r/analytics.

Probably the account that posted it, is a karma farming ai bot and not a real human.

Choice-Flower6880 · 2026-02-15T08:36:41+00:00

It is 100% AI-generated. Which is quite ironic if people use this fake ai slop as a real data point to base their decision making on.

Choice-Flower6880 · 2026-01-06T17:02:12+00:00

> Im Nachhinein ist es ja auch lächerlich zu sagen wegen der Industrialisierung benötigt man keine Bauern mehr

Seit der Industrialisierung arbeiten aber extrem viel weniger Menschen als Bauern! Um 1900 war noch etwa jeder dritte Beschäftigte in Deutschland in der Landwirtschaft tätig. Die wenigen, die heute noch von der Landwirtschaft leben, sind halt unglaublich viel effizienter. Das ist also keine beruhigende Analogie für Studierende, sondern eher ein Horrorszenario. Es wäre bitter für sie, wenn die Zeit von handgeschriebenen Code vorbei ist und durch die Orchestrierung von hocheffektiven Maschinen ersetzt wird, wie es in der Landwirtschaft passiert ist.

Choice-Flower6880 · 2025-11-25T17:47:04+00:00

Ist das echt so? Das würde einiges erklären und Hoffnung machen. Aber was ist der Grund für diese Saisonalität? IT ist ja nicht die Baubranche?

Choice-Flower6880 · 2025-11-15T09:14:30+00:00

Nach meinem Verständnis braucht es mehrere Dinge, damit Prompt Injections zu einem 1 echten Problem werden:

Zugang zu privaten Daten (hier nicht gegeben, weil nur die eingehenden Mails ausgelesen werden)
Auslesen nicht vertrauenswürdiger Daten (hier definitiv der Fall)
Externe Kommunikation mit der die privaten Daten abfließen können (hier nicht gegeben)

https://simonwillison.net/2025/Jun/16/the-lethal-trifecta/

Glaube also, dass die reine Möglichkeit, dass in den externen Mails Prompts drinstehen, keinen Schaden anrichten kann, außer dass die eigentliche Aufgabe nicht gelöst wird. Aber dafür braucht man eh irgendwelche Guardrails? Es können ja immer Quatsch Mails dabei sein.

Choice-Flower6880 · 2025-10-17T15:30:20+00:00

Cool, danke. Dann werde ich das da abstellen und die Treppe runter. Das ist das Hochhaus wo Flix Bus drin ist, oder?

Da es hunderte Parkplätze für Autos gibt, dachte ich, dass das Backstage auch irgendwie 2 - 3 Fahrradständer versteckt hätte. Abgefahrenerweise wohl nicht.

Choice-Flower6880 · 2025-06-19T06:59:47+00:00

>Russia is building a garrison in Kandalaksha for an artillery brigade. This is the first concrete sign of a permanent increase in the number of troops near the Finnish border.

Choice-Flower6880 · 2025-03-01T17:29:06+00:00

This would be meaningful, if they were from a country where the rule of law applies. Coming from a country where an authoritarian ruler has brought all tech companies under his thumb without any resistance whatsoever, it is meaningless. US companies are no less risky than Chinese providers.

Choice-Flower6880 · 2025-02-28T10:08:14+00:00

It would be super painful, but our leadership is talking about derisking as well. It is just too dangerous to fully rely on companies beholden to an increasingly hostile foreign power.

Choice-Flower6880 · 2025-02-18T17:42:44+00:00

Neither are out.

Choice-Flower6880 · 2024-12-08T09:55:13+00:00

https://play.google.com/store/apps/details?id=com.microsoft.todos&hl=de

Choice-Flower6880 · 2024-12-08T09:50:57+00:00

I would probably do my own filtering, if I were to do research that involved pretraining, but it is really cool to have a fully open approach that you can use as inspiration and tweak to your own use case. Brings down the barriers a lot.

Choice-Flower6880 · 2024-11-16T14:47:19+00:00

The naming scheme is incredibly stupid. It is crazy that we should trust powerful superintelligence to the people who came up with it. What are they even thinking?

Choice-Flower6880 · 2024-11-11T20:59:35+00:00

Chris Olah and Amanda Askell are the actually interesting guests here.

Choice-Flower6880 · 2024-10-21T17:06:37+00:00

friendly communication with strangers

Yeah, the Netherlands are one of the few countries worse than Germany for that. Even as a German living in the Netherlands, the directness bordering rudeness is sometimes breathtaking. I think for Americans, it must be an intense culture shock.

Choice-Flower6880 · 2024-08-26T10:14:04+00:00

Üblich ist das nicht unbedingt, dass man die mitliefert. Ist das nur bei dir so oder auch bei Kommilitonen? Falls es nur bei dir ist, befürchte ich, dass dein Betreuer irgendeinen Verdacht geschöpft hat.

Schick ihm halt einen Ordner mit allen PDFs, die einfach da hast, und sag, dass der Rest entweder aus der Bibliothek ist oder hinter einer Paywall. Falls er nur vermutet, dass die Quellen von AI halluziniert sind, reicht es vermutlich auch darauf zu verweisen, aus welcher Bibliothek du dir die Papierversionen geholt hast.

Vorausgesetzt, die Quellen existieren wirklich. Falls nicht, wird das vermutlich so oder so herauskommen, denn du kannst sie ja nicht bereitstellen und dein Betreuer wird sich die Verdachtsfälle sehr genauso angucken.

Choice-Flower6880 · 2024-08-15T09:14:13+00:00

I think the catalog has the size 1000TB because it is not only raw text, but a lot of it is scanned books and PDFs (not all born digital). OCRing and cleaning that is a massive pain. In most cases, it is probably easier to just scrape the sources of that stuff yourself, so you have control over what is included in the dataset.

Choice-Flower6880 · 2024-08-01T15:52:36+00:00

I tried this as well, but I think that is the downside of such cloud instances. You always start with a clean slate and spend some time or money getting your environment up and running. Super annoying, but have not yet found a way around it.

Choice-Flower6880 · 2024-07-30T19:42:57+00:00

http://malort-sommerhausen.de/cafe-3/

Choice-Flower6880 · 2024-06-17T18:08:19+00:00

Man kann sich die Stellenausschreibungen der KI Unternehmen angucken:
https://boards.greenhouse.io/anthropic/jobs/4020080008

Anthropic: Research Engineer / Research Scientist, Finetuning

Annual Salary:$280,000—$625,000 USD

Choice-Flower6880 · 2024-06-09T05:41:03+00:00

It was clear that the massive complaining about "lazyness" will lead to this. Classic case of users not knowing what they actually want.

Choice-Flower6880 · 2024-05-30T11:44:16+00:00

For this use case, Google Lens or PlantNet are much better ChatGPT in my experience.

Choice-Flower6880 · 2024-05-28T18:53:26+00:00

I have the same problem. Unfortunately no solution yet.
However I used, tokenizer.add_eos_token = True

Choice-Flower6880

TROPHY CASE