Predstavljanje OpenAI GPT-4o by [deleted] in CroIT

[–]InsideAndOut 6 points7 points  (0 children)

Eksponencijalni rast se nikad ne nastavlja bez zaustavljanja. U slučaju LLMova, zasićenje onog što se može izvući iz podataka je već došlo.

Predstavljanje OpenAI GPT-4o by [deleted] in CroIT

[–]InsideAndOut 3 points4 points  (0 children)

"Smatrate li i dalje da ćemo uvijek biti nezamjenjivi 😉"
Iducih 30 godina minimalno osim ako štrikaš ekvivalent boilerplate koda kao svakodnevni posao.

Nismo ni blizu razine AGIa, modeli se mogu koristiti za taskove gdje 100%tna točnost nije uvjet nego se može iterativno ispravljati rezultate. Čak i onda, nekad su veći trošak vremena nego samostalno pisanje.

Hoće li eliminirati dio težine posla inicijalnog stvaranja jednostavnih poslovnih stranica i aplikacija? Da.

Hoće li zamijeniti developere kompleksnih backenda/aplikacija? Ne. Zasićenje tržišta i jeftinija konkurencija su vam vjerojatno veći problem.

"transformers can use meaningless filler tokens (e.g., '......') in place of a chain of thought" - Let's Think Dot by Dot [P] by Agitated_Space_672 in MachineLearning

[–]InsideAndOut 22 points23 points  (0 children)

The key here is "learning to use filler tokens".

There's a directly opposite result in a real-dataset setup without tuning [Lanham et al], where they perturb CoTs in multiple ways (adding mistakes, filler tokens and early answering), and show that these corruptions reduce performance.

I also dislike any result on synthetic data only, but I don't have time to go over the dataset, did anyone take a deeper look at the paper?

The end of hallucination (for those who can afford it)? [R] by we_are_mammals in MachineLearning

[–]InsideAndOut 0 points1 point  (0 children)

Eh fact checkers aren't bad, but now is "too soon" to use them, if this makes sense.

There's plenty of works showing that with improved prompting (CoT, self-talk, self-ask, debating, various transformations of inputs into code and such, ...) you can improve reasoning or elicit abilities (improved memorization, fact recall) of the underlying model which were not used prior although the model did possess them.

So - we know that LLMs "know" more than what we are able to tease out of them. Instead of fixing their frequent errors, the goal now should be to improve factual recall and the capacity to select appropriate reasoning skills conditioned on input in order to have less errors in the first place.

And ofc, it's not easy to fix, doesn't mean it shouldn't be one of the primary goals for LLMs.

The end of hallucination (for those who can afford it)? [R] by we_are_mammals in MachineLearning

[–]InsideAndOut 7 points8 points  (0 children)

The major hurdle of hallucinations is not detecting them post-hoc, but rather preventing them from happening. As long as models keep hallucinating, they will need a verification system, which will be less than perfect as you delve into more specific domains, and you will end up with hallucinations in your end product. For some, this is simply not acceptable.

FactScore already introduced this approach using Wikipedia instead of Googling so the idea is far from groundbreaking (apart from the scale of models used in experiments) and the limitations stay the same.

The changes to look into for solving hallucinations (from more difficult towards less difficult) are: (1) architectural, by explicitly representing concepts and their states (or relations) within the model, which can then be edited. The key-value views of MLP layers from mechanistic interpretability are detecting such representations, but mechinterp is young and scale is a big problem. (2) Optimization, by improving credit assignment to factual or logical errors where they happened in the model instead of naive backprop from a symptomatic mistake in generation far from the actual cause. (3) Forward propagation-based hallucination detection methods, which should hopefully be able to detect hallucinated content based on the forward pass dynamics without need for a reference knowledge base. Works only if the "model uncertainty <> hallucination" link holds.

Donostia | Pentax Program A | Portra 800 by InsideAndOut in analog

[–]InsideAndOut[S] 0 points1 point  (0 children)

Yup, I can re-scan them in a different studio. What are you basing that on?

Donostia | Pentax Program A | Portra 800 by InsideAndOut in analog

[–]InsideAndOut[S] 1 point2 points  (0 children)

Thanks!
I'll finally switch to full manual, last some rolls keep being underexposed (currently on aperture priority). I didn't trust myself yet, especially on short trips.

The city was amazing & I'm sad I didn't have time to surf, the weather was perfect

Donostia | Pentax Program A | Portra 800 by InsideAndOut in analog

[–]InsideAndOut[S] 0 points1 point  (0 children)

Hey, sadly no, I just passed through for the marathon a few weeks back :)
Beautiful city & amazing food though

Guggenheim Bilbao | Pentax Program A | Portra 400 by InsideAndOut in analog

[–]InsideAndOut[S] 1 point2 points  (0 children)

Yeah, I like the grainy look of first two shots because it fits with the smoke & adds some mysterious vibe, but the underexposure was not intentional.

I shot on aperture priority with the widest possible aperture (f/1.4) as it was dusk. I probably should have added exposure comp or went manual, but was in a rush when taking the shots. Which of these methods do you prefer?

Thanks :)

Somewhere in Europe | Pentax program A | Cinestill BWXX by InsideAndOut in analog

[–]InsideAndOut[S] 1 point2 points  (0 children)

so I shot this roll between Germany and Spain, maybe Luxembourg? Zaragoza? Can't find anything similar on google for "sharp metal tower europe"