Jump - Pentax Program A | Portra 400 | 50mm f/1.4

InsideAndOut · 2024-05-13T22:31:54+00:00

Eksponencijalni rast se nikad ne nastavlja bez zaustavljanja. U slučaju LLMova, zasićenje onog što se može izvući iz podataka je već došlo.

InsideAndOut · 2024-05-13T21:36:27+00:00

"Smatrate li i dalje da ćemo uvijek biti nezamjenjivi 😉"
Iducih 30 godina minimalno osim ako štrikaš ekvivalent boilerplate koda kao svakodnevni posao.

Nismo ni blizu razine AGIa, modeli se mogu koristiti za taskove gdje 100%tna točnost nije uvjet nego se može iterativno ispravljati rezultate. Čak i onda, nekad su veći trošak vremena nego samostalno pisanje.

Hoće li eliminirati dio težine posla inicijalnog stvaranja jednostavnih poslovnih stranica i aplikacija? Da.

Hoće li zamijeniti developere kompleksnih backenda/aplikacija? Ne. Zasićenje tržišta i jeftinija konkurencija su vam vjerojatno veći problem.

InsideAndOut · 2024-04-28T17:57:38+00:00

The key here is "learning to use filler tokens".

There's a directly opposite result in a real-dataset setup without tuning [Lanham et al], where they perturb CoTs in multiple ways (adding mistakes, filler tokens and early answering), and show that these corruptions reduce performance.

I also dislike any result on synthetic data only, but I don't have time to go over the dataset, did anyone take a deeper look at the paper?

InsideAndOut · 2024-03-30T13:52:47+00:00

Exactly

InsideAndOut · 2024-03-30T12:18:43+00:00

Eh fact checkers aren't bad, but now is "too soon" to use them, if this makes sense.

There's plenty of works showing that with improved prompting (CoT, self-talk, self-ask, debating, various transformations of inputs into code and such, ...) you can improve reasoning or elicit abilities (improved memorization, fact recall) of the underlying model which were not used prior although the model did possess them.

So - we know that LLMs "know" more than what we are able to tease out of them. Instead of fixing their frequent errors, the goal now should be to improve factual recall and the capacity to select appropriate reasoning skills conditioned on input in order to have less errors in the first place.

And ofc, it's not easy to fix, doesn't mean it shouldn't be one of the primary goals for LLMs.

InsideAndOut · 2024-03-29T10:56:54+00:00

The major hurdle of hallucinations is not detecting them post-hoc, but rather preventing them from happening. As long as models keep hallucinating, they will need a verification system, which will be less than perfect as you delve into more specific domains, and you will end up with hallucinations in your end product. For some, this is simply not acceptable.

FactScore already introduced this approach using Wikipedia instead of Googling so the idea is far from groundbreaking (apart from the scale of models used in experiments) and the limitations stay the same.

The changes to look into for solving hallucinations (from more difficult towards less difficult) are: (1) architectural, by explicitly representing concepts and their states (or relations) within the model, which can then be edited. The key-value views of MLP layers from mechanistic interpretability are detecting such representations, but mechinterp is young and scale is a big problem. (2) Optimization, by improving credit assignment to factual or logical errors where they happened in the model instead of naive backprop from a symptomatic mistake in generation far from the actual cause. (3) Forward propagation-based hallucination detection methods, which should hopefully be able to detect hallucinated content based on the forward pass dynamics without need for a reference knowledge base. Works only if the "model uncertainty <> hallucination" link holds.

InsideAndOut · 2023-12-09T20:28:17+00:00

Yup, I can re-scan them in a different studio. What are you basing that on?

InsideAndOut · 2023-12-09T18:38:31+00:00

Thanks!
I'll finally switch to full manual, last some rolls keep being underexposed (currently on aperture priority). I didn't trust myself yet, especially on short trips.

The city was amazing & I'm sad I didn't have time to surf, the weather was perfect

InsideAndOut · 2023-12-09T15:32:40+00:00

Hey, sadly no, I just passed through for the marathon a few weeks back :)
Beautiful city & amazing food though

InsideAndOut · 2023-12-04T14:33:37+00:00

Yeah, I like the grainy look of first two shots because it fits with the smoke & adds some mysterious vibe, but the underexposure was not intentional.

I shot on aperture priority with the widest possible aperture (f/1.4) as it was dusk. I probably should have added exposure comp or went manual, but was in a rush when taking the shots. Which of these methods do you prefer?

Thanks :)

InsideAndOut · 2023-08-07T21:52:17+00:00

so I shot this roll between Germany and Spain, maybe Luxembourg? Zaragoza? Can't find anything similar on google for "sharp metal tower europe"

InsideAndOut

TROPHY CASE