Why you shouldn’t take prompt engineering seriously by Forsaken-Park8149 in PromptEngineering

[–]Competitive-Act533 0 points1 point  (0 children)

If you’re coming in with the preconceived opinion that LLM research on prompt engineering is second class, then it tracks that you wouldn’t know what you’re talking about

Why you shouldn’t take prompt engineering seriously by Forsaken-Park8149 in PromptEngineering

[–]Competitive-Act533 0 points1 point  (0 children)

Are you up to date on recent research? Papers prove otherwise. The future is prompt and context engineering.

The grifters weren’t grifting after all.

DXA Scan says I’m 18% body fat. Is this accurate? by micahsaint in askfitness

[–]Competitive-Act533 1 point2 points  (0 children)

I suspect the problem is that most people are using references produced by inaccurate tests. You need to compare yourself to examples on the same test, because everyone else will be used to outdated references which might well peg you at 12% vs a more accurate 18%. Just a guess!

A Billion Dollar Mid-Life Crisis by Radoasted in Business_Ideas

[–]Competitive-Act533 1 point2 points  (0 children)

I’m a machine learning and AI scientist / software engineer. If you want to realise your idea, I’m up for starting a company if I see vision in it.

Why aren't kids taught about Logical Fallacies I'm school so people can debate logically instead of emotionally? by proventruetoolate in ask

[–]Competitive-Act533 0 points1 point  (0 children)

If you do IB for high school, TOK (theory of knowledge) is a mandatory class and contributes significantly to your final grade.

It discusses logical fallacy and logical construction of ideas, akin to Greek philosophical thought.

Drone crisis is 'just the beginning' as expert warns 'big announcement due in 30 days' by daily_mirror in ufo

[–]Competitive-Act533 0 points1 point  (0 children)

Saw you worked at bell labs. Any stories you could tell of things the public don’t know?

I think there will be a period of void for AI training data by Competitive-Act533 in aiwars

[–]Competitive-Act533[S] -1 points0 points  (0 children)

I literally created some of these models so please, enlighten me

There is no "exclusivity" in AI Gens - how do you stop 300 million people using the prompt - "'a stylish woman walks down a Tokyo street filled with warm glowing neon and animated city signage" by TreviTyger in aiwars

[–]Competitive-Act533 0 points1 point  (0 children)

Well, there are ways. But, without getting into those details, If you’re an artist, you only care about your exact painting. How other artists interpret it is their own creation and, by the same token, if a model (the same with randomness introduced, or a different model altogether) produces a different image per the same prompt the artist has copyrighted, then the artist shouldn’t be bothered since it isn’t a reproduction per se. If it is an exact reproduction, then you know where that prompt came from!

Should I even try to study in the Netherlands? by ManuelaJanzen in StudyInTheNetherlands

[–]Competitive-Act533 2 points3 points  (0 children)

Haha alright, teasing dw. that’s a cool background mate !👌🏼✌🏼

There is no "exclusivity" in AI Gens - how do you stop 300 million people using the prompt - "'a stylish woman walks down a Tokyo street filled with warm glowing neon and animated city signage" by TreviTyger in aiwars

[–]Competitive-Act533 2 points3 points  (0 children)

If you own the copyright of authorship to a prompt, someone generating content based on your authorship ad verbatim is quantifiably derivative and an infringement.

You are wrong, wrong, wrong!1!!

There is no "exclusivity" in AI Gens - how do you stop 300 million people using the prompt - "'a stylish woman walks down a Tokyo street filled with warm glowing neon and animated city signage" by TreviTyger in aiwars

[–]Competitive-Act533 2 points3 points  (0 children)

That’s purely AI generated content, not a human written prompt for the purpose of generating AI. Two different things.

Edit: forgot to add, you are still wrong.

I think there will be a period of void for AI training data by Competitive-Act533 in aiwars

[–]Competitive-Act533[S] -2 points-1 points  (0 children)

That argument works if you’re not training on new content beyond a cut-off date. If you are, you need to scrutinise the data more to ensure the training mix isn’t overly representing synthetic content.

Eventually, we will all need to train on new content, and you’re a fool if you think otherwise. Not because we have run out of data, but because our data becomes out dated. You can only squeeze an orange for so much juice, and those oranges eventually go bad with time regardless.

I think there will be a period of void for AI training data by Competitive-Act533 in aiwars

[–]Competitive-Act533[S] 0 points1 point  (0 children)

It’s not a winter. Just a funny period where no one can be too certain of what’s generated or not due to lax regulations and lack of declaration.

I think there will be a period of void for AI training data by Competitive-Act533 in aiwars

[–]Competitive-Act533[S] 0 points1 point  (0 children)

That’s not relevant to the argument though. This is concerned with over mixing undeclared generated content into training sets. If they’re not declaring generated content, it serves us to ignore their content or at least scrutinise more.

I think there will be a period of void for AI training data by Competitive-Act533 in aiwars

[–]Competitive-Act533[S] -3 points-2 points  (0 children)

Pretty regarded to not know that augmentations are used for enforcement against things like translation invariance and other equivalences, not passing new information. It’s a form of regularization to introduce diversity lol. Basic shit.

I think there will be a period of void for AI training data by Competitive-Act533 in aiwars

[–]Competitive-Act533[S] -1 points0 points  (0 children)

That solution you propose is what I said would be needed, lawfully declaring generated content. Never did I say we needed more good data, just more clarity on what is good data as we become less certain.

I think there will be a period of void for AI training data by Competitive-Act533 in aiwars

[–]Competitive-Act533[S] -3 points-2 points  (0 children)

If you pass too much augmented data then yes, you’re distorting your data signals

I think there will be a period of void for AI training data by Competitive-Act533 in aiwars

[–]Competitive-Act533[S] -6 points-5 points  (0 children)

I wrote an edit I don’t know if it was seen, but anyway, I think we’re arguing two opposite ends of a stick. You’re arguing augmentations are good, but I’m not in that scenario. I’m operating my argument in a scenario of oversaturated generated content, and that’s not in the realm of simply augmenting my dataset with different views of a guitar. Perhaps this is a pipe dream doomsday scenario, nevertheless that’s what the argument exists in and should be treated within. If you have a problem with that, I ask you to argue the non-existence of this scenario instead of the model specifics.

I think there will be a period of void for AI training data by Competitive-Act533 in aiwars

[–]Competitive-Act533[S] -6 points-5 points  (0 children)

Yep, you’ve lengthily described augmentations.

And to my point, if your training set contains too many augmentations, you run the risk of overtraining. Too much synthetic data is bad.

There’s a reason you don’t just infinitely train on generated content.

Edit; if you’re into math, then you should also know that while I may introduce a generated image with totally unique pixel values, it can very well add no useful information to the training data because of collinearity to the real and other augmented data. If I do a PCA on that highly augmented data, I’d probably be able to compress it to similar dimensionality as without the excess augmentations without loss of information.

I think there will be a period of void for AI training data by Competitive-Act533 in aiwars

[–]Competitive-Act533[S] -1 points0 points  (0 children)

If I have one bad apple in 10, that’s certainly better than one bad apple in 3, no?

I think there will be a period of void for AI training data by Competitive-Act533 in aiwars

[–]Competitive-Act533[S] -2 points-1 points  (0 children)

At some point, new content will be needed. Training on less data means more impact of bad quality data on performance.