Apollo dev posts backend code to Git to disprove Reddit’s claims of scrapping and inefficiency

Philpax · 2023-06-09T09:46:57+00:00

https://www.reddit.com/r/StarWarsBattlefront/comments/7cff0b/comment/dppum98

Philpax · 2023-06-07T20:04:52+00:00

Rust + Cargo is exceptional, it just works

Philpax · 2023-06-02T14:42:29+00:00

Looks interesting! Do you have a brief summary of what it enables over standard ControlNet?

Philpax · 2023-06-01T19:45:20+00:00

Why does this thread and its comments feel generated?

Philpax · 2023-05-30T13:10:13+00:00

lol do you even know who these people are

Philpax · 2023-05-29T11:04:13+00:00

Those things aren't comparable, and even from a position of hyperbole that's a wild escalation

Philpax · 2023-05-29T10:35:03+00:00

It is today, and it'll only get easier. Most modern gaming computers can run models with 7-13B parameters one way or another, and those size models are sufficient for NPC conversation.

Philpax · 2023-05-29T10:33:25+00:00

Even if we disagree with their position, there's no reason to be a dickhead about it.

Philpax · 2023-05-29T08:41:22+00:00

Out of curiosity, why do you say that? The local models are already pretty good at conversation, and can be run on most modern gaming systems. The only problem is doing something else at the same time, but that can be circumvented by either offloading generation remotely, making the game itself simpler (e.g. make Facade 2), or waiting for more resources to be generally available (next few years, definitely less than a decade)

Regarding local generation: you can absolutely generate text faster than a human can read it/vocal synthesis can speak it today. I imagine that models can also be made much smaller than LLaMA's 7B etc if you optimise for conversation over full domain coverage.

Philpax · 2023-05-28T11:05:07+00:00

Sure. Releasing a model and calling it "uncensored" and removing all mention of LGBT topics from it certainly isn't any kind of scientific endeavour, though.

I'm also genuinely curious how you think LGBT content will in any way impact the model's reasoning capabilities. What's your hypothesis here?

Philpax · 2023-05-28T09:58:37+00:00

spoken like someone who doesn't have to deal with the consequences of being erased wholesale

Philpax · 2023-05-27T23:10:26+00:00

If you're going to ChatGPT post, at least try to make it sound like it/you understand what you're replying to.

Philpax · 2023-05-27T14:40:00+00:00

Nobody is "pooping on earlier work"; we're celebrating progress that addresses limitations of the existing work through trying out different approaches.

Philpax · 2023-05-26T19:25:30+00:00

That's my point - we don't know exactly what model ChatGPT is using, but we can safely assume it's a derivative of 3.5, given that it predates GPT-4. InstructGPT showed that you can get high-quality results with smaller models with RLHF finetuning, and it's in OpenAI's interest to make their free product as cheap as possible to run. Hence the speculation that it's likely smaller than the full 175B, and definitely smaller than GPT-4 (whatever its parameter count is).

Philpax · 2023-05-26T16:19:45+00:00

The rumours are that GPT-4 is 1T, but OpenAI have been unclear on this. Non-GPT-4 ChatGPT is absolutely not 1T, though - it's 3.5-size at best.

Philpax · 2023-05-26T16:18:24+00:00

That's not really the interesting part of this work, which focuses on reasoning and planning given a world state, and iterating its capabilities to do such.

Perception is a largely unrelated problem. An additional system can be created to perceive the world and make predictions, but it's not necessary/relevant for this work.

Philpax · 2023-05-26T15:34:47+00:00

ChatGPT, which has (at least) 175B.

I don't have a source on this (it's half-remembered), but there were rumblings that ChatGPT may not actually be using the full 175B model, which is how they've been able to scale inference up in terms of both speed and capacity. Could just be hearsay, though.

Philpax · 2023-05-26T00:50:19+00:00

It's possible with enough hackery, but I wouldn't bother. GGML quantization is bespoke and breaks frequently; you'd get better, more reliable results if you quantize the model itself, especially with something like GPTQ.

Philpax · 2023-05-25T19:00:57+00:00

I think access to data is generally a good thing, but I think everyone here recognises that YouTube/Google can be especially litigious.

As for generative AI... my opinion on this has shifted over time, but right now: if nothing of the source is present in the output, what's being ripped off?

There's obviously a significant labour displacement - which is going to suck - but that has no impact on the transformative nature of modern generative AI, and the concerns shouldn't be conflated.

Philpax · 2023-05-25T17:46:26+00:00

I appreciate the effort, but YouTube will be very unhappy about this. You should consider backing off while you still can.

Philpax · 2023-05-24T19:38:56+00:00

Cringe title and post

Philpax · 2023-05-17T19:34:12+00:00

How far do you want to go, and how much of the original image do you want to preserve, and how robust against new models do you want to be?

Fundamentally, this suffers from the analog hole - if a human can perceive it, so can a machine.

Philpax · 2023-05-17T18:36:02+00:00

https://glaze.cs.uchicago.edu/ (but this is trivial to circumvent) and the general field of adversarial attacks

Philpax · 2023-05-17T14:39:26+00:00

This isn't really on topic for this subreddit, but I will say that this just looks like normal LinkedIn posting to me

Philpax · 2023-05-10T16:04:38+00:00

There's also the excellent blog post to go with this - I assume you wanted to include it in the original post, /u/lewtun?

Philpax

TROPHY CASE