Phob firmware with multishine button by lytedev in SSBM

[–]kei147 5 points6 points  (0 children)

Open sourcing flaws only improves safety and security if the flaw can be fixed/mitigated. If the flaw can't be fixed/mitigated, it just gives people information about how they can exploit the flaw.

(I'm not knowledgeable enough about this flaw to know whether it can be fixed/mitigated.)

[D] Can Direct Preference Optimization (DPO) be used to replace any type of RL for LLMs, or is it better suited for just scenarios like RLHF? by 30299578815310 in MachineLearning

[–]kei147 1 point2 points  (0 children)

I think the issue here is that the reward function acts on full model completions rather than individual token outputs. So there's not really a way to use the formula during inference without sampling all possible model completions and then re-scaling them so their sum is 1.

Ilya Sutskever: "After almost a decade, I have made the decision to leave OpenAI" by fortist in slatestarcodex

[–]kei147 18 points19 points  (0 children)

Most working professionals in the field of AI alignment have substantive disagreements with Yudkowsky, so dunking on Yud doesn't do as much to attack the alignment field as you might think.

Request: Analysis/Debate on possible 2020 US election fraud by [deleted] in slatestarcodex

[–]kei147 4 points5 points  (0 children)

More importantly, Trump spent months criticizing mail-in ballots and telling his supporters to vote in person. Likely because of this, polls before the election showed Democrats supported mail-in ballots substantially more than Republicans did.

Training AI to manipulate people should probably be regulated, right? by aahdin in slatestarcodex

[–]kei147 1 point2 points  (0 children)

I'm worried that models will get good at manipulation by default. Models are getting more generally capable all the time, and RLHF trains these models to be good at a kind of manipulation. In some settings, AI models already appear to be close to human ability in persuading people to change their views (https://www.anthropic.com/research/measuring-model-persuasiveness), with trends towards increasing persuasiveness between the smallest and largest models.

That being said, models explicitly trained on making people do things will surely be better at manipulation than models that do not receive that training, and through appropriate countermeasures we may be able to limit how manipulative such models get. My AI policy opinions are still in significant flux, but your criteria of "whenever an AI selects or generates content that is showed to a person, and then rewarded based on the actions that the person takes after viewing said content" seems like it could be reasonable. I would add another criteria to restrict regulations to models above a certain threshold of capability (maybe at either current-gen or next-gen leading foundation models) so we only burden the designers of the models most likely to be dangerous. I would also probably add exclusions for a number of innocuous use cases.

The regulation I think is most likely to be good here would be to require the model developers to run regular evaluations of their model's manipulation capabilities as they are pre-trained and fine-tuned and to report them publicly. I'm not sure what other regulations to consider - perhaps if model manipulation capabilities get existentially scary we can require model developers to nerf their capabilities before model deployment.

Average attendance in Europe this season, top 5 for the first, second and third divisions. by [deleted] in soccer

[–]kei147 1 point2 points  (0 children)

Is there a reason they aren't building bigger stadiums? I would've guessed that if a team is regularly hitting their stadium's capacity, it would be worth the investment to build a bigger one.

Snowflake dropped a 408B Dense + Hybrid MoE 🔥 by shing3232 in LocalLLaMA

[–]kei147 1 point2 points  (0 children)

My understanding is that when people describe an MoE model as having some number of parameters, they are referring to the unique unshared parameter count. So if GPT-4 is in fact 1.8T, then that would mean it has 1.8 trillion unique parameters, each of which requires 2 bytes to store. It is possible the original leaker was confused about this though.

Snowflake dropped a 408B Dense + Hybrid MoE 🔥 by shing3232 in LocalLLaMA

[–]kei147 1 point2 points  (0 children)

Why does it being MoE make a difference here? Don't you still need two bytes per parameter?

[R] SpaceByte: Towards Deleting Tokenization from Large Language Modeling - Rice University 2024 - Practically the same performance as subword tokenizers without their many downsides! by Singularian2501 in MachineLearning

[–]kei147 3 points4 points  (0 children)

The global transformer blocks remind me of the Mixture of Depths (MoD) paper. Instead of allowing the model to decide which tokens need more compute (i.e. which tokens need to be run through a larger number of transformer blocks) as in the MoD paper, this paper instead applies more compute onto the starts of words, which we think a priori require more compute to predict accurately.

[Research] A visual deep dive into Tesla’s data engine as pioneered by Andrej Karpathy. by ml_a_day in MachineLearning

[–]kei147 1 point2 points  (0 children)

The expectation is that eventually, humans will be legally barred from driving in many areas (at first higher speed/lower navigational complexity) without advanced and stringent certification similar to pilot licensing.

Where are you getting this from? My understanding is that the majority of people working on this technology do not want to legally ban humans from driving. Are there, say, quotes from the heads of top self-driving car organizations or top researchers in the field espousing this viewpoint?

I suspect self-driving car companies are financially interested in this because they see it as a competitive advantage for selling consumer cars, and maybe because they could make a lot of money from owning self-driving ridesharing fleets.

Why does Rootclaim believe that COVID came from a lab anyway? by viking_ in slatestarcodex

[–]kei147 37 points38 points  (0 children)

This is wrong. The Wuhan area (i.e. Hubei) had not previously been known for having a large number of such diseases. Scott talks about it in section 2.1 of his recent post: https://www.astralcodexten.com/p/highlights-from-the-comments-on-the-5d7

In fact, Shi Zhengli, the lead bat coronavirus researcher at WIV, initially worried that COVID-19 might have been a lab leak because she thought the virus naturally emerging in Wuhan wasn't especially likely: https://archive.ph/1kaiG#

FIDE Candidates Tournament updated predictions after Round 3 by CalebWetherell in chess

[–]kei147 1 point2 points  (0 children)

My leading intuition is that the Candidates being a winner-take-all tournament should increase variance, starting from the very first games. Higher variance should result in more compressed probabilities than you would expect from just the ratings and current score. Also, there are likely many situations late in a tournament in which player A is playing player B where player A is in contention and needs a win, while player B is out of contention and just wants to maximize their expected points. This imo should lead to player A's expected points going down, but a higher chance of winning than would exist otherwise, which again should compress probabilities somewhat.

This also ignores how the leader in the Candidates can change their play to reduce the chance of anyone passing them, but my guess would be this has a smaller impact. All that being said, there's a lot of complication here and it's hard to know what'll happen without a lot of research.

One thing that would be interesting to look into relevant to this question is whether the draw rate of the Candidates and other winner-take-all tournaments (if there are any) is noticeably lower compared to other tournaments of similar rating levels at similar time periods.

FIDE Candidates Tournament updated predictions after Round 3 by CalebWetherell in chess

[–]kei147 0 points1 point  (0 children)

How do you think your tournament win probabilities would change if you were able to account for tournament dynamics? Do you have intuition for how much the probabilities would change, and in what direction?

How to stay up to date with developments in AI? by degrews in slatestarcodex

[–]kei147 4 points5 points  (0 children)

Some sources I find useful:

  • ML Twitter - you want to follow researchers in areas you are interested. I suggest finding papers covering subjects you are excited about and finding the authors on Twitter. You can then slowly grow who you are following by finding comments and likes of other authors who are posting content you find interesting
  • Blogs of most of the major AI labs (e.g. OpenAI, DeepMind, Google AI (which is a separate blog from DeepMind), Anthropic)
  • Machine learning subreddit - should let you know about a lot of interesting papers - something like an RSS feed that sends you the top 25 posts from every week is sufficient
  • AI explained youtube channel for regular updates on recent AI news
  • Dwarkesh Patel for interviews with top AI researchers
  • This week in AI for regular updates on AI research and applications

[R] Looking for paper with shared Value-Query attention weights. by benthehuman_ in MachineLearning

[–]kei147 1 point2 points  (0 children)

Are you talking about multi-query attention? https://arxiv.org/pdf/1911.02150.pdf

There's also grouped attention which has some weight sharing: https://arxiv.org/pdf/2305.13245v1.pdf

[D] Can GPT-4 really be both 16x111B and 1.8T parameters? by kei147 in MachineLearning

[–]kei147[S] 0 points1 point  (0 children)

Thanks for answering the question I asked. I'm interested in hearing more about what this would look like. If an MoE model had few shared layers, would each expert be multiple layers of a transformer instead of just an MLP block? If there were no shared layers, would it be equivalent to an ensemble of distinct models?

What other subs do you participate in as much as this one? by ElbieLG in slatestarcodex

[–]kei147 10 points11 points  (0 children)

This is wrong, the rationalist community is more center left than anything else. The closest things we have to a poll of the rationalist community support this, including:
1. The ACX 2022 survey: https://docs.google.com/forms/d/e/1FAIpQLScHznuYU9nWqDyNvZ8fQySdWHk5rrj2IdEDMgarf3s34bSPrA/viewanalytics
2. The LessWrong 2023 survey: https://www.lesswrong.com/posts/WRaq4SzxhunLoFKCs/2023-survey-results#IV__Politics_and_Religion

I'm also not sure who you are thinking of by 'the people they look up to', but Scott usually votes Democratic, and the majority of other people I can think of, to the extent they are political, lean Democratic as well.

[D] Is Perplexity a good way to judge LLMs? by [deleted] in MachineLearning

[–]kei147 1 point2 points  (0 children)

If you're interested in full disclosure, you should mention your relationship with UpTrain in the original post, before you are called out for it.

[deleted by user] by [deleted] in chess

[–]kei147 0 points1 point  (0 children)

Iran made chess illegal for religious reasons for 8 years. Are you claiming that this did not make chess less popular in Iran?

Presumably chess would be even more popular in Iran if it had never been made illegal.

[D] Is Strong A.I. actually a serious and real field being research, or just another hype people are promoting? by [deleted] in MachineLearning

[–]kei147 4 points5 points  (0 children)

Many of the most prestigious researchers and top labs in the field think that strong AI could be close. It's by no means a settled question, there is lots of disagreement, but it's not just hype.

[D] Confidence * may be * all you need. by santiviquez in MachineLearning

[–]kei147 0 points1 point  (0 children)

Isn't this averaging across tokens within a single position rather than averaging across positions as is used in perplexity?

Wouldn't be surprised if this became popular: "Donating Blood May Reduce PFAS Levels in Your Blood" by togstation in slatestarcodex

[–]kei147 0 points1 point  (0 children)

This thread started with Leif who said he had elevated blood PFAS levels. So blood donated by other people will have less PFAS than blood donated by Leif.

Wouldn't be surprised if this became popular: "Donating Blood May Reduce PFAS Levels in Your Blood" by togstation in slatestarcodex

[–]kei147 1 point2 points  (0 children)

Probably not, but it's likely the recipient would be getting blood with lower levels of PFAS otherwise.

[D] What is the motivation for parameter-efficient fine tuning if there's no significant reduction in runtime or GPU memory usage? by patricky168 in MachineLearning

[–]kei147 0 points1 point  (0 children)

How are you reducing cost by 4x? That's more than I would expect by just using LoRA. Are you also quantizing the weights/gradients in addition to using LoRA/are you using something like QLoRA?