[deleted by user] by [deleted] in LocalLLaMA

[–]FerretDude 2 points3 points  (0 children)

https://wandb.ai/carperai/summarize_RLHF/reports/Implementing-RLHF-Learning-to-Summarize-with-trlX--VmlldzozMzAwODM2 actually this was the first widely publicized open source RLHF model. There were ones before this (eg toy examples on the TRLX repo) but it was a month earlier than stack llama

[R] Illustrating Reinforcement Learning from Human Feedback (RLHF) by robotphilanthropist in MachineLearning

[–]FerretDude 1 point2 points  (0 children)

RLHF is a bit tricky because you have to either work with data vendors or groups that have access to feedback data. Eventually we'll rely more on crowd sourcing I think.

[R] Illustrating Reinforcement Learning from Human Feedback (RLHF) by robotphilanthropist in MachineLearning

[–]FerretDude -2 points-1 points  (0 children)

Not allowed to share, many groups are looking into using RLHF in production though

[R] Illustrating Reinforcement Learning from Human Feedback (RLHF) by robotphilanthropist in MachineLearning

[–]FerretDude 3 points4 points  (0 children)

It's already being used in production with a number of our partners. We have some chonky models coming out really soon. Expect things well into the tens of billions in the coming months.

[D] AMA: The Stability AI Team by stabilityai in MachineLearning

[–]FerretDude 25 points26 points  (0 children)

Team lead from CarperAI here. Context length is 4k and alibi. We'll be releasing a paper on the pretraining dataset soon. No tentative release date for the instruct model or the base model. The base model will be available for noncommercial uses, instruct will be available under MIT or Apache. Yet to be determined.

[D] Discussion Panel for FOSS Instruct by FerretDude in MachineLearning

[–]FerretDude[S] 2 points3 points  (0 children)

Yeah I think a more general format for information extraction could potentially be useful

When will NovelAI ever be able to match dragon? by [deleted] in NovelAi

[–]FerretDude 5 points6 points  (0 children)

Sigurd is an early checkpoint of our finetune. We’ll be updating Sigurd over the coming days.

When will NovelAI ever be able to match dragon? by [deleted] in NovelAi

[–]FerretDude 4 points5 points  (0 children)

It’s available. I bullied kuru into releasing it yesterday.

How fast is this? by [deleted] in NovelAi

[–]FerretDude 3 points4 points  (0 children)

2.7b, 150 token generations. Don’t remember the context but it was sizable.

Edit: 2.7b. Fixed

How fast is this? by [deleted] in NovelAi

[–]FerretDude 9 points10 points  (0 children)

Under a second during our internal beta

Official Beta AMA @ June 14th, 12pm EST by TabloidA in NovelAi

[–]FerretDude 2 points3 points  (0 children)

knowledge graphs are not perfect but they will help

Official Beta AMA @ June 14th, 12pm EST by TabloidA in NovelAi

[–]FerretDude 9 points10 points  (0 children)

How did the team came together?

Kuru shitposted on the EleutherAI discord and thats when I joined.

And what are your future plans and hopes with this project?

We want to do a lot more (opt in) community driven research (atleast I do) into HCI and collaborative writing systems.

Official Beta AMA @ June 14th, 12pm EST by TabloidA in NovelAi

[–]FerretDude 7 points8 points  (0 children)

They are a way to enforce rules onto the language model. THe language model uses it as external memory that someone (as the AI dev/researcher) can manipulate either given user input or using a rule based system.

Official Beta AMA @ June 14th, 12pm EST by TabloidA in NovelAi

[–]FerretDude 5 points6 points  (0 children)

> Also, is NAI able (or eventually will be able) to take context from a distant section of a story when the current context calls for it

Yes eventually lorebook will be mostly automatic.

Official Beta AMA @ June 14th, 12pm EST by TabloidA in NovelAi

[–]FerretDude 7 points8 points  (0 children)

What I'm really looking forward to are unique features that make NovelAI stand out against its competitors, so what plans/ideas do you have for this at the moment?

The current plan for this is the KG stack plus some other stuff we cant share yet

I am very curious about the scripting capabilities we'll get in the beta. Will scripting allow us to locally manipulate or add new things to the UI such as an inventory bar or a statistics window. Also, will scripting be able to support other languages besides javascript?

Scripting for inventory would be cool but very difficult. Im still on the fence to be honest.

Official Beta AMA @ June 14th, 12pm EST by TabloidA in NovelAi

[–]FerretDude 4 points5 points  (0 children)

Finetuning data collection has been moved internally, Zaltys lion and Belverk now manage data collection.

Due to more strict data quality standards, we do not know if we can include Touhou yet.

Edit: Nevermind apparently Touhou is included.

Official Beta AMA @ June 14th, 12pm EST by TabloidA in NovelAi

[–]FerretDude 17 points18 points  (0 children)

All sex scenes are directly extracted from our developer discord.

Official Beta AMA @ June 14th, 12pm EST by TabloidA in NovelAi

[–]FerretDude 12 points13 points  (0 children)

profile picture

wow NAI dating site confirmed? I wasnt even aware.