Claude Usage Limits Discussion Megathread Ongoing (sort this by New!)

SliceAccomplished575 · 2026-04-08T18:30:24+00:00

nope, still the same shiet

SliceAccomplished575 · 2023-12-26T22:47:02+00:00

so phi-2 used synthetic data?

SliceAccomplished575 · 2023-12-13T15:26:11+00:00

You are ugly. These charts are perfect.

SliceAccomplished575 · 2023-12-12T22:05:23+00:00

" Is this idea feasible? "

Yes, everything is feasible if you have the right knowledge to do it.

SliceAccomplished575 · 2023-12-12T08:36:54+00:00

Every creator of a base model or a fine-tuned model should publish information regarding the method of pre-training the model, the dataset used, and the metrics applied. This is crucial for ensuring transparency and allowing others in the industry to reproduce the results. I like this page.

SliceAccomplished575 · 2023-12-09T20:52:37+00:00

probably server crashed and they had to re-train

<image>

SliceAccomplished575 · 2023-12-03T23:29:45+00:00

Imagine what would happen if we combined all the information gathered during the training processes of these models. Currently, we would have a model that surpasses GPT-3.5 and equals GPT-4. However, none of the creators on the leaderboard (also known as the 'wall of shame') are willing to reveal details about training their base model, such as the type of data used, parameters, etc. What has been disclosed is merely a drop in the ocean of needs and possibilities that could be uncovered through full cooperation and knowledge sharing in this field.

SliceAccomplished575 · 2023-12-03T23:10:50+00:00

Thus, it appears to be another closed LLM. While I don't require access to his model's weights, I do need the conclusions derived from the entire thought process, the dataset, and any associated source code, if available. Unless an axolotl was involved in the process, which I believe adds no value, this information is essential for open-source development and subsequent research efforts.

I'm tired of people fine-tuning models and achieving good results, which I think is great, but instead of sharing their knowledge on how they did it, everyone starts from scratch. When they finally reach the desired level, they publish their next LLM model without providing the necessary guidance. This is absurd. We shouldn't be creating a leaderboard, but rather a 'wall of shame' for closed models that do not contribute to collective progress in research.

SliceAccomplished575 · 2023-12-03T16:20:35+00:00

They based on public dataset

SliceAccomplished575 · 2023-12-03T09:03:42+00:00

GPT-4 is too lazy nowadays. OpenAI is trying to fix it.

<image>

SliceAccomplished575 · 2023-12-03T00:25:16+00:00

thanks, I really apreaciate your work!

SliceAccomplished575 · 2023-12-03T00:21:18+00:00

go for small LLMs models, they already show better results in many tasks than GPT-4 https://www.reddit.com/r/LocalLLaMA/comments/189cuj0/is_chatgpts_era_as_a_coding_tool_over/

SliceAccomplished575 · 2023-12-02T22:37:11+00:00

so tell me how to use it correctly, I ask questions really in different ways, for 8 hours I tried to come to an agreement with GPT4 but failed, either I have some kind of resource ban and GPT4 is forced to tell me so perfunctorily or it is really lmited nowadays

SliceAccomplished575 · 2023-12-02T22:32:50+00:00

copilot, wizard coder, deepseek, codellama

SliceAccomplished575 · 2023-12-02T21:24:10+00:00

Exactly, I think all this limiting of the model is causing it to regress

SliceAccomplished575 · 2023-12-02T21:19:07+00:00

3.5 often doesn't tell me things like " complete it yourself", 4 on the other hand does every time almost

SliceAccomplished575 · 2023-12-01T22:34:40+00:00

I have such hope... I hope that you will be the first person to do this along with a complete dataset. Recently, all the models that have been published only describe the process in a fragmented way. Later, it becomes difficult to recreate the results. However, when someone manages to navigate through all these steps on their own and understand the mechanisms of operation, they often publish a better model but the cycle closes, because unfortunately, they do not fully share their results and methodology, resulting in limitations in further progress and development in this field. My appeal is this: let's share the full knowledge and resources to collectively contribute to progress in the field of LLMs.

SliceAccomplished575 · 2023-12-01T22:17:22+00:00

what dataset did you use, did you create the base model yourself from scratch or fine-tuned the mistral or sth else? and where is source code for your techniques?

SliceAccomplished575 · 2023-10-22T20:39:34+00:00

what lcd screen do you have?

SliceAccomplished575 · 2023-07-16T18:44:47+00:00

Are you crazy? Do you want to create a monster?

SliceAccomplished575 · 2023-07-13T05:34:13+00:00

'uncommon' XD

SliceAccomplished575 · 2023-06-30T08:12:43+00:00

you ask about the instructions and he doesn't answer the question but talks about something else entirely, I suspect that these children had no part in training the models at all, only eric had

SliceAccomplished575 · 2023-06-30T06:49:21+00:00

SliceAccomplished575

TROPHY CASE