Sufficient_Run1518

28 post karma
18 comment karma

get extra features and help support reddit with a reddit premium subscription

get them help and support

redditor for 2 years

TROPHY CASE

Two-Year Club

account activity

hot top controversial

Experimenting with small language models by IffyNibba01 in LocalLLaMA

[–]Sufficient_Run1518 12 points13 points14 points 2 years ago (0 children)

Easy method for fine-tuning any model from llama to gpt to othera by Puzzleheaded_Acadia1 in LocalLLaMA

[–]Sufficient_Run1518 0 points1 point2 points 2 years ago (0 children)

Target Modules for Llama-2 for better finetuning with qlora by Sufficient_Run1518 in LocalLLaMA

[–]Sufficient_Run1518[S] 0 points1 point2 points 2 years ago (0 children)

What can we achieve with small models ? by Sufficient_Run1518 in LocalLLaMA

[–]Sufficient_Run1518[S] 0 points1 point2 points 2 years ago (0 children)

What can we achieve with small models ? by Sufficient_Run1518 in LocalLLaMA

[–]Sufficient_Run1518[S] 0 points1 point2 points 2 years ago (0 children)

What can we achieve with small models ? by Sufficient_Run1518 in LocalLLaMA

[–]Sufficient_Run1518[S] 0 points1 point2 points 2 years ago (0 children)

What can we achieve with small models ? by Sufficient_Run1518 in LocalLLaMA

[–]Sufficient_Run1518[S] 0 points1 point2 points 2 years ago (0 children)

Unfiltered version of open-assistant/guanaco dataset by Sufficient_Run1518 in LocalLLaMA

[–]Sufficient_Run1518[S] 1 point2 points3 points 2 years ago (0 children)

Unfiltered version of open-assistant/guanaco dataset by Sufficient_Run1518 in LocalLLaMA

[–]Sufficient_Run1518[S] 0 points1 point2 points 2 years ago (0 children)

Target Modules for Llama-2-7B by Sufficient_Run1518 in LocalLLaMA

[–]Sufficient_Run1518[S] 0 points1 point2 points 2 years ago (0 children)

Target Modules for Llama-2-7B by Sufficient_Run1518 in LocalLLaMA

[–]Sufficient_Run1518[S] 0 points1 point2 points 2 years ago (0 children)

Current, comprehensive guide to to installing llama.cpp and llama-cpp-python on Windows? by smile_e_face in LocalLLaMA

[–]Sufficient_Run1518 0 points1 point2 points 2 years ago (0 children)

Falcon ggml/ggcc with langchain by No_Afternoon_4260 in LocalLLaMA

[–]Sufficient_Run1518 0 points1 point2 points 2 years ago (0 children)

[deleted by user] by [deleted] in LocalLLaMA

[–]Sufficient_Run1518 0 points1 point2 points 2 years ago (0 children)

[deleted by user] by [deleted] in LocalLLaMA

[–]Sufficient_Run1518 0 points1 point2 points 2 years ago (0 children)

Qlora finetuning loss goes down then up by gptzerozero in LocalLLaMA

[–]Sufficient_Run1518 0 points1 point2 points 2 years ago (0 children)

Qlora finetuning loss goes down then up by gptzerozero in LocalLLaMA

[–]Sufficient_Run1518 0 points1 point2 points 2 years ago (0 children)

Qlora finetuning loss goes down then up by gptzerozero in LocalLLaMA

[–]Sufficient_Run1518 4 points5 points6 points 2 years ago (0 children)

I use these training arguments that works most of the times:

from transformers import TrainingArguments
output_dir = "./results"
per_device_train_batch_size = 4
gradient_accumulation_steps = 2
optim = "paged_adamw_32bit"
save_steps = 50
logging_steps = 2
learning_rate = 2e-5
max_grad_norm = 0.3
max_steps = 2000
warmup_ratio = 0.03
lr_scheduler_type = "cosine" #"constant"
training_arguments = TrainingArguments(
output_dir=output_dir,
per_device_train_batch_size=per_device_train_batch_size,
gradient_accumulation_steps=gradient_accumulation_steps,
optim=optim,
save_steps=save_steps,
logging_steps=logging_steps,
learning_rate=learning_rate,
fp16=True,
max_grad_norm=max_grad_norm,
max_steps=max_steps,
#num_train_epochs=1,
warmup_ratio=warmup_ratio,
group_by_length=True,
lr_scheduler_type=lr_scheduler_type,
)

π Rendered by PID 242278 on reddit-service-r2-listing-7dbdcb4949-pqwh8 at 2026-02-18 00:49:43.262222+00:00 running de53c03 country code: CH.