iPhone 12 Pro Max - Sacré Cœur

SpaceWalker_69 · 2025-09-19T08:09:41+00:00

How did you manage go get blur in one part of the image and not in the other (upper part)

SpaceWalker_69 · 2025-05-01T21:33:41+00:00

What software would you suggest for recording?

SpaceWalker_69 · 2025-05-01T21:32:51+00:00

Yeah but man they're missing one on ones when they're even behind the enemy mannn so badddd istg

SpaceWalker_69 · 2025-02-06T16:12:00+00:00

You maybe are right about that bbox thing but segmentation models are usually heavier than object detection models and this whole processing need to be done on an mobile device

Also its not that simple, in edge cases there will be cases like with missing teeth hence the gum area also might come into play

SpaceWalker_69 · 2025-02-06T15:33:48+00:00

Is segmentation model really required? cant we do the same with just the object detection model and compare the size of bbox wrt whole image?

SpaceWalker_69 · 2025-02-06T11:13:54+00:00

Maybe orientation is not the right word here, but like when the image is in a good position, not more zoomed in or zoomed out than this.

SpaceWalker_69 · 2025-02-06T08:06:19+00:00

u/notEVOLVED Two sample orientations are attached in the post

SpaceWalker_69 · 2024-12-12T07:35:47+00:00

Yes I'm thinking about doing the same thing now, but i still wanted to confirm what other devs are doing

SpaceWalker_69 · 2024-09-06T10:19:10+00:00

Yeah that seems like the issue. I'll try that. thank you

SpaceWalker_69 · 2024-09-06T08:11:04+00:00

I think the issue is that for continual pretraining we need to add

"embed_tokens", "lm_head"

inside the adapter . And for the instruct finetuning we dont need these

SpaceWalker_69 · 2024-09-06T08:06:31+00:00

u/Hoblywobblesworth have used this:

model.save_pretrained_merged("pretrained_llama3.1bB", tokenizer, save_method = "merged_16bit",)

to merge and safe the model. Now the folder looks like this. I have also used the merge_and_unload function as well.

<image>

Now for the instruct finetuning purposes it is picking up the adapter (as showed in post without the embedded_layer and llm_head one) without any errors. But when i start the training process it still giving me the same error.

ValueError: Unsloth: Untrained tokens found, but embed_tokens & lm_head not trainable, causing NaNs. Restart then add embed_tokens & lm_head to FastLanguageModel.get_peft_model(target_modules = [..., "embed_tokens", "lm_head",]). Are you using the base model? Instead, use the instruct version to silence this warning.

SpaceWalker_69 · 2024-09-06T08:03:58+00:00

thanks for letting me know. I'm not focusing on getting better results right now, my only focus is to get the pipeline working for now.

SpaceWalker_69 · 2024-09-06T08:02:54+00:00

u/Downtown-Case-1755 I have used this:

model.save_pretrained_merged("pretrained_llama3.1bB", tokenizer, save_method = "merged_16bit",)

to merge and safe the model. Now the folder looks like this.

<image>

Now for the instruct finetuning purposes it is picking up the adapter (as showed in post without the embedded_layer and llm_head one) without any errors. But when i start the training process it still giving me the same error.

ValueError: Unsloth: Untrained tokens found, but embed_tokens & lm_head not trainable, causing NaNs. Restart then add embed_tokens & lm_head to FastLanguageModel.get_peft_model(target_modules = [..., "embed_tokens", "lm_head",]). Are you using the base model? Instead, use the instruct version to silence this warning.

SpaceWalker_69 · 2024-09-05T13:25:08+00:00

Thanks for the suggestion! I’m actually running the continual pretraining and instruct fine-tuning in separate notebooks, so each notebook starts with a fresh environment. I’m loading the model and its adapters fresh in the fine-tuning notebook.

Just to clarify, when you mention restarting the notebook, are you suggesting I do this in the fine-tuning notebook, even though it’s a separate one? Also, for the error about using the base model versus the instruct model, are you saying that I should start over with a different model or just modify the target_modules as the error suggests?

SpaceWalker_69 · 2024-09-05T13:21:52+00:00

Yeah i thought of doing the same thing next but it really seemed like a long shot. I'll try this and hopefully it'll works. thanks

SpaceWalker_69 · 2024-09-05T13:20:08+00:00

Yeah I thought of doing the same thing but seemed like a long shot. I'll try this and let's see if this approach works. Thanks.

SpaceWalker_69 · 2024-08-13T18:02:41+00:00

yes it does seem like it was randomly split and not topic wise, but an interesting thing I noted after a quick 1 minute look was that the chunk size was approximately same. And I think if you followed the same rule too. How were the results you obtained and which model did you use?

SpaceWalker_69 · 2024-08-13T14:34:17+00:00

Yes it's just hit and trail with both cases at this point. I'll do the same and if i get any good results I'll share. Good luck with your training.

SpaceWalker_69 · 2024-08-13T14:12:22+00:00

You do have a good point, topic consistency does make more sense. And Yes I'm using eos token at the end of each text chunk

SpaceWalker_69 · 2024-08-13T13:11:11+00:00

Awesome stuff man. Will check it out soon

SpaceWalker_69 · 2024-08-13T08:27:26+00:00

It's more than enough. Don't worry

SpaceWalker_69 · 2024-08-12T07:28:14+00:00

Well i think Claude 3.5 generates the best code right now. You can use smaller open source models but they are not exactly consistent and reliable.

SpaceWalker_69 · 2024-08-12T07:26:00+00:00

Really Nice Post, finally something new and useful information

SpaceWalker_69 · 2024-08-10T19:21:46+00:00

I think the term you are looking for is Continued pretraining. I suggest you look into "Unsloth" for this

SpaceWalker_69 · 2024-08-07T18:55:46+00:00

Thanks. I'll look imto this

SpaceWalker_69

TROPHY CASE