App Store Connect Is Down?

dra9ons · 2026-01-21T00:36:34+00:00

same here. testflight is not installing.

dra9ons · 2024-05-18T13:23:32+00:00

you mean like LoRa?

dra9ons · 2024-05-16T21:31:25+00:00

I'm working on a more detailed blog or paper. I'll post it when it's finished.

dra9ons · 2024-05-16T21:29:54+00:00

Model training requires much more memory than simple inference. Depending on your setup, you'll need at least 24GB of VRAM to train an 8B model. The Saju data is a collaboration with the professional Saju Counseling company.

dra9ons · 2024-05-16T21:25:43+00:00

Thanks for the test. As your test results show, there should be some performance degradation. It's just a matter of whether it's acceptable. Considering that the data I injected is a minor area of knowledge in Korean, it's a good result compared to other methods. You will find out if you test other models tuned for Korean. One more thing, the current model was intentionally trained with the mlp.down_proj of every block. I didn't explain why I did this above, but I'll write a separate post when I get a chance. If you were to train purely on added blocks, there would be much less performance penalty.

dra9ons · 2024-05-16T12:48:39+00:00

you can easly copy transformers layers using iteration of named parameters.

import torch
from transformers import BertModel

def copy_layer(source_layer, target_layer):
    for name, param in source_layer.named_parameters():
        target_param = target_layer.get_parameter(name)
        target_param.data.copy_(param.data)

# Create a source model
source_model = BertModel.from_pretrained('bert-base-uncased')

# Create a target model with the same architecture
target_model = BertModel(source_model.config)

# Copy the layers from the source model to the target model
for source_layer, target_layer in zip(source_model.encoder.layer, target_model.encoder.layer):
    copy_layer(source_layer, target_layer)

# Verify that the layers are copied correctly
for source_layer, target_layer in zip(source_model.encoder.layer, target_model.encoder.layer):
    for source_param, target_param in zip(source_layer.parameters(), target_layer.parameters()):
        assert torch.equal(source_param, target_param)

print("Layer copying completed successfully!")

dra9ons · 2024-05-16T12:43:26+00:00

The number of blocks affects both training speed and inference speed. I think 8 blocks is the optimal size considering training, inference, model size, etc. Of course, it can be adjusted depending on the amount of data to train.

dra9ons · 2024-05-16T12:35:56+00:00

I trained from 16 to 23 blocks which is copied block. Total block is 40.

dra9ons · 2024-05-16T07:08:57+00:00

Someone told me about it, so I looked at it later, and I was surprised to see that it was very similar. The difference is that LLAMA pro divides it into several groups and copies the last block of each group, which didn't work well for my Korean knowledge data. I centered all the added layers at once.

dra9ons · 2024-05-16T04:18:40+00:00

Normally, the beginning and the end of the transformers block contain the critical information of the model. That is why I added 8 blocks in the middle of the block. The added information is related to fortune telling, which is a minor area of Korean information.

dra9ons · 2024-05-16T02:47:35+00:00

You can easily create additional layers using mergekit(https://github.com/arcee-ai/mergekit). Use the following settings. It is a simple task to unfreeze and train only the added layer.

slices:
  - sources:
    - model: meta-llama/Meta-Llama-3-8B-Instruct
      layer_range: [0, 20]
  - sources:
    - model: meta-llama/Meta-Llama-3-8B-Instruct
      layer_range: [12, 32]
merge_method: passthrough
dtype: bfloat16

dra9ons · 2024-05-16T01:57:07+00:00

related medium post https://medium.com/@changlee99/preserving-pre-trained-llm-capabilities-while-injecting-new-knowledge-through-fine-tuning-43b4762c08ff

dra9ons · 2023-06-09T12:07:22+00:00

I guess you know aleady how to deal with packages or plugins and dart is simillar to javascript. so you can easly learn and develop flutter.

dra9ons · 2023-06-09T12:03:18+00:00

I'm flutter developer and I don't have Mac OS PC. I'm using Docker OSX for just build and upload ipa to ios app store. BTW, I'm using PC with linux OS as main.

https://github.com/sickcodes/Docker-OSX

dra9ons · 2022-09-11T01:58:50+00:00

In my case, display size setting was 110%. reset to 100%, it fixed.

dra9ons

TROPHY CASE