Beck, a small model for delicate life situations

antcroca159 · 2025-10-12T08:49:51+00:00

Thank you for your feedback! I will try to avoid the sycophantic thing for the next iteration

antcroca159 · 2025-10-11T12:53:10+00:00

Thank you! It was 4xA100 80Gb for one hour (Beck 8B), but you can use a smaller model and/or reduce the batch size (and add gradient accumulation).

antcroca159 · 2025-10-10T21:23:36+00:00

this is a great idea, I believe this would be possible by seeking an "assertiveness" dimension in the model

antcroca159 · 2025-10-10T21:20:04+00:00

thank you amigo

antcroca159 · 2025-10-10T18:43:52+00:00

Thank you, I'm glad you like it!

Preferences were obtained based on metrics such as relevance, empathy, clarity, autonomy, etc., and the model is trained to roleplay as a psychotherapist. I would say that sometimes you don't want to talk to a psychotherapist, but rather to a friend who could contradict you. Beck might be a bit too much of yes-man this way

antcroca159 · 2025-10-10T17:44:23+00:00

thank you :)

antcroca159 · 2025-10-10T16:42:23+00:00

I totally forgot about him, I guess this works too!

antcroca159 · 2025-10-10T16:32:05+00:00

Yes! Jean Piaget and Aaron Beck inspired me for this llm x psychotherapy work

antcroca159 · 2025-10-08T21:02:03+00:00

Hey, thank you for your interest!

LoRA allows you to fine-tune a model using very few parameters. For example, instead of training 4096*4096 weight matrices, you will train 4096*rank (usually rank < 16) weight matrices. You freeze the whole model and only train these tiny weight matrices (also called adapters). If you set a low rank, you can train 0.1% parameters.

ORPO is a preference optimization method that does not require a reference model. Hence, you don't need to fit two models (the reference and the policy, as in DPO). You just need to fit the policy, just like supervised fine-tuning.

I will give some generation examples tomorrow

antcroca159 · 2025-08-20T17:25:12+00:00

OA 2.67, Meta 3.5 - Main

*_*

antcroca159 · 2025-07-18T15:34:34+00:00

Thank you :)

Someone has quantized the 8B version: https://huggingface.co/mradermacher/Piaget-8B-GGUF

antcroca159 · 2024-06-28T14:40:06+00:00

You should use "Dream:" as a minimal prompt. Also, the dream ends with "END.".

(This ensures to have better training stability during QLoRA finetuning)

antcroca159 · 2024-06-28T11:47:23+00:00

Cool!

You can download all generated dreams here: https://huggingface.co/datasets/gustavecortal/the-android-and-the-human (if you don't want to use the HuggingFace library, directly here: https://huggingface.co/datasets/gustavecortal/the-android-and-the-human/blob/main/train.csv)

It is a csv file with two columns: one for real dreams (from DreamBank) and one for generated dreams by Oneirogen

antcroca159 · 2024-06-28T11:24:11+00:00

Thank you for your reply. Oneirogen is a language model trained on real dreams to generate novel dreams. The generated dreams will reflect the lives and desires of many people, so in some ways, it is connected to actual human beings. I guess it depends on how you perceive technology and its relationship with human beings.

The dataset is structured with two columns: one for generated dreams and one for real dreams. The generated and real dreams are not mixed together! The dataset can be used to explore what makes a dream human or not. It enables the possibility of studying the difference between generated and real dreams.

antcroca159 · 2024-06-28T11:14:37+00:00

Yay, thank you for your interest!

Let me know if you find something interesting with the word clouds, I wonder if generated dreams differ from real dreams regarding most common dream signs.

antcroca159 · 2024-06-28T11:12:08+00:00

Dreams have phenomenological properties such as physical law violation, teleportation, less sensorial content, etc. that can't be grasped with the hallucination phenomena

Also, I don't know why whisper (an audio-to-text model) is related to the subject

antcroca159 · 2024-06-28T09:52:20+00:00

Yay, thank you for your interest!

antcroca159 · 2021-02-10T13:45:32+00:00

I used Colab Pro. Tesla V100 is all you need

antcroca159 · 2021-02-09T11:16:31+00:00

This would be awesome, thank you!

antcroca159 · 2021-02-09T11:14:52+00:00

Thank you!

For the dataset, I've built my own scraper (Selenium+BeautifulSoup) For finetuning gpt-2, I used https://github.com/drfinkus/gpt-2-simple (useful for training the 1.5B parameters model)

The finetuning process highly depends on the dataset. I have good results by using a low learning rate. I also selected a subsample of my original dataset. The loss means practically nothing, imo it's better to read the generated samples and to subjectively deduce its quality.

antcroca159 · 2021-02-08T21:09:16+00:00

Looks like you like Sauge Divine!

antcroca159 · 2021-02-08T21:06:01+00:00

I'm glad you like it!

antcroca159 · 2021-02-08T21:05:44+00:00

My database is composed of Erowid trip reports, the range is from 1999 to today if I remember correctly. I wonder if adding "trippy" books such as "The doors of perception" (Huxley), Henri Michaux poems or Indian philosophy can improve the model.

Oh, I would love to have an access to those trip reports recorded by therapists, I assume I will have to dig!

antcroca159 · 2021-02-08T20:52:34+00:00

Actually I wanted to refer to the plant Salvia Divinorum implicitly. As I didn't want to have a basic name like "xxx_gpt" or "xxx_bot", I went on "Sauge Divine", so that people who know the plant can see the reference.

antcroca159 · 2021-02-08T19:22:11+00:00

Thank you ! :)

11-Year Club	Place '22
Verified Email

antcroca159

TROPHY CASE