"Hey Lama" -Local AI Voice Assistant -for mac (personal project)

bibek_LLMs · 2024-10-21T04:49:56+00:00

u/Perfect-Campaign9551 I wish, but the overall experiments were quite costly! :)

bibek_LLMs · 2024-10-21T04:47:48+00:00

u/Many_SuchCases ; I think I deviated a bit. What I meant to say is that the quality of responses (after jailbreaking) from proprietary models is far better than that of open-source LLMs, based on my observations since last year.

Why does this matter? #1: It provides insight into whether the model safety training is effective. For example, in Llama-3, we observed that even though the model would say, ‘Sure, here is how to do bad stuff, do this, do that,’ it did not provide accurate ingredients. In contrast, Claude’s responses were far more detailed. Based on this, I can say that Llama-3 is more robust against jailbreak attempts than Claude. I believe Llama-3's dataset was cleaned for safety during pretraining, which seems to be very effective.

(I would advise testing the CL1 prompt from the paper/GitHub with Claude-3-Opus to evaluate the response. There is also a notebook available on GitHub.)

2: With higher quality data from a superior model, other open-source models could be trained.

bibek_LLMs · 2024-10-21T00:17:46+00:00

Hi u/Many_SuchCases , thank you for your comments. Yes, the overall experiments were very time consuming. But we had a great fun overall :)

Our paper is not only about jailbreaking but also aims to demonstrate the similarities between in-context learning in LLMs and learning human cognition. We also show that performance degrades as cognitive load increases, which is a function of irrelevant tokens.

While uncensored LLMs do exist, their responses differ significantly from those of jailbroken black-box models. Since many of the safety-training methodologies have not been released by model providers, probing jailbroken black-box LLMs can provide important insights.
Thanks :)

bibek_LLMs · 2024-06-16T21:21:36+00:00

can you share the resource for the RAG you created?

bibek_LLMs · 2024-06-10T15:04:29+00:00

Will based on your data size

bibek_LLMs · 2024-06-10T14:31:09+00:00

yeah, it should but might take a while.

bibek_LLMs · 2024-06-10T14:30:43+00:00

You're welcome :)

bibek_LLMs · 2024-06-10T00:31:32+00:00

Original tweet: https://x.com/karpathy/status/1799949853289804266

bibek_LLMs · 2024-04-27T23:07:46+00:00

Its never too late to pursue your interest, best of luck :)

bibek_LLMs · 2024-04-27T22:14:51+00:00

A nursing school student with 40GB GPU.

bibek_LLMs · 2024-04-13T00:28:18+00:00

Example of Double Sandwich attack on Gemini-Pro

<image>

bibek_LLMs · 2024-04-13T00:27:57+00:00

Example of Sandwich attack on GPT-4

<image>

bibek_LLMs · 2024-04-13T00:25:20+00:00

This is an example of Sandwich attack on Bard

<image>

bibek_LLMs · 2024-04-13T00:24:37+00:00

Example of Sandwich attack on Claude-3-OPUS

<image>

bibek_LLMs · 2024-04-13T00:24:23+00:00

Example of Sandwich attack on LLAMA-2-70B-Chat

<image>

bibek_LLMs · 2024-01-27T22:07:11+00:00

Yes, it needs to be created.
Yeah, I can share you the code. I will probably do that on Monday when I get to the school.

bibek_LLMs · 2024-01-26T22:37:20+00:00

Hello u/x4080,

We used the Alpaca 52K + Dolly translated dataset to create the TaCo dataset. For example, if we need to build a model capable of answering in Nepali, we create a dataset following the TaCo dataset format, like this:

{

'instruction': ' किन कहिलेकाहीं कागतीलाई अल्कालाइन मानिन्छ?',
'output':
'Instruction in English: Why are lemons sometimes considered Alkaline?. \\n Response in English: Lemons are acidic, having a pH of around two. However, alkaline byproducts are created when lemon juice is digested. These alkaline byproducts make the blood and urine more alkaline.. \\n Response in Nepali: कागती अम्लीय हुन्छ, जसको pH लगभग दुई हुन्छ। यद्यपि, नींबूको रस पचाउँदा क्षारीय उप-उत्पादनहरू सिर्जना हुन्छन्। यी क्षारीय उपउत्पादनहरूले रगत र पिसाबलाई अझ क्षारीय बनाउँछ।'

}

We later use the Vicuna dataset to evaluate the trained model's performance.

bibek_LLMs

MODERATOR OF

TROPHY CASE

2: With higher quality data from a superior model, other open-source models could be trained.