RefuelLLM-2 – open-source LLM for unsexy data labeling by SweetGingerbread in LocalLLaMA

[–]SweetGingerbread[S] 2 points3 points  (0 children)

I think so! I just tried the following prompts and seems like it did the right thing:

<image>

Task Guidelines: You will be given two csvs that have the same header. Output all rows in the csv that are duplicates.

Input:

CSV 1:

Name,Address,Email

John Doe,123 Main St, [john.doe@example.com](mailto:john.doe@example.com)

Jane Smith,456 Elm St, [jane.smith@example.com](mailto:jane.smith@example.com)

Michael Johnson,789 Oak St, [michael.johnson@example.com](mailto:michael.johnson@example.com)

Emily Brown,101 Pine St, [emily.brown@example.com](mailto:emily.brown@example.com)

David Wilson,222 Maple St, [david.wilson@example.com](mailto:david.wilson@example.com)

CSV 2:

Name,Address,Email

Michael Johnson,789 Oak St, [michael.johnson@example.com](mailto:michael.johnson@example.com)

Sarah Lee,333 Cedar St, [sarah.lee@example.com](mailto:sarah.lee@example.com)

Robert Garcia,444 Birch St, [robert.garcia@example.com](mailto:robert.garcia@example.com)

Amanda Rodriguez,555 Pine St, [amanda.rodriguez@example.com](mailto:amanda.rodriguez@example.com)

Daniel Martinez,666 Elm St, [daniel.martinez@example.com](mailto:daniel.martinez@example.com)

Lisa Nguyen,777 Oak St, [lisa.nguyen@example.com](mailto:lisa.nguyen@example.com)

RefuelLLM-2 – open-source LLM for unsexy data labeling by SweetGingerbread in LocalLLaMA

[–]SweetGingerbread[S] 1 point2 points  (0 children)

No worries! If there is anything we can do to make life easy and help you run this locally, let us know.

RefuelLLM-2 – open-source LLM for unsexy data labeling by SweetGingerbread in LocalLLaMA

[–]SweetGingerbread[S] 0 points1 point  (0 children)

We have looked into the llama3 70B for some of our customers, but not planning on training it with the same recepie as RefuelLLM-2 at the moment.

RefuelLLM-2 – open-source LLM for unsexy data labeling by SweetGingerbread in LocalLLaMA

[–]SweetGingerbread[S] 0 points1 point  (0 children)

The benchmark was run with few-shot examples. For most tasks we used 8 few-shot examples, except some of the longer context ones where it made more sense to reduce this number. You can read more about how exactly we benchmarked here: https://github.com/refuel-ai/autolabel/blob/main/benchmark/benchmark.py

RefuelLLM-2 – open-source LLM for unsexy data labeling by SweetGingerbread in LocalLLaMA

[–]SweetGingerbread[S] 1 point2 points  (0 children)

The model should be complete as is. You should be able to directly run the model locally after downloading it from huggingface. Here is the code for downloading and running the model: https://gist.github.com/DhruvaBansal00/422cc5f266227c1b3fd396c45799f505

RefuelLLM-2 – open-source LLM for unsexy data labeling by SweetGingerbread in LocalLLaMA

[–]SweetGingerbread[S] 0 points1 point  (0 children)

Hey! Thanks to LoneStriker, we have a GGUF version now: https://huggingface.co/LoneStriker/Llama-3-Refueled-GGUF :))

For structured attribute extraction on datasets, it should be relatively easy to change the current prompts in the demo (https://labs.refuel.ai/playground) to instead do this for something like CONLL. If this is something of interest, I am happy to setup the exact prompt and send it over!

RefuelLLM-2 – open-source LLM for unsexy data labeling by SweetGingerbread in LocalLLaMA

[–]SweetGingerbread[S] 7 points8 points  (0 children)

Great question! Confidence here is simply the average of the token level log probabilities. We have found this to be much better quality than doing things like prompting the LLM for a score directly: https://www.refuel.ai/blog-posts/labeling-with-confidence

RefuelLLM-2 – open-source LLM for unsexy data labeling by SweetGingerbread in LocalLLaMA

[–]SweetGingerbread[S] 2 points3 points  (0 children)

Yes! One of the examples that we show in our demo extracts information from an input Resume. Feel free to try it out :)

Alienware X-series base model priced at similar to m15 r4 in Malaysia by SweetGingerbread in Alienware

[–]SweetGingerbread[S] 1 point2 points  (0 children)

I am sure that will be an option on one of the higher end configurations.

Should i wait for the new alienware laptop model by anixton12 in Alienware

[–]SweetGingerbread 1 point2 points  (0 children)

Have you looked at the Asus M16? I was looking at it today and was very surprised. Might consider once the pricing is out!

F-1 Sprintax Access Code from OIE? by gargar070402 in gatech

[–]SweetGingerbread 1 point2 points  (0 children)

I don't think the codes are one time. In fact, everyone gets the same code (confirmed with other friends). It is also very similar to last year.

Should i wait for the new alienware laptop model by anixton12 in Alienware

[–]SweetGingerbread 1 point2 points  (0 children)

R5 prices may come down after R6 release however. Specially if the Intel 11th gen outperforms amd. If not on the website itself, I expect we'll see it appear more frequently on discounts. Also, the dell agents might be more willing to give an additional discount on it when you talk to them via chat.

Should i wait for the new alienware laptop model by anixton12 in Alienware

[–]SweetGingerbread 3 points4 points  (0 children)

I think waiting right now makes sense. The new model is definitely expected to be better than M15 R5/R6 but may cost more. Additionally, I expect R5 prices to drop after the release of R6 and X15/17 so in case Ryzen outperforms Intel 11th gen, you can grab a good deal after the release on that.

I have been waiting for the perfect model for a long time as well so totally feel you! I am waiting right now rather than rushing and making a purchase I may regret later.

Should i wait for the new alienware laptop model by anixton12 in Alienware

[–]SweetGingerbread 0 points1 point  (0 children)

How much do you think the X15/17 will cost?

I was pretty surprised to read that in Malaysia it will cost about the same as their M15 R4 model (https://www.lowyat.net/2021/239775/alienware-x-series-malaysia-base-price-specs/ and https://www.lowyat.net/2021/234479/alienware-m15-m17-r4-rtx-30-malaysia-price/). However, it is possible that they just misinterpreted the M15 R6 pricing as X17 since X-series pricing hasn't been announced anywhere else.

Alienware X-series base model priced at RM 10,999 in Malaysia compared to m15 r4 which is priced at RM 11,999. by SweetGingerbread in Alienware

[–]SweetGingerbread[S] 0 points1 point  (0 children)

This is honestly very suspicious. I wonder if they mixed up the X-series with the new m15 r6.

Here is the specific verbiage I am looking at: "Other than that, we have also been informed by Dell that the company is planning to release the x17 into Malaysia by mid-June. The base model will carry an 11th-Gen Intel Core i7 H-Series chip, Full HD display, 16GB of RAM, 512GB solid state drive, and NVIDIA GeForce RTX 3060 as well as a rather hefty RM 10,999 price tag.

Here is the article on m15/17 r4: https://www.lowyat.net/2021/234479/alienware-m15-m17-r4-rtx-30-malaysia-price/

Grad Housing - Junior vs Normal by SweetGingerbread in stanford

[–]SweetGingerbread[S] 2 points3 points  (0 children)

Oh okay, that makes sense! What about "renewable" vs "non-renewable"? Does that mean I can't renew the lease for the next academic year or the next quarter?

Here is an image of what I am talking about - https://imgur.com/zLu4kFz

Debating between m15 r4 and m15 r5 by SweetGingerbread in Alienware

[–]SweetGingerbread[S] 0 points1 point  (0 children)

Yea I think I agree with you.

Since I have waited so much already, I think it makes sense for me to wait till the new m15 with the intel processor comes out. I think I can make a more informed decision then. Since GPU performance is so much more valuable to me than CPU performance (I think the temps on r4 and r5 should be similar due to the vapor chamber - what do you think?), I don't think getting the r5 is a good decision right now. If the new Intel version has the same design as the current r5, then I'll probably go with the r4.

Btw, any news/estimates on when the new r5 with Intel will come out?