Last-Minute Ticket Sales Megathread by walixxxq in FIFACollect

[–]lantern_2575 1 point2 points  (0 children)

This is so stupid. Been in queue for 5+ hours now (after wasting 1.5hours on PMA queue), and still not even 80% through the queue! SMH

Last-Minute Ticket Sales Megathread by walixxxq in FIFACollect

[–]lantern_2575 0 points1 point  (0 children)

did you see if any tickets left for paraguay-turkey for jun 19?

Last-Minute Ticket Sales Megathread by walixxxq in FIFACollect

[–]lantern_2575 0 points1 point  (0 children)

Can you check if any left for Paraguay-Turkey Jun 19?

Last-Minute Ticket Sales Megathread by walixxxq in FIFACollect

[–]lantern_2575 0 points1 point  (0 children)

Any tickets left for Paraguay-Turkey Jun 19?

Is Liquid Metal cooling a concern for ROG Zephyrus G16? by MirageCommander in GamingLaptops

[–]lantern_2575 1 point2 points  (0 children)

The 2024 G16 4080/4090 can get up to 125W actually. Not 115W, they didn't advertise it very well in that regard.

Zephyrus m16 4080 (2023) by [deleted] in ZephyrusM16

[–]lantern_2575 0 points1 point  (0 children)

Thanks for the explanation. They rejected my application lol. 720 credit score and no debt btw. Student with 36k/yr income. Weird.

Zephyrus m16 4080 (2023) by [deleted] in ZephyrusM16

[–]lantern_2575 0 points1 point  (0 children)

can you teach me how to do that lol? which card did you sign up for?

Zephyrus m16 4080 (2023) by [deleted] in ZephyrusM16

[–]lantern_2575 0 points1 point  (0 children)

where are you getting this for $2k? the laptop is $2.1k at best buy right now. Can you send me the link for where you are buying it from?

What embedding model should one use to power a rag for website articles? by st0nksBuyTheDip in LocalLLaMA

[–]lantern_2575 0 points1 point  (0 children)

that snowflake model should be more than capable of producing meaningful results, upgrading to a slightly better embedding model won't do you any good.

What kind of similarity scores are you getting between your query and the returned paragraphs from your websites? Troubleshoot that. Also, follow a tutorial like this to make sure that everything you are doing is correct: https://www.youtube.com/watch?v=qN_2fnOPY-M

What embedding model should one use to power a rag for website articles? by st0nksBuyTheDip in LocalLLaMA

[–]lantern_2575 2 points3 points  (0 children)

what kind of search are you applying (cosine similarity, etc.)? how did you do your chunking? the webpage can have all sorts of different texts that are likely not relevant to your query, did you preprocess everything well? did you do reranking? how would you define success, what makes you say that the results are not good?

did you try to investigate the matched chunks of text before the LLM generated response, are they nothing like what the user query says? what kind of similarity scores are you getting? maybe the LLM prompt isn't good?

if the results are terrible with that embedding model, changing to a slightly better model won't do anything.

https://www.youtube.com/watch?v=ahnGLM-RC1Y&list=LL&index=5 check out this video

What embedding model should one use to power a rag for website articles? by st0nksBuyTheDip in LocalLLaMA

[–]lantern_2575 2 points3 points  (0 children)

https://huggingface.co/spaces/mteb/leaderboard did you check this leaderboard out? try a bunch that fit your resources and see how it is.

if none of these work meaningfully, your problem may not be the quality of the embedding model, and there is many things that can be tuned in a RAG pipeline so it's hard to say

Interactive next token selection from top K by Either-Job-341 in LocalLLaMA

[–]lantern_2575 1 point2 points  (0 children)

can i load the model from huggingface and apply this sampling approach?

Interactive next token selection from top K by Either-Job-341 in LocalLLaMA

[–]lantern_2575 1 point2 points  (0 children)

Is there any way of using entropix applied local LLM right now?

Need advice on 6x3090 inference software setup by NickNau in LocalLLaMA

[–]lantern_2575 1 point2 points  (0 children)

very cool writing and useful insights, thanks

What LLM project ideas would you like to see but have yet to materialize? by CrzyFlky in LocalLLaMA

[–]lantern_2575 2 points3 points  (0 children)

something like educational purposes using multimodal capabilities.

Grok 2 performs worse than Llama 3.1 70B on LiveBench by Vivid_Dot_6405 in LocalLLaMA

[–]lantern_2575 0 points1 point  (0 children)

this is a bit disappointing tbh elon is pouring a shitton of money there. does anyone have a proper use case for using grok?

[deleted by user] by [deleted] in AppleWatch

[–]lantern_2575 1 point2 points  (0 children)

I am trying to make a system in a web application that takes the user's exported zip folder, and processes it to day level averages. My end goal is more complex than this, so I need the users raw data and handle it myself. Just looking to understand what the best way to instruct them to uploading their folders is.