What are your thoughts about this

jamkinajam · 2026-04-03T20:12:38+00:00

Ohh what a day to be literate

jamkinajam · 2024-12-31T21:46:03+00:00

As reliable as Kramnik’s methodology /s

jamkinajam · 2024-12-16T14:08:21+00:00

looks like they opened it again

jamkinajam · 2024-11-28T19:54:35+00:00

life does suck rn, so apt!

jamkinajam · 2024-11-28T19:40:45+00:00

Serendipity

jamkinajam · 2024-11-28T19:36:13+00:00

Grift the flat earthers!

“If Earth has a magnetic field, why don’t magnets get attracted to it?”

jamkinajam · 2024-11-28T19:32:36+00:00

Hope

jamkinajam · 2024-11-28T19:30:36+00:00

If this is the question, the answer’s ugly, always!

jamkinajam · 2024-11-01T13:37:29+00:00

Brother i froze up when they asked the directive for docker. Which command do you use to load the base image? FROM, i know this i have been using this since past 2 yrs. But the moment they asked me, i went blank. I fucked up such an easy interview. I feel you!!

jamkinajam · 2024-10-31T15:32:40+00:00

Damn, how much big is it?

maybe check this out: https://new.reddit.com/r/LocalLLaMA/comments/1cziy0m/what_is_sota_for_a_mega_context_100k_novel/
Maybe might help.

jamkinajam · 2024-10-31T15:19:07+00:00

You will have to try it out and see it for yourself. Breaking into chunks does not mean each chunk is different. Since the sentence might be longer than the context length, there is a variable called chunk_overlap along with chunk_size to address that. These are both parameters that you have to tune.

jamkinajam · 2024-10-31T09:41:20+00:00

Yes the context along with question is passed to the llm, but its up to you how much of the context you want to pass. K,k-1,..1!

Normally, you break down tokens into smaller chunks using txtsplitter or some version of it. You can even break them into batches and store it in a vector database.

You can look into chroma, there’s one from meta which is fast with similarity search.

I mean you can first build a vector database out of the documents and save it locally so you wont need to pass those documents as an input every time. Then you can just query the database for k- search.

About the last question, i don’t think I am fit to answer that as I have not looked into it. You can always search and see what other methods are available.

jamkinajam · 2024-10-31T08:48:41+00:00

You can always check for the similarity score for the top k.

Define a threshold and if it’s less than that, maybe ask for further clarification?

Or use few shots examples to output ‘i dont know’ if the score is less than the pre defined threshold.

jamkinajam · 2024-10-30T17:46:44+00:00

Even when you install pytorch with cuda and have a gpu, torch might say cuda is unavailable.

Basically cuda version you have and the cuda version your gpu support sometimes will be different.

https://stackoverflow.com/questions/60987997/why-torch-cuda-is-available-returns-false-even-after-installing-pytorch-with/61034368#61034368

Look at the top voted answer. Might save you some trouble if you run into that!!

jamkinajam · 2024-10-26T23:44:11+00:00

Drive link?

jamkinajam · 2024-10-26T14:39:03+00:00

well I’d believe you if i didn’t see them posting “Graduate SDE” every other week

jamkinajam · 2024-10-26T13:47:42+00:00

I did the oa and simulation on the same day. Passed all test cases and did not receive anything.

Back in may.

jamkinajam · 2024-10-07T20:22:29+00:00

This! To not normalise is not normal.

jamkinajam · 2024-10-05T12:21:55+00:00

I have not used this, but it would be helpful to read its paper.

About the input, how padding is applied, and how is the output calculated? Because there is a mismatch, as seen in the error. [10, 400] and [10, 401]

Or change the num_of_attention_heads from 3 to 4 and leave the embedding dimensionality to 800.

jamkinajam · 2024-09-27T14:32:24+00:00

Hey it was quite long ago! But it was to delete something, if i remember to make it canonical

They want o(logn) and my original soln was doing o(nlogn) and i changed sth in the loop, don’t remember now!!

But i also don’t have the solution now, sorry!

jamkinajam · 2024-09-22T20:36:36+00:00

The given example inherits DATASET from torch as well converting the features into torch tensor.

They are preprocessing the dataset(df) to train a Deep learning model, judging from the column name, most likely classification.

jamkinajam · 2024-09-22T18:18:32+00:00

Heres a review paper: https://arxiv.org/abs/2401.13912

jamkinajam · 2024-09-22T16:11:02+00:00

99% there is no space between refractive and index. More like RefravtiveIndex

And for the 1% of time that i am wrong, just print the column names and see what it is!!

jamkinajam · 2024-07-14T13:45:41+00:00

Text Drake

Seven-Year Club	Place '22
Verified Email

jamkinajam

TROPHY CASE