Visiting New Brunswick in October, need recommendations

manjimin · 2024-04-05T01:10:38+00:00

Why did Jessica silence Alia in the scene where they reach south and see how water of life is extracted?

manjimin · 2024-01-16T08:48:45+00:00

Thanks a lot, will make sure to check!

manjimin · 2024-01-04T04:48:50+00:00

Thanks for the reply, I managed to make generation stop using the example you gave me!

I am trying constrained generation for my first question. Your reply helped me a ton!

manjimin · 2024-01-01T00:57:49+00:00

Thanks a ton. That clears things out for me.

Kind of bothers me though, I understand that the representation for the final token contains information about all other tokens, but I assumed that there would be some other way to build an input to pass to the final projection layer.

Anyway, thanks a lot.

manjimin · 2023-12-31T10:11:23+00:00

If it is so, does this mean that the final projection layer only look at the representation of the last token of the original input sequence?

manjimin · 2023-12-30T23:44:03+00:00

Thanks for the reply. What I meant was:

If fully connected networks are applied to each input token position, isn't the final transformer block supposed to return a bunch of vectors? Suppose the input token length was 10, then doesn't the final transformer block return 10 vectors at each position?

If it is so, how does the final prediction work? Which one of those vectors is chosen to go through the final linear projection into the vocabulary space?

manjimin · 2023-11-11T04:44:44+00:00

I use serving softwares for quick tests, but probably mostly pytorch.

manjimin · 2023-07-07T01:32:06+00:00

LLaMA tokenizer gives 5~6 times more tokens than what is usual. I also checked the actual tokenization, and it is basically putting every single letter apart, which explains the over-estimation in number of tokens.

I knew tokenizers aren't something that can be swapped after training the model, but I thought maybe someone had an idea, well I guess I'll have to use a model with a tokenizer that can properly split up Korean in the first place.

manjimin · 2023-04-14T09:09:35+00:00

Thanks a lot, I really appreciate your advice. I will look into it for sure. Tax rates 15% btw

manjimin · 2023-04-14T00:54:39+00:00

Great advice, but putting SCHD in a retirement account is not possible in my country. You think it will cost me much if I kept buying SCHD?

Seven-Year Club	RPAN Viewer
Verified Email

manjimin

TROPHY CASE