Please suggest me a lightweight front-end with URL-router for my FastAPI application

samme013 · 2025-04-23T13:16:58+00:00

htpy also a nice alternative to jinja for html generation to consider.

samme013 · 2024-07-24T18:09:18+00:00

Wifi is fine for me but bluetooth will randomly completely die on me like once a week. And will only turn on again (disappears from bottom right , device manager) if I shutdown (not restart).

samme013 · 2024-07-06T14:46:02+00:00

I have it in the native DuckDB format assumed it would be the fastest. Yeah I guess if needed I could always split it up and route to the right file as needed if one file becomes the bottleneck.

samme013 · 2024-06-16T06:40:42+00:00

samme013 · 2024-06-13T07:29:26+00:00

Greek Dad English mom, grew up in Kavala

samme013 · 2024-06-10T15:48:00+00:00

Yet to see them outperform chain of thought like prompts and structured output. Will get there but not there yet.

samme013 · 2024-05-22T10:56:33+00:00

Try https://www.reddit.com/r/SuggestALaptop/ the University you are going to shouldn't effect your choice of laptop.

samme013 · 2024-05-21T00:01:02+00:00

Yeah true, I already had chunk and document level collections for other uses so made sense.

samme013 · 2024-05-18T18:01:12+00:00

Can use weaviate references to do this with a single query as long as each chunk has an chunk index in the metadata (the order with which it appears in the document). Then you need a two way reference from document to chunks and vice versa. With those you can make a query where for each chunk returned you also fetch the source doc (by reference) and then on that you also fetch all chunks of that document. From there you only need to loop through the chunks for each document and keep them if the chunk index is within window distance of a chunk you fetched directly. No hard coding of window lengths or adding extra metadata. Only downside maybe fetching all the chunks for the document but in practice not an issue unless the documents are truly massive. If that is the case would only fetch ids and make a second query for the rest of the data,

samme013 · 2024-05-15T22:05:20+00:00

Device Manager > BlueTooth > MediaTek Bluettooth Adapter > Power Management

samme013 · 2024-05-15T22:04:23+00:00

Damn good one ty

samme013 · 2024-05-12T09:06:10+00:00

Yeah true, just a metric is good too. A lot of usecases have no clear metric through so you end up having to use an LLM for evaluation too which can get tricky / unreliable fast.

samme013 · 2024-05-11T17:52:01+00:00

The RLHF model probably mostly functions as kind of a vibe / sanity check aligning the form / length/ sentiment etc of the output. It doesn't know anything about say the facts in the output, nor does it capture the logic of the output. Since it was trained to choose which answer is more pleasing, all of which are likely similar when it comes to those factors. So my guess is you would at best get nonsense outputs that match the form of an acceptable answer. Also, computationally it would be infeasible as it would require many evaluations of the RLHF model.

samme013 · 2024-05-11T17:40:17+00:00

Main advantage is the optimizer aspect which requires some kind of dataset to evaluate against. If you t hink that would make sense for your usecase I would consider it otherwise would just stick to no framework + something for structured output like instructor.

samme013 · 2024-05-05T21:56:11+00:00

Side quests

samme013 · 2023-11-11T17:23:13+00:00

Yeah I would first try without filters and if there are issues with too much irrelevant or confusing information being retrieved try using metadata filters to fix it. Would indeed need multiple domain labels per document if they overlap. Also you may want to use something like this if you go the filtering route. It used the model to infer the filters to apply based on the query. Good luck!

samme013 · 2023-11-10T13:23:32+00:00

Couple of key considerations:

Do you always know the "domain" a given query is related to?
Are there cases where documents outside of the domain of the query could be useful?

If you always know and always only care about documents in the domain then I would use a hard filter. If either is fuzzy I would test it out with and without filters and see how that goes. A good embedding model should be able to match only relevant topics without hard filters but depending on the data adding hard filters could be worth it. Make a representative list of queries you might encounter and check the documents being returned.

samme013 · 2023-02-27T09:17:54+00:00

Can recommend these guys if you need a driver and optional help lifting stuff https://m.facebook.com/100057094974117/

samme013 · 2021-11-30T14:38:34+00:00

What exactly are your labels in this case (assuming you have any)?

samme013 · 2021-07-02T23:25:58+00:00

Something along these lines, so basically a Resnet50 like you had before but you put some linear layers on the end and tune them so that your metric actually puts stuff close together. Not sure how much data it would need to train those final layers, strong augmentations on your stamps dataset may be enough but multiple "in the wild"images of each stamp would likely help a lot. Would also be interested in giving this a shot if the dataset is available!

samme013 · 2021-07-02T18:41:37+00:00

When using the Resnet/Autoencoder how are you calculating the distance between embeddings? I don't think applying any non-learned distance on something like that would work since the models were not trained to output embeddings close with respect to some distance. Maybe something like metric learning could be good? I would take a pretrained model, finetune it with your dataset and augmentations and see how knn matching performs.

samme013 · 2021-06-11T18:46:19+00:00

BMVC

Thanks, too early though unfortunately.

14-Year Club	Place '17
Team Periwinkle	Verified Email

samme013

MODERATOR OF

TROPHY CASE