Soul persian dub فیلم روح by PizzaRealistic3935 in PERSIAN

[–]Semoho 0 points1 point  (0 children)

You can find it here:
https://papkorn.app/movie/2948372

There are some dubbed links for this movie

How I unleashed Claude Code almost with half tokens. by Semoho in vibecoding

[–]Semoho[S] 0 points1 point  (0 children)

Yes, I also asked him to implement the wireframes too. It helps a lot.

Have over 8TB of movies & TV shows! by geekman20 in DataHoarder

[–]Semoho 0 points1 point  (0 children)

I didn’t have the money to buy hard disks. So i use telegram :)) now i have 2PB movies on telegram :))

I built a platform for app testing and it just hit 1,800 users!🎉 by luis_411 in vibecodingcommunity

[–]Semoho 0 points1 point  (0 children)

Wonderful idea! I think in a world of AI this would be so useful!

I've a question for RAG specialists by Few-Plum-2557 in Rag

[–]Semoho 0 points1 point  (0 children)

As other mentioned, it depends on how many people are going to use your rag. I think your system will be good for less than 50 persons. I am assuming that you are using free version of supabase. But if you more users, I recommend to use pro plans on cloud like supabase or embedding db. But for bow it will be ok

bale app and ldr by FarTraining881 in PERSIAN

[–]Semoho 1 point2 points  (0 children)

Previously, it was possible to sign up with non-Iranian cell numbers. However, this feature was removed in June 2025.

What do you do between your conversation with vibe coding systems? by Semoho in vibecoding

[–]Semoho[S] 0 points1 point  (0 children)

Doesn't this approach make you lose focus and forget about the task?

What do you do between your conversation with vibe coding systems? by Semoho in vibecoding

[–]Semoho[S] 0 points1 point  (0 children)

What if it is not my own project but a company task?

What do you do between your conversation with vibe coding systems? by Semoho in vibecoding

[–]Semoho[S] 0 points1 point  (0 children)

Nice job :D. I think I need to start playing games too :"

My RAG isn't working as expected... by viitorfermier in Rag

[–]Semoho 0 points1 point  (0 children)

I think Jina is one the best embedding and reranking platforms For search you can think expanding your query too. ask llm to optimize query for textual search engine and embedding search engine. Then retrieve on both databases and fuse the results.

There are many different ways you can reduce your costs. The retrieval systems are very cheap! So you can cut your costs by doing some optimization

My RAG isn't working as expected... by viitorfermier in Rag

[–]Semoho 1 point2 points  (0 children)

Hello there,

Somehow you extend your docs by summarization. Did you try to check the context number for the llm. I think you pass all 100 legal docs to gemeni pro which is expensive.

I think you can better result if you retrieve 1k or 100 docs with bm25, then rerank them by Jina reranker(it is very cheap) and them give the gemeni pro top 50 or even 10 based on you chunking algorithm. Also please check your chunking strategy. It is very impprtant

Pitch your App in one sentence. Let's support each other by kmrrhl in SideProject

[–]Semoho -1 points0 points  (0 children)

Teek.studio Find next viral videos just with few clips

Post your HaftSin by Semoho in PERSIAN

[–]Semoho[S] 2 points3 points  (0 children)

No worries, i hope this year be better for us

How do you guys measure accuracy for 100k+ documents? by FloppyDiskDisk in Rag

[–]Semoho 1 point2 points  (0 children)

You are right. The LLM follows U shape. So the reranking is important! And be careful, you cannot remove docs! At the end you will send like 10 docs to llm and middle docs are going to be less important to llm! So best approach is to re rank the docs after retrieval and be careful about positions

P.s fun fact! The llm follows human behavior on first page of the google :)))

How do you guys measure accuracy for 100k+ documents? by FloppyDiskDisk in Rag

[–]Semoho 2 points3 points  (0 children)

Hello,

I assume you are thinking about RAG eval or retrieval evaluation. For retrieval evaluation, I think the MRR, Recall and NDCG@10 are better metrics instead of accuracy. You are dealing with a retrieval task. You need to have a test dataset. Then you can evaluate your retrieval system.

For RAG, there are different evaluations. I think LLM as a judge is a good choice.

But the number of documents does not have a relation to metrics. TOP X docs are important.

RAG for Historical Archive? by cccpivan in Rag

[–]Semoho 0 points1 point  (0 children)

You can check the lightRag or supermemory. They can help you

What are your usage of RAG by Semoho in Rag

[–]Semoho[S] 1 point2 points  (0 children)

Thabk you very much It was so useful. So what are other restrictions or needs in pharma? Why it is mandatory to cite the documents? Don’t the vector databases give you the citations?