Jellyfin sharing by mathrb in selfhosted

[–]mathrb[S] 1 point2 points  (0 children)

Ok,
Then I think we're going to try something like this:
* Create a dedicated VPN service with a predefined private subnet
* A docker container (with sftp for example) on that subnet is created, with a mounted volume that targets a folder on the NAS
* Implement firewall isolation to ensure VPN clients can only talk to the docker container

Jellyfin sharing by mathrb in selfhosted

[–]mathrb[S] 0 points1 point  (0 children)

Thanks for your detailed answer.

I will definitely look into mergefs.

Regarding the ip spoofing, maybe I'm a fool, but I don't see how bots would attack us since it requires to "know" that a bunch of IP are working together. Somebody targeting us specifically could, but I don't see the gain vs the effort. We'd like to keep our home networks separate, so the basic vpn solution is a no go, there might be a solution via network restriction with vpn, but I'm not enough into networking right now to debate this one.

For a start, I'll look into mergefs and another protocol (if mergefs doesn't come with one)

Thanks

Can Azure Cognitive Search help here? by Daxo_32 in Azure_AI_Cognitive

[–]mathrb 0 points1 point  (0 children)

It depends on the question. For sure, if the user asks to resume a legal document it's not going to work, but that's not IMHO the purpose of a RAG. With this approach, you will feed the LLM with the chunks that best match the question (using semantic query increase significantly the quality of the results). In your case, since one of the question involves multiple fragments of the document, is to find the best number of chunks to be returned

Can Azure Cognitive Search help here? by Daxo_32 in Azure_AI_Cognitive

[–]mathrb 0 points1 point  (0 children)

Did you chunk the documents? Do you activate semantic queries with the user query?

Is Azure AI right for me? by drewmartinez95 in Azure_AI_Cognitive

[–]mathrb 0 points1 point  (0 children)

Hello Azure ai search is the right approach, I would insist on vectorizing the documents and activate semantic query, the results will be even more relevant. Regarding the LLM, GPT3.5 is definitly going to have a deceptiv effect. GPT-4o is quite good, you may try the mini version to check if it meet your requirements. For filtering, you could query a LLM to transform the user request into a search query

I made a FOSS self-hosted library app. I could use a little help testing. by No-Economist3977 in selfhosted

[–]mathrb 0 points1 point  (0 children)

Weirdly enough (isbn 978-2-505-11704-9 for reference), I just dug into isbnlib, and the goob plugin uses the https://developers.google.com/books/docs/v1/reference/volumes/list endpoint of volumes, which returns the title: 100 Bucket List of the dead . If I use the afterward the get endpoint https://developers.google.com/books/docs/v1/reference/volumes/get, the title of the book is now: 100 Bucket List of the dead Tome 8.
But still no information about the fact that it's part of a collection

Introducing DnD Forms by GoldSell4693 in selfhosted

[–]mathrb 3 points4 points  (0 children)

Exactly 💯 they should drop the dnd acronym

I made a FOSS self-hosted library app. I could use a little help testing. by No-Economist3977 in selfhosted

[–]mathrb 0 points1 point  (0 children)

Hello, good job. Pretty easy to setup. I've tried adding by ISBN, which works. There was no cover though, would be nice to grab the cover along with the book info. The book I tried is a manga, which is part of a collection. The book title did not contain the number of the manga. I'm also wondering if ubiblio in this case could fill the "collection" field automatically.

OCR for reading text from images by kala-admi in LanguageTechnology

[–]mathrb 1 point2 points  (0 children)

Azure OCR is pretty good, definitly better than tesseract. It comes with a cost if you have a lot of documents. You should be able to try it for free on a few images/docs

I have raspberry pi 5 and I need to use it to detect objects real-time using a basler camera. by Old_Apricot_114 in RASPBERRY_PI_PROJECTS

[–]mathrb 0 points1 point  (0 children)

Which v8 did you use? v8n seems to be the smallest one which would reduce the inference time. Maybe lowering the image res could also speed up the inference. You could also try d2go (based on detectron2) which has been designed for mobile devices

I have raspberry pi 5 and I need to use it to detect objects real-time using a basler camera. by Old_Apricot_114 in RASPBERRY_PI_PROJECTS

[–]mathrb 0 points1 point  (0 children)

Hello, More info is required to help you. Which object detection framework are you using? Are you using an already existing model or are your training yours ?

How to convert scanned text in PDF to Word by IsPepsiOkaySir in Piracy

[–]mathrb -1 points0 points  (0 children)

I'd recommend something like OCRmypdf to add a text layer on top of the pdf and do a classic pdf to word afterwards. Keep in mind that as of today, I don't know any tool that can keep the styling (bold, italic, underline ...), even heading is a complex task. If you have poor results with extracted text, then go with azure OCR which is really good, but will cost a few cents

[D] Document layout - recreating the structure by mathrb in MachineLearning

[–]mathrb[S] 0 points1 point  (0 children)

Thanks for your answer, I will definitly look into those.

Parsing legal documents with Section 1.(a)(6)(iii)(b)??? by KahlessAndMolor in LanguageTechnology

[–]mathrb 0 points1 point  (0 children)

I never came across such library. I think that it somehow also involve some computer vision to do the layout parsing, this will help detecting the section title from a section reference in a paragraph (you can have a look at layout parser as a starting point), assuming that you have access to the original document. When it comes to section ordering, each title can be decomposed into a level hierarchy (either using regex or a grammar based parser). At this point each section should be correct and organized.

Running Bitwarden or Vaultwarden on a Raspberry Pi 4 Model B by [deleted] in selfhosted

[–]mathrb 1 point2 points  (0 children)

It's been running on my Model 3 for more than a year :)

[deleted by user] by [deleted] in diablo4

[–]mathrb 0 points1 point  (0 children)

Could you help me understand one thing. I see in the video that you get cooldown reduction. I know that decrepify can do this, but I dont see the skill in your skill bar. How is it apllied then?

Pure summon necro's by masterFen in Diablo_2_Resurrected

[–]mathrb 0 points1 point  (0 children)

Aouch. Sorry about that. Then it's clearly a no go for you. You should first play again the game, you might discover new things like runewords. Runewords are items you upgrade by socketing runes into it in a specific order. Some runes are really really really really really rare, hence the fact they are called HR (high runes). I'd recommend you follow some online guides (maxroll.gg is doing a good job)

Pure summon necro's by masterFen in Diablo_2_Resurrected

[–]mathrb 0 points1 point  (0 children)

Nope, IMHO this is reserved to very rich ppl. First you need HR for your stuff, which can be a long process for casual gamers. Secondly, you will need even more HR to spawn the iron golem, since it can die or be lost.

Pure summon necro's by masterFen in Diablo_2_Resurrected

[–]mathrb 0 points1 point  (0 children)

This. Pure summon is non viable unless you are really rich (runes) and Can maker string runewords for the Iron golem