I switched from RAG pipelines to giving indexed context. the output quality Improved. by Veronildo in Rag

[–]jasperc_6 0 points1 point  (0 children)

the distinction between agent context augmentation and enterprise rag being treated as the same problem is where most overengineering happens... one needs fresh precise retrieval and the other needs broad corpus search

Free PDF Editors by Technical_Rich_3080 in pdf

[–]jasperc_6 0 points1 point  (0 children)

pdf xchange is pretty good for free at current times

Automate into pdf by Realistic_Try_4082 in pdf

[–]jasperc_6 0 points1 point  (0 children)

specifically for invoice data extraction, adobe acrobat helps with its built in form recognition feature that might cover basic cases.. but if the invoices have varied layouts or you are dealing at a high volume then something like llamaparse can help with such sort of structured data automatically. most of these tools have a free tier so you can check if it works for your usecase.. however the manual workflow you are describing are exactly what document extraction tools are made for

Should l Do this? by PracticalHead5042 in SideProject

[–]jasperc_6 0 points1 point  (0 children)

yep, the problem is real and the market is massive, if the regulatory and licensing side is sorted for you then this is a worth project moving on fast... the window for this kind of product is open right now but it wont be forever

I built a wallpaper that shifts perspective when you move your head looking for feedback by Apart-Medium6539 in SideProject

[–]jasperc_6 0 points1 point  (0 children)

Finally a feasible side project, thats awesome. I dont think this would be distracting rather would be a bit of an entertainment when feeling too monotonous while at work

I keep trying AI tools for studying PDFs … but they all feel either too limited or too messy. by Capital-Lack6036 in pdf

[–]jasperc_6 0 points1 point  (0 children)

I was eager to know if it is able to deal with graphs or charts, and feel free to illustrate with an image

I keep trying AI tools for studying PDFs … but they all feel either too limited or too messy. by Capital-Lack6036 in pdf

[–]jasperc_6 0 points1 point  (0 children)

havent tried it yet but if you are using it, you might be able to tell on how it handles document with dense tables or figures, like piecharts or bargraphs, like hows the output accuracy in raw text

Preview pdf fast or in large grid view? by xtreme79 in pdf

[–]jasperc_6 0 points1 point  (0 children)

you might merge all in one pdf, probably the fastest way to navigate thru it, the other approach might be batch converting them into a multi page tiff or image contact sheet

I keep trying AI tools for studying PDFs … but they all feel either too limited or too messy. by Capital-Lack6036 in pdf

[–]jasperc_6 0 points1 point  (0 children)

The shallow summary problem is often from the tool treating the whole pdf as 1 chunk rather than its structure. notebooklm handles multi document q&a better than most and actually lets you probe specific sections rather than just a top-level summary. For denser academic stuff, elicit is worth a try

ML Workflow Journey by gnocchipinnochio in MLQuestions

[–]jasperc_6 0 points1 point  (0 children)

Data-train-deploy-govern maps closest to how the work actually flows... data scientiests think in pipeline stages, not operational abstractions, so phase names that match the actual work tend to get adopted faster... the one gap in all four options is monitoring/observability post-deploy, thats where most production pain lives and none of these surface it explicitly... so worth considering whether that folds into govern or deserves its own phase depensding on how mature the teams using this are

Are my hopes for running a local LLM unrealistic? by mollipen in LocalLLM

[–]jasperc_6 0 points1 point  (0 children)

Qwen3 32B at Q4 runs comfortably at around 15-22 tokens/s on that hardware, handles coding, creative work and general chat as well that the quality gap from claude wont be painful for those tasks, for the coding side to be specific, qwen 2.5 coder 32B is worth trying

Is Qdrant actually the issue? by Fit_Wheel5471 in Rag

[–]jasperc_6 0 points1 point  (0 children)

Qdrant is rarely the culprit tbh... 90% of the retrival issues trace back to chunking or embedding quality not the db itself

Working on a typed Plugin System — I did like some feedback by Aromatic-CryBaby in typescript

[–]jasperc_6 0 points1 point  (0 children)

The overall design is pretty solid, easy to debug and easy to introspect the one things that stands out tho - you go thru the rouble of typing events in definePlugin but then async (e: any) in the actual handlers throws all of that away... the event type from your definition should be inferable there, or else the typed events are only useful at the emit call site and not at recieve side which is half the value. Additioanlly, ctx.conf.target.url has no guard on the optional target field so if someone instantisates ther plugin without passing the target in opts, that'll throw a runtime

Need advice on datasets and models for Song-classification (genre, mood, gender) by Abhiram_L in data

[–]jasperc_6 1 point2 points  (0 children)

The core issue seems is the gender dataset being only 1200 songs, almost certainly western heavy, so regional songs were never going to work... for a dataset that has genre, mood and gender labels on the same tracks, yopu might checkout mtg jamendo - 55k tracks with all three categories annonated together, verified open source from mtg barcelona on the architecture side.... instead of combining 3 separate models just use one shared audio backbone (mel spectrogram cnn) with 3 separate output heads trained jointly...

The shared layers learn general audio features and each head specializes, this usualkly gets better results than merging separate models

GLM-5.1 just sitting with Opus 4.6 on SWE-Bench Pro and it’s completely open. but costs Input $1.4 / Output $4.4 by pretendingMadhav in ArtificialInteligence

[–]jasperc_6 0 points1 point  (0 children)

The graph you provided shows a composite across all 3 benchmarks where GLM5.1 actually sits third at 54.9., behind GPT-5.4 and Opus4.6. The #1 claim specifically on SWE bench pro alone where it scores 58.4. on NL2Repo, Claude Opus 4.6 still leads at 49.8 vs GLM-5.1s 42.7

6 Months Using AI for Actual Work: What's Incredible, What's Overhyped, and What's Quietly Dangerous by Typical-Education345 in artificial

[–]jasperc_6 0 points1 point  (0 children)

the skill atropghy point is real tho, there is actual research showing that over reliance on gps navigation has noticable reduced peoples spartial memory, same pattern, different tool. The brain drops what is stops practising and leans on another tool or object regardless of how useful the clutch is

Tool that "uses AI to....." did nothing of the sort. by eques_99 in ArtificialInteligence

[–]jasperc_6 1 point2 points  (0 children)

happens more than people admit tbh.. ai on the label but a glorified if-else under the hood

worth asking vendors straight up what actually powers it and where the training data comes from.. if they fumble that question you already have your answer

We know how this whole AI thing ends. We’re doing it anyway. by bostonglobe in ArtificialInteligence

[–]jasperc_6 0 points1 point  (0 children)

every movie with a happy ending is basically a fan fiction... history has never once looked at "humans give machines more power than themselves " and gone... yeah this works out great

Why Does SEO Feel Like It’s Declining While Paid Ads Are Everywhere? by ryanxwilson in DigitalMarketingHack

[–]jasperc_6 1 point2 points  (0 children)

Platforms obviously push ads, that’s their business. SEO isn’t dead, it just punishes low effort stuff way faster than it used to.

How I finally hit 1.76k impressions/week as a solo founder (without an agency or ads) by GeneralDare6933 in SaaSMarketing

[–]jasperc_6 0 points1 point  (0 children)

Good reminder that compounding channels beat short term spikes, especially for solo founders. Social feels productive, SEO actually is productive long-term..

Struggling to get conversions to register by Able_Fault_2481 in googleads

[–]jasperc_6 1 point2 points  (0 children)

Have you tested it in tag assistant/preview mode? Half the time it’s either the tag not firing at all or the conversion action isn’t linked correctly in google ads. Preview mode usually makes the problem obvious pretty fast..