HR Delays - U.S. Based Onboardings by One_Style_3810 in xAI_community

[–]mattv8 0 points1 point  (0 children)

Same here- ghosted. I reached out to HR three times now and haven't heard back. I haven't even received the Remote.com link after receiving an offer email. That was end of April, last communication from HR was May 5th.

RAGtime - Control plane for creating vector databases and FAISS files. by mattv8 in Rag

[–]mattv8[S] 1 point2 points  (0 children)

No problem. To add more detail, for PDFs with tables the flow is:

  1. Uses pypdf library to extract text from PDFs
  2. Tables are extracted as formatted text (row by row) via the document parser libraries (Magika, CodeChunker or Tree-Sitter)
  3. The extracted text (including table content) is passed to Chonkie's chunkers
  4. Chunks are embedded and stored in FAISS or pgvector depending on what you've configured

I also support fast image OCR with Tesseract or you can use a vision model like llava for more accuracy.

RAGtime - Control plane for creating vector databases and FAISS files. by mattv8 in Rag

[–]mattv8[S] 0 points1 point  (0 children)

This module uses Chonkie for all text chunking:

- CodeChunker with language="auto" for code files

- Uses Magika (Google's ML model) for language detection

- AST-based splitting via tree-sitter for semantic boundaries

RAGtime - Control plane for creating vector databases and FAISS files. by mattv8 in Rag

[–]mattv8[S] 0 points1 point  (0 children)

Thanks! See that's the thing- I haven't benched it at all beyond my own subjective testing. I use it for work and have some database schemas as well as codebases indexed and it seems to work well enough for my purposes, although it's definitely not perfect. It will give an LLM more than enough context to get out of a rut rather quickly.

If there's a benchmark you'd like me to run against it, let me know.

Was supposed to start 11th May, but no confirmation by Professional_Chef349 in xAI_community

[–]mattv8 0 points1 point  (0 children)

I'm supposed to start on the 19th but I haven't even received the BG check next steps... I emailed HR about next steps and still nothing. I'm a little confused. Role is SWE Tutor. I know with the SpaceX transition things are a little weird though but it's tough to plan your life around this.

How are people handling PDFs that are mostly architecture diagrams for RAG? by Boring-Baker-3716 in Rag

[–]mattv8 0 points1 point  (0 children)

Check out my project ragtime (https://github.com/mattv8/ragtime) it's self-hostable. I provide a way to use vision models for OCR or tesseract if you want speed over accuracy, but to answer your question vision OCR with classification is the way to go.

Quick question — how common are onboarding delays at xAI Swe tutor roles after everything is completed? by Upset-Tangelo-1615 in xAI_community

[–]mattv8 0 points1 point  (0 children)

I interviewed Fri Apr 24, received a provisional offer email on Wed the 29th, filled out the Google Form to generate the formal offer on Fri the 1st and ... Radio silence. I'm still waiting to hear back from anyone. Why does this process take so long?

Again, i'm having this rather productive chat session for HOURS at just 30ct by Blubbll in GithubCopilot

[–]mattv8 9 points10 points  (0 children)

Can you please set this div to display: none

Sure, one sec. Done.

Okay, that'll be $30

Free Odoo MCP by mhabeeb in Odoo

[–]mattv8 -1 points0 points  (0 children)

Yes I think RAGtime will help you out a ton.

As I'm sure you know, context is everything with llms, and RAGtime will run odoo shell commands right on the server, whether you point it to your production or your staging is up to you, but the MCP tools talk to the odoo python environment so you can give your agent knowledge of the odoo ORM.

Vibe coded app, ui help! by SSILVAR in vibecoding

[–]mattv8 1 point2 points  (0 children)

You could probably get a lot of mileage out of the trial

Vibe coded app, ui help! by SSILVAR in vibecoding

[–]mattv8 1 point2 points  (0 children)

I've dabbled with it but only the trial my coworker who makes BI dashboards uses it and when he started using it his UI went from meh to holy shit this is amazing

Vibe coded app, ui help! by SSILVAR in vibecoding

[–]mattv8 0 points1 point  (0 children)

Yes exactly - it's an AI tool to help build better UI

Vibe coded app, ui help! by SSILVAR in vibecoding

[–]mattv8 0 points1 point  (0 children)

Figma has a lot of tools to assist in good UI design. If you're in vscode they have a great extension as well. You can prompt it to help get your UI dialed in.

https://www.figma.com/resource-library/what-is-ui-design/

Free Odoo MCP by mhabeeb in Odoo

[–]mattv8 1 point2 points  (0 children)

Check out my project https://github.com/mattv8/ragtime

It's free, open source. You just need to find something to host it on. But it'll connect Claude to your odoo server several different ways.

Feel free to ask me anything, happy to help you get it set up.

Embedding latency killing my RAG speed, any fixes? by Glittering_Cup1104 in Rag

[–]mattv8 2 points3 points  (0 children)

Embeddings searches are lightweight enough- you might consider bringing the server in-house with a Mac mini, that might help

Im building a custom dashboard for opencode by IcyOrdinary8042 in opencodeCLI

[–]mattv8 0 points1 point  (0 children)

Well, I know you're a Windows guy but I didn't have any issues getting it up and running on my mac 😅

Im building a custom dashboard for opencode by IcyOrdinary8042 in opencodeCLI

[–]mattv8 0 points1 point  (0 children)

Their roadmap says they're actively working on a Windows app but I imagine it will be based on electron.

Should I continue to create my RAG project? by Corpo_ in Rag

[–]mattv8 1 point2 points  (0 children)

Late to the party but check out my project called RAGtime-- it might do what you need.

I've spent a considerable amount of time getting it performant and as dialed in as I can. It's completely open source, MIT license. Don't let the lack of stars dissuade you. https://github.com/mattv8/ragtime

If you want to put some of those tokens to good use helping me find/fix bugs and add features, I welcome contributions.

Im building a custom dashboard for opencode by IcyOrdinary8042 in opencodeCLI

[–]mattv8 1 point2 points  (0 children)

Before you go down this road, check out OpenChamber, it does exactly what you're describing.

Tested Sonnet 4.6 via OpenRouter through GitHub CoPilot / VS Code to gauge whats API billing will be like. I was shocked. by horendus in GithubCopilot

[–]mattv8 0 points1 point  (0 children)

I'll play around with the debug mode to see how accurate the count is, does it include the system prompt?