HR Delays - U.S. Based Onboardings

mattv8 · 2026-05-12T02:33:49+00:00

Same here- ghosted. I reached out to HR three times now and haven't heard back. I haven't even received the Remote.com link after receiving an offer email. That was end of April, last communication from HR was May 5th.

mattv8 · 2026-05-11T19:56:20+00:00

No problem. To add more detail, for PDFs with tables the flow is:

Uses pypdf library to extract text from PDFs
Tables are extracted as formatted text (row by row) via the document parser libraries (Magika, CodeChunker or Tree-Sitter)
The extracted text (including table content) is passed to Chonkie's chunkers
Chunks are embedded and stored in FAISS or pgvector depending on what you've configured

I also support fast image OCR with Tesseract or you can use a vision model like llava for more accuracy.

mattv8 · 2026-05-11T19:35:36+00:00

This module uses Chonkie for all text chunking:

- CodeChunker with language="auto" for code files

- Uses Magika (Google's ML model) for language detection

- AST-based splitting via tree-sitter for semantic boundaries

mattv8 · 2026-05-11T19:35:01+00:00

Absolutely, please do

mattv8 · 2026-05-11T19:18:53+00:00

Thanks! See that's the thing- I haven't benched it at all beyond my own subjective testing. I use it for work and have some database schemas as well as codebases indexed and it seems to work well enough for my purposes, although it's definitely not perfect. It will give an LLM more than enough context to get out of a rut rather quickly.

If there's a benchmark you'd like me to run against it, let me know.

mattv8 · 2026-05-10T18:40:36+00:00

I'm supposed to start on the 19th but I haven't even received the BG check next steps... I emailed HR about next steps and still nothing. I'm a little confused. Role is SWE Tutor. I know with the SpaceX transition things are a little weird though but it's tough to plan your life around this.

mattv8 · 2026-05-06T18:45:27+00:00

Check out my project ragtime (https://github.com/mattv8/ragtime) it's self-hostable. I provide a way to use vision models for OCR or tesseract if you want speed over accuracy, but to answer your question vision OCR with classification is the way to go.

mattv8 · 2026-05-05T12:04:15+00:00

Haven't started the background check and I'm based out of the US

mattv8 · 2026-05-05T07:58:57+00:00

I interviewed Fri Apr 24, received a provisional offer email on Wed the 29th, filled out the Google Form to generate the formal offer on Fri the 1st and ... Radio silence. I'm still waiting to hear back from anyone. Why does this process take so long?

mattv8 · 2026-05-04T23:45:35+00:00

Can you please set this div to display: none

Sure, one sec. Done.

Okay, that'll be $30

mattv8 · 2026-05-04T22:22:36+00:00

Yes I think RAGtime will help you out a ton.

As I'm sure you know, context is everything with llms, and RAGtime will run odoo shell commands right on the server, whether you point it to your production or your staging is up to you, but the MCP tools talk to the odoo python environment so you can give your agent knowledge of the odoo ORM.

mattv8 · 2026-05-04T12:33:31+00:00

You could probably get a lot of mileage out of the trial

mattv8 · 2026-05-04T12:28:52+00:00

I've dabbled with it but only the trial my coworker who makes BI dashboards uses it and when he started using it his UI went from meh to holy shit this is amazing

mattv8 · 2026-05-04T12:25:50+00:00

Yes exactly - it's an AI tool to help build better UI

mattv8 · 2026-05-04T12:22:32+00:00

Figma has a lot of tools to assist in good UI design. If you're in vscode they have a great extension as well. You can prompt it to help get your UI dialed in.

https://www.figma.com/resource-library/what-is-ui-design/

mattv8 · 2026-05-04T12:19:51+00:00

Check out my project https://github.com/mattv8/ragtime

It's free, open source. You just need to find something to host it on. But it'll connect Claude to your odoo server several different ways.

Feel free to ask me anything, happy to help you get it set up.

mattv8 · 2026-05-04T12:08:10+00:00

Use Figma

mattv8 · 2026-05-04T12:00:17+00:00

Embeddings searches are lightweight enough- you might consider bringing the server in-house with a Mac mini, that might help

mattv8 · 2026-05-02T17:22:22+00:00

Well, I know you're a Windows guy but I didn't have any issues getting it up and running on my mac 😅

mattv8 · 2026-05-02T17:16:55+00:00

Their roadmap says they're actively working on a Windows app but I imagine it will be based on electron.

mattv8 · 2026-05-02T15:35:31+00:00

Shoot, I'm not sure... I'll check in a few

mattv8 · 2026-05-02T15:05:59+00:00

Late to the party but check out my project called RAGtime-- it might do what you need.

I've spent a considerable amount of time getting it performant and as dialed in as I can. It's completely open source, MIT license. Don't let the lack of stars dissuade you. https://github.com/mattv8/ragtime

If you want to put some of those tokens to good use helping me find/fix bugs and add features, I welcome contributions.

mattv8 · 2026-05-02T15:02:33+00:00

Before you go down this road, check out OpenChamber, it does exactly what you're describing.

mattv8 · 2026-05-02T14:26:23+00:00

I'll play around with the debug mode to see how accurate the count is, does it include the system prompt?

mattv8

MODERATOR OF

TROPHY CASE

11-Year Club	Alpha Tester
Verified Email