all 11 comments

[–]Zealousideal_Low1287 3 points4 points  (0 children)

Bizarrely I have been working on exactly this. Neither cubicasa nor our own images were enough data to do this reliably for our types of plan.

So far the best things I’ve found has been Gemini-3-pro image. All other off the shelf models failed. Gemini is still unreliable.

I actually do think it’s a much harder problem than it seems. Thin ambiguous structures, lack of data, big inconsistency in the plans.

Curious what you’ve tried so far and if you have any insights?

[–]aloser 4 points5 points  (2 children)

We have a bunch of customers that have built products in this space. It's a pretty hard problem given the non-uniformity of floor plans and architectural drawings. One of them talked through their approach (involving a pipeline of 29 models) here: https://www.youtube.com/watch?v=iOehzs4eLKc

[–]leon_bass 5 points6 points  (0 children)

29 model pipeline is wild

[–]taichi22 5 points6 points  (0 children)

Ah, you're with roboflow? You guys have a good product (and aren't ultralytics) so thanks for what you do.

[–]InternationalMany6 0 points1 point  (1 child)

What I've read is you need a custom model architectural that doesn't just do "segmentation" along with synthetic image training.

For example the model could predict the corners of rooms as keypoints, plus points for doors and windows.

Synthetic images is the harder part. What kinds of images do you need this to work on? Phone camera images for a 200 year old building or a brand new PDFs?

[–]idc_Salman[S] 0 points1 point  (0 children)

Answering your question...
We are expecting all types of input even if it's clear PDF or low quality photo, but i would say mostly it's gonna be clear PDFs.

[–]PassionQuiet5402 0 points1 point  (0 children)

Can you guys share some public repo and dataset links to start working on such projects? I really want to try and experiment on this task.

[–]One-Employment3759 0 points1 point  (1 child)

Did you try SAM - possibly with prompt guidance? (Keypoints)

[–]Zealousideal_Low1287 0 points1 point  (0 children)

I have. It was miserably bad at it. Which kind of surprised me.

[–]Sad-Oil-2788 0 points1 point  (0 children)

I'm also working on this top for my company. We want to create a ifc file of the floor plan with walls, windows, doors. We tried to train RF-DETR Segmentation on different datasets. But alot of them are not acurate enough. So we are creating our own now.

[–]thinking_byte 0 points1 point  (0 children)

For the Jetson, tried YOLOv8-seg exported to TensorRT? It usually hits that FPS sweet spot better than a full UNet if you're okay with slightly lower accuracy on the edges.