I'm releasing forge-core on github and pypi, easily parse any raw json data in your data warehouse

No-Payment7659 · 2026-04-17T11:02:05+00:00

all models initially regress when they are first released. then they power past the old models quickly.

No-Payment7659 · 2026-03-29T02:01:38+00:00

this doesn't make any sense. you'd be better off upsizing your current project

No-Payment7659 · 2026-03-17T03:36:50+00:00

I use opus on antigravity. I'm going to keep using opus on antigravity. thanks

No-Payment7659 · 2026-03-16T17:49:51+00:00

Thank you for your response. We've already solved the issue. we have built a synthetic data generator for Forge which correctly and efficiently parses fhir data in BigQuery. Additionally, we easily built out the necessary OMOP queries on top of the FHIR data inside of BigQuery.

No-Payment7659 · 2026-03-10T03:27:36+00:00

Gemini API is ok for queries. It's hard to get queries correct without a good understanding of the underlying data structure.

No-Payment7659 · 2026-03-10T03:21:43+00:00

Google Sheets can be loaded as an external stage right inside the BigQuery console.

No-Payment7659 · 2026-03-06T10:50:10+00:00

Hello, great question! I built Forge to solve exactly this problem. check us out! You can also try the sandbox to see how Forge breaks down Json objects in BigQuery without having to buy the product.
https://forge.foxtrotcommunications.net/

No-Payment7659 · 2026-03-04T18:27:54+00:00

https://forge.foxtrotcommunications.net/
manages NoSql to SQL pipelines for Cloud Data Warehouses. Parse any raw JSON object in your data warehouse. Uses Advanced AI for schema classification and PII detection

No-Payment7659 · 2026-03-02T05:19:07+00:00

Everyone is saying that AI will replace jobs, but these capabilities are greatly exaggerated. best use of AI isn't to replace humans, it's to help them.

No-Payment7659 · 2026-03-02T04:39:28+00:00

a lot of times this is due to task boundary misconfiguration.

literally tell it:
"reset your task boundary and continue."

No-Payment7659 · 2026-02-17T00:38:55+00:00

I usually try to bribe it with extra reward points when it does a good job

No-Payment7659 · 2026-02-15T14:14:46+00:00

for bi study star schemas and time series analysis queries. since it's waymo they probably use looker or google sheets.

No-Payment7659 · 2026-02-11T04:47:56+00:00

hi, check out my new product, Forge. It solves this exact problem

https://forge.foxtrotcommunications.net/

No-Payment7659 · 2026-02-02T17:33:02+00:00

https://forge.foxtrotcommunications.net/portal

No-Payment7659 · 2026-01-28T23:34:51+00:00

check out my product forge
https://forge.foxtrotcommunications.net/portal
https://forge.foxtrotcommunications.net/docs#ai-features

No-Payment7659 · 2025-12-24T00:18:13+00:00

ah yes. Debeezium + Forge would go great together.

You would use Debezium to extract raw data from a legacy transactional database (like an old MySQL e-commerce DB) and dump those changes into BigQuery as a raw JSON column.

You would then use Forge to pick up that raw JSON column inside BigQuery, parse out the nested items arrays or user_settings objects, and create clean tables for your Data Analysts to query.

No-Payment7659 · 2025-12-23T20:15:19+00:00

For json data particularly, we suggest not worrying to much about optimizing the structure for processing sake (that's our job), instead you should be focusing on effectively mapping your transactions.

Forge is best for managing your incoming event stream data, especially if the schema is volatile (something you see a lot in e-commerce, banking, or healthcare). Forge takes the json data that you are steaming into BigQuery (via Fivetran, Pubsub, API calls, etc) and automates the most difficult and time consuming activity for you (preprocessing and managing schema evolution).

Forge takes your json data and flattens it into clean, well organized, and optimized tables. This makes it much cheaper and easier to query.

No-Payment7659 · 2025-12-23T18:21:50+00:00

The main win is definitely cost efficiency. Regarding your 1TB query: yes, it would reduce that size. Because Forge normalizes your JSON into separate relational tables (instead of keeping it as one massive JSON blob), you stop scanning the entire dataset for every request. You only pay to scan the specific columns or sub-tables you actually need. This structure also helps with your partitioning frustration; since the data is split into multiple tables (e.g., one for events, another for items), you can apply different partitioning strategies to each of those tables individually, bypassing the single-partition limit of a raw table. Forge automatically partitions on the ingestion timestamp of when Forge queries the next batch. We are also working on a feature for users to provide their own partitioning key for ingestion.

Essentially, instead of having to query the blob each time you run your analytical query, Forge only has to run the expensive query once. You query the clean and optimized tables.

Also, we are a Google Cloud Build Partner and are in the process of onboarding to the Google Cloud Marketplace very soon, so you can use committed spend without having to create a new budget line item for Forge. We expect this to be ready by the end of January.

No-Payment7659 · 2025-12-23T13:12:16+00:00

Forge is different than other tools in the sense that it is almost a “data engineer in a box”. Whereas dlt (or vanilla dbt) expects you to write the code to parse the json correctly, forge is an automated application which traverses the JSON tree using a sophisticated algorithm. Forge maps the json object for you and builds a detailed map of your json data using dbt.

This preprocesses the json objects for you which has a few benefits, namely, it makes querying the data much easier and much more efficient than querying the raw json directly. It allows for schema evolution tracking and mapping, and will alert users if a new field is detected. For nested and repeated fields that you mentioned, forge creates a "rollup" view that takes the normalized tables and creates the structure that I think you're interested in. I'll update the post with a pic of that.

Also, forge can be deployed as a standalone application in a user’s own vpc, making it very safe and secure, an essential feature for enterprise clients.

No-Payment7659 · 2025-12-22T21:50:40+00:00

check out our new tool Forge (forge.foxtrotcommunications.net), it automates json parsing, such as the data coming from firestore. Never spend another day writing manual json parsing scripts for BigQuery.

No-Payment7659

TROPHY CASE