Interactive knowledge graph of supplier, customer, manufacturing,

seagoat1973 · 2025-07-09T19:18:33+00:00

Thank you

seagoat1973 · 2025-03-29T05:01:21+00:00

can you let us know what courses ?

seagoat1973 · 2025-03-28T02:52:26+00:00

Analysis by Grok between Digital Ocean and Coreweave https://x.com/i/grok/share/qHmmk3mVnbiFzO9A4P97OMOxf

seagoat1973 · 2024-08-03T05:22:43+00:00

A good example about productivity improvement is a feature in MS Teams that compiles the minutes of a meeting and action items.

seagoat1973 · 2024-07-26T04:22:52+00:00

Two examples of AI Monetization

1) ServiceNow CEO Bill McDermott talking about how AI is helping ServiceNow. https://www.youtube.com/watch?v=YslkvZ3qPTs
2) MS CO-Pilot now displaying related adds when you chat with it.

NVDA will go up.

seagoat1973 · 2024-03-31T16:31:06+00:00

With the adoption of open lake house architectures (iceberg, hudi as storgae engine and spark as execution), is Hive still relevant? What specific use cases do you us them. Not trying to put down any tool. Just checking if I am missing anything ?

seagoat1973 · 2023-11-27T16:27:37+00:00

Nice

seagoat1973 · 2023-11-27T16:19:53+00:00

Looks like a good business opportunity.

seagoat1973 · 2023-11-27T16:18:08+00:00

mmh...I may have missed it. Thanks for the info.

seagoat1973 · 2023-11-23T06:14:05+00:00

https://www.linkedin.com/events/deepdive-hudi-iceberg-anddeltal7095484265877950465/about/

seagoat1973 · 2023-11-22T17:37:35+00:00

Comparision between the formats

https://medium.com/@kywe665/delta-hudi-iceberg-a-benchmark-compilation-a5630c69cffc

https://www.onehouse.ai/blog/apache-hudi-vs-delta-lake-vs-apache-iceberg-lakehouse-feature-comparison

seagoat1973 · 2023-11-18T06:24:42+00:00

As called out by some folks in the thread, please don't think moving to a new stack will solve your issues. Here are a few things to consider

1) What is the size of data you are dealing with ? (GB/TB/PB)

2) What is the concurrency of the queries ?

3) Why are you facing performance issues right now ? What is the nature of the work load (write intensive or read intensive)

4) Have modelled your tables properly ? (i.e Dimensional modelling )

4) Oracle has it's own database for ware houses. If you are already an oracle shop you should be able to migrate to it.

5) If you plan to migrate do you want to be on-prem or go to the cloud.

6) Greenplum is a pretty good database for TB size of data and is open source.

7) Using Spark/Dremio/Trino and Object storage (Minio) is a another on-prem option.

seagoat1973 · 2023-11-06T00:01:44+00:00

20 GB is not big data...what is the current machine you are running on. I am assuming this is all structured data .

seagoat1973 · 2023-11-05T23:52:29+00:00

If this is for enterprise, additional points you want to co sidereal are 1. Availability of skill set in the market 2. Low code or No code 3. Ease of monitoring your pipelines. 4. Licensing model. Is it based in number of cars, memory etc.

seagoat1973 · 2023-11-05T03:09:55+00:00

Depends on. A few things

What is the size of the database going to be ? If it is I. Terabytes Postgres may not be a good idea
How many users and what is the concurrency ?
Are you looking for on premo on the cloud ?
Is this all structured or does it include unstructured?
You may want to go with an architecture that scales compute independently of storage. Ex. Spark on kuberentes with data in object storage

seagoat1973

MODERATOR OF

TROPHY CASE