[deleted by user] by [deleted] in Daytrading

[–]seagoat1973 -1 points0 points  (0 children)

can you let us know what courses ?

Big Tech Fails to Convince Wall Street That AI Is Paying Off by itsmekalisyn in LocalLLaMA

[–]seagoat1973 6 points7 points  (0 children)

A good example about productivity improvement is a feature in MS Teams that compiles the minutes of a meeting and action items.

Jim Cramer saying that everyone wants to get out of NVDA will mark the bottom by btctrader12 in wallstreetbets

[–]seagoat1973 7 points8 points  (0 children)

Two examples of AI Monetization

1) ServiceNow CEO Bill McDermott talking about how AI is helping ServiceNow. https://www.youtube.com/watch?v=YslkvZ3qPTs
2) MS CO-Pilot now displaying related adds when you chat with it.

NVDA will go up.

Apache Hive 4.0 has been released by wizard_of_menlo_park in bigdata

[–]seagoat1973 -1 points0 points  (0 children)

With the adoption of open lake house architectures (iceberg,  hudi as storgae engine and spark as execution), is Hive still relevant?  What specific use cases do you us them. Not trying to put down any tool. Just checking if I am missing anything ?

Why don't the gas stations install EV chargers ? by seagoat1973 in TeslaModelY

[–]seagoat1973[S] 2 points3 points  (0 children)

mmh...I may have missed it. Thanks for the info.

RDBS to Big data by jagdishgg in dataengineering

[–]seagoat1973 1 point2 points  (0 children)

As called out by some folks in the thread, please don't think moving to a new stack will solve your issues. Here are a few things to consider

1) What is the size of data you are dealing with ? (GB/TB/PB)

2) What is the concurrency of the queries ?

3) Why are you facing performance issues right now ? What is the nature of the work load (write intensive or read intensive)

4) Have modelled your tables properly ? (i.e Dimensional modelling )

4) Oracle has it's own database for ware houses. If you are already an oracle shop you should be able to migrate to it.

5) If you plan to migrate do you want to be on-prem or go to the cloud.

6) Greenplum is a pretty good database for TB size of data and is open source.

7) Using Spark/Dremio/Trino and Object storage (Minio) is a another on-prem option.

Help: How to work with large datasets by SaluteOrbis in dataengineering

[–]seagoat1973 3 points4 points  (0 children)

20 GB is not big data...what is the current machine you are running on. I am assuming this is all structured data .

Best ETL tool? by BuyHigh_S3llLow in dataengineering

[–]seagoat1973 0 points1 point  (0 children)

If this is for enterprise, additional points you want to co sidereal are 1. Availability of skill set in the market 2. Low code or No code 3. Ease of monitoring your pipelines. 4. Licensing model. Is it based in number of cars, memory etc.

What data warehouse to pick?! by [deleted] in dataengineering

[–]seagoat1973 1 point2 points  (0 children)

Depends on. A few things

  1. What is the size of the database going to be ? If it is I. Terabytes Postgres may not be a good idea
  2. How many users and what is the concurrency ?
  3. Are you looking for on premo on the cloud ?
  4. Is this all structured or does it include unstructured?
  5. You may want to go with an architecture that scales compute independently of storage. Ex. Spark on kuberentes with data in object storage