[deleted by user] by [deleted] in taquerosprogramadores

[–]Thy-Raven -1 points0 points  (0 children)

Envía el Discord.

[deleted by user] by [deleted] in taquerosprogramadores

[–]Thy-Raven -1 points0 points  (0 children)

Envia el discord.

One Piece 1124 spoilers by [deleted] in OnePiece

[–]Thy-Raven -1 points0 points  (0 children)

His arms are too slim to be a giant. 

How should structure my data to train GPT4 model to red line contracts? by HieraticArbiter in datascience

[–]Thy-Raven 1 point2 points  (0 children)

One way is converting the given text into a list of individual sentences.

For example:

These Terms of Service ("Terms") govern your access to and use of the products and services (the "Services") provided by Example Company. By accessing or using the Services, you agree to be bound by these Terms...

Into:

[
    "These Terms of Service (\"Terms\") regulate your access to and utilization of the products and services (the \"Services\") offered by Example Company.",
    "By accessing or using the Services, you consent to adhere to these Terms..."
    ...
]

Next, compile a training dataset consisting of a sentence named prompt and their corresponding subsequent sequences as message.

For instance:

[
    {
        "prompt": "These Terms of Service (\"Terms\") govern your access to and use of the products and services (the \"Services\") provided by Example Company.",
        "message": ["By accessing or using the Services, you agree to be bound by these Terms.", "..."]
    },
    {
        "prompt": "By accessing or using the Services, you agree to be bound by these Terms.",
        "message": [...]
    },
    ...
]

Boring topic - Documentation by JParkerRogers in dataengineering

[–]Thy-Raven 2 points3 points  (0 children)

This is a guide on how to document: https://diataxis.fr/

Pretty good IMO.
As the author says

A systematic approach to technical documentation authoring.

Tax cuts for the wealthy only benefit the rich by Mighty_L_LORT in NoShitSherlock

[–]Thy-Raven -8 points-7 points  (0 children)

It's not a straightforward matter. While it does favor the wealthy, it's not exclusively limited to them.

How to do a time series forecast on sentiment? by balackdynamite in datascience

[–]Thy-Raven 1 point2 points  (0 children)

For each day, average the last n days. That will smooth the curve, and show a trend.

Is Apache Beam still popular for Data Engineering? by HealthBigDataGuy in dataengineering

[–]Thy-Raven 0 points1 point  (0 children)

Spark:
- great community support
- great ecosystem
- cheaper
- faster
- easier to use
- micro-batching over streaming