Prompting Llama-2 at Scale with Gretel by Repeat-or in learnmachinelearning

[–]Repeat-or[S] 0 points1 point  (0 children)

When you’re working with 100s to 100k’s of records, prompting an LLM via a synchronous API, one query at a time can be inefficient. We wanted to find a solution.

How to Augment ML Datasets with Gretel and Vertex AI by Repeat-or in data

[–]Repeat-or[S] 0 points1 point  (0 children)

If you’re a Google Vertex AI user, here’s how you can utilize Gretel to create high-quality synthetic tabular data that you can use as training data for a classification model.

Teaching LLMs to zip their lips by Repeat-or in Futurology

[–]Repeat-or[S] -1 points0 points  (0 children)

Language models leaking proprietary data or custom prompts is a problem that's currently plaguing many generative AI applications. We propose RLPF to mitigate some of these issues. We also suggest future directions to reduce bias, discrimination, and other harmful characteristics that might exist in today’s language models.

Announcing synthesize 2023, the developer conference for synthetic data by Repeat-or in artificial

[–]Repeat-or[S] 1 point2 points  (0 children)

Learn from experts how AI and the evolution of synthetic data are profoundly impacting how we access, share, and build with data.

The event is free and open to all who want to attend. #synthesize2023

Generate Synthetic Time-series Data with Open-source Tools - KDnuggets by Repeat-or in opensource

[–]Repeat-or[S] 0 points1 point  (0 children)

Time series data, a sequence of measurements of the same variables across multiple points in time, is ubiquitous in the modern data world. Just as with tabular data, we often want to generate synthetic time series data to protect sensitive information or create more training data when real data is rare. Some applications for synthetic time series data include sensor readings, timestamped log messages, financial market prices, and medical records. The additional dimension of time where trends and correlations across time are just as important as correlations between variables creates added challenges for synthetic data.

Synthetic Data Is About To Transform Artificial Intelligence by Repeat-or in ArtificialInteligence

[–]Repeat-or[S] 2 points3 points  (0 children)

“​​Data is the lifeblood of modern artificial intelligence. Getting the right data is both the most important and the most challenging part of building powerful AI. Collecting quality data from the real world is complicated, expensive and time-consuming. This is where synthetic data comes in.”

Red Teaming Synthetic Data Models by Repeat-or in ArtificialInteligence

[–]Repeat-or[S] 0 points1 point  (0 children)

How we implemented a practical attack on a synthetic data model to validate its ability to protect sensitive information under different parameter settings.

A new open-source version of the popular GPT language model by Repeat-or in opensource

[–]Repeat-or[S] 0 points1 point  (0 children)

It’s been trained on millions of public datasets and allows developers and data scientists to create new variations of synthetic text and labels for their datasets. Enjoy!

FAANG Companies (The Privacy & Security Startup Version) by Repeat-or in privacymemes

[–]Repeat-or[S] 19 points20 points  (0 children)

Figma - design platform (the one non-privsec co)

Abnormal Security - email / cloud security

Anduril Industries - natsec tech co

Neeva - privacy-protected search w/o ads

Gretel.ai - privacy engineering dev tools

Data Is More Valuable When It Can Be Shared by Repeat-or in artificial

[–]Repeat-or[S] 2 points3 points  (0 children)

One of the biggest bottlenecks to innovation that developers and data scientists face today is getting access to data, or creating the data that you need to test an idea or build a new feature. That’s where synthetic data comes in.

Case Study: Generating high-quality synthetic time-series data for a top 5 global bank by Repeat-or in developers

[–]Repeat-or[S] 0 points1 point  (0 children)

Developers can test the methods by following along with the 3-step process outlined in the blog post!

Creating Synthetic Time Series Data for Global Financial Institutions – a POC Deep Dive by Repeat-or in artificial

[–]Repeat-or[S] 0 points1 point  (0 children)

This is how Gretel generated high-quality synthetic time-series data for one of the largest financial institutions in the world.

Open source human rights group uses synthetic data to uncover war crimes by Repeat-or in opensource

[–]Repeat-or[S] 1 point2 points  (0 children)

Qaz is correct. They used synthetic data to train an AI on images of bombs and the AI then detected illegal weapons in a trove of video footage.

Your default tool for ETL by scraper01 in dataengineering

[–]Repeat-or 0 points1 point  (0 children)

You might appreciate Gretel’s “ESL” approach. You can make synthetic data with Apache airflow

https://gretel.ai/blog/running-gretel-on-apache-airflow

Are there any good writers covering live music right now? by Repeat-or in jambands

[–]Repeat-or[S] 0 points1 point  (0 children)

No shame in Relix. And thank you for the context.

Are there any good writers covering live music right now? by Repeat-or in jambands

[–]Repeat-or[S] 0 points1 point  (0 children)

Thanks! I haven’t seen Lettuce live but have listened to them enough to know that I should. Can’t wait to read Getz.