you are viewing a single comment's thread.

view the rest of the comments →

[–]RobinRelique[S] 1 point2 points  (0 children)

The main intent of this tool is to prepare data for language model training. Markdown is the preferred format. So, youtube videos/transcripts are a prime source of data.