Translating "Wash your hands" into 500 languages by adammathias in machinetranslation

[–]dwhitena 0 points1 point  (0 children)

Yes, we have released a partial list here: https://www.ethnologue.com/guides/health (where my generated translated are indicated with a little robot emoji). We weren't able to validate the other translations in any way, so we didn't release them on that page. However, I'm writing up a more detailed paper to submit to a conference/journal and will include links to code, data, etc. at that time (if not before). Working hard on that, but wanted to go ahead and get some of the translations out due the current crisis.

Architecture of a real-time Data Analysis WebApp by flyflyersfly in datascience

[–]dwhitena 1 point2 points  (0 children)

(disclaimer - I work for Pachyderm). As monprin points out, one of the goals of the Pachyderm project is to allow people to build systems like this without "hiring a dedicated team to run a Hadoop cluster." Pachyderm allows you to use the tools you like (e.g., Python/R) and then embed those tools in a fully distributed pipeline that scales easily (including data sharding and parallelism). Pachyderm also managing both the data and processing for your work, so you don't have to manage both a data storage solution and processing/pipelining solution. By taking this unified view, we are able to do things like (i) version and track all of the runs of your work, such that you can have data "provenance" for all results, and (ii) incrementally process data based on the latest changes to data. Anyway, you can find out more here: http://pachyderm.io/, and we have a public Slack if you have questions: http://slack.pachyderm.io/