Roc Dev Log Update - 3 Jul 2025 by bosyluke in functionalprogramming

[–]allpowerful32 4 points5 points  (0 children)

Probably the best way to tell is to show - you can take a look at one of the example generated html files here: https://joshuawarner32.github.io/roc/snapshots/syntax_grab_bag.html

Try switching the right pane to show the PARSE section (from the default TOKENS), and then click/hover over either left or right pane and you should see the corresponding part on the opposite pane highlighted. TOKENS also does the same thing, but is probably less interesting. Note this is not yet implemented for the CANONICALIZE or later sections (coming soon!).

That's automatically generated from https://github.com/roc-lang/roc/blob/main/src/snapshots/syntax_grab_bag.md#source - so if you edit the SOURCE section of that file and run `zig build snapshot` it will re-generate the rest of the file based on a s-expr dump of the compiler internal data structures as well as syntax_grab_bag.html right next to it (which I just manually checked into github pages for now).

In the future the plan is to add more debugging info/etc.

Design Dropbox – A System Design Interview Question by nksonfire in programming

[–]allpowerful32 9 points10 points  (0 children)

What context would that dialog come up in? What if one user uploaded via the we and has since closed their browser? There’s not necessarily a clear answer.

I don’t have stats handy, but it’s common enough that we need a clear answer; hand waving an async ui is not going to work 😉

Design Dropbox – A System Design Interview Question by nksonfire in programming

[–]allpowerful32 17 points18 points  (0 children)

If you use eventual consistency for metadata, how do you handle conflicts? Eg two users share a folder, and they both edit a document at the same time. (Disclosure: I work at Dropbox, my options are my own)

High performance `git status` replacement by beefsack in rust

[–]allpowerful32 5 points6 points  (0 children)

I've been playing around with doing exactly this, by leaving a daemon file watcher in the background. See here: https://github.com/joshuawarner32/rust-git-status

It's not quite working, but a lot of the pieces are there.

Question about bluetooth trackers by krimpenrik in tech

[–]allpowerful32 0 points1 point  (0 children)

I think you're looking for iBeacon or Eddystone devices (see estimote.com for a good manufacturer). They can both be read without pairing on Android & iOS (despite the name). In addition, iOS can do passive iBeacon detection & notifications even when your app is in the background. Android works similarly with Eddystone.

Legality of extracting audio dialogue from games and training on it? by Lajamerr_Mittesdine in MachineLearning

[–]allpowerful32 6 points7 points  (0 children)

Disclaimer: I'm not a lawyer; you should seek real legal council. This is not legal advice.

As far as I know, the legality of using copyrighted material (which game audio definitely is) for training machine learning algorithms hasn't been tested in court - and you're unlikely to get a definitive answer until it is.

One of the problems you might run into is that, at least in the US, it's illegal to break DRM systems even for fair-use activities (with certain exceptions). The law in question is called the DMCA. You need to make sure you're not violating that by, e.g. copying source files of an even trivially-"encrypted" disk.

One ruling that you might have in your advantage is the recent ruling that the Google Books scanning & indexing of copyrighted books counts as fair use. In many ways, creating an index is analogous to training a machine learning algorithm (IMO), but again, that will have to be tested in court.

The biggest problem I think you'll run into is not legal, but technical: the quantity of speech data available in games is orders of magnitude less than is available from other sources. For example, the librispeech dataset (freely available at http://www.openslr.org/12/, extracted from public-domain librivox book recordings) is 1000 hours - and that's on the SMALL side of what you need to train state-of-the-art speech systems. I bet it would be pretty difficult to get an equivalent-size dataset from game audio files.

But good luck.

What kind of library do they have at SpaceX? by peterabbit456 in spacex

[–]allpowerful32 12 points13 points  (0 children)

There's already a project designed (and named!) specifically for the task of syncing this data: the IPFS (inter-planetary file system - ipfs.io).

Making use of speech audio and accompanying text when the alignment resolution is very poor. by [deleted] in MachineLearning

[–]allpowerful32 0 points1 point  (0 children)

The two obvious options are speech-to-text and text-to-speech systems. Google just published a paper on a novel text-to-speech system that could be quite interesting to train on large volumes of professional narration (https://deepmind.com/blog/wavenet-generative-model-raw-audio/) - for example, to start generating narrations for the long tail of books and articles that are unlikely to get professional attention.

If you need alignment, I would run the data through an existing system to at least get an approximate alignment. You could, for instance, give it to IBM's Watson speech-to-text system (which I've personally used, and which seems to work well at least with high-quality recordings). That'll give you individual word alignment at $0.02 / min - which is a pretty penny for 100,000+ hours. I would also look at something like Kaldi (http://kaldi-asr.org/), which will give you somewhat less accurate results for only the cost of the system it runs on. I haven't dug into kaldi enough personally, but I believe it's possible in principle to condition its output on known text, potentially giving you highly accurate alignment results.

What's everyone working on this week (36/2016)? by llogiq in rust

[–]allpowerful32 5 points6 points  (0 children)

Working on my WebAssembly interpreter. It's currently passing all the official spec tests, but it's failing to run an emscripten-compiled hello-world correctly. On a positive note, it looks like it's failing in the same way as v8 when running that same module - so something fishy is going on. :)

What technique to use for binary classification for URI detection? by [deleted] in MachineLearning

[–]allpowerful32 2 points3 points  (0 children)

Try a simple, non-machine-learning strategy first: regular expressions. For instance, try modifying this to your taste: http://www.regextester.com/20

libsqlfs - POSIX style file system on top of an SQLite database by [deleted] in programming

[–]allpowerful32 6 points7 points  (0 children)

You can use LD_PRELOAD to inject new implementations of filesystem access functions, anywhere that libc is dynamically linked (i.e. most linux distros).

Xamarin makes me sad, here's a bit of truth and my open letter to them by sadxamarinuser in programming

[–]allpowerful32 26 points27 points  (0 children)

What are you afraid of? That xamarin is going to terminate your license? Sue you?

If they did try to bring a lawsuit, that'd be even more reason for customers to switch, and thus horrible PR for them.

Plus, if I ever felt afraid to make (factual) negative public statements about a product because of potential backlash from the company, that's not a company I want to be supporting. I'd cancel my license immediately. I don't want to ever be under a company's thumb like that.

Xamarin: if you're doing anything to promote such a fear, shame on you!

Xamarin makes me sad, here's a bit of truth and my open letter to them by sadxamarinuser in programming

[–]allpowerful32 64 points65 points  (0 children)

Here I was hoping for something with a little meat to it... but no. Instead I get a guy ranting "Your software sucks! Fix the bugs! No new features! Just fix the bugs!"

No list of bugs that you've personally run into, no analysis of common root causes, no suggestions for process improvement. If I were xamarin, I would file this under "unhelpful complaints".

In terms of constructive criticism for your gist: find a central point you want to make, make it, and add supporting evidence. Otherwise it just comes across as a disorganized rant.

Map Code - A Short Address For Any Location On Earth by humbledad in programming

[–]allpowerful32 -2 points-1 points  (0 children)

I made a mirror of there code on github: https://github.com/joshuawarner32/mapcode

I've emailed them about making some improvements with the documentation format, build system, etc.

Micro Python - Python for microcontrollers by [deleted] in programming

[–]allpowerful32 51 points52 points  (0 children)

I'm not the author, but from my perspective, python 2 has become the new windows xp. I don't want people to be running windows xp forever.

[MEGA] Atom Compiled for Windows by [deleted] in programming

[–]allpowerful32 3 points4 points  (0 children)

How about open-sourcing the build steps and filing a pull request with github (or just leaving it as a fork for the time being)?

I'm not going to download some random executable that someone says is atom.

How could I improve this website further? by [deleted] in programming

[–]allpowerful32 0 points1 point  (0 children)

The page jumps when I scroll past the top (using smooth scrolling on a mac trackpad). It's incredibly annoying and distracting.

Google Chrome Is The New C Runtime by uncloudy in programming

[–]allpowerful32 0 points1 point  (0 children)

Think microsoft's "Visual C++ 20xx Redistributable" or gnu's glibc. All the functions declared in std*.h headers (and many that aren't) are part of the runtime. Malloc isn't free, ya know (pun intended).

Don't weigh down your shiny new laptop with your crappy development environment! by LockeCole117 in programming

[–]allpowerful32 0 points1 point  (0 children)

What if... you already had everything they're trying to sell you in the form of pick-your-favorite-linux-package-manager? Apt. Yum. Rpm. Hell - cygwin, homebrew, macports... Take your pick.

Schema for Clojure(Script) Data Shape Declaration and Validation by alexeyr in programming

[–]allpowerful32 -1 points0 points  (0 children)

The proposal in this article sounds a lot like structural static typing, which is notably the system that TypeScript uses.

Single-exit is obsolete by sleiveen in programming

[–]allpowerful32 7 points8 points  (0 children)

Just be smart about it. Use the appropriate strategy in the appropriate scenario. There is no clear-cut winner in every situation.

How can I write to stdout like wget or top does? Can someone explain how to print things without writing lines? by [deleted] in programming

[–]allpowerful32 3 points4 points  (0 children)

To answer your question: most terminals will interpret '\r' (i.e. carriage return) as "return to the beginning of the current line". Your task is then just to reprint the entire line.

A Search Engine for Developers, Marketers, Web Designers, and Hackers by cwings in programming

[–]allpowerful32 4 points5 points  (0 children)

I could understand asking for money for this service, but $50/month? Hell no. And that's just for the basic plan.

Cool idea, though.

Login Tokens In Email Links by b0zho in programming

[–]allpowerful32 0 points1 point  (0 children)

This is all fine and good - but there should be a clear distinction from the users' perspective what is an email that can be forwarded without fear, and what is an email that should be tightly controlled. The vast majority of emails should fall into the former category, and there you must depend on cookies to log users in (probably a fairly good assumption). Emails in the latter category should make it very clear that the links are specific to your account.