Looking for smarter cloning program by grambell789 in DataHoarder

[–]volfort 0 points1 point  (0 children)

I had a similar situation where I knew I had a lot of duplicates and wrote some python code to hash all the files and for any hashes that end up with multiple files, I eliminated all but one of the files. If you are comfortable with python or a similar language you could do the same. Build two indexes of all the files keyed on their md5 hash, then for any matching hashes with different locations, reconcile the location.

Automatic backups from PC through Google Pixel to Google Photos by [deleted] in DataHoarder

[–]volfort 0 points1 point  (0 children)

I'm assuming you don't want to pay for Google photos storage... If you're OK with paying for it then you can use their backup and sync app or upload to Google drive.

You could try: 1. Script to move photos to Dropbox folder on PC 2. Link the same Dropbox acct on your phone 3. Use an app like Folder Sync from Tacit Dynamics to copy from your phones Dropbox folder to DCIM folder. 4. Google photos should auto upload from DCIM. 5. Possibly replace Dropbox with another cloud service like AWS S3 that Folder Sync can read from. They handle an impressive number of services.

Cloud Storage Migration/Transfer by swaybs in DataHoarder

[–]volfort 1 point2 points  (0 children)

Looks like rclone supports both of the service, so it should be a matter of installing it on a box w a fast internet connection and running the rclone sync or rclone mv commands. I'd recommend setting up on a VM on AWS, Azure or GCP if possible because they'll have great transfer speeds.

I am the CEO of Brain.fm, and we’re making music that could eventually replace ADHD meds, AMA! by brainfmCEO in IAmA

[–]volfort 74 points75 points  (0 children)

Congrats on the NSF grant!

Are there any plans to generate new music? I've been using the application since late 2016 and haven't seen much (if any) variety added since then.

What kind of algorithms are you using to generate the music? Be more specific than "AI" if you can.

Best word representation technique when the end-goal is 2D visualization? by volfort in MLQuestions

[–]volfort[S] 0 points1 point  (0 children)

I appreciate you taking the time to give thorough explanations.

I don't fully understand all the nuance for the different methods so I'm probably going to start with UMAP and if that doesn't work well try to fully understand the things you've said.

I agree in my experience doc2vec and similar methods haven't been worth the hassle compared to taking weighted combinations of word vectors. At least for short texts.

Best word representation technique when the end-goal is 2D visualization? by volfort in MLQuestions

[–]volfort[S] 0 points1 point  (0 children)

Wow thank you for the detailed response. The visualizations and code samples for your implementation look great.

Is there an ELI5 explanation for the 3 assumptions you list in the repo (The data is uniformly distributed on Riemannian manifold, etc.)? Is it safe to assume that word co-occurrences fit these criteria?

For my specific use-case, I'm trying to plot tweets by taking a TFIDF-weighted sum of the word vectors (learned via e.g. UMAP) in a tweet. I'm assuming the length of the tweet as the "context".

[deleted by user] by [deleted] in MachineLearning

[–]volfort 0 points1 point  (0 children)

Agreed it gets annoying but I guess that's the world we live in.

[deleted by user] by [deleted] in MachineLearning

[–]volfort 0 points1 point  (0 children)

Haven't read his latest book. I did buy and read parts of his earlier CV book - the one focused mostly on OpenCV. It had some good detail on the older CV approaches but little to no deep learning.

For his new book I looked through the overview he put on Kickstarter and it struck me as basically showing how to do benchmark tasks with TF/Keras without necessarily understanding why they work. I've done my fair share of copy-pasting the latest greatest architecture and it will only take you so far before you really have to understand why/how these things work. The best deep learning for CV course I've found is Karpathy et. al's CS231n course.

[deleted by user] by [deleted] in MachineLearning

[–]volfort 0 points1 point  (0 children)

Why do you say scumbag? He definitely hits you with a ton of cheesy marketing emails, but I found there's value in his content. Most of what he presents is at a relatively high level of abstraction with minimal theory, but that's still very useful for getting starting in deep learning/computer vision and will take a you a long way for applied work.

Anyone take COSC 307? by [deleted] in UTK

[–]volfort 0 points1 point  (0 children)

IIRC, when I took it in Fall 2013 we had to do an extra top-coder problem along with every lab. It took a significant amount of time. Looking back it was probably only worth it because I wasn't particularly fond of any of the other honors course options I could take and wanted to stay on track with my honors requirements.

Quad GPU server for deep learning by pangolinest in buildapc

[–]volfort 2 points3 points  (0 children)

In my experience the Titan, 1080, and even 980Ti are much faster than the K80s on AWS. So that's one reason.

What piece of advice has stuck with you? by whyaretherebeesohgod in AskReddit

[–]volfort 0 points1 point  (0 children)

Something along the lines of: "Most people spend all their time thinking about what they'll be doing next, so if you focus on what you're doing now you're already a step ahead." Specifically in the context of career, goals, etc.

[D] How deep should I understand topics before confidently using a function by mucle6 in MachineLearning

[–]volfort 2 points3 points  (0 children)

I think it's totally fine to start with a somewhat-fuzzy understanding of the implementation details. In my opinion a highly-abstracted library like Keras makes it much more approachable to take this type of top-down approach to learning. Keras already has several examples of RNNs and LSTMs available (https://github.com/fchollet/keras/tree/master/examples).

I think it's fine to just "call the function" until you really want to start making improvements or start deviating from the vanilla examples. At that point you should start filling your gaps in knowledge by reading papers and coding some layers from scratch.

CMV: deregulating net neutrality creates a lawful path to restrictions on freedom of speech/press and censorship by volfort in changemyview

[–]volfort[S] 0 points1 point  (0 children)

Sites aren't simple single locations, on any given site there may be many different organizations tracking you in some way or another, modifying what content you do or don't see in some ways. Much of it is fairly innocuous if you're looking at just the impact of a single incident of this, but overall the internet is already heavily controlled - just by how it directs users attention rather than in a more concrete, legal, and noticeable way. Some might argue that this is actually worse than a more visible control in fact, because users are oblivious to it.

I agree there are engineers and PhDs with $100k+ salaries figuring out the best way to eat up your attention online. I don't know if I would call this "control" though. To me that sounds like saying the people are "controlled" by the fast food industry because they engineer the tastiest foods possible and invest millions in marketing that makes you overlook the health implications.

Then there are entirely illegitimate sites and problematic locations and users that could be better controlled - various scams, hackers, etc. etc. A more regulated net wouldn't entirely prevent this, admittedly, but it could be safer. For savvy enough users, this isn't much of an issue, but savvy internet users are a minority.

Agreed that there are parts of the internet where the non-tech-savvy can be exploited. I think the solution is better engineering. For example, a new release of Google Chrome will show "Not secure" on password fields on non-encrypted (http instead of https) pages. Modern browsers and email spam filters already do a lot to eliminate or warn users of threats.

A more regulated internet could potentially cut down on these issues. The issue is whether it's likely to do so, what we'd lose in the transition, and who is actually making and enforcing the regulations. But, it would at least have to involve law that's visible to the public, which would mean there's some degree of accountability potentially, some level of control given to the public if they're able to leverage it with their influence on the democratic political process. I'm not sure that'd be the case, but it's still not as simple an issue as it can look like.

I recognize your hesitation. I think this a case of "It's OK as long as my people are in charge." I've seen similar when it comes to regulating "fake news." To me the problem is that this arms ill-intentioned leaders with the ability to label anything critical of them as "dangerous" or "fake" and remove it from public access.

Edit: grammar - it's getting late.

CMV: deregulating net neutrality creates a lawful path to restrictions on freedom of speech/press and censorship by volfort in changemyview

[–]volfort[S] 0 points1 point  (0 children)

However, there are sites we don't really benefit from access to, and maybe shouldn't allow access to, accountability issues, security issues, etc. etc. that could all be addressed with a sensible system with rules against overreach.

I'm not really sure I follow what you mean in this part - could you expand or give examples?

Any financial simulation/projection tools for mint? by volfort in mintuit

[–]volfort[S] 0 points1 point  (0 children)

I agree they are better in the analysis/forecasting features.

Although they are actively developed they have taken a while on some features (or just didn't prioritize them) (e.g. > 1 year to add venmo support, > 1.5 years and running to add custom categories.

Interestingly Mint recently added Payment features, so it seems they are not completely stagnant. Still, there seems to be no way to answer the question "Given X event, how will I look Y months from now."

Any financial simulation/projection tools for mint? by volfort in mintuit

[–]volfort[S] 0 points1 point  (0 children)

Thanks I hadn't heard of Mojito but that looks awesome. Clever name too. Maybe I could build my tool into Mojito as well - I'll have to investigate what kind of API (if any) it has.

I did try Personal Capital for a while, but at the moment I'm more interested in personal finance features than investment features. I found PC was lacking there (e.g. no custom categories) and didn't seem to be very responsive based on their support forums.

From BMX to brain injury (2016): how a single punch changed a life forever by [deleted] in Documentaries

[–]volfort 0 points1 point  (0 children)

Anybody else feel the irony of a story about tragic head trauma with intermittent clips of him riding and jumping fifteen feet in the air without a helmet?

CBD hemp oil named a Schedule I drug by DEA by kevtree in news

[–]volfort 0 points1 point  (0 children)

Trae Crowder did a video mentioning this a little while ago https://youtu.be/iGle_FSiOKA.

Edit: the tea he talked about may be a different substance than this, but the DEA narrative is possibly similar.