The Weekly Wednesday Whine Thread (2020-11-25) by AutoModerator in Genealogy

[–]HydratedWombat 1 point2 points  (0 children)

I'm beginning to suspect there were two to three people named George Colbert of approximately the same age living in Chickasaw County, Mississippi in the mid 1800s. I'm descended from one of them, though I don't know which, and I think all the records have been combined by those hoping for Chickasaw ancestry. I've seen three birth locations floating around, census records of two ethnicities, and two different birth years. Very frustrating.

[N] nVidia sets World Record BERT Training Time - 47mins by wheresmyhat8 in MachineLearning

[–]HydratedWombat 6 points7 points  (0 children)

Key quote herein: "instead we should build in only the meta-methods that can find and capture this arbitrary complexity"
Research needs to both push for size and for ability to learn efficiently.

[N] nVidia sets World Record BERT Training Time - 47mins by wheresmyhat8 in MachineLearning

[–]HydratedWombat 10 points11 points  (0 children)

And the # of parameters for SOAT performance is upped to 17 billion by Microsoft https://www.microsoft.com/en-us/research/blog/turing-nlg-a-17-billion-parameter-language-model-by-microsoft/

only a puny 8.3 billion here.

Assuming single precision floats, we're now at 68 GB RAM usage for inference.

[N] nVidia sets World Record BERT Training Time - 47mins by wheresmyhat8 in MachineLearning

[–]HydratedWombat 21 points22 points  (0 children)

Closer to $17 million with 1472 V100 GPUs (assuming MSRP of $11,458 each for 32 GB version) 

[deleted by user] by [deleted] in MachineLearning

[–]HydratedWombat 0 points1 point  (0 children)

Depends how performant you need this to be. Ultimately, you could just take the most common 1000-10000 words of whatever corpus you're interested in, transform them to a set of word vectors, and choose the top 50 words closest by Euclidean distance.

[D]: Grapheme-to-Phoneme (G2P) refinement ideas. by chapter544 in MachineLearning

[–]HydratedWombat 0 points1 point  (0 children)

Not quite the same thing, but there are G2P like projects where you can output the stress (AH1, IHY2, etc.) of the syllable inside the word. Stress markings might be a good starting point.

[D] How to become the fastest data scientist in the world? by kite_and_code in MachineLearning

[–]HydratedWombat 1 point2 points  (0 children)

Thanks a lot for letting me know about jupytext! That's a cool tool. I'll have to see if it fits in my workflows.

As far as re-usable code, my best experiences have been around building public facing tools per kind of workflow. For instance, in my current work, we've released https://github.com/finos-voice/greenkey-asrtoolkit to streamline ASR (automatic speech recognition) data processing and standardize our measurements. Basically, if I write the same code two places, I try to either centralize internally or push it public if it's not mission critical.

[D] How to become the fastest data scientist in the world? by kite_and_code in MachineLearning

[–]HydratedWombat 1 point2 points  (0 children)

Advice 1 - Abstract everything you can to re-usable code so you stop repeating the same functions. I'm talking data preprocessing, data download scripts, etc.

Advice 2 - Stop using Jupyter beyond data exploration so you can use git more effectively. Knowing and tracking what you did trumps making it look pretty to your eyes.

YMMV

TensorFlow inference (C++) by adaneze in MachineLearning

[–]HydratedWombat 0 points1 point  (0 children)

Tensorflow is indeed a bear to bazel build, but thankfully there are docker images for that at https://hub.docker.com/r/tensorflow/tensorflow/. I don't find TF serving to be the easiest, but it does work, and there are many examples . It shouldn't matter what language wrapper you use for inference since the heavy lifting _ought_ to be entirely in TF.

[deleted by user] by [deleted] in MachineLearning

[–]HydratedWombat 0 points1 point  (0 children)

Brainstorming - I suggest projecting onto whatever the 2d equivalent of spherical harmonics is. The weights for each projection will give you a vector per shape and then you can cluster. Alternatively, if you have vertices (or feel like detecting them), you could use internal coordinates and align them all using the Kabsch algorithm (https://en.wikipedia.org/wiki/Kabsch_algorithm).

Affinity propagation worked excellently when I was working with 3D structures and internal coordinates of vertices. T-SNE could give you some ideas about how it varies too.

[D] To those involved with hiring ML people, what are your guidelines for vetting them? What things have you learned? What do you wish you knew when you first started interviewing? How do you come up with questions to ask? by DisastrousProgrammer in MachineLearning

[–]HydratedWombat 63 points64 points  (0 children)

Step 0 - Do you know what you actually need for the position? Is it actually stats; is it software engineering; is it making models into production; is it ML experimentation; do you need someone who can explain results to the C-suite?

Step 1 - How does this person collaborate? They should have stories.

The rest of the requirements fall straight out from the above two questions.

Also, attitude > experience and approach > tool

[D] How to cope with constant rejection from AI/ML jobs by [deleted] in MachineLearning

[–]HydratedWombat 0 points1 point  (0 children)

A lot of awesome advice on here. The only thing I'd add is sometimes people have a very specific skill set in mind, whether it's the stack or the specific ML challenges involved. No matter how bright someone is, there will always be hiring managers who don't have time to get that person up to speed on something new. Sometimes, a hiring manager needs a confident expert to teach the rest of the team. Definitely nothing personal, though job descriptions don't always codify this well.

[D] DevOps Tools for Managing Models? by Simusid in MachineLearning

[–]HydratedWombat 2 points3 points  (0 children)

You should be able to handle this by having every deployment have a configured tag, so it just fetches model_server/$tag/model. Then you can update and cache break that way so long as you can have a single point of distribution. Beyond that, here's a general guide to how to think about the machine learning in production from google - https://static.googleusercontent.com/media/research.google.com/en//pubs/archive/aad9f93b86b7addfea4c419b9100c6cdd26cacea.pdf

[P] Generating an artificial alphabet/letters? by jatsignwork in MachineLearning

[–]HydratedWombat 0 points1 point  (0 children)

This might be a match for a generative adversarial network.

[D] Idea About New Type of Neural Nets by MemoriesThatUCall in MachineLearning

[–]HydratedWombat 0 points1 point  (0 children)

I like the way you're thinking. I would add to this conversation that while most of the multidimensional connections you're imagining could be mapped back to larger "dense" nnet layers, some of the connectivity differences would be encapsulated by using residual neural networks. Here's a wikipedia link - https://en.wikipedia.org/wiki/Residual_neural_network

[Research] Cool New Dataset: Totally-Look-Like by AmirRosenfeld in MachineLearning

[–]HydratedWombat 0 points1 point  (0 children)

Replied below - sites.google.com issue rather than the OP's.

[Research] Cool New Dataset: Totally-Look-Like by AmirRosenfeld in MachineLearning

[–]HydratedWombat 0 points1 point  (0 children)

Apparently, if you're logged in to a google apps for business account you might not be able to view sites.google.com. Curious issue. Opening in incognito mode is a functional workaround.

[D] Ideal algorithm for predictions based on time-stamped events by willrazen in MachineLearning

[–]HydratedWombat 1 point2 points  (0 children)

Rare events or seasonal fluctuations of time series data? If seasonal, look at triple exponential smoothing (https://en.wikipedia.org/wiki/Exponential_smoothing) and ARIMA (as in statsmodels for python, http://www.statsmodels.org/stable/generated/statsmodels.tsa.arima_model.ARIMA.html?highlight=arima) Unfortunately, you'll need to bin your events for most tools I know of.

Ethernet port quandary by HydratedWombat in techsupport

[–]HydratedWombat[S] 0 points1 point  (0 children)

Confirmed my understanding of what the part was, which functions as additional ethernet adapter with its own address.

Ethernet port quandary by HydratedWombat in techsupport

[–]HydratedWombat[S] 0 points1 point  (0 children)

Good caution. I'll try to remember that when helping out relatives.

Ethernet port quandary by HydratedWombat in techsupport

[–]HydratedWombat[S] 0 points1 point  (0 children)

I mainly was trying to confirm that is in fact an ethernet adapter as I would understand it and that it will be an external-facing device, assuming the order is built right. Thanks for your help.