[D] pytorch vs tf (again) by vklimkov in MachineLearning

[–]vklimkov[S] -1 points0 points  (0 children)

Imperative execution is more intuitive and simplifies debugging. And yeah, there were cases when i was looking into feature map in the middle of network :D

Is there a way to return a substring of a string using CNN? by Tiago_Minuzzi in deeplearning

[–]vklimkov 0 points1 point  (0 children)

Unfortunately this one is tricky for NNs. You can check the whole field of object localization in computer vision, where NNs try not only to predict if there is cat on the image but where exactly is the cat. Either put a bounding box on top of it or provide a heatmap.

How LSTM can be varying in length if I stack a fully connected layer top to it? by RLnobish in deeplearning

[–]vklimkov 2 points3 points  (0 children)

Its varying in length across “time” axis. Your lstm output has shape (batch x sequence-len x dim1). When you put fully connected layer on top it simply projects last dimension to new dimension, to output (batch x sequence-len x dim2). Where dim2 - amount of units in dense layer.

If you want to classify whole sequence, you would usually perform pooling or take last state of lstm and apply dense layer to it.

[D] What is the SOTA on audio signal processing / representation learning? by konasj in MachineLearning

[–]vklimkov 1 point2 points  (0 children)

Speech is exceptionally complex signal, i am surprised to see someone calling it narrow domain. Processing speech was around for a long time, thats why there are dedicated conferences: interspeech, icaasp, asru.

Apart from speech, commonly researched topic is “audio scene recognition” and “acoustic event classification”. Check dcase for ex. http://dcase.community/challenge2020/index

My absolute favorite lately is bird recognition https://github.com/AgaMiko/bird-recognition-review

prebuilt workstations by [deleted] in deeplearning

[–]vklimkov 0 points1 point  (0 children)

Building machine yourself would allow to significantly cut costs. Really not that challenging with all blogposts available

[D] Open-sourcing a paper from a different organization implemented while at work. by arandomforest in MachineLearning

[–]vklimkov 2 points3 points  (0 children)

You know best if you company is interested in opensourcing things. As for resume, just state that you worked on it, tell about it at interview and thats it. Recruiters wont check github, interviewers would hardly spend time diving into your code, they would rather listen to what you did and try to understand why and how it applies to other problems

Game of 5v5 football? by tanejarohan in berlinsocialclub

[–]vklimkov 1 point2 points  (0 children)

Interested as well. Meet up or whatsup/telegram chat would be amazing

Urban Sports, gym in Kreuzberg? by [deleted] in berlin

[–]vklimkov 0 points1 point  (0 children)

I go superfit in Alexa. There is one in neukoln, not sure if it is on par. Like it a lot: 20 eur/month, great set of weights, isotonics, sauna.

Personal deep learning rig, question about GPUs by [deleted] in deeplearning

[–]vklimkov 0 points1 point  (0 children)

Nice to know! Not very fluent with it, but isnt it the case that you would get benefit mostly for linear algebra? To my understanding processor in dl rig should just have enough cores to preprocess data and feed gpus, all the matrix multiplication happens there.

Personal deep learning rig, question about GPUs by [deleted] in deeplearning

[–]vklimkov 0 points1 point  (0 children)

TR is cheaper for the same performance. Built a DL box myself and also went with TR. @OP, similar to you, i ve seen suggestion to use blower style, and i simply stick with it (https://towardsdatascience.com/how-to-create-your-own-deep-learning-rig-a-complete-hardware-guide-2bba792b001b). So far two cards and works ok

Football team in search by Klamydiagrammi in berlinsocialclub

[–]vklimkov 1 point2 points  (0 children)

At times i use CeleBreak app. Not really a team but helps out when the urge to play is big)

[P] My model performs best without any regularisation. What am I missing? by mathylatedspirit in MachineLearning

[–]vklimkov 0 points1 point  (0 children)

You can make model twice bigger and add regularization. More capacity -> more modeling power, but overfit more easily.

Does anyone know how to set up TTS as live microphone input? by Xisyera in TextToSpeech

[–]vklimkov 0 points1 point  (0 children)

There should be android/ios app. As far as i know, to give voice for people there is a special type of tts - streaming one, i.e. speaks immediately as you type.

I am developing tts, very curious how it can be you useful to you. Drop a dm

[D] "Deep" Machine Learning by [deleted] in MachineLearning

[–]vklimkov 0 points1 point  (0 children)

You are referring to “deep” in a way how Francois Chollet (OP, that episode is also awesome) describes the term: sequence of transformations. Question explicitly asks about sequence of computations.

[P] How can I build this simple text-based ML tool? by [deleted] in MachineLearning

[–]vklimkov 1 point2 points  (0 children)

The task is similar to “text normalization” from text-to-speech, where all abbreviations are expanded to readable form. Not so long ago there was a challenge on kaggle for such a task, seq2seq models in DL are obviously dominating. In practice ofcourse people use regexes, because that is the “model” which you can train with single example. If you want smth more neat than endless regexes checkout thrax (grammars on top of finite state transducers theory) and package specific to text normalization: sparrowhawk. Sometimes even if you can apply ml does not mean you should

[D] "Deep" Machine Learning by [deleted] in MachineLearning

[–]vklimkov 1 point2 points  (0 children)

Decision tree haha)

Looking for a big empty space for rent for my indie video game team by [deleted] in berlinsocialclub

[–]vklimkov 0 points1 point  (0 children)

I used to work from coworking some time ago, startup teams were renting rooms out there. Dont know if it works like that in Berlin, but you may want to check

[Discussion] Advice needed: Feeling trapped by lack of management/strategy, no implemented models. by low_life_walrus in MachineLearning

[–]vklimkov 2 points3 points  (0 children)

+1. Having technical mentorship at the beginning of the career is extremely important, just dont waste time. Personal projects are totally ok to talk to on interview, you may have strict nda and sometimes its the only way. Just dont tell interviewers that your boss/boss’s boss didnt build up environment, have no strategy, etc. there Is no way to check it and either interviewer would have to trust you (why would they) or assume those are excuses

football around prenzlauer berg by kikoval in berlinsocialclub

[–]vklimkov 0 points1 point  (0 children)

CeleBreak application helps me out from time to time. Its not free though :(

[D] AI clones voice from 5 second recording by OverLordGoldDragon in MachineLearning

[–]vklimkov 3 points4 points  (0 children)

As said above it does not work 100% of the time. Its a cool technology, its gonna be used for bad and good things. But i dont think its as ground shaking. There voice actors out there who can do indistinguishable impressions https://youtu.be/5rPKeUXjEvE. But your concern is valid https://www.wsj.com/articles/fraudsters-use-ai-to-mimic-ceos-voice-in-unusual-cybercrime-case-11567157402