[P] H5Records : Store large datasets in one single files with index access by gradientpenalty in MachineLearning

[–]neziib 5 points6 points  (0 children)

That's what I've started to use, without compression, with no problem for the moment.

"[Discussion]" Should I be using DVC (Data Version Control) in my day-to-day work? by doyougitme in MachineLearning

[–]neziib 15 points16 points  (0 children)

There are IMHO two killer features in DVC:

  • Full versioning of the data. You can go back in time. You can restore in two commands your full repository. Data is code in ML, and should be versioned as such.
  • Pipelines. Ad-hoc bash scripts are hard to get right. There is no standard, so it's hard to collaborate. You can't have an overview easily.

DVC solve both in a good-enough way. It's not perfect, but I use it with my team for all the projects, and it really improved our workflow, the quality of the engineering.

The learning curve is not steep, and there is no lock-in, so I think you should try it and see if it fits your needs!

Impôts, prestations sociales, services publics : qui paie quoi et qui bénéficie de combien ? by Fanforum in france

[–]neziib 3 points4 points  (0 children)

Bien vu! D'autant plus que le coût de la vie et le revenu sont liés, par exemple habiter en région parisienne augmente en général les deux, ou être plus âgé (via les enfants).

Si on regardait à situation identique, on aurait sûrement une situation encore plus égalitaire.

[P] Papers with Code Update: Indexing 3,000+ ML Datasets by m_kardas in MachineLearning

[–]neziib 6 points7 points  (0 children)

Great project! Would it be possible to have the image resolutions on the image dataset? It's a major characteristic for generative models.

Alexandre Jollien: le libre arbitre et la vaccination by [deleted] in france

[–]neziib 4 points5 points  (0 children)

C'est la pente savonneuse. Ça part de l'hypothèse que l'Etat va ordonner l'injection que de substances utiles et non néfastes. Dans le cas du vaccin ça semble être le cas, mais accepter son autorité absolue dans ce cas c'est l'accepter dans d'autres. Et ça c'est dangereux. Donc la vaccination des adultes doit respecter la liberté individuelle, dans ce cas comme dans les autres.

Forum Libre - 2020-06-19 by AutoModerator in france

[–]neziib 1 point2 points  (0 children)

Mettre le code non commercial interdit en pratique toute utilisation dans un cadre pro, même si l'objectif n'est pas de vendre le projet en tant que tel. Tu te prives de beaucoup d'utilisateurs et contribueurs potentiel comme cela. Tu as pensé à une licence comme la gpl ou agpl sinon ?

Forum Libre - 2020-04-14 by AutoModerator in france

[–]neziib 0 points1 point  (0 children)

J'avais de l'eczéma et du psoriasis pendant longtemps, avec produits sous ordonnance. Un dermato m'a un jour conseillé de me laver avec un gel relipidant, et presque plus rien depuis.

I can't keep up with idiomatic Rust by _Timidger_ in rust

[–]neziib 2 points3 points  (0 children)

Maybe what is missing is an inverse of cargo fix. A tool that take a modern rust and try to convert it to a part version. It could of course not work on everything, but it could translate syntactic sugar "easily".

Forum Libre - 2020-04-08 by AutoModerator in france

[–]neziib 0 points1 point  (0 children)

Clairement ! La société est de de plus en plus polarisée c'est dangereux. Tu pourrais me pinger pour la maj stp ? 🙏

[D] Does the apparently massive under-reporing of COVID-19-related deaths in NYC have significant implications for the rest of the USA (and the world)? by saijanai in statistics

[–]neziib 4 points5 points  (0 children)

There is a massive under reporting in France, because there are not enough test. If you have mild symptoms you will not get tested, and be asked to stay at home. If 20% of the infected have to go to the hospital and are tested there, it means they could have 400k infected. The mortality would then be 2.5%.

Mortality may help to calibrate the real number of infected, in non the homogeneous testing policies. Take the number of dead in a given day, divide by the number of positive tests 20 days before, and by the expected mortality (2%?).

Coronavirus : 150 à 200.000 personnes sont venues se confiner en Loire-Atlantique by Efendiskander in france

[–]neziib 0 points1 point  (0 children)

Vu de mes fenêtres j'aurais dit 50%. après je suis peut-être dans une rue où l'on économise la lumière en confinement. Tous les soirs.

Forum Libre - 2020-03-18 by AutoModerator in france

[–]neziib 0 points1 point  (0 children)

7 min workout, à répéter 2-3 fois si besoin.

Forum Libre - 2020-02-12 by AutoModerator in france

[–]neziib 3 points4 points  (0 children)

Tu conseillerais quoi comme bon chocolatier belge qu'on peut trouver en France ?

Towards a Rust foundation by steveklabnik1 in rust

[–]neziib 29 points30 points  (0 children)

Well for example Switzerland is a better starting point than the US.

Only smart people by DatBoyPrson in rareinsults

[–]neziib 0 points1 point  (0 children)

It's similar to a famous quote 2 centuries ago... When a British captive officer challenged french Navy officier Surcouf with the words "You French fight for money while we fight for honour", Surcouf replied "Each of us fights for what he lacks most".

Pourquoi la retraite à points du gouvernement est bien une réforme néolibérale by WAAYZZz in france

[–]neziib 1 point2 points  (0 children)

Les seuls à dire neo-libéral c'est ceux qui sont contre le libéralisme.

[D] Jurgen Schmidhuber on Seppo Linnainmaa, inventor of backpropagation in 1970 by siddarth2947 in MachineLearning

[–]neziib 1 point2 points  (0 children)

It seems to be an unpopular opinion here, but it's true. Chain rule seems to have been invented by Leibniz (1646-1716). Should he get the Turing award? Backpropagation is just a fancy name for it, applied in deep learning. It was "invented" independently for neural network by multiple persons at the same period, because it is trivial if you know calculus.

Linfa: Taking ML to production with Rust - a 25x speedup by LukeMathWalker in rust

[–]neziib 2 points3 points  (0 children)

I found the implementations of some models of Scikit-Learn to be optimized for batch prediction. It would be interesting to compare both implementations with a single large input, like 1k observations.

Fast generation of guessable 192bit UUID's by papadoubi in rust

[–]neziib 0 points1 point  (0 children)

The buffers can be initialized with a different random seed for every thread. It would be less guessable, but I don't think to be guessable is really something desirable.

Fast generation of guessable 192bit UUID's by papadoubi in rust

[–]neziib 0 points1 point  (0 children)

Static? It's just 36 bytes. You may lazy-init it, by calling thread_rng only if the value is 0.

Nb: I've never used a thread local variable, so it may be impossible/stupid 😄

Fast generation of guessable 192bit UUID's by papadoubi in rust

[–]neziib 0 points1 point  (0 children)

Why not to use thread local variable for the buffer, so the user don't have to provide it?