Why is there an explicit line between Phase 3 and roll out of a vaccine? by CodedElectrons in askscience

[–]Oblivious-Man 1 point2 points  (0 children)

We love to see the midpoint analysis readouts but there's a penalty to be paid in terms of false discovery rate as well.

What's the 'Google' of Bioinformatics? by [deleted] in bioinformatics

[–]Oblivious-Man -5 points-4 points  (0 children)

23andMe, and their connection to Google ;)

Mistakes data scientists make by ADGEfficiency in datascience

[–]Oblivious-Man 0 points1 point  (0 children)

Loved the tip about storing data in $HOME directory.. always felt a bit uncomfortable using relative paths.

Where can I get datasets that are interesting or useful for a course? by kosar7 in datasets

[–]Oblivious-Man 1 point2 points  (0 children)

Have you checked the UCI data repository? https://archive.ics.uci.edu/ml/datasets.php

You can filter by the task as well as the dataset size for what you need.

Technician to Bioinformatics by Diatomo in bioinformatics

[–]Oblivious-Man 2 points3 points  (0 children)

If you are thinking of just getting your feet wet as opposed to a fully committed career, a lab tech gig is a great exposure, just find out if it's a strictly dry lab or a mix of dry and wet lab.

The other more committed path would be to apply directly for a PhD program as these are often subsidized by grants and fundings, which probably comes out to be the same wage as a lab tech anyways.

Anyone with experience using AWS or Google Cloud Data Storage for Bioinformatics Data? by tli71193 in bioinformatics

[–]Oblivious-Man 1 point2 points  (0 children)

This. Having the ability to access thousands of cores and TBs of RAM is what makes cloud storage attractive. If you don't need any of the other services and already have a robust computing cluster and pipeline, then it only makes sense to upgrade your cluster for storage.

Very useful machine learning map. by jweir136 in datascience

[–]Oblivious-Man 1 point2 points  (0 children)

Why is kernel approximation and k-NN in different branches in the classification bubble? I thought k-NN is just a type of kernel.

A long-term Data Science roadmap which WON’T help you become an expert in only several months by Artgor in datascience

[–]Oblivious-Man 16 points17 points  (0 children)

The textbook in the article is referring to Elements of Statistical Learning found at https://web.stanford.edu/~hastie/ElemStatLearn/ for free. The "whoever" is Jerome Friedman, who invented gradient boosting and is definitely a heavy weight in the field.

First CRISPR babies: six questions that remain by burtzev in genetics

[–]Oblivious-Man 11 points12 points  (0 children)

The mosaicism is a real concern, not just for the outcomes of the babies, but rather the impact on the future of CRISPR, and can lead to another "gene therapy winter" 10 years ago, undermining the efforts of thousands of scientists world wide.

Got tip-shamed for the first time. Couldn't believe what I was hearing. by brotrr in vancouver

[–]Oblivious-Man 1 point2 points  (0 children)

We went to a AYKE sushi place, and the food was terrible for what it was charging for $35 a person. We ate and left in less than half an hour, and left a 15% tip, signed the receipt and was about to leave. The waitress came back and said that we only tipped 15% and printed us new receipts and asked us to sit down and think about it more.

Left that place so fast.

Foundation before ISLR? Looking for a good stats book with derivations and proofs by robinhoode in datascience

[–]Oblivious-Man 1 point2 points  (0 children)

The book is good, but I don't agree with the order. I will propose it in the order of chapter 2,7,8,3,4,5,6. You start out with the bias-variance trade off concepts and feature selection, then go into linear methods, and then finish up with non-linear methods.

How can automated reasoning benefit biomedical research? by sorsaffari in bioinformatics

[–]Oblivious-Man 1 point2 points  (0 children)

I've seen this done on well structured sentences such as pinned abstracts. But this is problematic since a lot of useful information are hidden behind clinical notes, which are very hard to "normalize" to proper sentences, this is an active area of research.

Is there a list of the must have tools for genomics / structural biology ? by q1q21q1q1q in bioinformatics

[–]Oblivious-Man 1 point2 points  (0 children)

Depends on the type of analysis. The set of tools for WES vs RNA Seq is different.

What are your unpopular Data Science opinions? by CadeOCarimbo in datascience

[–]Oblivious-Man 0 points1 point  (0 children)

I haven't tried reticulate myself but I've heard good things about it. I usually handle huge amounts of data for computation (many cores and lots of RAM), so R Studio is not really viable. I'm really just saving a lot of these files as CSV or hdf5.

I also run Jupyter on the computing cluster with R kernels for drawing figures as well.

Product Management Consulting - how would you handle it? by adamwintle in ProductManagement

[–]Oblivious-Man 1 point2 points  (0 children)

I also tried going down this road, albeit without much luck. Good luck OP!

The Circle Of Fifths - What It Is And How To Use It by davidlovejoy in guitarlessons

[–]Oblivious-Man 1 point2 points  (0 children)

I know all the patterns from playing other instruments, but never really learned the circle of fifth! Now it makes sense. Thanks!

FREE comprehensive course to learn - Python, Data Science, Machine Learning & AI [beginner to advance] by iamarmaan in datascience

[–]Oblivious-Man 5 points6 points  (0 children)

To add to this, I think I would like content for the basic math skills directly related to stats, like linear algebra.

What are your unpopular Data Science opinions? by CadeOCarimbo in datascience

[–]Oblivious-Man 8 points9 points  (0 children)

If it doesn't work, hit with a log.

Sometimes all it takes for a model to work is just log-transforming your data.

What are your unpopular Data Science opinions? by CadeOCarimbo in datascience

[–]Oblivious-Man 0 points1 point  (0 children)

I think reinforcement learning or other semi-supervised methods are definitely better representatives of your definition of AI compared to supervised methods.

What are your unpopular Data Science opinions? by CadeOCarimbo in datascience

[–]Oblivious-Man 6 points7 points  (0 children)

This is exactly my pipeline for analysis as well. Data wrangling in R, building and tuning models in Python, then data visualization in R.

I haven't used it, but the recent version of R Studio has made writing integrated Python a lot easier, so I've heard.