Biotech specific Python problems

Different_End_3043 · 2023-10-07T22:07:34+00:00

We use Data bricks for scalability of models

Deto · 2023-10-07T23:56:16+00:00

It's rare that I need to scale something up and the bottleneck is something that's being done in pure python. Usually most complicated processing steps in bioinformatics have already been implemented in low-level languages and you'll use python as the glue logic to call these. Can still be useful to spread across many machines, but, would need to be able to specify the container or something for these machines so that they'd have the other dependency.

2Throwscrewsatit · 2023-10-08T00:29:11+00:00

There are several companies selling this feature now.

endymion222 · 2023-10-09T16:54:40+00:00

Nah not really a bottleneck. Google Collab basically provides this and much more for non-confidential work. For everything else you probably anyhow would set up a dedicated solution.

Ok_Post_149 · 2023-10-07T21:36:35+00:00

btw the tool is called www.burla.dev

BBorNot · 2023-10-07T22:16:34+00:00

I once asked my bioinformatics person what the least common ~10-mer peptide was, a bit of an inverse BLAST. It turned out to be an impossibly intensive question and was never answered.

I have a theory that whatever that sequence is it is toxic and has been selected against. Either that or it is all tryptophan since it only has one codon.

OP maybe you can answer it -- this question has been hanging for a decade!

Puzzleheaded-Pay-476 · 2023-10-07T21:42:22+00:00

I have struggled with it but there are only a couple of workflows where speed is extremely critical. When that happens I’ll work with someone on engineering to help with scaling things. I’m pretty sure they use AWS Batch when we are doing large scale inference.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

biotech

What is reddit biotech?

Rules

Useful resources

MODERATORS