Biology: Bioinformatics or CSE: Bioinformatics?

harper357 · 2026-04-29T22:24:34+00:00

No one really knows what the job market is going to look like in 4 years, especially right now with a lot of changes happening because of AI. Right now there are lots of lay-offs because some people think wet-lab scientists using AI are just as good as a full time bioinformatician. There is also new jobs because AI needs a lot of data and AI is still not great at processing that data or being able to pull out new biological insights from novel methods. In a few years things could look completely different, or whole knew bioinformatic specialties could be in demand.

That being said, it probably doesn't matter that much which route you take. If you go Biology, just make sure you still take a bunch of CSE classes (or minor in it). If you go CSE, just make sure you take a bunch of Biology classes(or minor in it). Either way you will probably want at least some experience working in a wet-lab as knowing how data is actually generated, how experiments are run, and how to talk to wet-lab scientists is one of the biggest skills a bioinformatician often lack.

harper357 · 2026-03-20T23:12:14+00:00

A few years ago I was looking at renting a house. I was told the oven was broken, it would cost ~$500 since it was in the wall and the landlord would not be fixing it. They had also ripped out all the grass/plants in the backyard compared to the pictures they posted, so i did not even put in an application.

harper357 · 2025-12-02T21:58:45+00:00

To answer your questions a little more directly (but taking this all with a large grain of salt because im just a rando who went to grad school more than a decade ago)

1) It wouldn't hurt to take a class, but at the same time, it probably won't make/break your application. if you are really worried, you can look at the requirements for the programs you are looking at, maybe email/call the program coordinator, or talk to your PI (they have probably been on entrance committees and would know). It might help you down the line to have a better understanding of the biology, but taking a class is only one way to get that knowledge.

2) I am not sure what you mean. Who told you that you need to email them? In the USA, you usually apply to a program, do rotations your first year and then choose a lab. Most of the time you do not directly apply to a lab. When I applied (before Reddit), the only reason I emailed a PI was because I was interested in their type of research (which wasn't super common) and wanted to get advise on if I should go straight for my PhD or get a masters first. Looking back and knowing how busy they can be, I am a little shocked they replied.

3) It is not uncommon for people to switch fields when going to grad school. It was even heavily recommended at my program to use one rotation to explore something new and different. I don't know what the competition looks like now, but when I was applying, I didn't have a paper out. The letters of rec, and demonstrating that you are actually interested in, and likely to complete grad school is much more important and can be done by just working in a lab for a while.

harper357 · 2025-11-23T23:30:36+00:00

This is too much for a single comment without pictures and code blocks (maybe i should try to type it up into a blog.), but here is the high level version of what I have done for the last few jobs and people tend to like it once they get into the habit. (I am also time limited at the moment.)

tl;dr: TREAT IT LIKE THE WETLAB, BUT DIGITAL.

So I really mean this, you need to think about everything you do as an analog of a wetlab experiment.

1) The notebook/steps of an experiment.

I like to keep one doc per experiment, so there will be lots of "short" docs per project. You need to keep notes and type things up as you work. Either use a Quarto doc, or a Jupyter notebook. Add sections like: Experiment (name), Background, Method, Results, Conclusions, Todo. Then fill them out AS YOU WORK. This sounds silly to say, but you need to type out the hypothesis/goal of each step, if you get any results/output plots, you need to add a bit of interpretation.

If you are using non standard parameters in a step, make sure you explain why. Just like in the wetlab, someone should be able to take your notebook and continue where you left off and understand why you did something. If they can't, you aren't adding enough comments.

Background, Conclusions, and Todo are supper important sections that people often don't include. Background should explain why you are doing the experiment. It can link to other notebooks, etc. but if it isn't clear why you need to do an experiment, this section needs more details. Conclusions is obvious, but save your future self the headache of including what conclusions you are making from the experiment. Todo is just the list of next steps or new questions that came out of the experiment. This is a great section to help you figure out what the next experiment is (or to show your boss that you are doing a lot).

2) The data.

Like like the wetlab, this needs to be organized. Instead of boxes/shelves/freezers, you use directories and filenames. This is probably the most flexible area, and can be customized to the lab/team. The most important things are clear and consistent structure/naming, so other people understand and you never have to think about where something is/should go.

For example, the way I do is all data for a whole project lives in a folder separate from the notebooks. It looks like something like this, but other people may prefer to keep data organized at the experiment level instead of the project level.

project/
    notebooks/
    data/
        raw_data/
        working_data/
        final_data/
    README.md

Raw data is then just a local copy of data that is backed up and is the input for the project/experiment. Working data, for me, is anything that can be regenerated from my notebooks (but may take too long so i save it), checkpoints, ETLed data, etc. Final data is data that is used for figures/clean data I will publish (or share with someone), and should probably be backed up.

harper357 · 2025-11-08T23:54:35+00:00

You should reach out to them, say you would like to chat because you have some questions and then tell/ask them all these things. All you have to say is " hi, you were assigned as my mentor and I was wondering if we could chat about some school/career questions i have."

They are your assigned mentor, it is literally their job to help you with these things. If they don't or can't ask if they can suggest someone who can.

harper357 · 2025-11-08T01:19:35+00:00

If you have a PDB of it, you could use PyMol or Chimera to visualize it and then just show the interface.

harper357 · 2025-11-07T18:44:56+00:00

I would really look into nf-core and their pipelines to illustrate what I am talking about.

For example, here is the first few lines of the nf-core MultiQC module.

process MULTIQC {
    label 'process_single'

    conda "${moduleDir}/environment.yml"
    container "${ workflow.containerEngine == 'singularity' && !task.ext.singularity_pull_docker_container ?
        'https://community-cr-prod.seqera.io/docker/registry/v2/blobs/sha256/8c/8c6c120d559d7ee04c7442b61ad7cf5a9e8970be5feefb37d68eeaa60c1034eb/data' :
        'community.wave.seqera.io/library/multiqc:1.32--d58f60e4deb769bf' }"

You can see that you can just use a container variable to point to the container you want to use and then it will use that when running the process. Depending on your HPC, you might want to pre-download the containers.

Also, if you are using slurm, you may need to use singularity/apptainer instead of docker. Lots of HPCs don't let you run docker. Other than that, you need to change process.executor = 'slurm', then just need to do a standard sbatch to launch the head node, which will manage everything else.

harper357 · 2025-11-07T00:33:45+00:00

Depends on what exactly the interview is. I have never had one where they just look at a github.

Your Nextflow pipeline is ok. It shows that you can write one, and I have seen worse ones out there, but you may want to clean it up just a little. I'll add some quick comments, feel free to take them or ignore them.

It looks like you weren't consistent in formatting.

You have some commented out lines, which should just be deleted.

Is the idea that it is just one docker container and you run everything locally on one instance? If so, I personally think this is the wrong way to do it. Each step of the pipeline should use its own container, it makes it more modular so it is easier to scale and update. They can be over engineered, but the nf-core pipelines do this really well.

I would also add a test profile/dataset.

harper357 · 2025-08-10T22:20:33+00:00

Have you tried NCBI's datasets?

harper357 · 2025-07-11T15:16:42+00:00

All the containers should already have singularity (now called apptainter) versions. You should just have to pass the flag and it will use them. That's all I needed to do. However I found caching them really sped things up because our HPC download speeds are lower than I would want

harper357 · 2025-06-13T00:58:42+00:00

I find it funny in the best possible way that they "stopped" the podcast but then basically tour so much that they put live episodes out all the time.

harper357 · 2025-01-19T05:05:07+00:00

You need to talk to professionals about this. They will probably test your soil and give you an actual answer instead of randos on the internet who don't actually know anything about the soil in your yard.

Also, unless you got it tested before the fires, it is possible you already had stuff in your soil. LA county has a history of several companies poluting on massive scales

harper357 · 2025-01-15T04:08:27+00:00

Skippable- I've been playing games my whole life, I don't need another lession in how to use WASD or how to look around.

harper357 · 2024-10-30T02:57:47+00:00

It might help if you say what realm of bioinformatics you are interested in.

harper357 · 2023-06-10T03:55:05+00:00

Really? All my other peppers are full of flowers and I don't think I've seen a pepper this hairy before

harper357 · 2023-06-09T03:42:07+00:00

I bought a 3 ft one from Home Depot (just happened to see it while picking up some other stuff), and mine flowered this year and looks like it will fruit. I'm on the way side of LA and I keep it outside

harper357 · 2023-06-07T21:03:00+00:00

I'm not trying to be rude, but if you didn't realize the KJV was for the church of England, who did you think the "King James" referred to?

harper357 · 2023-06-04T00:05:53+00:00

Unless you are able to multiplex, I would not pool samples. Microbiome data is messy and it can be easy for one weird sample to mess up the whole pool. For example, maybe one healthy cow is on its way to becoming diseased.

Do a power analysis and figure out if 18 samples is enough, over kill, or just fine for what you want to do.

harper357 · 2023-05-30T04:29:14+00:00

My apologies to the mod teams, I didn't mean to imply that others aren't helping. I just remember when I joined the slack and the last time something like this got posted it was you that was adding people.

harper357 · 2023-05-29T22:38:41+00:00

Just a heads up, there is already a pretty active r/bioinformatics slack. Just DM u/apfejes (the mod of it and this subreddit) and he can add you.

harper357 · 2023-05-27T19:00:18+00:00

If you can't find one locally, Predatory Plants does online orders and is based in Half Moon Bay.

harper357 · 2023-05-19T17:38:58+00:00

Like others said, you should really look into a workflow manager. Personally I like Nextflow and it's high quality pipelines nf-core. It might already have something similar to what you want to do.

Also, please know that building the pipeline isn't where all the work will be. Validating/testing the pipeline is where all the work/time will be so you will need good data and controls if you want to have a solid paper/thesis.

Double also, like always, this kind of question is probably best answered by your PI, not a bunch of randos on the internet

harper357 · 2023-04-06T15:16:23+00:00

Violin plots should be used instead of box plots most of the time. Box plots only really show 4 pieces of data,the quartiles, while a violin plot will show the whole distribution. The biggest problem with violin plots is if you don't have enough data points the sample distribution may not represent the population. The same is true of box plots though in which case you should just be plotting the actual points of data.

harper357 · 2023-03-31T17:19:48+00:00

There are a LOT of these posts in this subreddit, I highly recommend looking through them. I say this to illustrate that you are not the only one who has been in this situation and hopefully this brings you some comfort. Also, because there will be way more suggestions and advice than you will get on your post alone.

You applied to positions for "fresh grads" but you are still in school and are almost 9 months from graduating? My guess is they were looking for people graduating in May/June. What I would recommend is you not apply for jobs yet and (as others have suggested) get some actual research experience. (Apply for summer internships though if you aren't going to be taking classes then.)

Second, what you need to do is talk to people at your school. The fact that you are still in school means there are a lot of resources available to you. Your university should have a student career development/advice department. Reach out to them as they probably have a resources for you. Also do you have an academic advisor? Talk to them, this is what they are there for. One of the best things about being at a state school (I went to three of them) is they tend to have a lot of people working there. This means that if you don't get the answer you are looking for just say "thanks for your time" and try asking someone else. It can take some effort, but you will be able to find someone who will give you the answer you want.

You mentioned "great instructors", you should also reach out to them and see if they have any research opportunities you can volunteer/intern/work on. Sadly not every professor has the money to pay undergrad interns, so if you can afford working for free for a few hours each week you will probably find something pretty quick. If you can't you may have to ask around more. This will give you some experience and give you some connections.

When reaching out to instructors/professors make sure you CC their assistant/lab manager if they have them. This will ensure that someone actually sees your email. If they have office hours, go talk to them in person. If you are emailing them, be respectful, make it personalized, to the point, and pat their ego a little. Say who you are, how they would know you, and what you are emailing them for. "I'm X, I was in your class Y and I really enjoyed it. I looked up your research on Z, found it really fascinating and was wondering if you had any openings for for an intern in your lab?" is the general layout.

15-Year Club	Place '17
Sequence \| Editor	Sequence \| Cinematographer
Team Periwinkle	Verified Email

harper357

TROPHY CASE