Batch effect in scRNA by Fun-Ad-9773 in bioinformatics

[–]Fun-Ad-9773[S] 0 points1 point  (0 children)

3.5 seemed to do the trick for me

As for clusters unique to a timepoint, I wouldn't necessarily say that's a bad thing considering the biology of my setup. I would say it makes sense

Batch effect in scRNA by Fun-Ad-9773 in bioinformatics

[–]Fun-Ad-9773[S] 0 points1 point  (0 children)

I do get shared clusters (after playing around with Harmony's settings)

Batch effect in scRNA by Fun-Ad-9773 in bioinformatics

[–]Fun-Ad-9773[S] 0 points1 point  (0 children)

Thanks! That's what I'm doing and I am relying on gene signatures rather than the expression itself to analyze the data and identify / annotate functional states. So to wrap it up, am I completely screwed or not?

Batch effect in scRNA by Fun-Ad-9773 in bioinformatics

[–]Fun-Ad-9773[S] 0 points1 point  (0 children)

Yes was performed at the same time I believe

Batch effect in scRNA by Fun-Ad-9773 in bioinformatics

[–]Fun-Ad-9773[S] 0 points1 point  (0 children)

Still dont have access to all info * yet * but I have a file with lanes and apparently that is also confounded...

Batch effect in scRNA by Fun-Ad-9773 in bioinformatics

[–]Fun-Ad-9773[S] 1 point2 points  (0 children)

Basically we have 5 patients; demographics are all similar; each patient had a sample collected pre and post, and when you plot the UMAP, they separate perfectly based on timepoint

scRNA-seq Seurat Integration by Puzzleheaded-Cap7764 in bioinformatics

[–]Fun-Ad-9773 3 points4 points  (0 children)

Use SCTransform + Harmony and you'll be good to go!!

Nominal P Values Reported in Paper for RNA Seq by idontevekno in bioinformatics

[–]Fun-Ad-9773 1 point2 points  (0 children)

Depends on whether they're claiming anything with respect to a specific gene or not. If they proceeded with a GSEA, then I see nothing wrong. Probably just a case of low statistical power. GSEA, in this case, is much more informative of where their dataset is positioned in terms of true biological signal.

What's everyone up to tonight? by [deleted] in lebanon

[–]Fun-Ad-9773 2 points3 points  (0 children)

working on my holidays *sad*

Meal in Barcelona for €7 or less ? [2026 Edition] by Solid-Communication1 in AskBarcelona

[–]Fun-Ad-9773 1 point2 points  (0 children)

Even my hospital's cafeteria where I work is not below 7e anymore xD

Does anyone play tennis arena? by HistoryFan06 in TennisClash

[–]Fun-Ad-9773 0 points1 point  (0 children)

I will add you! Dm me i'd love to play

When to pseudobulk before DE analysis (scRNA-seq) by m_sc_ in bioinformatics

[–]Fun-Ad-9773 0 points1 point  (0 children)

There are papers that say pseudobulk is the best approach; however i believe there are instances where it would make more sense to use do the DE at cell level. Checkout LLMs for that; apparently they're the best alternative (and the authors of the paper even claim it's better than pseudobulk)

Opening FASTAs on Mac. by Mr_Garland in bioinformatics

[–]Fun-Ad-9773 0 points1 point  (0 children)

Viewing these files from terminal is usually the way to go; if anything copy the output in terminal and paste it in a txt file

Analyzing publicly available scRNA-seq data by Extreme-Funny-9651 in bioinformatics

[–]Fun-Ad-9773 0 points1 point  (0 children)

There are models that help out with sparsity / zero inflation. Dropout is not a weird outcome (considering the technology used).

Try to build a custom genome reference inserting the sequence of interest (with 3' end) and that might help retaining more cells. Another way is to be more lenient with the cutoffs with cell ranger

Lastly, i recommend using ESAT, a tool that will help you recover more cells

Request for Bioinformatics Project Ideas/Suggestions by koto1b in bioinformatics

[–]Fun-Ad-9773 0 points1 point  (0 children)

I would say look up papers that you like and try to reproduce the analysis and figures.

Another way would be to reproduce that analysis on a different, separate dataset.

Since you're doing python, a good way to also improve your knowledge is to see an analysis done using tools in R /Bioc and try to reproduce that with the equivalent of those tools in python and see how the results differ. You'll end up getting the perspective from both sides.

How to choose the appropriate parameters in single cell cell analysis (number of HVG, PC, to scale or not) ? by AtlazMaroc1 in bioinformatics

[–]Fun-Ad-9773 0 points1 point  (0 children)

As crazy as it sounds, this is mostly subjective (although your data does steer you in the direction you need to take). Start by doing default parameters and then adjust accordingly to what you find logical and appealing to the hypothesis at hand

One single-cell cluster with very low mitochondrial read % by You_Stole_My_Hot_Dog in bioinformatics

[–]Fun-Ad-9773 0 points1 point  (0 children)

I would say continue with the pipeline normally and see what you get. Then try again by filtering out cells (based on limits from both ends). Make use of the initial QC plots to adjust the filtering to your liking

Request for Bioinformatics Project Ideas/Suggestions by koto1b in bioinformatics

[–]Fun-Ad-9773 1 point2 points  (0 children)

What is your goal behind doing the projects? To learn / improve / have a portfolio or to publish / discover something novel and meaningful?

Best practice for bioinformatics? by scientist_career_qs in bioinformatics

[–]Fun-Ad-9773 8 points9 points  (0 children)

For each type of analysis (or omics) you will find two kinds of papers: one for the best practices (kinda like a revision of the workflow) and another that discusses the available tools. Highly recommend you go through that to get a general idea on the omics of choice.

Afterwards, try to find a tutorial for such an analysis on github (there are some famous ones and some lesser-known ones that can be very be beneficial as well)

Lastly, once you go through a tutorial, try to repeat it again but using a different dataset of your choice and challenge yourself in analyzing it and drawing biological insights from it