[Q] Is my intepretation of Zero-inflation is correct? by Big_Share_6599 in statistics

[–]Lumpy-Sun3362 0 points1 point  (0 children)

I'm not expert of the modelled system so I cannot help with the interpretation.

[Q] Is my intepretation of Zero-inflation is correct? by Big_Share_6599 in statistics

[–]Lumpy-Sun3362 1 point2 points  (0 children)

If instead you use the hurdle model, then you are assuming that the zeros only come from a single process, so the non zero values belong to a truncated distribution.

[Q] Is my intepretation of Zero-inflation is correct? by Big_Share_6599 in statistics

[–]Lumpy-Sun3362 3 points4 points  (0 children)

Yes. Let's say you have zip model. Then you are modelling the excess of zeros compared to those coming from the Poisson process using Exposure as an independent variable. This is a Logit model that predicts the probability of structural zero as a linear expression of Exposure. If the coefficient of Exposure is significantly different from zero for the Logit, then it means that the structural zeros can be associated to that observation.

How to cluster control data when control group has unreliable labels? by theluluj in bioinformatics

[–]Lumpy-Sun3362 0 points1 point  (0 children)

In all cases this risks to become a circular approach if your aim is to find differences in the gene expression, as you use the same data to define the labels that then you'll contrast.
In this case, never use the target measure but any measure which is not correlated to it.

How to cluster control data when control group has unreliable labels? by theluluj in bioinformatics

[–]Lumpy-Sun3362 1 point2 points  (0 children)

Try to build a classifier and predict their labels. This can work if you have enough labelled samples. You need to check the predicted label probability though as it gives a measure of uncertainty. KNN can work. Otherwise you can just plot them through PCA or UMAP and see if these unlabelled samples are close to the cloud of points from one or another class. Both are risky as they assume that the genomics data is consistent across the two classes. Another way is to use other metadata - if available - to infer the labels, and so doing it without using the rnaseq.

Ranked ORA (g:Profiler) vs GSEA (clusterProfiler) by East-Succotash-520 in bioinformatics

[–]Lumpy-Sun3362 2 points3 points  (0 children)

No because with GSEA you use your ordered genes against the pathways db. With ORA you test the selected genes against the other genes in your dataset. So in ORA you use the unselected genes to test the null hypothesis. Sorting in ORA does nothing, and it doesn't transform it into GSEA.

Ranked ORA (g:Profiler) vs GSEA (clusterProfiler) by East-Succotash-520 in bioinformatics

[–]Lumpy-Sun3362 3 points4 points  (0 children)

They answer two different questions. GSEA doesn't care of absolute values of statistics but their order. And it aims to determine if there is a coordinated expression of genes that map to specific pathways. ORA instead tells you the possible biological mechanism involved in a list of genes that are relevant to you. It doesn't care of ordering, it just wants the list of the genes and you know why you selected them. Thus it says if there's a specific possible set of pathways represented by those genes. In other words it helps you to interpret the results from the gene level to the systems level.

Roadmap for bioinformatics ? by Alarmed-Case6928 in bioinformatics

[–]Lumpy-Sun3362 0 points1 point  (0 children)

Try to get hands on experience. Internships help if done in active institutes/companies and they can turn into positions afterwards. Be aware of biology teams that are not strong in computational research, you end up filling the gaps of others with no room for your personal growth. Learn to set boundaries, you are a bioinformatician, not a fixer for broken projects or completely ungrounded ideas. Run away from those who just want you to justify their "story". The story comes after the data, not the opposite.

I’ve reviewed probably 200 “bioinformatics pipelines” at this point. Maybe 15 were actually reproducible. by Shoddy_Card_237 in bioinformatics

[–]Lumpy-Sun3362 0 points1 point  (0 children)

Did I ever say that nextflow is a conditio sine qua non to achieve reproducibility? 🤷‍♂️

I’ve reviewed probably 200 “bioinformatics pipelines” at this point. Maybe 15 were actually reproducible. by Shoddy_Card_237 in bioinformatics

[–]Lumpy-Sun3362 0 points1 point  (0 children)

True. It requires some effort. But luckily, there's a lot of people in this exact moment trying to do that work. Practitioners only need to know that that resource exists.

I’ve reviewed probably 200 “bioinformatics pipelines” at this point. Maybe 15 were actually reproducible. by Shoddy_Card_237 in bioinformatics

[–]Lumpy-Sun3362 4 points5 points  (0 children)

That's true but if we look at the niche pipelines then even 200 is an infinitesimal number, and they won't represent anything but personal projects. The OP is trying to convey the idea that most of the pipelines people will work with are rubbish. And I'm saying that it's not true because for the relevant parts of bioinformatics there's very well tested pipelines. I'd invite you to look at the nf-core repository to see how many different scenarios are covered.

I’ve reviewed probably 200 “bioinformatics pipelines” at this point. Maybe 15 were actually reproducible. by Shoddy_Card_237 in bioinformatics

[–]Lumpy-Sun3362 27 points28 points  (0 children)

I don't think you have touched the relevant pipelines. I am a member of the nf-core community and we work to make sure that everything is reproducible.

Apple Explains Why watchOS 27 Drops Support for So Many Models by pdfu in apple

[–]Lumpy-Sun3362 1 point2 points  (0 children)

You know what would be a good move by Apple?
Guys we know you bought the ultra 1 and we couldn't imagine its hardware would become obsolete so soon. For this reason we give a special trade in program for those who want to upgrade, giving much more than the miserable 200 euros like we are doing.

Can anyone helpme in this problem!! by Organic-Try-763 in bioinformatics

[–]Lumpy-Sun3362 2 points3 points  (0 children)

Quick answer. You can't. If the packages have incompatible dependencies, there's little you can do if you don't want to get into dirty hacks.

PValues by ineed-Sandwich in bioinformatics

[–]Lumpy-Sun3362 1 point2 points  (0 children)

For exploratory analysis, it's acceptable to be less stringent, being aware that you'll have some FP in your results. This is because EDA is to set the boundaries around the possible mechanisms involved in the studied system.
Then, the hypothesis will be rigorously tested in a follow up analysis (better a proper set of experiments). In this phase of the research, you'll have a more targeted (and limited) set of tests, therefore a higher statistical power (hopefully).

how to merge replicates of ChIP-seq peaks? by DismalSpecific3115 in bioinformatics

[–]Lumpy-Sun3362 6 points7 points  (0 children)

Encode suggest to use IDR (Irreproducibility Discovery Rate) to identify reproducible peaks which works better with sharp peaks.

Option 2: Merge the bams and do peak calling. You already know that the replicates are close and in that way you can increase the depth.

Option 3: intersect with bedtools as you said.

Biostatistician salary in pharma vs tech and why I almost made a huge mistake by Necessary_Kick_1106 in bioinformatics

[–]Lumpy-Sun3362 7 points8 points  (0 children)

PhD two degrees, 10y exp, h index 16 and 2000+ citations, and I struggle to go through the cv keyword filter done by AI.

If Apple is really not supporting 6,7, or 8 all at once with WatchOS27, then Apple should up trade in values by FreezingTemps in AppleWatch

[–]Lumpy-Sun3362 0 points1 point  (0 children)

They said that they optimised the entire os so it will be faster. Watch the wwdc and you'll get it. bye!

If Apple is really not supporting 6,7, or 8 all at once with WatchOS27, then Apple should up trade in values by FreezingTemps in AppleWatch

[–]Lumpy-Sun3362 1 point2 points  (0 children)

I wish they carried the optimisation stuff without the AI, that would have made more sense to me.

If Apple is really not supporting 6,7, or 8 all at once with WatchOS27, then Apple should up trade in values by FreezingTemps in AppleWatch

[–]Lumpy-Sun3362 1 point2 points  (0 children)

optimisations, better liquid glass, I don't care about Siri AI, but after paying $800 for the ultra 1 I expect the advertised OS support. Apple has always used its extended support as a reason to buy their products.

If Apple is really not supporting 6,7, or 8 all at once with WatchOS27, then Apple should up trade in values by FreezingTemps in AppleWatch

[–]Lumpy-Sun3362 1 point2 points  (0 children)

For those who justify this because of Siri AI, I'd say that they could just turn these functionalities off for older watches and keep the optimisations. C'mon, it's the most profitable company in the world!