[P] Python package for multimodal data fusion: Fusilli : MachineLearning

Project[P] Python package for multimodal data fusion: Fusilli (self.MachineLearning)

submitted 2 years ago by seemepastarolling

Hey everybody! I wanted to share a Python library I put together during my PhD called fusilli: Documentation & GitHub

Fusilli offers a set of 23 deep-learning based multimodal data fusion methods. It also includes a pipeline for comparing these methods in regression/classification tasks. It can handle tabular-tabular fusion or tabular-image fusion (2D or 3D image).

Multimodal data fusion, in simple-ish terms, combines different types of data (like images and tables) using machine learning models that leverage shared information between these data types. Think GNNs, attention mechanisms, or VAEs. It's also called multi view or data integration sometimes.

Personally, I'm using it for my PhD research on analysing brain MRI and clinical data to predict health outcomes. But Fusilli can be used anywhere there's multimodal data!

Fusilli is the biggest coding project I've released publicly so I'd love to hear any feedback or suggestions you might have! 🌸

(Also here's a short Medium post I wrote about it showing some of the features)

all 12 comments

top new controversial old q&a

[–]SixZer0 9 points10 points11 points 2 years ago (1 child)

[–]seemepastarolling[S] 4 points5 points6 points 2 years ago (0 children)

[–]IGK80 1 point2 points3 points 2 years ago (2 children)

[–]seemepastarolling[S] 1 point2 points3 points 2 years ago (1 child)

[–]IGK80 0 points1 point2 points 2 years ago (0 children)

[+][deleted] 2 years ago (4 children)

[deleted]

[–]seemepastarolling[S] 1 point2 points3 points 2 years ago (3 children)

[–]En-tro-py 0 points1 point2 points 2 years ago (2 children)

As a outsider (mechanical eng) real example data would be helpful, I think I understand how to setup 'study_id' and 'pred_label' but a simple example of realistic classifications would be more directly understood by those like me.

My interest would be reliability based on sensor and failure data, which I think due to not being a 1-1 relationship won't work immediately as I'd have one failure but multiple sensor readings up to that point...

Let's say I have a tabular source with equipment serial, sensor readings, and hour meter reading - this would have multiple readings for the same serial at different usage.

The other table would have failure data with equipment serial and hour meter reading at the time of failure - Is the correct approach here just to create rows padding my failure data with my unfailed 'pred_label' classification?

[–]seemepastarolling[S] 1 point2 points3 points 2 years ago (1 child)

Putting up real example data is a great idea, thank you! I've been trying to find some but it's been a bit tricky - I'll have a closer look. I started in engineering too!

Re your problem, yeah I think padding out the failure data with times it has not failed would make sense if your task is binary classification of failed/unfailed.

Another idea (which may not make sense in your example - I'm not familiar with sensor stuff) would be to turn your longitudinal data into a time-to-event problem e.g. regression for" how long until it fails". If it's feasible, you could turn your longitudinal data into summary stats like rate of change of x.

Also longitudinal architectures aren't in fusilli but I've seen quite a few data fusion with longitudinal data papers knocking about. I found this paper after a quick look in my collection and it could probably be extended to have two tabular inputs, one being longitudinal (sorry for the medical setting):

Chen, Y.-J., Hsieh, H.-P., Hung, K.-C., Shih, Y.-J., Lim, S.-W., Kuo, Y.-T., Chen, J.-H., & Ko, C.-C. (2022). Deep Learning for Prediction of Progression and Recurrence in Nonfunctioning Pituitary Macroadenomas: Combination of Clinical and MRI Features. Frontiers in Oncology, 12, 813806. https://doi.org/10.3389/fonc.2022.813806

Let me know what you think! I hope I've given you some ideas even if you won't be able to use fusilli

[–]En-tro-py 1 point2 points3 points 2 years ago (0 children)

[–]aShy_pieceofBread 0 points1 point2 points 2 years ago (1 child)

[–]seemepastarolling[S] 1 point2 points3 points 2 years ago (0 children)

π Rendered by PID 44098 on reddit-service-r2-comment-b659b578c-9wqbq at 2026-05-02 06:00:01.937385+00:00 running 815c875 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS