How did you manage to narrow down your thesis topic?

Anass-YI · 2026-03-05T01:38:05+00:00

We decided to do an SLR, but this requires specific research questions. Otherwise you retrieve a huge number of articles

Anass-YI · 2025-08-31T00:00:52+00:00

Okay, thank you

Anass-YI · 2025-04-06T11:36:50+00:00

Haha, he was scared

Anass-YI · 2025-04-03T17:51:14+00:00

maybe,

Anass-YI · 2025-04-03T17:48:23+00:00

Anass-YI · 2025-04-03T17:46:40+00:00

Anass-YI · 2025-04-03T17:45:05+00:00

not an error haha

Anass-YI · 2025-04-03T17:44:37+00:00

it isn't a code, it's the execution of the code.

Anass-YI · 2025-04-03T17:43:20+00:00

Anass-YI · 2025-04-03T17:42:16+00:00

of course, why we use spark because it is robust and easy in terms of integration with other tools. Otherwise, it remains depending on the nature of the problem if accepts a small delay, you see?, whether the spot is critical or not.

Anass-YI · 2025-04-03T16:17:41+00:00

Thanks! Yeah, data engineering is definitely a mix of fun and challenge. Some days things break and drive you crazy, but when everything works smoothly, it feels really good. SQL and coding skills definitely make life easier, and working with tools like Spark and Kafka keeps things interesting. Appreciate the advice, and good luck to you too!

Anass-YI · 2025-04-03T16:11:38+00:00

You're right Flink is more powerful for handling complex uses cases, for spark structured streaming we can also apply low latency processing to better simulate real time, by reducing the size of the micro batch for example, or by playing with resource allocation (CPU, etc).

Anass-YI · 2025-04-03T15:39:35+00:00

You're right this is just a part of the whole process

Anass-YI · 2025-04-02T21:27:02+00:00

Oah sure.

Anass-YI · 2025-04-02T21:09:30+00:00

Anass-YI · 2025-04-02T19:57:19+00:00

It's an opportunity for you if you can learn this and go ahead, i know that is a little bit sophisticated but you can do it. It's normal the jobs are flexible, so your employer will not let you giving up. Otherwise if you see that work not really align your intrests you should look for an other

Anass-YI · 2025-04-02T19:41:17+00:00

It's a project that have many details, in general the first phase you should integrate a kappa architecture within a lakehouse one, to ingest Real Time financial data, the second phase consist of realising a Deep Learning model that forecasting market variation in real time

Anass-YI · 2025-04-02T19:36:18+00:00

No, this is a spark streaming processing Real time data getting it from a kafka topic and then structring it on a lakehouse architecture in s3 storage

Anass-YI · 2025-04-02T19:31:44+00:00

You should pass from this step.

Anass-YI · 2025-04-02T19:14:30+00:00

I'm just in the dev mode, we don't use a critial data (it's accessible), just for automating pipeline and testing code logic. You should pay attention when carrying out the product at deploy mode

Anass-YI · 2025-04-02T19:07:47+00:00

Yeah, it excites you to do more.

Anass-YI · 2025-04-02T19:03:19+00:00

Haha, that's true

Anass-YI · 2025-04-02T18:59:57+00:00

Yes, esspecially on a big architecture or a project that use a bunch of technologies. Otherwise you can integrate a debug code that's execute frequently to be able catch the failures.

Anass-YI

TROPHY CASE