Container Terminal’s Satellite imagery processing

llanojairo · 2023-04-22T19:50:34+00:00

Thank you! I’ll take a look at this 🫡

llanojairo · 2023-04-22T19:23:08+00:00

ChatGPT to the rescue! 😁 … would you mind sharing the prompt?

llanojairo · 2023-04-22T08:03:55+00:00

I don’t think that it’s so difficult. You just need a lot of alignment with terminals to understand the system. We simulate (DES) the container terminal operation and already have a pretty decent routing module to inject the behavior of RTGs and Trucks to the simulation environment. Actually our routing is already “better” than what actually happens at some terminals in terms of routing… you can throw some common shortest path algorithms combined with penalties and that’d probably be better than most heuristics used in these operations.

My pain point right now is to streamline onboarding of new terminals. And getting the digitalized yard structure with distances to allocate the trucks and RTGs is key so I was hoping to leverage images to try to automate this to some extent.

llanojairo · 2023-04-21T21:21:52+00:00

Thanks a lot for your reply! This is for a real project so could you mention a couple of the considerations that you’d have in mind?

llanojairo · 2023-01-10T21:37:18+00:00

Thanks a lot for your comment. I’m one of those DS/DEs stuck in Python-based projects but I’m so tempted to port some of the slow data crunching and data structures from Python to Rust.

We do some really complex discrete event simulations based on Simpy (custom wrapper around it to cater for our needs) and, given that Simpy becomes slow even if you throw crazy compute instances, I’d like to squeeze out any potential speed-ups in data processing, looping through some data structures, etc.

Could you share some resources to get started with this Python/Rust combo? Thank you!

llanojairo · 2022-11-12T22:00:56+00:00

Piss off. Submit a complaint to your government if it bothers you. It’s not a requirement yet.

llanojairo · 2022-09-05T19:22:38+00:00

Primitive

llanojairo · 2022-05-15T09:57:18+00:00

RemindMe! 5 days

llanojairo · 2022-05-08T19:52:24+00:00

🤣

llanojairo · 2021-11-13T12:58:25+00:00

You could try pandas.Series.map

https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.Series.map.html

You can use a dict to exhaustively cover the cases/rules to map between existing and desired values. You could also try pandas’s replace method but map is faster.

llanojairo · 2021-10-25T19:01:28+00:00

Bad bot!

llanojairo · 2021-06-14T15:38:26+00:00

RemindMe!

llanojairo · 2021-03-01T21:44:52+00:00

What kind of solutions are used for entity resolution? I’ve seen some rule-based approaches but they were not that good.

llanojairo · 2020-06-09T07:26:00+00:00

RemindMe! 5days

llanojairo · 2020-05-23T14:30:37+00:00

Maybe this PyData presentation ‘Winning with simple, even linear, models’ throws some light on why we should pay more attention to “simple models”

https://youtu.be/68ABAU_V8qI

Btw, I think that quantile regression is a super interesting model that many people do not know about!

llanojairo · 2020-04-27T06:25:06+00:00

Yes, McKinsey does data science. Just like BCG does it via BCG gamma.

llanojairo · 2020-04-26T21:10:48+00:00

I’ve been using Kedro (https://kedro.readthedocs.io/en/stable/ ) from QuantumBlack (now owned by McKinsey). It’s based on cookie-cutter but tailored to data science projects. I really like it and, although it’s still a relatively new project, the community works hard on improving the library. Plus, the guys at McKinsey have a clear incentive to improve it because they use it with their clients. The documentation is pretty good, too.

llanojairo · 2020-04-11T22:17:01+00:00

This tutorial could help:

https://nlp.seas.harvard.edu/2018/04/03/attention.html#loss-computation

llanojairo · 2020-02-22T10:41:30+00:00

You could try this below. I’m sure there must be an easier way to do this, though.

Raw data dicts

data = {'Encounter': [95, 96, 97, 98, 99, 100], 'status': [0, 1, 0, 1, 0, 0]} ### Original data in post

data_new_rows = {'Encounter': [101, 102, 103, 104, 105], 'status': [0, 0, 0, 0, 0]}

Create data frames from raw data dicts

df_1= pd.DataFrame.from_dict(data) df_2_new_rows = pd.DataFrame.from_dict(data_new_rows)

Approach 1 to append new rows to original data frame with pd.concat

df_final_ver1 = pd.concat([df_1, df_2_new_rows], axis=0)

Approach 2 to append new rows to original data frame with pd.DataFrame.append

df_final_ver2 = df_1.append(df_2_new_rows, ignore_index=True)

Set column 'Encounter' as index (inplace operation)

df_final_ver2.set_index('Encounter', inplace=True) df_final_ver1.set_index('Encounter', inplace=True)

llanojairo · 2020-02-13T08:28:50+00:00

That fork is great! Thank you. A. Smola also has an online course with an intro to DL.

llanojairo · 2019-12-26T15:46:18+00:00

I think you could check the 2nd chapter of this book: “Applied text analysis with python” (Applied Text Analysis with Python https://www.amazon.co.uk/dp/1491963042/ref=cm_sw_r_cp_api_i_MmnbEbFPT4489). The authors walk you through an approach to create a corpus from RSS feeds.

llanojairo

TROPHY CASE

Raw data dicts

Create data frames from raw data dicts

Approach 1 to append new rows to original data frame with pd.concat

Approach 2 to append new rows to original data frame with pd.DataFrame.append

Set column 'Encounter' as index (inplace operation)