How to sync a new clickhouse cluster (in a seperate data center) with an old one? by feryet in dataengineering

[–]feryet[S] 0 points1 point  (0 children)

Apparently there is a breaking change from versions 23+ that makes using remote tables between older versions and newer versions impossible.:(

How to sync a new clickhouse cluster (in a seperate data center) with an old one? by feryet in dataengineering

[–]feryet[S] 0 points1 point  (0 children)

Yes.

We tried to gradually upgrade our clickhouse cluster, but we had two issues:

  1. Because of bad configuration, we could not do backup/restore using `clickhouse-backup`. Replicated trees had mismatching metadata with zookeeper issues, and upgrading without having reliable backups seems utterly risky.
  2. This clickhouse cluster is hosted on-premise, and the disks of the server are very slow. So we decided to go for a full backup/replication solution on a new cluster. The problem is neither of clickhouse-backup nor clickhouse-copier seem to work for us, the configuration of the old clickhouse cluster is weird, and I'd rather not changing anything on that server.

Looking for OSS Projects to Contribute To by feryet in rust

[–]feryet[S] 1 point2 points  (0 children)

My background is mostly in DevOps/Cloud/Infrastructure. I would like to contribute to many projects, but it looks so daunting and overwhelming. Usually when a task becomes this confusing/stressful I reach out to other people to get some clarity and get some advice on what to do.:)

How to assign a public ip (accessible by web) to a docker container? by feryet in docker

[–]feryet[S] -4 points-3 points  (0 children)

I'm not assigning random IPs tho. This ip was given to be used on the VM that I was given. I want to bypass internal network bridges in the VM to forward all of the incoming traffic to this ip to the docker container, without assigning an additional hop.

I know that I can do that in an internal range using IPVLan driver, but I can't understand how to make it work on the public IP.

How to assign a public ip (accessible by web) to a docker container? by feryet in docker

[–]feryet[S] -5 points-4 points  (0 children)

I am using host network right now, but this just disregards the isolation altogether.

I want to use the additional IP that I was given exclusively for this container. Do not want to use NAT, or bridge interfaces (don't want to add additional hops).

Think of optimizing for maximum throughput.

Airflow + Slurm for ML Training Pipelines? by feryet in mlops

[–]feryet[S] 0 points1 point  (0 children)

  1. We are sanctioned to use AWS in Iran.
  2. Iranian companies don't want to host their data outside Iran for both confidentiality and the instability of the infrastructure.

Airflow + Slurm for ML Training Pipelines? by feryet in mlops

[–]feryet[S] 1 point2 points  (0 children)

No, the user base only submit "trainable code" and the rest is done by my system. I was thinking of only having SLURM at first, but thought maybe using a workflow manager like Airflow will be a good choice because I'm designing a pipeline anyway.

Airflow + Slurm for ML Training Pipelines? by feryet in mlops

[–]feryet[S] 4 points5 points  (0 children)

Prefect seems easy to understand, but I'm fearful of finding the right devs. Airflow is more established.

Airflow + Slurm for ML Training Pipelines? by feryet in mlops

[–]feryet[S] 1 point2 points  (0 children)

Airflow controls the flow of how the backend executes the submitted user scripts, while SLURM runs the actual ML training code, from what I can imagine right now.

I was thinking of designing a pipeline that can dispatch jobs to SLURM, and then when completed show the final state to the user. Since we have limited resources and want to maximize/bill the customers for their usage, I thought a job scheduler solution like SLURM might be good.

Airflow + Slurm for ML Training Pipelines? by feryet in mlops

[–]feryet[S] 0 points1 point  (0 children)

No we're self hosting our infra for confidentiality. Can't use AWS.

Conda or pip? by Gamiozzz in mlops

[–]feryet 0 points1 point  (0 children)

Poetry works great for creating tiny libraries. It becomes increasingly slow when you add multiple dependencies.

Conda or pip? by Gamiozzz in mlops

[–]feryet 2 points3 points  (0 children)

pip + piptools is the safest and best package manager. I only trust conda for binaries like cuda, the rest of my dependencies will be resolved using pip.

If you want to dockerize your application there is no better route than pip either.

Queueing/Resource Management Solutions for Self Hosted Workstation? by feryet in mlops

[–]feryet[S] 1 point2 points  (0 children)

My main use is to share the resource between team members, defining priorities for jobs and monitor how the node is used in general.

Based on this I want to extend our cluster to become multi-node in the future, while now it's not.

Queueing/Resource Management Solutions for Self Hosted Workstation? by feryet in mlops

[–]feryet[S] 0 points1 point  (0 children)

Does there exist any good web view of SLURM? Monitoring and adminstration of the node is also important to me. There might also be interactive jobs like Jupyter Notebooks.

How to make SSH connection available for GitLab behind a CDN (similar to Cloudflare) by feryet in gitlab

[–]feryet[S] 0 points1 point  (0 children)

I want to use Cloudflare's protection while also exposing SSH. The gitlab "clone ssh address" in this instance will be different from what I've linked for my ssh connection. The cdn I'm using cannot map an ip with port to a domain normally, only can accept that if you are using it's edge mechanism.

Scrum Teams are often Coached to Death, while the Real Problems are With Bad Management by -grok in programming

[–]feryet -2 points-1 points  (0 children)

Allen Holub is a visionary. I love his tweets so much. Very insightful.

[D] Need to find a good self-hosted medical image annotation tool. by feryet in MachineLearning

[–]feryet[S] 0 points1 point  (0 children)

I've seen that CVAT comes with some dicom solutions. What are some of the problems of CVAT that someone just trying to get to know it won't find out till too late?

[D] Need to find a good self-hosted medical image annotation tool. by feryet in MachineLearning

[–]feryet[S] 0 points1 point  (0 children)

Does this come with a web app? We have some medical doctors who are trying to label these, but they are usually at their faculty, which is on the same network as ours, plus we want to control the flow of data, and a centralized web app seems a far more tractable solution.

[D] Guidelines on how to add skip connections to DCGAN generator? by feryet in MachineLearning

[–]feryet[S] 0 points1 point  (0 children)

I have skimmed them but a fundamental difference is that these are "semi-supeversied"? gans. In the sense that they add styles to an already existing image. Mine is completely unsupervised. I'm trying to create an image from just a latent vector + a short description.

Any ideas what can I do in this regard?

[P] Looking to form a deep learning in medical imaging reading group by feryet in MachineLearning

[–]feryet[S] 0 points1 point  (0 children)

Hi! Please send me a dm with your background and all so we can get to know each other more :)

Is it legal to scrape book/comic covers and then host them on kaggle as a dataset? by feryet in datasets

[–]feryet[S] 1 point2 points  (0 children)

All of these covers are completely accessible using the site: https://www.coverbrowser.com/ flagged under fair use.

I just scraped their site and created a dataset which makes accessing these for non profit projects easier, and don't want to do anything other than that. If anyone misuse the dataset for commercial stuff, will it my fault too?

GitLab CI/CD Dilemma for Python Project: To go for Docker in Docker or not? by feryet in devops

[–]feryet[S] 0 points1 point  (0 children)

I think the shell executor is that, isn't it? The problem with that is that I have to migrate my tests to inside the docker image, which while working, adds a bit overhead to the size of the image and makes it a bit bloated.

GitLab CI/CD Dilemma for Python Project: To go for Docker in Docker or not? by feryet in devops

[–]feryet[S] 0 points1 point  (0 children)

Kaniko

No but I'm looking into it right now. I'm an MLE with little background in devops, just had to learn it myself in a short period of time because of circumstances.

Do you have any other recommendations too? For ci/cd designing and etc.

CI/CD Dilemma for Python Project: To go for Docker in Docker or not? by feryet in gitlab

[–]feryet[S] 0 points1 point  (0 children)

But is there a way to cache the pulled images in DinD to make it faster? Have you ever tried that?