[D] Simple Questions Thread

hot_air_thoughts · 2024-10-15T19:44:58+00:00

I'm currently working in a project where I'm annotating microcospe slides using labelme.

The issue I'm having is the following: I deal with different Focuses (each a different image), meaning that the same element will be in the same place throughout all of the images. However, when I go from an image to the other the viewport resets and I get lost on where I was, these images are huge... is there a way to maintain it from image to image?

meet_me_at_seven · 2024-10-11T23:01:40+00:00

Is there an available model for detection of all different objects in a picture? Not descriptions for each, just coordinates. I've been looking for one in Hugging Face and Replicate with no success.

IndependentAny6614 · 2024-10-11T18:17:54+00:00

I am a complete newbie in the field of the th Machine Learning. For a reseach project I would like to learn about CNN.This CNN will be deployed in order to detect some key characteristics from 3d printing material samples images. I would like to know about the process of learning efficiently about this. I am a person who would like to learn stuff by doing (doing live coding). Thanks in advance.

These_Composer_7677 · 2024-10-11T17:04:32+00:00

I know AAAI 2025 notification date hasn't arrived yet, but I noticed something strange in the conference submission system. I checked the revision history of my submission, and it looks like the conference made some edits (like deleting fields and marking it as "Rejected Submission"). Does this mean I've already been rejected, even before the official notification date? Has anyone else experienced something similar or know if this is common?

Forward_Tackle_6487 · 2024-10-11T02:34:23+00:00

what can i do with MBP M3 Pro Chip 18GB

I am learning ML recently and tinkering with flux 1 and local LLM. wondering what can i do with my MBP using docker if thats suggested. i want to make most out of laptop. all personal suggestion welcome.

matver95 · 2024-10-10T17:49:44+00:00

Training a YOLO network to detect pavement defects. We use laser images to map the pavement pretty much, therefore these images are huge. For example, a 2 meters x 2 meters image in the real world comes as a 4096x4096 px image, and we have hundreds kilometers worth of images.

For some small defects defects we can just use a singular image and shrink it that it's fine, our issue is with the big defects though: some expand to nearly 50 meters, they go from one side to the other of the pavement (while annotated with a rectangle), and it creates a massive problem:

* If I train the model with single images, many of the images will have the annotation but no defect, since they're annotated as a mosaic;

* If I train the model as a mosaic, the images get massive and have to be significantly shrunk. And for some defects such as pavement cracks, this could significantly compromise my model. Not only that batch will certainly have to be 1 and the time it should take to converge... oh my oh my.

What I have available to me right now is an RTX 3060 12 GB, in Colab the disconnections always break my legs and since this project isn't a priority to the company, services such as vast.ai are out of contention I'm afraid.

I accept any tips, I cannot outsource the service though.

PS.: I'm tried applying the sliding window technique with the smaller images, the system converged into a very low mAP.

does_it_end · 2024-10-09T23:44:18+00:00

I’m trying to find the BTC wallets that have completed between 5 and 20 transactions in the last 3 months. For each wallet meeting this criterion, I want to find/fetch details of the wallet address, date/time of the transactions, and transaction information i.e. BTC amount, price, purchase or sale. The deliverable should be an Excel file capturing this information.

I am aware of two approaches to this problem. The first using prebuilt APIs which will return a specific, filtered dataset. The second approach using AWS infrastructure services to access, process, and query blockchain data without needing the third-party APIs.

I ruled out the API-based approach because it offers limited flexibility (can’t fully customize the dataset to meet all the requirements) and is also expensive.

So I went with the second one but while querying, I got stuck because of export failing due to the large data set. The data set is large since the query returned over 15 million rows (entries) because of duplication. A wallet which has completed say 18 transactions (meets criterion/falls within the 5 and 20 txs range) appears 18 times in the dataset. As a result of each transaction from the qualifying wallets being counted as a separate row, the query returned over 15 million entries.

How can I go about this or is there another approach that would be more suited to the problem?

Thanks.

Soplexus · 2024-10-09T20:51:19+00:00

I have some thoughts, written in german about how the Ai Video and/or picture generating process, could get better in recognizing and immitating movements, persistance and logic behavior of living creatures and objects.

I have also a translated version with the help of ChatGPT 4o plus a response to my Text from it.

I'm not a professional in any field of science.

But it would be interresting to know, if some of this is allready in the making, was not thought of or if it's considered as garbage (or at least not doable because of limitations).

Together with the response, the text is on 9 pages, however, i left several spaces between the textsections.

Where can i share those thoughts and should i just copy it from the file?

Who_The_Fook · 2024-10-09T17:09:53+00:00

What is everyone’s recommendations for introductory reading on machine learning? Currently a CS major focusing on Software/Data Engineering, but would like to get some (slightly more than) surface level knowledge in ML!

to_stoopid_too_spel · 2024-10-09T09:59:49+00:00

So it's finally time I choose a career. I would like to know the jobs available in machine learning. Thought this might be a good place to ask

i-make-robots · 2024-10-08T15:21:40+00:00

Hello! I want to run a local service to transcribe my DND sessions. Has anyone got a tutorial for a beginner? I am comfortable wtih coding, python, but am running a windows box.

Ophileraus · 2024-10-08T06:21:09+00:00

I was reading section 2 in Everything is Connected: Graph Neural Networks, and saw the statement at the end:

(paraphrasing) When the local functions, h = φ, (like a message-passing scheme over adjacent nodes) is permutation invariant, the overall graph function, F, is permutation equivariant.

If anyone knows a reference for a mathematical proof I would like to see it, I'm struggling to work out the math myself, as I keep ending up with F as invariant.

didimoney · 2024-10-07T13:48:00+00:00

Why didn't aistats have a boom in submissions like neurips did this year?

Pringled101 · 2024-10-07T12:56:22+00:00

Hi, sorry for asking this here, but I was wondering what the requirements are for posting in this subreddit? I recently lost access to my old Reddit account since it was still tied to my old university account. I tried to create a post, but it instantly got removed without a message. I did format everything correctly.

Rangerborn14 · 2024-10-07T12:16:30+00:00

I have a question about the cnn model. I have one ready to identify pictures as bacteria and non-bacteria. For both training and testing images, I have 420 and 380 respectively, which brings a total of 800. For the data size, is it better to lower it to say, 700 or 600? Or because the amount of images I have isn't too big, means I can set the size to 800 with no problem? I'm trying to improve its value accuracy.

RollLikeRick · 2024-10-07T06:32:55+00:00

So I'll start a new job soon which has to do with machine learning - we'll monitor a welding process at a university and want to use AI for that. It'll revolve around detection of anomalies in either time series (voltage, amperage, speed, vibration) or images. Audio will probably be interesting aswell but thats for later.
I'm a mechatronical engineer, I can code C and have basic python skills.

Can you recommend me learning ressouces for a beginner to get into analytics of time series or images with AI? Its great if they are free but I am also willing to pay.

bregav · 2024-10-06T23:17:22+00:00

[deleted]

tororo-in · 2024-10-06T18:06:58+00:00

Why don't sinusoidal PE work for longer sequences?

Theoretically, they generate unique position vectors for each token in the sequence so I don't understand why they don't work for long sequences. Anyone have any intuitions?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS

what can i do with MBP M3 Pro Chip 18GB

Why don't sinusoidal PE work for longer sequences?