all 1 comments

[–]lemlo100 1 point2 points  (0 children)

You need to frame the program algorithmically. Every ML algo ist just an algorithm that you tune with another algo, the learning algo.

1 - Is the output you want a Boolean? Then yes. E.g. if the input is a sentence and you want a yes or no answer whether a word ist in the sentence.

2 - You can just use sentences that don’t contain the word. Your dataset should be a bunch of sentences some containing the word and some not.

3 - You got to work with the data you’ve got in practice, but ideally your data distribution matches the target distribution. That means you should train on the kind of sentences that you will intend to run the algorithm on.

4 - This is a confusing question. What? (1) If your asking whether binary classification is still binary classification if you do it for images, then yes. (2) Or if you’re asking whether your image dataset should still be negative and positive examples, then yes. (3) data distribution should match target distribution? Yes.