And since we are talking about this in relation to anti-virus topics, we will call the meaningful text “clean” and the rubbish “malicious”. Our task is to develop a machine learning algorithm that would distinguish one from the other. The text that real people write looks like this: Objective: to distinguish meaningful text from rubbish The complexity of this algorithm is toy - but it allows us to draw the most real conclusions.
And the algorithm will be described as real, falling under the definition of machine learning. And since technology, it’s not difficult to explain all this in human language, which is what we are going to do now. As usual, there is no magic here, all are the same technology.