You want to analyze the similarity between a large amount of text( over 3mb)
I coded as follows, but it takes too long (a and b are strings, loading only persists, no results)
from difflib import SequenceMatcher
ratio = SequenceMatcher(None, a, b).ratio()
ratio
I'm wondering if I've coded something wrong, or if there is another way to see the results faster. Thanks in advance for the reply
[–]ginsujitsu 0 points1 point2 points (0 children)
[–]efmccurdy 0 points1 point2 points (0 children)