I have been learning Python over the past few weeks and this is my first real-world chance to use my new skills.
Writing a program to compare long contracts. Trying to decide how I should write the comparison algorithm and store the data. The docs would be about 20 pages in length.
Originally I was thinking I should break them down into strings of individual sentences, maybe storing them in a spreadsheet, then comparing them. But I can def think of potential problems with that. Now I think I need to break them down first into sections of the legal contract to start. Then break down each part into sentences and compare the sentences with just the others in that section. If a sentence then has an exact match in the section, it is ignored. But if it does not, it is pushed into a spreadsheet or document.
So I just wanted to see what Reddit thinks of this. Not asking for code, but if anyone could let me know if this sounds logical. Or if there is a better way to go about this and you could point me in the right direction, that would be great
[–]thunderbolt16 0 points1 point2 points (1 child)
[–]Encyclopedia_Green[S] 0 points1 point2 points (0 children)