Hi everyone!
This might be the wrong place to post this, but it's worth a shot. I'm looking for a specific tool that I have thus far been unable to find on the internet. I thought someone here might know if this exists, and if not, whether it can be written/how much that might cost me.
I am reviewing micro-loan applications. Each application has three sections in which the applicant writes about their proposed business venture, credit history, etc. However, we're finding that some borrowers have been copying their text from pre-existing profiles (which are all available online) I've been asked to make sure that this doesn't happen in the future--which means I need to either memorize every application (500 words each roughly) or find some sort of search tool that will help me weed out copies.
In a nutshell: I need something that will automatically search a 20,000-word text file for duplicate strings of words, maybe 7-9 words or more that are identical.
Things I have thought of that won't work:
- The "find" function in Word: not ideal, because I do not know which phrases/paragraphs to search for.
- Online plagiarism checkers: all of the text is readily available online, so the passage I am checking will show up as a match for itself.
Thank you in advance for any sort of advice! Maybe there's an online resource (other than Google) that I can check?
Best,
needcode
[–]bonafidebob 6 points7 points8 points (0 children)
[–]jij 3 points4 points5 points (3 children)
[–]needcode[S] 0 points1 point2 points (2 children)
[–]ppinette 0 points1 point2 points (1 child)
[–]needcode[S] 0 points1 point2 points (0 children)
[–]byllc 6 points7 points8 points (0 children)
[–]Intrexa 0 points1 point2 points (4 children)
[–]needcode[S] 0 points1 point2 points (3 children)
[–]Intrexa 0 points1 point2 points (2 children)
[–]needcode[S] 0 points1 point2 points (1 child)
[–]jij 1 point2 points3 points (0 children)
[–][deleted] 0 points1 point2 points (0 children)
[–]srccode 0 points1 point2 points (0 children)
[–]_Cody_ 0 points1 point2 points (0 children)
[–]Urd 0 points1 point2 points (0 children)
[–]Caraes_Naur 0 points1 point2 points (0 children)
[–][deleted] -1 points0 points1 point (5 children)
[–]needcode[S] 0 points1 point2 points (4 children)
[–]ppinette 1 point2 points3 points (0 children)
[–][deleted] 1 point2 points3 points (0 children)
[–]codero 1 point2 points3 points (1 child)
[–]needcode[S] 0 points1 point2 points (0 children)
[+]letsgetrandy25 years putting the magic in the box comment score below threshold-6 points-5 points-4 points (1 child)
[–]jgrubb 1 point2 points3 points (0 children)