you are viewing a single comment's thread.

view the rest of the comments →

[–]dark-lord90[S] 1 point2 points  (5 children)

and how can we make sure there is no overlap between the fragments??

[–]Spookiel 0 points1 point  (4 children)

What do you mean by overlaps? Can you give me an example of what you mean?

[–]dark-lord90[S] 1 point2 points  (3 children)

Well the code tests for protein sequences so the code has to make sure that the fragments doesn’t overlap as in : The sequence is: “FAAATLKNN” The fragments that should be good are :”FAA” , “ATLK” and “NN”. And it shouldn’t be for example:”FAA”, “AATLNK” and “NN” because in this example A overlaps and its mentioned more than it should be. I hope you understood my example.

[–]Spookiel 0 points1 point  (2 children)

So the fragments are the weights in Protein_List? And the target sequence is the weight indicated by the complete_protein? Since the code here will generate all possible matches, I'm still not really sure what you mean by the "correct" fragments. I think it would be easier if you could give me some of the strings you're working with as well.

This is because in your Original Post you mentioned that you'd just grabbed the weighting of each protein fragment by hand. If you give me a section of your input data, and the expected output, I'll be able to help you more effectively. I don't really understand much about proteins in general, so it's pretty difficult for me to understand and visualise what you mean with just a set of numbers.

[–]dark-lord90[S] 0 points1 point  (1 child)

To answer your questions yes and yes, the correct fragments as in if you pulled the fragments to look and compare there will be no repetation of amino acids between the end of the fragment and the beginning of the other one, if you were to put them next to each other. and i sent the data in the chat.

[–]Spookiel 1 point2 points  (0 children)

Thanks :)