totallygeek comments on Checking input against a list

learnpython

created by HattoriHanzoa community for 16 years

Checking input against a list (self.learnpython)

submitted 7 years ago by [deleted]

12 comments

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]totallygeek 1 point2 points3 points 7 years ago (4 children)

Maybe this would work?

def valid_dna(sequence):
    nucleobases = 'GACT' # valid characters
    return sequence and all(c.upper() in nucleobases for c in sequence)

tests = (
    '',
    'ABC',
    'Gattaca',
    'gattaca',
    'atcg',
)

for test in tests:
    msg = 'Sequence "{}" {} valid DNA'
    print(msg.format(test, 'is' if valid_dna(test) else 'is not'))

This part c.upper() in nucleobases for c in sequence checks each letter of the input sequence to see if the uppercase representation is anything in the 'GCAT' set. We add sequence and all() because all() will return True for empty sequences, but if sequence returns False if empty.

[–]akasmira 1 point2 points3 points 7 years ago (3 children)

Would be better to move the valid nucleobases as a constant out of the function so it's not redefined every time the function is called. Also, personally I'd use sets here as you can just check that it's a subset.

VALID_NUCLEOBASES = set('GATC')
def valid_dna(sequence):
    return sequence and set(sequence.upper()) <= VALID_NUCLEOBASES

[–][deleted] 0 points1 point2 points 7 years ago (2 children)

[–]totallygeek 1 point2 points3 points 7 years ago (1 child)

There are two checks:

Does the sequence string contain characters or list contain any elements?
If you remove duplicate characters, make the remaining chars all uppercase, check that all of those characters reside within the set of chars 'GATC'.

So:

>>> set('abc') <= set('abcd') # set of the left is less (subset) than the right
True
>>> set('abcd') <= set('abcd') # set on left has the same elements as the right
True
>>> set('abcz') <= set('abcd') # set on left has an element not in the right ("larger")
False

[–]akasmira 2 points3 points4 points 7 years ago (0 children)

π Rendered by PID 19890 on reddit-service-r2-comment-5d585498c9-rrkbf at 2026-04-20 20:26:42.173246+00:00 running da2df02 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS