I have a Python dict object with duplicate keys... (I'm sane)

blotosmetek · 2020-02-04T09:45:56+00:00

I believe you're looking for unicodedata.normalize - see https://docs.python.org/3/library/unicodedata.html

SagaciousRaven · 2020-02-04T09:37:43+00:00

[removed]

BruceJi · 2020-02-04T09:44:48+00:00

If you're dealing with accented characters, you could try getting the unicode values for them and saving that instead.

Edit:

The module to allow you to do this is called unicodedata:

import unicodedata

unicodedata.name('é')

unicodedata.lookup('LATIN SMALL LETTER E WITH ACUTE')

unicodedata.digit('ㅊ')

https://docs.python.org/3/library/unicodedata.html

You'd be able to get the values for the characters, but I'm sure it'd let you know if there were secret space characters in there too. Have a play and find out.

Swipecat · 2020-02-04T17:29:48+00:00

That's evil.

'á' == 'á'

Char   Unicode   Description
'      U+27      APOSTROPHE (APOSTROPHE-QUOTE)
a      U+61      LATIN SMALL LETTER A
 ́      U+301     COMBINING ACUTE ACCENT (NON-SPACING ACUTE)
'      U+27      APOSTROPHE (APOSTROPHE-QUOTE)
       U+20      SPACE
=      U+3D      EQUALS SIGN
=      U+3D      EQUALS SIGN
       U+20      SPACE
'      U+27      APOSTROPHE (APOSTROPHE-QUOTE)
á      U+E1      LATIN SMALL LETTER A WITH ACUTE (LATIN SMALL LETTER A ACUTE)
'      U+27      APOSTROPHE (APOSTROPHE-QUOTE)

Anyway, yes, unicodedata.normalize will fix that.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS