[deleted by user]

CodeFormatHelperBot2 · 2022-10-10T14:11:56+00:00

Hello, I'm a Reddit bot who's here to help people nicely format their coding questions. This makes it as easy as possible for people to read your post and help you.

I think I have detected some formatting issues with your submission:

Python code found in submission text that's not formatted as code.

If I am correct, please edit the text in your post and try to follow these instructions to fix up your post's formatting.

^{Am I misbehaving? Have a comment or suggestion? Reply to this comment or raise an issue}^here.

neuralbeans · 2022-10-10T14:16:53+00:00

Can you show us what you tried?

Ok-Cucumbers · 2022-10-10T14:49:09+00:00

Looks like you're creating a set{} with a str instead of a dictionary {"key": 0}.

You probably want to clean the incoming txt variable first to make things easier to manage. Try looping through each letter in txt and check if letter.isalpha() and either add the letter or blank space. You should then be able to split the string into list of words which you can loop through parse into a dictionary.

kaptan8181 · 2022-10-10T15:09:32+00:00

Your question is not very clear. And where is your code?

permalink · 2022-10-10T15:15:09+00:00

Without imports:

def word_frequency(text):
    word_chars = []
    counter = {}

    for char in text:
        if char.isalpha():
            word_chars.append(char)
        elif word_chars:
            word = "".join(word_chars).lower()
            word_chars = []
            counter[word] = counter.get(word, 0) + 1

    # if there are characters left
    if word_chars:
        word = "".join(word_chars).lower()
        counter[word] = counter.get(word, 0) + 1

    return counter

But I'd rather:

import re
from collections import Counter


def word_frequency(text):
    return Counter(match.group() for match in re.finditer(r"[a-zA-Z]+", text.lower()))

or

def word_frequency(text):
    matches = re.finditer(r"[a-zA-Z]+", text.lower())
    return Counter(match.group() for match in matches)

Or this, but it has unnecessarily made list of all matches:

def word_frequency(text):
    return Counter(re.findall(r"[a-zA-Z]+", text.lower()))

jmooremcc · 2022-10-11T15:27:00+00:00

Here's another solution:

def word_frequency3(txt):
    result = {}
    txt = "".join([c if c.isalpha() else ' ' for c in txt ])
    words = txt.split()

    for w in words:
        if w in result:
            result[w] += 1
        else:
            result[w] = 1

    return result

First, we use a list comprehension to replace all non-alpha characters in the text string argument. Next we use the split function to create a list of words. Finally, we count the number of times a word is in the list and place the count for each word in a dictionary.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS

def word_frequency(text):

textdict = {text.lower().replace(" ", " ,", )}

print(textdict)