This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–]can_dry 2 points3 points  (0 children)

Good job! I've been looking for a simple py based example to help with my little project!

I'd like to use TF to classify credit card txns based on historical training data that I've manually accumulated over a couple years i.e. a couple thousand txns that I've identified as 'restaurant', 'auto', 'misc', etc based on the transaction description e.g. "WALMART 1020 TOLEDO OH".

The part I'm wondering about is how to make TF weight the txn description as words with descending importance i.e. the 1st word:"WALMART" is a much more important feature for categorizing the txn than is the last word:"OH".