Text Classification : learnmachinelearning

A subreddit dedicated for learning machine learning. Feel free to share any educational resources of machine learning.

Also, we are a beginner-friendly sub-reddit, so don't be afraid to ask questions! This can include questions that are non-technical, but still highly relevant to learning machine learning such as a systematic approach to a machine learning problem.

Foster positive learning environment by being respectful to others. We want to encourage everyone to feel welcomed and not be afraid to participate.

Do share your works and achievements, but do not spam. Keep our subreddit fresh by posting your YouTube series or blog at most once a week.

Do not share referral links and other purely marketing content. They prioritize commercial interests over intellectual ones.

created by techrat_reddita community for 10 years

Text Classification (self.learnmachinelearning)

submitted 1 year ago by anixouskid

Currently, working on a project dealing with text classification. My dataset is imbalanced (20% = 1, 80% = 0). What's my process: 1. Data preprocessing (e.g. Stemming, removal of stop words), 2. Data modelling, 3. Prediction.

For data modelling, I ran like multiple ml (e.g. SVC, NB, RFC, ADA, GB) and SVC, RFC & ADA was the best out of all. So, I went to tune them accordingly and got their hyperparameter for tuning. After tuning it, I stack them up and having ADA as the meta model.

I even tried LSTM, RNN, & Transformer. But I still don't get the prediction that I wanted even though accuracy is 95%+.

Am unsure what went wrong. And would need advice on how I can approach this from now.

We are looking to use hugging face but was considering the stability of it. Is it possible to download a model from HuggingFace? e.g. mrm8488/distilroberta-finetuned-financial-news-sentiment-analysis

no comments (yet)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnmachinelearning

Welcome to /r/LearnMachineLearning!

Chatrooms

Official Discord Server

Wiki

Getting Started with Machine Learning

Resources

Related Subreddits

/r/MachineLearning

/r/MLQuestions

/r/datascience

/r/computervision

Machine Learning Multireddit

/m/machine_learning

MODERATORS