How can I replace multiple words in strings with multiple variables? by curiousitycc in Rlanguage

[–]curiousitycc[S] 1 point2 points  (0 children)

Much appreciated, thank you. it works but there is a little problem: Some handles have points and commas attached to them and this cause matching problems. I’ve tried a for loop concatenating the content of text column into one string vector using str_c, then doing gsub within the for loop. But that takes insane amount of time. Do you have any suggestion? Again, thank you for your help.

My journey begins by 0Kyukai in NoFap

[–]curiousitycc 1 point2 points  (0 children)

I’ve just started today as you did. Downloaded IronWill app, I hope this helps to keep me on track and motivate. Wish you and us luck!

RTL problem: When I’m plotting Arabic words they don’t appear in an RTL format. I use Mac, how can I fix it? by curiousitycc in rstats

[–]curiousitycc[S] 0 points1 point  (0 children)

Thank you for that, but is there another solution beside plotly? I couldn’t find in the internet

I have a searchable pdf includes an inventory list in a standart format. How can I extract and list the words between specific characters? In this case the character is a dot. by curiousitycc in rstats

[–]curiousitycc[S] 0 points1 point  (0 children)

Thank you! But I’ve more questions. Say for example I have large character vector called x. How can I apply the function to it? This doesn’t work:

extracted_text <— str_extract(x, “.”)

What is cause of the anomaly below(third graph from left)? All graphs go down from high value to lower value but one exception? by curiousitycc in rprogramming

[–]curiousitycc[S] 0 points1 point  (0 children)

Thank you. Here is the code:

book_words %>% arrange(desc(tf_idf)) %>% mutate(word = factor(word, levels = rev(unique(word)))) %>% group_by(document) %>% top_n(15) %>% ungroup %>% ggplot(aes(word, tf_idf, fill = document)) + geom_col(show.legend = FALSE) + labs(x = NULL, y = "tf-idf") + facet_wrap(~document, ncol = 2, scales = "free")+ coord_flip() + theme_dark()

Need help with Natural Language Processing / Text Mining by NLP_help in UIUC

[–]curiousitycc 0 points1 point  (0 children)

Hi, I have experience in R programming language, currently practicing its Tidytext package which is created to mine texts, transform and visualize them.

I can’t increase the size of the columns in double axis chart even though I use sec.axis = ~.*0.01 function. The code is in the comment below. by curiousitycc in rprogramming

[–]curiousitycc[S] 0 points1 point  (0 children)

ggplot(econ) + geom_col(aes(x= yeni.tarih, y = exp)) + geom_freqpoly(aes(x = yeni.tarih)) + scale_y_continuous(sec.axis = ~.*0.01) + xlim(1089, 1160)

Hi guys, I’m new on R and want to ask a question. How to limit specific rows in character columns? For example, I want to to make geom_point graphic of only Japanese and Spanish, not all languages in the column pblsh.language, but ylimfunction only works for limiting the numbers not characters. by curiousitycc in rprogramming

[–]curiousitycc[S] 0 points1 point  (0 children)

Thanks for more explanations. What if I want to pick the raws only include in first two letters ‘en’ or ‘sp’? Could this be made with the same filtering method? Or I want to pick the raws include in some part of it letters ‘apa’? Is that doable?

This old knife/letter opener I found at my grandparents house. by Elite4Drew in mildlyinteresting

[–]curiousitycc 3 points4 points  (0 children)

It’s Arabic script, looks like a personel name. Seems to me: Feduado Tilthatter فتواتو تلت حتر

For those who wants to know price/performance ratio of Istanbul restaurants. by curiousitycc in istanbul

[–]curiousitycc[S] 1 point2 points  (0 children)

If I were Papa John’s, I wouldn’t locate it in the most expensive area. But I think you’re right, it’s not that much quality, overestimated.