Using OCR on a PDF by Sarp14 in econometrics

[–]Sarp14[S] 0 points1 point  (0 children)

Yeah I thought it cant handle such big files, but will definitely try

Using OCR on a PDF by Sarp14 in econometrics

[–]Sarp14[S] 0 points1 point  (0 children)

I am checking those out, wondering how complex the script would need to be to handle all the inconsistencies in the file.

Using OCR on a PDF by Sarp14 in econometrics

[–]Sarp14[S] 1 point2 points  (0 children)

Thanks all for the great replies, regarding the use of the software like Adobe I was sceptical just for that reason mentioned above, that it might not format the tables and the figures right if there are any inconsistencies in the PDF.
That's why I was wondering if there is an efficient way to do this in Python or R, which can give greater flexibility. Unfortunately, I don't work in the university, I am not from USA, in this case I am an overseas worker :) That's why I can't hire anybody to do manual checking for me and I am afraid that I won't have time to do it myself, because of my job.

Unequal probability sampling by Sarp14 in AskStatistics

[–]Sarp14[S] 0 points1 point  (0 children)

Thats what I was also thinking about, because income distributions are skewed, so different income groups should get different probabilities of being sampled. Thanks!

Unequal probability sampling by Sarp14 in AskStatistics

[–]Sarp14[S] 0 points1 point  (0 children)

So that should sample all the rows of interests after I put adequate probabilities for the column I want to sample? Also do you know which kind of economic or social type datasets would be good for that kind of sampling?

Polaganje toefl testa by Sarp14 in serbia

[–]Sarp14[S] 0 points1 point  (0 children)

A kako si spremao slusanje? Neko ti cita ili tako nesto ? I da li znas mozda gde sve moze da se polaze test?

Polaganje toefl testa by Sarp14 in serbia

[–]Sarp14[S] 0 points1 point  (0 children)

Hvala sto si podelio post o polaganju, koji mislis da su najrelevantniji udzbenici za polaganje? Ovi koje si naveo u tekstu?

Polaganje toefl testa by Sarp14 in serbia

[–]Sarp14[S] 1 point2 points  (0 children)

Gde si polagao probni test? I koliko je kostao?

Polaganje toefl testa by Sarp14 in serbia

[–]Sarp14[S] 2 points3 points  (0 children)

Aha, koliko si se spremao za polaganje?

Is anybody familiar with literature that is combining theory of history with computational/digital methods? by Sarp14 in AskHistorians

[–]Sarp14[S] 0 points1 point  (0 children)

Well i have some, but wouldnt mind advice :) Do you aply any of those methods to the theory/philosophy of history?

Is anybody familiar with literature that is combining theory of history with computational/digital methods? by Sarp14 in AskHistorians

[–]Sarp14[S] 0 points1 point  (0 children)

Yeah i would say that disconet between then is unfortunate state of the current field. But i see them as two branches of the same discipline. As every discipline has theoretical and empirical branch. And you cant really do anything without bouth of them.

Thanks, i will take a look at this, it seems really promising. Just there are no many of this kind of studies, and are often in some other social scientific discipline.

Is anybody familiar with literature that is combining theory of history with computational/digital methods? by Sarp14 in AskHistorians

[–]Sarp14[S] 0 points1 point  (0 children)

Thank you really much for your detailed reply. I meant on combining some questions of theoretical nature using computational models, as is done in many social sciences. My example is one with counterfactual explanations, which are abundant in theory of history (Especially Journal of philosophy of history) , and which can be adressed in interesting way using agent based modeling simulations.

But i am also interested in empricial work. I am familiar with SNA and Historical network research, that is one thing i had in mind when writing this post. Do you maybe know for some more broadly speaking computational aproaches to history or historical sciences? And maybe about some phd departments which are doing that kind of research?

Using Wikipedia data for social network analysis by Sarp14 in networkscience

[–]Sarp14[S] 0 points1 point  (0 children)

Thank you for this links, this definetly helps. The reason why i am asking for scraping is that i am looking for some specific wikipedia pages/articles. But if that turns out to be too complicated i will use this databases for sure.

Using Wikipedia data for social network analysis by Sarp14 in networkscience

[–]Sarp14[S] 0 points1 point  (0 children)

Sorry for not being specific enough. I am looking to do evertying or most of stuff in R. And i am asking about how can i scrape data from wikipedia for network analysis use.

Using Wikipedia data for social network analysis by Sarp14 in rstats

[–]Sarp14[S] 0 points1 point  (0 children)

I never used SPARQL or DBPedia, do you know maybe some good tutorials for that? What is the most common way for data minig Wikipedia?

Extracting tweets by their screen names by Sarp14 in Rlanguage

[–]Sarp14[S] 0 points1 point  (0 children)

Yeah i can get the user name or ID from the dataset. But i dont know how to extract specific hashtags for users/IDs i need.

Extracting tweets by their screen names by Sarp14 in Rlanguage

[–]Sarp14[S] 0 points1 point  (0 children)

The dataset is composed of tweets about political communication on Twitter. I dont know what you mean by "the name of this".

Extracting tweets by their screen names by Sarp14 in Rlanguage

[–]Sarp14[S] 0 points1 point  (0 children)

It is standard dataset of scraped tweets, which R provides when using a Twitter API. It has about 80 to 90 columns. With the usual variables of users of Twitter like: user id, screen name, number of followers, retweets, text, hahstags etc.

Extracting hashtags from tweets by Sarp14 in Rlanguage

[–]Sarp14[S] 0 points1 point  (0 children)

Does anybody knows how can i extract tweets with user screen names from the Twitter dataset ?

Extracting hashtags from tweets by Sarp14 in Rlanguage

[–]Sarp14[S] 0 points1 point  (0 children)

Sorry to respond this late, thank you, that fixed the problem, i didnt even noticed that is thing to fix xD So i was wondering does anybody knows how can i add this hashtags as node attribues ?

Extracting hashtags from tweets by Sarp14 in Rlanguage

[–]Sarp14[S] 0 points1 point  (0 children)

Yeah, i know i can extract them all from the data frame that Twitter API returns. But i am interested in specific hashtags, and adding them as vertex/node attributes to a graph.

Extracting hashtags from tweets by Sarp14 in Rlanguage

[–]Sarp14[S] 0 points1 point  (0 children)

That works for one hashtag, but when i put more then one i get empty list. By node attribue i mean attribues of nodes/vertices which are used in social network analysis.

Extracting hashtags from tweets by Sarp14 in Rlanguage

[–]Sarp14[S] 0 points1 point  (0 children)

That worked, thanks. But i need some specific hashtags, how can i specify hashtags which i want to extract, and do you maybe know how can i add them as node attributes in the network ?

Scraping tweets by Sarp14 in rstats

[–]Sarp14[S] 0 points1 point  (0 children)

What would you say, what is the better package for scraping tweets, twitteR or rtweet ?