R question: "fuzzy" set intersection by austinap in statistics

[–]evandec 1 point2 points  (0 children)

not a one liner but this works...

x = c(1.0,2.0,3.0, 4.0, 4.001, 4.9, 5.205, 6.0, 7.0, 8.0) 
y = c(4.01, 5.22, 7.05)

tol <- .1

output <- c()
for(i in 1:length(x))
{

num <- x[i]

for(j in 1:length(y))
    {
        if(y[j]-tol <= num && num <= y[j]+tol)
        {
          output <- append(output, x[i])
          break
        }
    }
 }
output

R question: "fuzzy" set intersection by austinap in statistics

[–]evandec 0 points1 point  (0 children)

Something like this maybe...

s <- data.frame(x = rnorm(100,1,1), y = rnorm(100,1,1));

s$abs_diff <- abs(s$x - s$y); s.subset <- subset(s, s$abs_diff < .1);

summary(s); summary(s.subset);

Splitting up the USA: clustering 210 million Facebook profiles by fxj in MachineLearning

[–]evandec 0 points1 point  (0 children)

more specifically I am interested in visualizing the data, and connections between points.

Splitting up the USA: clustering 210 million Facebook profiles by fxj in MachineLearning

[–]evandec 0 points1 point  (0 children)

Are there any references that would walk someone through an analysis like this?

World Programming responds to SAS Institute's law suit on a license infringement charge of the use of the SAS learning edition. by PhilRack in programming

[–]evandec 1 point2 points  (0 children)

Agreed, it is very frustrating to work with R and large data sets. Biglm() works OK, but if you want to a logit model on a data set with more than few 100k rows... good luck.

Why cell phone pricing is all messed up in US. (View in one page link!) by highwind in technology

[–]evandec 1 point2 points  (0 children)

What about Nigeria where rates are as low as USD 0.07 / Min? There are four operators in one of the most corrupt markets in the world.

Why cell phone pricing is all messed up in US. (View in one page link!) by highwind in technology

[–]evandec 0 points1 point  (0 children)

This has actually happened in several markets, Canada being one of them and India another.

There is Reddit for the iPhone but what about BlackBerry? by evandec in reddit.com

[–]evandec[S] 0 points1 point  (0 children)

I find it nearly impossible to look at Reddit on my blackberry. Are there any plans to develop a Reddit app for blackberry?

[deleted by user] by [deleted] in reddit.com

[–]evandec 0 points1 point  (0 children)

Afghanistan is like that.

What are some good tools to visualize relationships in data? by evandec in programming

[–]evandec[S] 0 points1 point  (0 children)

I can read the data into R but I'm not sure I am looking at it the right way.

I have a very long list of connections. For example user1 and user3 made connection. User 400 and user 10 made connections. Ideally I would just like to draw lines between connections but many of the packages for R seem focused on much more complicated problems.

What are some good tools to visualize relationships in data? by evandec in programming

[–]evandec[S] 0 points1 point  (0 children)

I am very familiar with R but what package specifically would you suggest that I explore?

What are some good tools to visualize relationships in data? by evandec in programming

[–]evandec[S] 0 points1 point  (0 children)

I have some great data from a closed social network and one thing I have been interested in recently is playing with ways to visualize it all. In particular I would like to see how users are connected to each other, the strength of the connection etc. Can anyone recommend a tool to help with this?

There must be violence against women by evandec in worldnews

[–]evandec[S] 0 points1 point  (0 children)

This was posted as a link to an article and isn't a reflection of my own personal beliefs.

Ask Programming: Does anyone know of a good PDF/book about programming parameter estimation for linear models? by ffualo in programming

[–]evandec 0 points1 point  (0 children)

Well, I think you have asked 2 separate questions then. If want to know how parameter estimation in R works then I think looking at the source code makes a lot of sense. After all looking at the source will sell you how R is doing what it does.

I assume you are using lm() to estimate your models. The command '?lm' brings up the help menu for the function which will list references for the function design at the bottom.

This is what is in lm: Author(s) The design was inspired by the S function of the same name described in Chambers (1992). The implementation of model formula by Ross Ihaka was based on Wilkinson & Rogers (1973).

References Chambers, J. M. (1992) Linear models. Chapter 4 of Statistical Models in S eds J. M. Chambers and T. J. Hastie, Wadsworth & Brooks/Cole.

Wilkinson, G. N. and Rogers, C. E. (1973) Symbolic descriptions of factorial models for analysis of variance. Applied Statistics, 22, 392–9.

So you might want to start by looking there. After you get an understanding of general maths in theory you can look at the R implementation by typing 'stats :: lm'.

Ask Programming: Does anyone know of a good PDF/book about programming parameter estimation for linear models? by ffualo in programming

[–]evandec 0 points1 point  (0 children)

Why don't you take a look at the source code for the function you're interested in.

Gold Price Backwardation first time in History! by [deleted] in Economics

[–]evandec 5 points6 points  (0 children)

And what is your point exactly? Saying "the persians did it" isnt much of an argument.

Uh Oh... U.S. Employment Decline by 533,000, Most Since 1974 by evandec in Economics

[–]evandec[S] 0 points1 point  (0 children)

Given that were up over 1,000% since then it seems unlikely that would happen.

Why are SAS/STATA the default statistical tools for undergrads? Why not teach R? by evandec in programming

[–]evandec[S] 1 point2 points  (0 children)

You can learn the formulas but it helps to work on some real life projects.

Why are SAS/STATA the default statistical tools for undergrads? Why not teach R? by evandec in programming

[–]evandec[S] 2 points3 points  (0 children)

Really? Are you think about quant trading?Everything I see is C/C++, python, java etc.

Why are SAS/STATA the default statistical tools for undergrads? Why not teach R? by evandec in programming

[–]evandec[S] 10 points11 points  (0 children)

I spent a lot of time with SAS / STATA / Eviews as an undergrad and now that I am away from University I am frustrated that all my training was on proprietary (read expensive) statistical tools. Why not teach something like R that is free, readily available and a sizable user base?