There are many discussions about scoping, environments and functions already. See e.g. here or here. However, I am not sure I have found a good solution to the following problem:
df <- data.frame(id=rep(LETTERS[1:2],each=2), x=1:4)
d <- -1
myfun <- function(df, d){
require(plyr)
new.dat <- ddply(df, .(id), transform, x=x*d)
return(new.dat)}
myfun(df, 1)
You can easily verify that the globally defined d=-1 was used, instead of the d=1 as provided in the argument. (If no globally defined d exists, then a object not found message is returned) The big question is now: how do I make the d argument to the function used instead of the globally defined d?
I was under the impression that the following should work:
myfun2 <- function(df, d){
here <- environment()
new.dat <- ddply(df, .(id), transform, x=x*with(here,d))
return(new.dat)}
myfun2(df, 1)
It is my understanding that with(here, d) retrieves the object d from the environment here. So, the result should be 1. An error is returned, though, saying
Error in eval(substitute(expr), data, enclos = parent.frame()) :
invalid 'envir' argument of type 'closure'
I am not sure I understand why this does not work, and I would be happy if anyone could shed some light on this, or if you could provide alternative solutions. Note that wrapping the entire ddply-statement into with(...) does not seem to help either.
A solution that does work is to attach the current environment inside the function:
myfun3 <- function(df, d){
here <- environment()
attach(here)
new.dat <- ddply(df, .(id), transform, x=x*d)
detach(here)
return(new.dat)
}
but I don't like this solution since it works by masking the globally defined d with the local d, which I think is not very elegant.
Any comments / pointers are appreciated.