|
From: | David Rosenberg |
Subject: | Re: Use R to manage results from GNU Parallel |
Date: | Mon, 6 Jan 2014 00:26:48 -0500 |
Is it possible to do an automatic fall back onto, say, read.csv if
data.table or plyr is not installed?
Why:
rownames(raw) = 1:nrow(raw)
Why not:
rownames(raw) = NULL
> 2) When stdout is empty, I don't include any entries. Another possibilityI am not sure what the correct R approach is. The UNIX approach would
> would be to include NAs, but that would take a few more lines of code.
no entries. So only if there is an R tradition of returning NAs should
you consider changing that.
One of the things that convince me is reproducible
measurements/timings. I have too many times been tricked by the common
wisdom that used to be true, but which no longer is true (Recent
example UUOC: http://oletange.blogspot.dk/2013/10/useless-use-of-cat.html).
My gut feeling is that if the data is not in disk cache, then disk I/O
will be the limiting factor, but I would love to see numbers to
(dis)prove this.
I have commented the code and checked it in:
git clone git://git.savannah.gnu.org/parallel.git
/Ole
[Prev in Thread] | Current Thread | [Next in Thread] |