Hacker Newsnew | past | comments | ask | show | jobs | submit | mapcar's commentslogin

There are many great things about the open nature but it also leads to a very diverse experience, depending on which subreddit and thread you visit. Discussions range from asinine to profound, and casual visitors are often quick to generalize their limited evaluation to the whole site.


It's amazing to think that R (or S) had data frames since the 70s and only now are other languages implementing them. There are some quirks of course, and pandas introduced some convenient features. But the R community has also provided its own improvements in the way of data.table, and now, dplyr.


Data.table package by matt dowle definitely deserves a mention! Its fast and I like the indexing functonalities it provides. The benchmark timings are pretty impressive.


@ajinkyakale, thanks. What'd be also interesting is to benchmark memory usage in addition to runtime.


I should have mentioned you (arun_sriniv) as the co-developer of data.table! Thanks for all the hard work. And yes, memory usage will be interesting as that is the bottleneck when it comes to large dataset. I am working on something on those lines. Will post something soon :)


No worries :-). And glad to hear you're working on it! Let me know if I can be of any help.


How about PyCharm or Eclipse+PyDev (I've personally heard more praise for the former)? I use emacs and ess or python-mode so can't comment on the IDEs too much but being able to use the same platform for both has been convenient for me.


and if you overwrite variables like "c", you can always invoke the original concatenation function as "base::c".


I agree, I've read other gripes about R function documentation but it's one of the better ones for community software. Python's documentation seems focused on implementation from a programmer's perspective, but often not as helpful for actual application of the function.


I understood that at least a part of the larger programming community felt that the non-concurrent GC was limiting its future growth and shopped elsewhere for a production language.


Does lubridate solve the unintended timezone conversions of POSIXct classes?


Can you provide a bit more detail?


In the R Help Desk 2004 (http://www.r-project.org/doc/Rnews/Rnews_2004-1.pdf), Gabor Grothendieck recommends chron over POSIXct classes on account of the time zone conversions which occur when the tz attribute of the latter object is not "GMT". Will this not be a problem with lubridate? Thanks in advance.


I've never found that to be a problem in practice. Do you have an example where it's bitten you in practice?

(Also you should use UTC and not GMT)


Hi Hadley, yes for instance

> as.chron("1970-01-01")+unclass(as.chron("2001-04-01")) [1] 04/01/01

> as.POSIXct("1970-01-01","EST")+unclass(as.POSIXct("2014-06-01","EST")) [1] "2014-06-01 05:00:00 EST"

If there is any conversion necessary it is difficult to get back the original intended time.


What does adding two dates together mean?


Isn't this the conventional way of converting variables which have been coerced to their numeric representations back to time/date classes?


good point. unless it is vi or emacs...


I understand there have been studies showing people don't estimate angles as well as linear distances but people go too far out of their way to avoid pie charts when parts within whole data is shown. The 1D mosaic plot is effectively a stacked bar chart but there I suspect there is also some bias given toward the largest component such that the relative proportions of the smaller components are not well discriminated.


Stephen Few has an excellent overview of why pie charts should be avoided: http://www.perceptualedge.com/articles/visual_business_intel...

Neither pies nor mosaic charts should be used if you need exact readings of the data --- though mosaic plots do have the advantage of being rectangular, making segments much easier to label with values than the segments of pie charts.


I disagree partly. It's easier to estimate relative shares of a total from a pie chart compared to his linear examples.


I am also not so convinced. I still like a simple pie-chart.


Can't you just create an alias for

`emacs -nw -Q` or `emacsclient -nw`

when you want to open an instance in a terminal?


I could, of course, but I like it less bloated - 70 shared libraries less.)


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: