The folks over at RStudio have been killing it lately. A few months ago they released their integration with knitr and RMarkdown. We use it here at tumblr for report generation and I absolutely love it.
Most recently, they released Shiny, which lets you build interactive visualizations in the browser with minimal effort. I haven’t played around with it yet but I’m really looking forward to doing so.
If you’ve built any interesting Shiny apps yet, leave a link in the comments. I’d love to see them.
A great pandas-to-R reference page from gappy.
(I’d love to see this as a github gist rather than read-only google doc)
DataGotham is a 1.5 day conference focusing on the data analysis community in New York City. The focus will be on tutorials, stories, and ideas rather than tools and “enterprise solutions.”
The call for proposals was just posted, so get over there and share your work.
I’m looking forward to this talk.
(Source: workstream-piccolbo)
R Flavored Markdown is a plain-text formatting syntax for creating documents that can be rendered to HTML. In fact it’s like HTML, but simpler. R Flavored Markdown is a variant of original Markdown with a few additional features
This is a really exciting step towards reproducible research. The markdown code below creates this output:
# Normal Distributions Functions in R
Density, distribution function, quantile function and random
generation for the normal distribution with mean equal to ‘mean’
and standard deviation equal to ‘sd’.
Use them this way:
```{r}
dnorm(x, mean = 0, sd = 1, log = FALSE)
pnorm(q, mean = 0, sd = 1, lower.tail = TRUE, log.p = FALSE)
qnorm(p, mean = 0, sd = 1, lower.tail = TRUE, log.p = FALSE)
rnorm(n, mean = 0, sd = 1)
```
The math behind the code:
$$latex
f(x) = \frac{1}{(\sigma\sqrt{2 \pi})} e^{-(\frac{(x - \mu)^2}{2 \sigma^2})}
$$
There is still room at the NY Open Statistical Meetup tomorrow to hear Jeff and others talk about creating dynamic reports in R.
This is a pretty interesting site with 90 2-minute R tutorial videos. I’ve only watched a couple but the narration is pretty high-octane as far as programming tutorial videos go.
Some examples:
A video of Hadley Wickham (author of ggplot2, plyr, reshape, stringr, lubridate, etc) talking about “tidy data” at the NYC Open Statistical Programming Meetup.
Highly recommended for anyone who works with data from multiple sources that comes in various structures.
The talk from the NYC R meetup was really great.