I'm a data engineer at tumblr and this is my blog. I write mostly about personal projects, data science, R/python, and various curiosities.
I’ve got a few cool things coming up:
DataKind - Sept. 7-9. I’ll be leading a team of volunteer data hackers to work with NYC governmental agencies and make sense of the data that they collect. DK will be announcing all of the details soon, but it sounds like there are a lot of interesting projects to work on over the course of the weekend. My group’s work from the last DataDive was presented at the United Nations General Assembly, and others work on NYC’s Stop and Frisk policy has been getting a lot of media attention lately.
DataGotham Conference - Sept. 13-14. DG is a conference celebrating the Data community in NYC. There is a great list of speakers and tutorials so far, primarily by engineers, researchers, and data scientists at New York-based institutions. Follow @DataGotham to see when the schedule is announced, but it sounds like I’ll be on a really interesting panel.
Bad Data Handbook - Nov. 2012 (est). I contributed a chapter to this collection of tips, tricks, and “war stories” on working with disorganized, inconsistent, and overall messy data.