i am a senior research scientist at microsoft research in new york city, where my work in the area of computational social science involves applications of statistics and machine learning to large-scale social data. i was previously a member of the social dynamics group at yahoo! research. i received my ph.d. from columbia university's physics department where i am an adjunct professor in the applied math department. please see my resume for more project and background information.
this site serves several purposes, from presenting and organizing my current research and teaching efforts to publishing code and tips that i hope others will find useful.
i bookmark lots of references on pinboard, occasionally tweet things, post random tidbits on tumblr, and share photos on flickr.
my latest geek tips, also available on twitter, tumblr, or as plain text:
20.02.28.18. rstats: quick hack for per-group summaries df %>% group_by(x) %>% do(tidy(.$y)) (h/t @stefanmherzog)
20.02.27.18. rstats: use na.locf from the zoo package to carry forward observations http://bit.ly/2CnwcsI
20.01.04.18. rstats: try geom_quasirandom or geom_beeswarm for column / violion scatter plots http://bit.ly/2lUsJIr
20.01.04.18. rstatus: try theme_ipsum for ggplot http://bit.ly/2CredxX
20.01.04.18. rstats: use geom_lollipop and geom_dumbbell for easy dot plots http://bit.ly/2qmNNfp