i am a principal research scientist at microsoft research in new york city, where my work in the area of computational social science involves applications of statistics and machine learning to large-scale social data. i am also an adjunct assistant professor in columbia university's applied math department and i run the microsoft research data science summer school to promote diversity in computer science. i was previously a member of the social dynamics group at yahoo! research. i received my ph.d. from columbia university's physics department. see my curriculum vitae for more information.
this site serves several purposes, from presenting and organizing my current research and teaching efforts to publishing code and tips that i hope others will find useful.
i bookmark lots of references on pinboard, occasionally tweet things, post random tidbits on tumblr, and share photos on flickr.
my latest geek tips, also available on twitter, tumblr, or as plain text:
20.10.08.19. rstats: knit a regular R script to html with knitr::stitch("script.R") http://bit.ly/2IxcT1N
20.10.02.19. rstats: use ggstatplot for easy statistical plots and reporting http://bit.ly/2mXfxGk
20.09.23.19. latex: watermark along a left/right margin http://bit.ly/2m8UnUW
20.09.20.19. shell: split a multi-page pdf into separate jpgs convert x.pdf x-%04d.jpg http://bit.ly/2V4YDCe
20.08.27.19. rstats: set the default level of a factor using fct_relevel http://bit.ly/324o35p