Founded by Josh Reich and Drew Conway, the New York Open Statistical Programming Meetup started as the New York R Meetup with a handful of people in an office at Union Square Ventures. Since then it has grown to over 8,000 members and has been hosted at NYU, Columbia, AOL, iHeartRadio, eBay, Work-Bench and other locations.

Our mission is to spread knowledge of statistical programming techniques in open-source languages such as R, Python, Julia and Go, and data science in general. Another important aspect is community building and socializing. The meetups start with pizza, followed by a 45-90 minute talk, ending with a trip to the local bar.

Attending

To attend please visit the meetup page.

Presentations and Videos

Whenever possible we make presentations available at the Presentations page.

We now stream and host videos of meetups on Facebook and YouTube and older videos are scattered on a variety of services. They are also listed on the Presentations page.

Jobs

Job openings and other announcements are on the meetup discussion board.

Upcoming Meetup

Life in the Fast Lane: data.table Intro and Best Practices

September 17, 2018 06:30:00 PM

As a counterpoint to JD Long's dplyr talk (https://www.meetup.com/nyhackr/events/250182828/) Bill Gold will show the same functionality in data.table.

Thank you to Google for hosting us this month.

To learn more about the meetup and see videos of past presentations visit nyhackr.org.

About the Talk:

data.table, an extension of R's data.frame, is a popular package that enables the user to focus more on finding new insights and less on managing data and also provides fast data processing capabilities.

Reasons to consider using data.table include: it’s fast, really fast; it works great with large datasets; it offers concise and easy to read code.

We will show fast manipulation of 100GB+ of data and share best practices and sample code. For some people data.table syntax may at first appear less than intuitive so our goal is to reduce data.table’s learning curve by connecting data.table syntax with SQL and dplyr equivalents.

About Bill:

Bill Gold’s career intersects data science, management consulting and technology. He has delivered $500MM+ in ROI to financial services and healthcare clients. Across the customer lifecycle he’s developed and deployed: hundreds of models (some patented), commercial software and data science infrastructure with trillions of transactions and hundreds of terabytes of data, all servicing thousands of data scientists.

Bill guest lectures as Columbia and NYU. He has a BS in Electrical Engineering from Hofstra University.

Pizza (nyhackr.org/pizzapoll.html) begins at 6:30, the talk starts at 7, then after we head to the local bar.

Website

The nyhackr website was built as a RMarkdown website and the source code can accessed by the community on GitHub.

How to contribute

If you wish to contribute to the website the process is pretty simple.

  1. Fork and clone the repository (an example can be found here)
  2. Create a new branch for your changes (warning, this step cannot be done in RStudio!)
  3. Make your changes. You can build and view your local version by using rmarkdown::render_site()
  4. When you are done, submit a pull request. Your changes might not appear on the public site right away as we have a development version for making sure changes don’t break the site.