Books
Data Science at the Command Line
http://datascienceatthecommandline.com/
Data Science Challenges
http://inverseprobability.com/2016/07/01/data-science-challenges
Foundations of Data Science
- intro: John Hopcroft and Ravindran Kannan
- book: https://www.cs.cornell.edu/jeh/book2016June9.pdf
Tutorials
Intelligent Data Analysis
Scaling Data Science in Python Tutorial
- github: https://github.com/chdoig/dss-scaling-tutorial
- slides: https://speakerdeck.com/chdoig/scaling-ds-in-python
HackerMath: Introduction to Statistics and Basics of Mathematics for Data Science - The Hacker’s Way
- intro: This is the repository for the full day workshop conducted at Fifth Elephant 2016 (https://fifthelephant.in/2016/)
- github: https://github.com/amitkaps/hackermath
Python Data Science Tutorials: common data analysis and machine learning tasks using python
Getting Started with Data Science @ MSU Data Science
Helping our new Data Scientists start in Python: A guide to learning by doing
Courses
General Assembly’s Data Science course in Washington, DC
- course page: https://generalassemb.ly/education/data-science/washington-dc/
- github: https://github.com/justmarkham/DAT8
140.711/2 Advanced Data Science I/II
- homepage: http://jtleek.com/advdatasci/
Data Science From data to knowledge (September 12 to 14, 2016)
http://data-science-ce-course.tk/
Harvard CS109 Data Science
- homepage: http://cs109.github.io/2015/
- class material: http://cs109.github.io/2015/pages/videos.html
- github: https://github.com/cs109/2015
Tools
Provision the Linux Data Science Virtual Machine
- blog: https://azure.microsoft.com/en-gb/documentation/articles/machine-learning-data-science-linux-dsvm-intro/
- github: https://github.com/Azure/azure-content/blob/master/articles/machine-learning/machine-learning-data-science-linux-dsvm-intro.md
Deep Learning tools for the Data Science Virtual Machine
Scrape Tools
Portia
Portia: Visual scraping for Scrapy
- intro: Portia is a tool that allows you to visually scrape websites without any programming knowledge required
- homepage: http://scrapinghub.com/portia/
- github: https://github.com/scrapinghub/portia/
Scrapely: A pure-python HTML screen-scraping library
- intro: Scrapely is a library for extracting structured data from HTML pages. Given some example web pages and the data to be extracted, scrapely constructs a parser for all similar pages.
- github: https://github.com/scrapy/scrapely
Blogs
Comprehensive learning path – Data Science in Python http://www.analyticsvidhya.com/learning-paths-data-science-business-analytics-business-intelligence-big-data/learning-path-data-science-python/
Top Data Scientists to Follow & Best Data Science Tutorials on GitHub
http://www.analyticsvidhya.com/blog/2015/07/github-special-data-scientists-to-follow-best-tutorials/
Non-Mathematical Feature Engineering techniques for Data Science
Learning Path To Become Data Scientist – Step by Step Guide
http://www.bigdataanalyticsguide.com/2016/07/09/learning-path-become-data-scientist-step-step-guide/
Data Science Basics: An Introduction to Ensemble Learners
- intro: Beginners, Boosting, Data Science, Ensemble methods
- blog: http://www.kdnuggets.com/2016/11/data-science-basics-intro-ensemble-learners.html
Resources
Awesome Data Engineering: A curated list of data engineering tools for software developers
Data science blogs
data-science-practice-handbook: Awesome and Complete Practice Handbook For Data Science
If you want to learn Data Science, start with one of these programming classes
Data Science, Machine Learning, and Artificial Intelligence Resources
Data Science and Robots
http://brohrer.github.io/blog.html
Data Sources for Cool Data Science Projects
- part 1: http://blog.thedataincubator.com/2014/10/data-sources-for-cool-data-science-projects-part-1/
- part 2: http://blog.thedataincubator.com/2014/10/data-sources-for-cool-data-science-projects-part-2/
- part 3: http://blog.thedataincubator.com/2016/10/data-sources-for-cool-data-science-projects-part-3/
- part 4: http://blog.thedataincubator.com/2016/10/data-sources-for-cool-data-science-projects-part-4/