diff --git a/CNAME b/CNAME
new file mode 100644
index 00000000..0fb637a2
--- /dev/null
+++ b/CNAME
@@ -0,0 +1 @@
+datasciencemasters.org
\ No newline at end of file
diff --git a/README.md b/README.md
index 17c118db..9d1e4d15 100644
--- a/README.md
+++ b/README.md
@@ -1,28 +1,20 @@
-Find me Twitter [@clarecorthell](http://bit.ly/clarecorthelltwitter)!
+The Curriculum for learning Data Science, the Open Source way.
-I'm the cofounding partner of [summer.ai](http://bit.ly/summerai), a Machine Intelligence Consulting firm. Talk to us at **hello[at]summer.ai**!
+## The Internet is Your Oyster
-## The Open-Source Data Science Masters
+I didn't want to wait. I wanted to work on things I care about **now**. Why sleep through grad school lectures tomorrow when you can hack on interesting questions today?
-The open-source curriculum for learning Data Science. Foundational in both theory and technologies, the OSDSM breaks down the core competencies necessary to making use of data.
+With Coursera, ebooks, stackoverflow, and github -- all free and open -- how can you afford not to take advantage of an open source education?
-### The Internet is Your Oyster
-
-With Coursera, ebooks, Stack Overflow, and GitHub -- all free and open -- how can you afford not to take advantage of an open source education?
-
-### The Motivation
+## The Motivation
We need more Data Scientists.
> ...by 2018 the United States will experience a shortage of 190,000 skilled data scientists, and 1.5 million managers and analysts capable of reaping actionable insights from the big data deluge.
--- [McKinsey Report Highlights the Impending Data Scientist Shortage](http://bit.ly/datascienceshortage) 23 July 2013
-
-> There are little to no Data Scientists with 5 years experience, because the job simply did not exist.
+-- [McKinsey Report Highlights the Impending Data Scientist Shortage](http://blog.gopivotal.com/news-2/mckinsey-report-highlights-the-impending-data-scientist-shortage) 23 July 2013
--- David Hardtke [How To Hire A Data Scientist](http://bit.ly/howtohireadatascientist) 13 Nov 2012
-
-### An Academic Shortfall
+## An Academic Shortfall
Classic academic conduits aren't providing Data Scientists -- this talent gap will be closed differently.
@@ -32,224 +24,14 @@ Classic academic conduits aren't providing Data Scientists -- this talent gap wi
> And there’s yet another trend that will alleviate any talent gap: the democratization of data science. While I agree wholeheartedly with Raden’s statement that “the crème-de-la-crème of data scientists will fill roles in academia, technology vendors, Wall Street, research and government,” I think he’s understating the extent to which **autodidacts – the self-taught, uncredentialed, data-passionate people – will come to play a significant role in many organizations’ data science initiatives.**
--- James Kobielus, [Closing the Talent Gap](http://bit.ly/closingthetalentgap) 17 Jan 2013
-
-### Ready?
-
-***
-
-## The Open Source Data Science Curriculum
-
-Start here.
-
-**Intro to Data Science** [UW / Coursera](http://bit.ly/uwintrodatascience)
- * *Topics:* Python NLP on Twitter API, Distributed Computing Paradigm, MapReduce/Hadoop & Pig Script, SQL/NoSQL, Relational Algebra, Experiment design, Statistics, Graphs, Amazon EC2, Visualization.
-
-**Data Science** / Harvard [Video Archive](http://bit.ly/harvarddatasciencevideos) & [Course](http://bit.ly/harvarddatasciencecourse)
- * *Topics:* Data wrangling, data management, exploratory data analysis to generate hypotheses and intuition, prediction based on statistical methods such as regression and classification, communication of results through visualization, stories, and summaries.
-
-**Data Science with Open Source Tools** [Book ```$27```](http://bit.ly/book-datasciencewithopensourcetools)
- * *Topics:* Visualizing Data, Estimation, Models from Scaling Arguments, Arguments from Probability Models, What you Really Need to Know about Classical Statistics, Data Mining, Clustering, PCA, Map/Reduce, Predictive Analytics
- * *Example Code in:* R, Python, Sage, C, Gnu Scientific Library
-
-### A Note About Direction
-This is an introduction geared toward those with at least **a minimum understanding of programming**, and (perhaps obviously) an interest in the components of Data Science (like statistics and distributed computing).
-Out of personal preference and need for focus, I geared the original curriculum toward **Python tools and resources**. R resources can be found [here](http://bit.ly/osdsm-rresources).
-
-### Math
-
-[★ What are some good resources for learning about numerical analysis? / Quora ]
-(http://www.quora.com/What-are-some-good-resources-for-learning-about-numerical-analysis)
-
-* **Linear Algebra & Programming**
- * Linear Algebra [Khan Academy / Videos](https://www.khanacademy.org/math/linear-algebra)
- * Linear Algebra / Levandosky [Stanford / Book ```$10```](http://amzn.to/1kIfmmI)
- * Linear Programming (Math 407) [University of Washington / Course](http://bit.ly/course-uw-linearprogramming)
- * The Manga Guide to Linear Algebra [Book ```$19```](http://amzn.to/1n4hM5l)
-
-* **Statistics**
- * Statistics I [Princeton / Coursera](http://bit.ly/course-princeton-stats)
- * Stats in a Nutshell [Book ```$29```](http://amzn.to/1iMnx2X)
- * Think Stats: Probability and Statistics for Programmers [Digital](http://bit.ly/ebook-thinkstats) & [Book ```$25```](http://amzn.to/RcVnTf)
- * Think Bayes [Digital](http://bit.ly/ebook-thinkbayes) & [Book ```$25```](http://amzn.to/1hmy4Cr)
-
-* **Differential Equations & Calculus**
- * Differential Equations in Data Science [Python Tutorial](http://bit.ly/ipynb-differentialeq)
-
-* **Problem Solving**
- * Problem-Solving Heuristics "How To Solve It" [Polya / Book ```$10```](http://amzn.to/1mqJRSi)
-
-### Computing
-
-Get your environment up and running with the [Data Science Toolbox](http://datasciencetoolbox.org)
-
-* **Algorithms**
- * Algorithms Design & Analysis I [Stanford / Coursera](http://bit.ly/coursera-algo)
- * Algorithm Design, Kleinberg & Tardos [Book ```$125```](http://amzn.to/1iMnWm5)
-
-* **Distributed Computing Paradigms**
- * *See Intro to Data Science [UW / Lectures on MapReduce](http://bit.ly/uwintrodatascience)
- * Intro to Hadoop and MapReduce [Cloudera / Udacity Course](http://bit.ly/udacity-hadoopmapreduce) *includes select free excerpts of Hadoop: The Definitive Guide [Book ```$29```](http://amzn.to/1i7wgLv)
-
-* **Databases**
- * Introduction to Databases [Stanford / Online Course](https://bit.ly/introdatabases)
- * SQL School [Mode Analytics / Tutorials](http://sqlschool.modeanalytics.com/)
- * SQL Tutorials [SQLZOO / Tutorials](http://bit.ly/tut-sqlzoo)
-
-* **Data Mining**
- * Mining Massive Data Sets / Stanford [Coursera](https://www.coursera.org/course/mmds) & [Digital](http://bit.ly/ebook-miningmassivedata) & [Book ```$58```](http://amzn.to/1txocpo)
- * Mining The Social Web [Book ```$30```](http://amzn.to/1mqxAsB)
- * Introduction to Information Retrieval / Stanford [Digital](http://bit.ly/ebook-stanford-inforetrieval) & [Book ```$56```](http://amzn.to/1mWbnUT)
-
-_OSDSM Specialization: [Web Scraping & Crawling](https://github.com/datasciencemasters/go/blob/master/specializations.md#web-scraping--crawling)_
-
-* **Machine Learning**
-
- _Foundational & Theoretical_
- * Machine Learning [Ng Stanford / Coursera](http://bit.ly/stanford-ml)
- * A Course in Machine Learning [UMD / Digital Book](http://ciml.info/)
- * The Elements of Statistical Learning / Stanford [Digital](http://bit.ly/ebook-elemstatlearn) & [Book ```$80```](http://amzn.to/1hmyKry) & [Study Group](http://www.reddit.com/r/eosl)
- * Machine Learning [Caltech / Edx](http://bit.ly/caltech-ml)
-
- _Practical_
- * Programming Collective Intelligence [Book ```$27```](http://amzn.to/1mqxYqW)
- * Machine Learning for Hackers [ipynb / digital book](http://nbviewer.ipython.org/github/carljv/Will_it_Python/blob/master/MLFH/CH1/chapter1.ipynb)
- * Intro to scikit-learn, SciPy2013 [youtube tutorials](http://bit.ly/scikit-video-tuts)
-
-* **Probabilistic Modeling**
- * Probabilistic Programming and Bayesian Methods for Hackers [Github / Tutorials](http://bit.ly/ipnb-probabilisticprogramming)
- * Probabilistic Graphical Models [Stanford / Coursera](http://bit.ly/stanford-pgm)
-
-* **Deep Learning (Neural Networks)**
- * Neural Networks [Andrej Karpathy / Python Walkthrough](http://karpathy.github.io/neuralnets/)
- * Neural Networks [U Toronto / Coursera](http://bit.ly/utoronto-neuralnets)
-
-* **Social Network & Graph Analysis**
- * Social and Economic Networks: Models and Analysis / [Stanford / Coursera](http://bit.ly/stanford-socialeconnetworks)
- * Social Network Analysis for Startups [Book ```$22```](http://amzn.to/1jySCCT)
-
-* **Natural Language Processing**
- * From Languages to Information / Stanford CS147 [Materials](http://web.stanford.edu/class/cs124/)
- * NLP with Python (NLTK library) [Digital](http://bit.ly/ebook-nltk), [Book ```$36```](http://amzn.to/1iMrDIp)
-
-* **Analysis**
- * Python for Data Analysis [Book ```$24```](http://amzn.to/Q2pI5I)
- * Big Data Analysis with Twitter [UC Berkeley / Lectures](http://bit.ly/cal-course-bigdatatwitter)
- * Exploratory Data Analysis [Tukey / Book ```$81```](http://amzn.to/1kNUEPa)
- * An Example Data Science Process [ipynb](http://nbviewer.ipython.org/github/Jay-Oh-eN/happy-healthy-hungry/blob/master/h3.ipynb)
+-- James Kobielus, [Closing the Talent Gap](http://www.ibmbigdatahub.com/blog/data-scientist-closing-talent-gap) 17 Jan 2013
-### Data Design
+## The Open Source Curriculum
-* **Visualization**
-
- _Foundational Information Design Books_
-
- * Envisioning Information [Tufte / Book ```$36```](http://amzn.to/Sn0QI4)
- * The Visual Display of Quantitative Information [Tufte / Book ```$27```](http://amzn.to/1q5FB91)
-
- _Theoretical Courses / Design & Visualization_
-
- * Data Visualization [University of Washington / Slides & Resources](http://bit.ly/uw-dataviz)
- * Berkeley's Viz Class [UC Berkeley / Course Docs](http://bit.ly/cal-viz)
- * Rice University's Data Viz class [Rice University / Slides](http://bit.ly/riceu-viz)
-
- _Practical Visualization Resources_
-
- * D3 Library / Scott Murray [Blog / Tutorials](http://bit.ly/tut-scottmurray-d3)
- * Interactive Data Visualization for the Web / Scott Murray [Online Book](http://bit.ly/interactive-data-viz-web) & [Book `$26`](http://amzn.to/1oK1xCN)
-
-_OSDSM Specialization: [Data Journalism](https://github.com/datasciencemasters/go/blob/master/specializations.md#data-journalism)_
-
-#### **Python** (Learning)
- * Learn Python the Hard Way [Digital](http://bit.ly/ebook-learnpyhardway) & [Book ```$23```](http://amzn.to/1hmzGw9)
- * Python [Class / Google](http://bit.ly/T4j40A)
- * Think Python [Digital](http://bit.ly/ebook-thinkpy) & [Book ```$34```](http://amzn.to/1ktQ5ZU)
- * Introduction to Computer Science and Programming [MIT OpenCourseWare / Lectures](http://bit.ly/mit-cs)
-
-#### **Python** (Libraries)
-Installing Basic Packages [Python, virtualenv, NumPy, SciPy, matplotlib and IPython ](http://bit.ly/scientific-py-install) & [Using Python Scientifically](http://bit.ly/lecture-scipy)
-
-[Command Line Install Script](https://github.com/fonnesbeck/ScipySuperpack) for Scientific Python Packages
-
- * [Pandas Cookbook](https://github.com/jvns/pandas-cookbook) (data structure library)
-
-_More Libraries can be found in the ["awesome machine learning"](https://github.com/josephmisiti/awesome-machine-learning#python) repo & in related [specializations](https://github.com/datasciencemasters/go/blob/master/specializations.md)_
-
- * **Data Structures & Analysis Packages**
- * Flexible and powerful data analysis / manipulation library with labeled data structures objects, statistical functions, etc [pandas](http://bit.ly/py-pandas) & Tutorials [Python for Data Analysis / Book](http://amzn.to/Q2pI5I)
-
- * **Machine Learning Packages**
- * [scikit-learn](http://bit.ly/py-scikit) - Tools for Data Mining & Analysis
-
- * **Networks Packages**
- * [networkx](http://bit.ly/py-networkx) - Network Modeling & Viz
-
- * **Statistical Packages**
- * [PyMC](http://bit.ly/py-pymc) - Bayesian Inference & Markov Chain Monte Carlo sampling toolkit
- * [Statsmodels](http://bit.ly/py-statsmodel) - Python module that allows users to explore data, estimate statistical models, and perform statistical tests
- * [PyMVPA](http://bit.ly/py-mvpa) - Multivariate Pattern Analysis in Python
-
- * **Natural Language Processing & Understanding**
- * [NLTK](http://bit.ly/py-nltk) - Natural Language Toolkit
- * [Gensim](http://bit.ly/py-gensim) - Python library for topic modeling, document indexing and similarity retrieval with large corpora. Target audience is the natural language processing (NLP) and information retrieval (IR) community.
-
- * **Live Data Packages**
- * [twython](http://bit.ly/py-twython) - Python wrapper for the Twitter API
-
- * **Visualization Packages**
- * [matplotlib](http://www.ast.uct.ac.za/~sarblyth/pythonGuide/PythonPlottingBeginnersGuide.pdf) - well-integrated with analysis and data manipulation packages like numpy and pandas
- * [Orange](http://bit.ly/software-orangeviz) - Open source data visualization and analysis for novice and experts. Data mining through visual programming or Python scripting. Components for machine learning. Add-ons for bioinformatics and text mining
-
-* **iPython Data Science Notebooks**
- * [Data Science in IPython Notebooks](http://bit.ly/ipynb-ds) (Linear Regression, Logistic Regression, Random Forests, K-Means Clustering)
- * [A Gallery of Interesting IPython Notebooks - Pandas for Data Analysis](https://github.com/ipython/ipython/wiki/A-gallery-of-interesting-IPython-Notebooks#pandas-for-data-analysis)
-
-#### Datasets are now [here](http://bit.ly/osdsm-datasets)
-
-#### R resources are now [here](http://bit.ly/osdsm-rresources)
-
-### Data Science as a Profession
-
- * Doing Data Science: Straight Talk from the Frontline [O'Reilly / Book ```$25```](http://amzn.to/1vAIscK)
- * The Data Science Handbook: Advice and Insights from 25 Amazing Data Scientists [Book ```$22```](http://amzn.to/1J7lILJ)
-
-### Capstone Project
-* Capstone Analysis of Your Own Design; [Quora](http://bit.ly/quora-toyproblems)'s Idea Compendium
-* Healthcare Twitter Analysis [Coursolve & UW Data Science](http://bit.ly/project-healthcare-twitter-analysis)
-* Analyze your LinkedIn Network [Generate & Download Adjacency Matrix](http://socilab.com/)
-
-***
-### Resources
-
-#### Read
-* [DataTau](http://bit.ly/datatau) - The "Hacker News" of Data Science
-* [Wikipedia](http://bit.ly/1kKg0gD) - The free encyclopedia
-* [The Signal and The Noise - Nate Silver ```$15```](http://amzn.to/1hoxQoG) - Bestseller Pop Sci
-* [Zipfian Academy's List of Resources](http://bit.ly/1qoF1We)
-* [A Software Engineer's Guide to Getting Started with Data Science](http://bit.ly/1jwgV4p)
-* [Data Scientist Interviews / Metamarkets](http://bit.ly/1r1tJot)
-* [/r/MachineLearning](http://bit.ly/1uANaEM)
-
-#### Watch
-* [The Life of a Data Scientist / Josh Wills](https://www.youtube.com/watch?v=h9vQIPfe2uU)
-
-#### Learn
-* [Metacademy](http://bit.ly/metacademy) - Search for a concept you want to learn
-* [Coursera](http://bit.ly/coursera-online-courses) - Online university courses
-* [Wolfram Alpha](http://bit.ly/wolframalpha-torus) - The smart number and info cruncher
-* [Khan Academy](http://bit.ly/khan-academy-lifeinsurance) - High quality, free learning videos
-
-***
-
-### Notation
-Non-Open-Source books, courses, and resources are noted with ```$```.
+**[Start Here](http://datasciencemasters.org)**.
## Contribute
-Please Contribute Your Ideas -- **this is Open Source!**
-
-Please **showcase your own specialization & transcript** by submitting a markdown file pull request in the ```/transcripts``` directory with your name! eg [```clare-corthell-2014.md```](http://bit.ly/U6yVMU)
-
-[Follow me on Twitter @clarecorthell](http://bit.ly/clarecorthelltwitter)
+I'm not the expert -- please Share and Contribute. **It's Open Source**!
-Email me at [github@clarecorthell.com](mailto:github@clarecorthell.com)
+Follow me on Twitter [@clarecorthell](http://twitter.com/clarecorthell)
diff --git a/fonts/OpenSans-Bold-webfont.eot b/fonts/OpenSans-Bold-webfont.eot
new file mode 100644
index 00000000..e1c76744
Binary files /dev/null and b/fonts/OpenSans-Bold-webfont.eot differ
diff --git a/fonts/OpenSans-Bold-webfont.svg b/fonts/OpenSans-Bold-webfont.svg
new file mode 100644
index 00000000..364b3686
--- /dev/null
+++ b/fonts/OpenSans-Bold-webfont.svg
@@ -0,0 +1,146 @@
+
+
+
\ No newline at end of file
diff --git a/fonts/OpenSans-Bold-webfont.ttf b/fonts/OpenSans-Bold-webfont.ttf
new file mode 100644
index 00000000..2d94f062
Binary files /dev/null and b/fonts/OpenSans-Bold-webfont.ttf differ
diff --git a/fonts/OpenSans-Bold-webfont.woff b/fonts/OpenSans-Bold-webfont.woff
new file mode 100644
index 00000000..cd86852d
Binary files /dev/null and b/fonts/OpenSans-Bold-webfont.woff differ
diff --git a/fonts/OpenSans-BoldItalic-webfont.eot b/fonts/OpenSans-BoldItalic-webfont.eot
new file mode 100644
index 00000000..f44ac9a3
Binary files /dev/null and b/fonts/OpenSans-BoldItalic-webfont.eot differ
diff --git a/fonts/OpenSans-BoldItalic-webfont.svg b/fonts/OpenSans-BoldItalic-webfont.svg
new file mode 100644
index 00000000..8392240a
--- /dev/null
+++ b/fonts/OpenSans-BoldItalic-webfont.svg
@@ -0,0 +1,146 @@
+
+
+
\ No newline at end of file
diff --git a/fonts/OpenSans-BoldItalic-webfont.ttf b/fonts/OpenSans-BoldItalic-webfont.ttf
new file mode 100644
index 00000000..f74e0e3c
Binary files /dev/null and b/fonts/OpenSans-BoldItalic-webfont.ttf differ
diff --git a/fonts/OpenSans-BoldItalic-webfont.woff b/fonts/OpenSans-BoldItalic-webfont.woff
new file mode 100644
index 00000000..f3248c11
Binary files /dev/null and b/fonts/OpenSans-BoldItalic-webfont.woff differ
diff --git a/fonts/OpenSans-Italic-webfont.eot b/fonts/OpenSans-Italic-webfont.eot
new file mode 100644
index 00000000..277c1899
Binary files /dev/null and b/fonts/OpenSans-Italic-webfont.eot differ
diff --git a/fonts/OpenSans-Italic-webfont.svg b/fonts/OpenSans-Italic-webfont.svg
new file mode 100644
index 00000000..29c7497f
--- /dev/null
+++ b/fonts/OpenSans-Italic-webfont.svg
@@ -0,0 +1,146 @@
+
+
+
\ No newline at end of file
diff --git a/fonts/OpenSans-Italic-webfont.ttf b/fonts/OpenSans-Italic-webfont.ttf
new file mode 100644
index 00000000..63f187e9
Binary files /dev/null and b/fonts/OpenSans-Italic-webfont.ttf differ
diff --git a/fonts/OpenSans-Italic-webfont.woff b/fonts/OpenSans-Italic-webfont.woff
new file mode 100644
index 00000000..469a29bb
Binary files /dev/null and b/fonts/OpenSans-Italic-webfont.woff differ
diff --git a/fonts/OpenSans-Light-webfont.eot b/fonts/OpenSans-Light-webfont.eot
new file mode 100644
index 00000000..837daab8
Binary files /dev/null and b/fonts/OpenSans-Light-webfont.eot differ
diff --git a/fonts/OpenSans-Light-webfont.svg b/fonts/OpenSans-Light-webfont.svg
new file mode 100644
index 00000000..bdb67265
--- /dev/null
+++ b/fonts/OpenSans-Light-webfont.svg
@@ -0,0 +1,146 @@
+
+
+
\ No newline at end of file
diff --git a/fonts/OpenSans-Light-webfont.ttf b/fonts/OpenSans-Light-webfont.ttf
new file mode 100644
index 00000000..b50ef9dc
Binary files /dev/null and b/fonts/OpenSans-Light-webfont.ttf differ
diff --git a/fonts/OpenSans-Light-webfont.woff b/fonts/OpenSans-Light-webfont.woff
new file mode 100644
index 00000000..99514d1a
Binary files /dev/null and b/fonts/OpenSans-Light-webfont.woff differ
diff --git a/fonts/OpenSans-LightItalic-webfont.eot b/fonts/OpenSans-LightItalic-webfont.eot
new file mode 100644
index 00000000..f0ebf2c0
Binary files /dev/null and b/fonts/OpenSans-LightItalic-webfont.eot differ
diff --git a/fonts/OpenSans-LightItalic-webfont.svg b/fonts/OpenSans-LightItalic-webfont.svg
new file mode 100644
index 00000000..60765da8
--- /dev/null
+++ b/fonts/OpenSans-LightItalic-webfont.svg
@@ -0,0 +1,146 @@
+
+
+
\ No newline at end of file
diff --git a/fonts/OpenSans-LightItalic-webfont.ttf b/fonts/OpenSans-LightItalic-webfont.ttf
new file mode 100644
index 00000000..5898c8c7
Binary files /dev/null and b/fonts/OpenSans-LightItalic-webfont.ttf differ
diff --git a/fonts/OpenSans-LightItalic-webfont.woff b/fonts/OpenSans-LightItalic-webfont.woff
new file mode 100644
index 00000000..9c978dc3
Binary files /dev/null and b/fonts/OpenSans-LightItalic-webfont.woff differ
diff --git a/fonts/OpenSans-Regular-webfont.eot b/fonts/OpenSans-Regular-webfont.eot
new file mode 100644
index 00000000..dd6fd2cb
Binary files /dev/null and b/fonts/OpenSans-Regular-webfont.eot differ
diff --git a/fonts/OpenSans-Regular-webfont.svg b/fonts/OpenSans-Regular-webfont.svg
new file mode 100644
index 00000000..01038bb1
--- /dev/null
+++ b/fonts/OpenSans-Regular-webfont.svg
@@ -0,0 +1,146 @@
+
+
+
\ No newline at end of file
diff --git a/fonts/OpenSans-Regular-webfont.ttf b/fonts/OpenSans-Regular-webfont.ttf
new file mode 100644
index 00000000..05951e7b
Binary files /dev/null and b/fonts/OpenSans-Regular-webfont.ttf differ
diff --git a/fonts/OpenSans-Regular-webfont.woff b/fonts/OpenSans-Regular-webfont.woff
new file mode 100644
index 00000000..274664b2
Binary files /dev/null and b/fonts/OpenSans-Regular-webfont.woff differ
diff --git a/fonts/OpenSans-Semibold-webfont.eot b/fonts/OpenSans-Semibold-webfont.eot
new file mode 100644
index 00000000..289aade3
Binary files /dev/null and b/fonts/OpenSans-Semibold-webfont.eot differ
diff --git a/fonts/OpenSans-Semibold-webfont.svg b/fonts/OpenSans-Semibold-webfont.svg
new file mode 100644
index 00000000..cc2ca427
--- /dev/null
+++ b/fonts/OpenSans-Semibold-webfont.svg
@@ -0,0 +1,146 @@
+
+
+
\ No newline at end of file
diff --git a/fonts/OpenSans-Semibold-webfont.ttf b/fonts/OpenSans-Semibold-webfont.ttf
new file mode 100644
index 00000000..6f150731
Binary files /dev/null and b/fonts/OpenSans-Semibold-webfont.ttf differ
diff --git a/fonts/OpenSans-Semibold-webfont.woff b/fonts/OpenSans-Semibold-webfont.woff
new file mode 100644
index 00000000..4e47cb1a
Binary files /dev/null and b/fonts/OpenSans-Semibold-webfont.woff differ
diff --git a/fonts/OpenSans-SemiboldItalic-webfont.eot b/fonts/OpenSans-SemiboldItalic-webfont.eot
new file mode 100644
index 00000000..50a8a6f7
Binary files /dev/null and b/fonts/OpenSans-SemiboldItalic-webfont.eot differ
diff --git a/fonts/OpenSans-SemiboldItalic-webfont.svg b/fonts/OpenSans-SemiboldItalic-webfont.svg
new file mode 100644
index 00000000..65b50e2a
--- /dev/null
+++ b/fonts/OpenSans-SemiboldItalic-webfont.svg
@@ -0,0 +1,146 @@
+
+
+
\ No newline at end of file
diff --git a/fonts/OpenSans-SemiboldItalic-webfont.ttf b/fonts/OpenSans-SemiboldItalic-webfont.ttf
new file mode 100644
index 00000000..55ba3120
Binary files /dev/null and b/fonts/OpenSans-SemiboldItalic-webfont.ttf differ
diff --git a/fonts/OpenSans-SemiboldItalic-webfont.woff b/fonts/OpenSans-SemiboldItalic-webfont.woff
new file mode 100644
index 00000000..0adc6df1
Binary files /dev/null and b/fonts/OpenSans-SemiboldItalic-webfont.woff differ
diff --git a/fonts/copse-regular-webfont.eot b/fonts/copse-regular-webfont.eot
new file mode 100644
index 00000000..af1f5e6e
Binary files /dev/null and b/fonts/copse-regular-webfont.eot differ
diff --git a/fonts/copse-regular-webfont.svg b/fonts/copse-regular-webfont.svg
new file mode 100644
index 00000000..1e920b51
--- /dev/null
+++ b/fonts/copse-regular-webfont.svg
@@ -0,0 +1,247 @@
+
+
+
\ No newline at end of file
diff --git a/fonts/copse-regular-webfont.ttf b/fonts/copse-regular-webfont.ttf
new file mode 100644
index 00000000..434b208e
Binary files /dev/null and b/fonts/copse-regular-webfont.ttf differ
diff --git a/fonts/copse-regular-webfont.woff b/fonts/copse-regular-webfont.woff
new file mode 100644
index 00000000..a9a0450e
Binary files /dev/null and b/fonts/copse-regular-webfont.woff differ
diff --git a/fonts/quattrocentosans-bold-webfont.eot b/fonts/quattrocentosans-bold-webfont.eot
new file mode 100644
index 00000000..c041ed97
Binary files /dev/null and b/fonts/quattrocentosans-bold-webfont.eot differ
diff --git a/fonts/quattrocentosans-bold-webfont.svg b/fonts/quattrocentosans-bold-webfont.svg
new file mode 100644
index 00000000..fb162e94
--- /dev/null
+++ b/fonts/quattrocentosans-bold-webfont.svg
@@ -0,0 +1,247 @@
+
+
+
\ No newline at end of file
diff --git a/fonts/quattrocentosans-bold-webfont.ttf b/fonts/quattrocentosans-bold-webfont.ttf
new file mode 100644
index 00000000..7389c879
Binary files /dev/null and b/fonts/quattrocentosans-bold-webfont.ttf differ
diff --git a/fonts/quattrocentosans-bold-webfont.woff b/fonts/quattrocentosans-bold-webfont.woff
new file mode 100644
index 00000000..fc14168b
Binary files /dev/null and b/fonts/quattrocentosans-bold-webfont.woff differ
diff --git a/fonts/quattrocentosans-bolditalic-webfont.eot b/fonts/quattrocentosans-bolditalic-webfont.eot
new file mode 100644
index 00000000..7c1aa7aa
Binary files /dev/null and b/fonts/quattrocentosans-bolditalic-webfont.eot differ
diff --git a/fonts/quattrocentosans-bolditalic-webfont.svg b/fonts/quattrocentosans-bolditalic-webfont.svg
new file mode 100644
index 00000000..9070a8b3
--- /dev/null
+++ b/fonts/quattrocentosans-bolditalic-webfont.svg
@@ -0,0 +1,248 @@
+
+
+
\ No newline at end of file
diff --git a/fonts/quattrocentosans-bolditalic-webfont.ttf b/fonts/quattrocentosans-bolditalic-webfont.ttf
new file mode 100644
index 00000000..9766a17a
Binary files /dev/null and b/fonts/quattrocentosans-bolditalic-webfont.ttf differ
diff --git a/fonts/quattrocentosans-bolditalic-webfont.woff b/fonts/quattrocentosans-bolditalic-webfont.woff
new file mode 100644
index 00000000..c436da06
Binary files /dev/null and b/fonts/quattrocentosans-bolditalic-webfont.woff differ
diff --git a/fonts/quattrocentosans-italic-webfont.eot b/fonts/quattrocentosans-italic-webfont.eot
new file mode 100644
index 00000000..379b3834
Binary files /dev/null and b/fonts/quattrocentosans-italic-webfont.eot differ
diff --git a/fonts/quattrocentosans-italic-webfont.svg b/fonts/quattrocentosans-italic-webfont.svg
new file mode 100644
index 00000000..b613779e
--- /dev/null
+++ b/fonts/quattrocentosans-italic-webfont.svg
@@ -0,0 +1,247 @@
+
+
+
\ No newline at end of file
diff --git a/fonts/quattrocentosans-italic-webfont.ttf b/fonts/quattrocentosans-italic-webfont.ttf
new file mode 100644
index 00000000..c7ba47a1
Binary files /dev/null and b/fonts/quattrocentosans-italic-webfont.ttf differ
diff --git a/fonts/quattrocentosans-italic-webfont.woff b/fonts/quattrocentosans-italic-webfont.woff
new file mode 100644
index 00000000..3798881e
Binary files /dev/null and b/fonts/quattrocentosans-italic-webfont.woff differ
diff --git a/fonts/quattrocentosans-regular-webfont.eot b/fonts/quattrocentosans-regular-webfont.eot
new file mode 100644
index 00000000..346db6fe
Binary files /dev/null and b/fonts/quattrocentosans-regular-webfont.eot differ
diff --git a/fonts/quattrocentosans-regular-webfont.svg b/fonts/quattrocentosans-regular-webfont.svg
new file mode 100644
index 00000000..34709243
--- /dev/null
+++ b/fonts/quattrocentosans-regular-webfont.svg
@@ -0,0 +1,247 @@
+
+
+
\ No newline at end of file
diff --git a/fonts/quattrocentosans-regular-webfont.ttf b/fonts/quattrocentosans-regular-webfont.ttf
new file mode 100644
index 00000000..e4146706
Binary files /dev/null and b/fonts/quattrocentosans-regular-webfont.ttf differ
diff --git a/fonts/quattrocentosans-regular-webfont.woff b/fonts/quattrocentosans-regular-webfont.woff
new file mode 100644
index 00000000..09ed324d
Binary files /dev/null and b/fonts/quattrocentosans-regular-webfont.woff differ
diff --git a/images/arrow-down.png b/images/arrow-down.png
new file mode 100644
index 00000000..585b0bdd
Binary files /dev/null and b/images/arrow-down.png differ
diff --git a/images/background.png b/images/background.png
new file mode 100644
index 00000000..b63b420f
Binary files /dev/null and b/images/background.png differ
diff --git a/images/bg_hr.png b/images/bg_hr.png
new file mode 100644
index 00000000..514aee50
Binary files /dev/null and b/images/bg_hr.png differ
diff --git a/images/blacktocat.png b/images/blacktocat.png
new file mode 100644
index 00000000..e160053a
Binary files /dev/null and b/images/blacktocat.png differ
diff --git a/images/body-background.png b/images/body-background.png
new file mode 100644
index 00000000..d6a152f1
Binary files /dev/null and b/images/body-background.png differ
diff --git a/images/bullet.png b/images/bullet.png
new file mode 100644
index 00000000..22ea5437
Binary files /dev/null and b/images/bullet.png differ
diff --git a/images/hr.gif b/images/hr.gif
new file mode 100644
index 00000000..bdb4168d
Binary files /dev/null and b/images/hr.gif differ
diff --git a/images/icon_download.png b/images/icon_download.png
new file mode 100644
index 00000000..5a793f17
Binary files /dev/null and b/images/icon_download.png differ
diff --git a/images/nav-bg.gif b/images/nav-bg.gif
new file mode 100644
index 00000000..47439656
Binary files /dev/null and b/images/nav-bg.gif differ
diff --git a/images/octocat-logo.png b/images/octocat-logo.png
new file mode 100644
index 00000000..28a3ad1b
Binary files /dev/null and b/images/octocat-logo.png differ
diff --git a/images/octocat-small.png b/images/octocat-small.png
new file mode 100644
index 00000000..66c25398
Binary files /dev/null and b/images/octocat-small.png differ
diff --git a/images/sprite_download.png b/images/sprite_download.png
new file mode 100644
index 00000000..f9f8de24
Binary files /dev/null and b/images/sprite_download.png differ
diff --git a/index.html b/index.html
new file mode 100644
index 00000000..9027d7f6
--- /dev/null
+++ b/index.html
@@ -0,0 +1,546 @@
+
+
+
+
+
+
+
+
+
+
+ The Open Source Data Science Masters
+
+
+
+
+
+
The open-source curriculum for learning Data Science. Foundational in both theory and technologies, the OSDSM breaks down the core competencies necessary to make data useful.
+
+
+The Internet is Your Oyster
+
+
With Coursera, ebooks, Stack Overflow, and GitHub -- all free and open -- how can you afford not to take advantage of an open source education?
+
+
+The Motivation
+
+
We need more Data Scientists.
+
+
+
...by 2018 the United States will experience a shortage of 190,000 skilled data scientists, and 1.5 million managers and analysts capable of reaping actionable insights from the big data deluge.
Classic academic conduits aren't providing Data Scientists -- this talent gap will be closed differently.
+
+
+
Academic credentials are important but not necessary for high-quality data science. The core aptitudes – curiosity, intellectual agility, statistical fluency, research stamina, scientific rigor, skeptical nature – that distinguish the best data scientists are widely distributed throughout the population.
+
+
We’re likely to see more uncredentialed, inexperienced individuals try their hands at data science, bootstrapping their skills on the open-source ecosystem and using the diversity of modeling tools available. Just as data-science platforms and tools are proliferating through the magic of open source, big data’s data-scientist pool will as well.
+
+
And there’s yet another trend that will alleviate any talent gap: the democratization of data science. While I agree wholeheartedly with Raden’s statement that “the crème-de-la-crème of data scientists will fill roles in academia, technology vendors, Wall Street, research and government,” I think he’s understating the extent to which autodidacts – the self-taught, uncredentialed, data-passionate people – will come to play a significant role in many organizations’ data science initiatives.
+Topics: Data wrangling, data management, exploratory data analysis to generate hypotheses and intuition, prediction based on statistical methods such as regression and classification, communication of results through visualization, stories, and summaries.
+Topics: Visualizing Data, Estimation, Models from Scaling Arguments, Arguments from Probability Models, What you Really Need to Know about Classical Statistics, Data Mining, Clustering, PCA, Map/Reduce, Predictive Analytics
+
+Example Code in: R, Python, Sage, C, Gnu Scientific Library
+
+
+
+A Note About Direction
+
+
This is an introduction geared toward those with at least a minimum understanding of programming, and (perhaps obviously) an interest in the components of Data Science (like statistics and distributed computing).
+Out of personal preference and need for focus, I geared the original curriculum toward Python tools and resources. R resources can be found here.
Flexible and powerful data analysis / manipulation library with labeled data structures objects, statistical functions, etc pandas & Tutorials Python for Data Analysis / Book
+
+Gensim - Python library for topic modelling, document indexing and similarity retrieval with large corpora. Target audience is the natural language processing (NLP) and information retrieval (IR) community.
+matplotlib - well-integrated with analysis and data manipulation packages like numpy and pandas
+
+Orange - Open source data visualization and analysis for novice and experts. Data mining through visual programming or Python scripting. Components for machine learning. Add-ons for bioinformatics and text mining
Non-Open-Source books, courses, and resources are noted with $.
+
+
+Contribute
+
+
Please Contribute Your Ideas -- this is Open Source!
+
+
Please showcase your own specialization & transcript by submitting a markdown file pull request in the /transcripts directory with your name! eg clare-corthell-2014.md