What is Data Science?

Drew Conway's Data Science Venn Diagram

From Drew Conway's blog in 2013

MA346 at Bentley

  • Mathematics
    • Functions and relations -- the foundation for the course
    • Review of GB213
    • Previews of MA239, MA347, etc.
  • Technology
    • Extending CS230 with depth in data science-specific Python tools
    • Computational notebooks
    • GitHub
  • Visualization
    • New plot types (violin, bee swarm, box, joint, pair, heatmap, maps, etc.)
    • Creating interactive online dashboards
  • Communication
    • Technical writing
    • Clarity and brevity
    • Knowing the target audience

A Data Science Workflow

  1. Obtain data that you hope will help answer the question.
  2. Explore the data to understand it.
  3. Clean and prepare the data for analysis.
  4. Perform analysis, model building, testing, etc.
  5. Draw conclusions from your work.
  6. Report those conclusions to the relevant stakeholders.