DS-100: Data Speak Louder than Words

🎓 • Boston University

Course Overview

Introductory data science course emphasizing real-world problem solving. Covers Python, Jupyter, exploratory data analysis, statistics, and visualization—modeled after Berkeley’s Data 8.

Teaching Impact

Fall 2021 – Present: Authored and launched the course as one of CDS’s inaugural undergraduate offerings. Taught six times. Oversaw new instructors during 2024. Currently directing a GenAI-integrated redesign.

Foundational course in the undergraduate data science program.

Learning Objectives

By the end of this course, students will be able to:

  1. Data Analysis Fundamentals: Apply statistical thinking to real-world datasets
  2. Python Programming: Use Python for data manipulation and analysis
  3. Data Visualization: Create compelling visualizations for data storytelling
  4. Reproducible Research: Build analysis workflows using Jupyter notebooks
  5. Critical Thinking: Evaluate data quality and interpret results responsibly

Course Innovation

  • Real-world datasets: Students work with actual data from public health, social science, and business contexts
  • Jupyter-first approach: All coursework conducted in interactive notebooks
  • GenAI integration: Currently redesigning to incorporate responsible AI practices
  • Berkeley Data 8 model: Adapted curriculum from one of the most successful introductory data science programs

Technologies Used

  • Python: Primary programming language
  • Jupyter Notebooks: Development and presentation environment
  • Pandas: Data manipulation and analysis
  • NumPy: Numerical computing
  • Matplotlib/Seaborn: Data visualization
  • Statistical inference: Hypothesis testing and confidence intervals