DS-100: Data Speak Louder than Words
🎓 • Boston University
Course Overview
Introductory data science course emphasizing real-world problem solving. Covers Python, Jupyter, exploratory data analysis, statistics, and visualization—modeled after Berkeley’s Data 8.
Teaching Impact
Fall 2021 – Present: Authored and launched the course as one of CDS’s inaugural undergraduate offerings. Taught six times. Oversaw new instructors during 2024. Currently directing a GenAI-integrated redesign.
Foundational course in the undergraduate data science program.
Learning Objectives
By the end of this course, students will be able to:
- Data Analysis Fundamentals: Apply statistical thinking to real-world datasets
- Python Programming: Use Python for data manipulation and analysis
- Data Visualization: Create compelling visualizations for data storytelling
- Reproducible Research: Build analysis workflows using Jupyter notebooks
- Critical Thinking: Evaluate data quality and interpret results responsibly
Course Innovation
- Real-world datasets: Students work with actual data from public health, social science, and business contexts
- Jupyter-first approach: All coursework conducted in interactive notebooks
- GenAI integration: Currently redesigning to incorporate responsible AI practices
- Berkeley Data 8 model: Adapted curriculum from one of the most successful introductory data science programs
Technologies Used
- Python: Primary programming language
- Jupyter Notebooks: Development and presentation environment
- Pandas: Data manipulation and analysis
- NumPy: Numerical computing
- Matplotlib/Seaborn: Data visualization
- Statistical inference: Hypothesis testing and confidence intervals