D'Math University | Statistics & Data Science · Programme #15

MSc Data Science

A cutting-edge one-year postgraduate programme that trains the next generation of senior data scientists and AI professionals. With a 98% placement rate and a defining industry capstone, graduates enter the workforce with both deep technical expertise and real project experience.

Postgraduate 1 Year Online Industry Capstone

Specialist Modules

£65k

Average Graduate Salary

50+

Industry Partners

98%

Placement Rate

Programme Overview

The MSc Data Science delivers advanced training across the full data science stack — from distributed cloud computing and database engineering to deep learning and NLP. Semester 1 covers core technical modules; Semester 2 adds specialisations in NLP, data privacy, and optimisation for ML. The programme concludes with a 12-week capstone project embedded within a partner organisation — a live professional experience that frequently results in direct graduate hires.

Partner organisations for capstone projects include firms across fintech, healthcare analytics, e-commerce, climate science, and government data agencies.

What You'll Learn

Deep Learning: CNNs, RNNs, transformers, and large language model fine-tuning
Advanced Databases & Cloud: Distributed systems, AWS/GCP, data lake architecture
Predictive Analytics: Forecasting, time series, regression with model deployment
NLP & Text Mining: Tokenisation, embeddings, sentiment, named entity recognition
Data Engineering: Pipeline design, orchestration (Airflow), streaming (Kafka)
Optimisation for ML: Gradient descent variants, hyperparameter search, AutoML
Data Privacy & Governance: Differential privacy, federated learning, data stewardship
Capstone Project: Industry-embedded 12-week applied project with real deliverables

Core Curriculum

🤖

Deep Learning

Architecture design, training techniques, regularisation, and deployment of deep neural networks for vision and language tasks.

🗃️

Advanced Databases & Cloud

Distributed SQL and NoSQL systems, cloud-native data architecture (AWS Redshift, BigQuery), and data lake design.

📊

Applied Statistics

GLMs, mixed models, and advanced regression techniques applied to real datasets from healthcare, finance, and social science.

📈

Predictive Analytics

Forecasting methods — exponential smoothing, ARIMA, Prophet, gradient-boosted trees — with production deployment.

🌐

Distributed Computing

Apache Spark, Hadoop, Dask, and Kubernetes for scalable data processing across cloud clusters.

🔬

NLP & Text Mining

Classical NLP pipelines through to transformer models (BERT, GPT) — fine-tuning and deploying LLMs for domain tasks.

💻

Data Engineering

Building robust ETL/ELT pipelines using Apache Airflow, dbt, and modern data stack tools for analytics engineering.

🎓

Industry Capstone

12-week embedded project with a partner organisation, delivering an analytics product with full documentation and presentation.

Course Catalogue

Click any course to view its objective and learning outcomes.

DSC 501 Mathematical Foundations for Data Science +

Objective

To consolidate the linear algebra, calculus and probability essential for advanced ML.

Learning Outcomes

Apply matrix calculus to backpropagation.
Use SVD and PCA for dimensionality reduction.
Apply convex analysis to optimisation.
Use measure-theoretic probability where appropriate.
Implement vectorised computations efficiently.

DSC 502 Statistical Learning Theory +

Objective

To analyse the theoretical foundations of supervised learning.

Learning Outcomes

Apply VC dimension and Rademacher complexity.
Use PAC learning bounds.
Apply concentration inequalities.
Analyse generalisation gap.
Discuss bias-variance trade-off rigorously.

DSC 503 Advanced Machine Learning +

Objective

To master state-of-the-art supervised and unsupervised learning algorithms.

Learning Outcomes

Apply gradient boosting and ensemble methods.
Use kernel methods and SVMs.
Apply unsupervised clustering and dimensionality reduction.
Use semi-supervised and active learning.
Tune hyperparameters with Bayesian optimisation.

Interactive Activity — 2×2 Matrix Transformation

Set the entries of a 2×2 matrix. Watch how it transforms the unit square. Determinant = signed area of the transformed square.

a = 1.0 b = 0.5 c = -0.3 d = 1.0

Interactive Activity — Vector Field & Gradient Visualizer

Pick a scalar field f(x,y). Gradient arrows point in the direction of steepest ascent. Click anywhere to drop a particle that follows the gradient.

f(x,y) =

Click on the plot to drop a particle.

DSC 504 Deep Learning +

Objective

To design, train and deploy modern neural networks at scale.

Learning Outcomes

Implement CNNs, RNNs and Transformers.
Apply transfer learning effectively.
Use distributed training on GPUs.
Apply self-supervised pretraining.
Deploy models for inference.

Interactive Activity — 2×2 Matrix Transformation

Set the entries of a 2×2 matrix. Watch how it transforms the unit square. Determinant = signed area of the transformed square.

a = 1.0 b = 0.5 c = -0.3 d = 1.0

Interactive Activity — Vector Field & Gradient Visualizer

Pick a scalar field f(x,y). Gradient arrows point in the direction of steepest ascent. Click anywhere to drop a particle that follows the gradient.

f(x,y) =

Click on the plot to drop a particle.

Interactive Activity — Gradient Descent on a 2D Loss Surface

Click anywhere on the surface to drop a starting point. Animation traces the descent path on the chosen loss function. Adjust the learning rate to see how step size affects convergence.

Loss: η = 0.10

Click on the loss surface to drop a starting point.

Interactive Activity — Linear Classifier Decision Boundary

Click to add red or blue points. Adjust weights w₁, w₂, b to position the decision line w₁x + w₂y + b = 0. Misclassified points highlight red.

w₁ = w₂ = b =

DSC 505 Big Data Engineering +

Objective

To process data at scale using modern distributed systems.

Learning Outcomes

Apply Spark for distributed processing.
Use Kafka and Flink for streaming.
Build data lakehouse architectures.
Apply orchestration via Airflow.
Manage cloud data infrastructure.

Interactive Activity — Distribution Plotter

Pick a distribution and adjust its parameters. Read off mean and variance directly from the plot.

Audit ML systems for bias.
Apply differential privacy.
Use federated learning.
Discuss interpretability methods.
Comply with GDPR and similar regulations.

Interactive Activity — Truth Table Builder

Type a logical expression using p, q, r and operators (AND, OR, NOT). The truth table generates instantly.

Operators: AND OR NOT XOR -> <->

DSC 511 Advanced Causal Inference +

Objective

To estimate causal effects using modern statistical methods.

Learning Outcomes

Apply potential-outcomes framework.
Use propensity scores and matching.
Apply instrumental variables.
Use difference-in-differences.
Apply causal ML methods.

Interactive Activity — Distribution Plotter

Pick a distribution and adjust its parameters. Read off mean and variance directly from the plot.

Distribution: p1 = 0.0 p2 = 1.0

Interactive Activity — Central Limit Theorem Simulator

Sample n values, take their average, repeat. The histogram of averages converges to a normal distribution — CLT in action.

Source: Sample size n = 10

Total sample means: 0

Interactive Activity — Hypothesis Testing Visualizer

Set null hypothesis μ₀, sample mean x̄, sample size n and σ. The activity computes the z-statistic, p-value (shaded tail), and tells you whether to reject H₀ at significance α.

μ₀ = x̄ = σ = n =

α = 0.050 Tail:

DSC 512 Master's Capstone +

Objective

To deliver an industry-grade end-to-end data science project.

Learning Outcomes

Frame a complex business problem.
Build production-quality ML systems.
Communicate with stakeholders.
Document for handoff and reproducibility.
Present to a technical and lay audience.

Career Pathways

🤖

Senior Data Scientist

Lead data science squads at scale-ups and enterprise firms, owning end-to-end ML model lifecycle and business impact.

⚙️

ML Engineer

Design and productionise ML systems using MLOps frameworks, serving models at scale with monitoring and retraining pipelines.

🚀

AI Product Manager

Bridge technical AI capabilities and business strategy, defining AI product roadmaps at technology companies.

🏢

Chief Data Officer

Executive-level leadership of data strategy, governance, and analytics capability within large organisations.

🏗️

Data Architect

Design enterprise-scale data platforms, warehouses, and lake architectures that support thousands of analytical users.

🔬

Research Scientist

Conduct applied ML research at corporate labs (Google Brain, Meta AI) or academic research centres.

Top Universities Offering This Programme

MIT Columbia University Carnegie Mellon University University College London Imperial College London University of Edinburgh ETH Zürich TU Munich University of Toronto NTU Singapore

Why D'Math University

98% Placement Rate

Our dedicated careers team, alumni network, and capstone partnerships deliver extraordinary graduate employment outcomes.

Full-Stack Data Training

From statistical modelling to cloud infrastructure and deep learning — graduates are equipped across the entire data science spectrum.

Live Industry Capstone

Real problem, real data, real client — the capstone is the centrepiece of the programme and a launching pad for careers.

Expert Mentorship

Each student is paired with an industry mentor from our partner network for guidance throughout the programme.

Enrol in MSc Data Science →

September and January intakes — early applications receive priority capstone placement