BSc Data Science
One of the fastest-growing undergraduate programmes in the world — combining programming, statistics, machine learning, and data engineering into a comprehensive three-year degree. Graduates are in extraordinary demand across every sector of the modern economy.
Programme Overview
The BSc Data Science is built on four foundational pillars: mathematics and statistics, programming and engineering, machine learning, and data ethics. Year 1 introduces Python, probability, and databases. Year 2 progresses to machine learning, big data frameworks, and visualisation. Year 3 features advanced ML, NLP electives, and a substantial industry-partnered capstone project with a real client and live dataset.
Students gain access to our curated repository of over 10,000 labelled datasets spanning healthcare, finance, climate, and social science — used in every module for hands-on learning.
What You'll Learn
- Python for Data Science: NumPy, pandas, scikit-learn, data pipelines
- Machine Learning: Supervised, unsupervised, and reinforcement learning
- Database Systems: SQL, NoSQL, relational design, query optimisation
- Big Data Technologies: Hadoop, Spark, cloud platforms (AWS/GCP)
- Data Visualisation: matplotlib, Tableau, D3.js, storytelling with data
- Bayesian Methods: Probabilistic modelling, uncertainty quantification
- Ethics in Data Science: Fairness, bias, privacy, GDPR compliance
- Linear Algebra: Vector spaces, matrix decomposition, SVD for ML
Python for Data Science
End-to-end Python skills — from scripting and data cleaning with pandas to building and evaluating ML pipelines.
Statistics & Probability
Distributions, estimation, hypothesis testing, and regression — the statistical backbone of every data science workflow.
Machine Learning
Linear models, decision trees, SVMs, neural networks, and ensemble methods with cross-validation and hyperparameter tuning.
Database Systems & SQL
Relational schema design, advanced SQL queries, indexing strategies, and introduction to NoSQL (MongoDB, Cassandra).
Linear Algebra for Data Science
Matrices, eigenvalues, SVD, PCA, and tensor operations — essential mathematical tools for ML and neural networks.
Big Data Technologies
Distributed processing with Apache Spark, Hadoop ecosystem, cloud storage, and real-time streaming with Kafka.
Data Visualisation
Principles of visual communication, interactive dashboards with Tableau and Power BI, and custom D3.js charts.
Ethics in Data Science
Algorithmic fairness, bias auditing, data privacy legislation (GDPR/CCPA), and responsible AI deployment frameworks.
Click any course to view its objective and learning outcomes.
DSC 101 Mathematical Foundations for Data Science +
Objective
To establish the linear algebra, calculus and probability used throughout data science.
Learning Outcomes
- Compute matrix operations relevant to data representations.
- Apply derivatives and gradients to optimisation.
- Use probability rules and standard distributions.
- Manipulate large vectors and matrices efficiently.
- Translate ML notation into vector-matrix form.
DSC 102 Programming Foundations (Python) +
Objective
To gain fluency in Python for data manipulation, analysis and machine learning.
Learning Outcomes
- Write idiomatic Python with collections and comprehensions.
- Use NumPy and pandas for vectorised data work.
- Apply control flow and functions for clean code.
- Manage projects with packaging and virtual environments.
- Debug and profile Python programs.
DSC 103 Linear Algebra for Data Science +
Objective
To develop the algebraic toolbox needed for ML and modern statistics.
Learning Outcomes
- Compute eigendecomposition and SVD.
- Apply PCA for dimensionality reduction.
- Use matrix factorisation for recommender systems.
- Solve least-squares problems analytically and numerically.
- Interpret latent-space embeddings.
DSC 104 Probability & Statistics for Data Science +
Objective
To master the inferential and probabilistic reasoning behind ML models.
Learning Outcomes
- Compute Bayes' rule for posterior updates.
- Apply maximum likelihood and MAP estimation.
- Use bootstrap and resampling for inference.
- Quantify model uncertainty with credible intervals.
- Compare frequentist and Bayesian perspectives.
DSC 201 Database Systems +
Objective
To design, query and manage data stores for analytics and ML pipelines.
Learning Outcomes
- Model entities into normalised relational schemas.
- Write SQL including joins and window functions.
- Use NoSQL and graph stores where appropriate.
- Optimise queries with indexing and partitioning.
- Build ETL/ELT pipelines for ML training data.
DSC 202 Data Wrangling & Visualisation +
Objective
To clean, transform and visualise messy real-world datasets effectively.
Learning Outcomes
- Identify and treat missing data and outliers.
- Reshape data using long/wide pivots.
- Visualise distributions, relationships and trends.
- Build interactive dashboards.
- Document data lineage and quality issues.
DSC 203 Machine Learning I — Supervised Learning +
Objective
To master the core supervised learning algorithms and evaluation methodology.
Learning Outcomes
- Apply linear and logistic regression with regularisation.
- Train decision trees, ensembles and SVMs.
- Tune hyperparameters with cross-validation.
- Evaluate using appropriate metrics and curves.
- Diagnose bias-variance issues.
DSC 204 Machine Learning II — Deep Learning +
Objective
To build, train and evaluate neural networks for vision, text and tabular tasks.
Learning Outcomes
- Implement neural networks with PyTorch or TensorFlow.
- Apply CNNs to image data and RNNs/Transformers to text.
- Use transfer learning effectively.
- Combat overfitting with dropout, augmentation and early stopping.
- Profile and optimise training on GPUs.
DSC 301 Big Data Engineering +
Objective
To process, store and serve data at scale using modern cloud infrastructure.
Learning Outcomes
- Use Spark for distributed data processing.
- Build streaming pipelines with Kafka or Flink.
- Deploy lakehouse architectures.
- Apply infrastructure-as-code for reproducible environments.
- Compare cloud platforms for cost and performance.
DSC 302 NLP & Text Analytics +
Objective
To analyse and model text data with classical and neural approaches.
Learning Outcomes
- Apply text preprocessing and tokenisation.
- Compute TF-IDF and word embeddings.
- Train classifiers for sentiment, topic and intent.
- Use transformer models for sequence tasks.
- Evaluate NLP systems with task-appropriate metrics.
DSC 303 Data Ethics, Privacy & Fairness +
Objective
To address ethical, regulatory and fairness considerations in data science work.
Learning Outcomes
- Audit ML systems for bias and disparate impact.
- Apply differential privacy and federated learning.
- Discuss interpretability and explainability methods.
- Comply with GDPR and similar regulations.
- Build data-ethics review processes.
AND OR NOT XOR -> <->
DSC 304 Data Science Capstone +
Objective
To deliver an end-to-end data-science project from data acquisition to deployment.
Learning Outcomes
- Frame a real-world problem as a data-science task.
- Build, validate and deploy an analytic model.
- Manage project timelines and stakeholder expectations.
- Document code, data and decisions reproducibly.
- Present results to a technical and lay audience.
Data Scientist
Build predictive models and extract intelligence from large-scale datasets at technology companies, banks, and consultancies.
Machine Learning Engineer
Develop and deploy ML models at scale, integrating statistical models into production software systems.
Business Intelligence Analyst
Create dashboards and analytical reports that drive strategic decisions across sales, marketing, and operations.
Data Engineer
Design and maintain scalable data pipelines, warehouses, and ETL systems that power analytics organisations.
Product Analyst
Use data to guide product strategy, A/B test features, and understand user behaviour at tech and SaaS companies.
AI Researcher
Pursue research in applied machine learning, contributing to academic publications or corporate research labs.
Why D'Math University
Industry-First Curriculum
Modules co-designed with data teams at leading technology firms, ensuring skills are current and immediately deployable.
10,000+ Dataset Library
Unparalleled access to labelled real-world datasets from healthcare, climate science, finance, and social media.
Cloud Lab Environment
Every student gets a cloud computing environment (AWS-based) from Day 1, with credits for Spark and GPU-accelerated workloads.
Industry Capstone
Year 3 capstone projects are completed with a real client organisation, with many leading to direct job offers upon graduation.
Rolling admissions — apply today and begin within 30 days