CDS — Certified Data Scientist (CDS)

“Turn data into decisions. Turn decisions into impact.”

A comprehensive data science certification aligned to ACM Data Science standards, IBM Professional Data Science, and Google Data Analytics — covering the complete data science lifecycle from wrangling through machine learning to MLOps deployment.

Programme Details Information
Level
University & Professional
Audience
Analysts, researchers, engineers, business professionals, and postgraduate students moving into data science
Standards
ACM Data Science Curriculum · IBM Professional Data Science · Google Data Analytics · J-PAL Statistical Standards
Duration
6 months
Format
Self-paced · Live instructor-led · Cohort-based · Blended
Assessment
Deployed end-to-end data science project + technical report (minimum 75%)
Certificate
CDS Certificate — Ukeh-Adah Alliance Services Ltd

Course modules

Module 1: Foundations of Data Science | Outcomes: Describe the data science lifecycle and apply ethical data principles · Retrieve and explore data confidently using SQL and Python

Data science lifecycle: problem definition through to actionable insight · Types of data, sources, public APIs, and open datasets · Ethics in data science: bias, fairness, privacy, and GDPR compliance · Tools setup: Python, Jupyter, Anaconda, and VS Code · SQL for data retrieval: SELECT, JOINs, GROUP BY, and subqueries

 

Module 2: Data Wrangling & Exploratory Analysis | Outcomes: Clean and prepare messy real-world datasets for analysis · Conduct thorough EDA to surface key patterns and insights

Data collection: APIs, web scraping, and public datasets · Data cleaning: missing values, duplicates, outliers, and imputation · Exploratory data analysis: distributions, correlations, and patterns · Feature engineering: encoding, scaling, and transformation · Pandas advanced: groupby, merge, pivot, apply, and method chaining

 

Module 3: Statistics for Data Science | Outcomes: Apply inferential statistics to test business and research hypotheses · Design and analyse A/B tests using Bayesian and frequentist approaches

Descriptive statistics and common probability distributions · Inferential statistics: hypothesis testing, p-values, and effect sizes · Confidence intervals, power analysis, and sample size calculation · Bayesian statistics: Bayes’ theorem, prior/posterior, and MCMC intro · A/B testing: design, analysis, and interpretation

 

Module 4: Machine Learning — Supervised | Outcomes: Train, evaluate, and tune supervised machine learning models · Select the right evaluation metric for classification and regression

Regression: linear, polynomial, ridge, lasso, and elastic net · Classification: logistic regression, KNN, decision trees, and SVM · Ensemble methods: random forest, gradient boosting, XGBoost, LightGBM · Model evaluation: accuracy, precision, recall, F1, ROC-AUC · Hyperparameter tuning: Grid Search, Random Search, and Optuna

 

Module 5: Machine Learning — Unsupervised & Advanced | Outcomes: Apply clustering and dimensionality reduction to real datasets · Build and train basic neural networks using TensorFlow/Keras

Clustering: K-Means, DBSCAN, hierarchical, Gaussian mixture models · Dimensionality reduction: PCA, t-SNE, UMAP, and autoencoders · Anomaly detection: isolation forest and one-class SVM · Recommender systems: collaborative and content-based filtering · Deep learning introduction: ANNs, CNNs, RNNs with TensorFlow/Keras

 

Module 6: Data Visualisation & Storytelling | Outcomes: Create interactive dashboards using Plotly/Dash or Power BI · Communicate data insights to both technical and non-technical audiences

Matplotlib and Seaborn: publication-quality statistical visualisations · Plotly and Dash: interactive web-based dashboards · Tableau or Power BI: executive-level business intelligence dashboards · Storytelling with data: choosing the right chart and building a narrative · Presenting findings to non-technical audiences with clarity and impact

 

Module 7: Deployment & MLOps | Outcomes: Deploy machine learning models as production APIs using FastAPI · Monitor deployed models for drift and performance degradation

Model serialisation: pickle, joblib, and ONNX formats · Building ML APIs with FastAPI: endpoints, validation, and async · Deploying models: Hugging Face Spaces, AWS SageMaker, or GCP Vertex AI · Model monitoring: data drift, concept drift, and performance degradation · MLflow: experiment tracking, model registry, and reproducibility

 

Module 8: Capstone Project | Outcomes: Deliver a complete, deployed data science project with technical documentation · Present data science findings and methodology to a professional audience

End-to-end project on a real-world dataset of your choice · Problem definition, EDA, feature engineering, modelling, and evaluation · Deployed interactive dashboard or REST API endpoint · Technical report: methodology, results, limitations, and recommendations · Panel presentation with Q&A from data science reviewers

 

Outcomes

Execute the full data science lifecycle from collection to model deployment · Apply supervised and unsupervised ML to real-world datasets · Build and deploy ML APIs and interactive dashboards · Communicate insights compellingly to technical and non-technical audiences · Achieve a credential benchmarked against ACM, IBM, and Google Data Analytics standards

 

Certification requirement

Complete all 8 modules, deploy an end-to-end data science project with an interactive dashboard or API, and submit a technical report (minimum 75%).

 

Career pathways

Data Scientist, Machine Learning Engineer, Data Analyst, Business Intelligence Analyst, AI Researcher, Quantitative Analyst. Average starting salary: $75,000–$120,000 USD.

 

Training format

One-on-one live sessions via Zoom or Google Meet · Small group sessions (2–5 people) · Corporate group training (6+ people)

Minimum sessions

3 sessions minimum for any tool (we do not do one-off sessions — we ensure you actually learn)

Session duration

90 minutes per session — focused, practical, no wasted time

Turnaround

Personalised training plan delivered within 24 hours of your request

Your data welcome

Bring your own dataset, your own project, your own real-world task — we train you on what you actually need to do

Corporate packages

Available for companies and institutions wanting to upskill entire teams on specific tools — custom pricing and delivery

 

Certificate

Certificate of Tool Training issued on completion — specific tool named, hours completed, skills covered

 

“Your tool. Your data. Your timeline. Your way. That is the Ukeh-Adah promise.”

“Enrol Now — Join Thousands of Students and Researchers Worldwide”

“Get Certified. Build Skills. Change Your Future.”

IITA-CGIAR Research Fellow · CAC Registered · Over 15 Years of Excellence · Globally Recognised Certificates