Summary
Overview
Work History
Education
Skills
Websites
Accomplishments
Technical Tools
Highlighted Projects
References
Timeline
Generic
DAVID MUNA MWANGI

DAVID MUNA MWANGI

Nairobi,30

Summary

Highly adept Data Scientist with close to half a decade of experience in Business Intelligence, AI, Machine Learning and Deep Learning to help organizations achieve their goals of being data-driven through Teamwork, Collaboration, Quality Assurance, and clear communication with stakeholders. Possesses excellent understanding and continuous learning ability of new Data Science techniques. David is currently looking for a Data Science, ML, or AI-related position in a promising environment with more learning and growth opportunities.

Overview

9
9
years of professional experience

Work History

Data Science Consultant

Upwork/LinkedIn
01.2021 - Current
  • Built machine learning models in Python & R
  • Performed Project Scoping, Data Cleaning & Preparation, EDA, Data Visualization, and Hypothesis Testing
  • Researched & documented statistical concepts & techniques in Data Science/ML.
  • Transformed raw data into actionable insights by utilizing advanced statistical methods, effectively guiding business strategy with empirical evidence.

AI/Deep Learning/Machine Learning Contractor

pOrbis Group, Inc
04.2023 - 10.2023
  • Built a Model-Based Reinforcement Learning Model on top of existing NLP implementation
  • Performed Deep Learning, NLP Model Enhancements, and Fine-Tuning
  • Improved model performance by over 25% leading to the acquisition of more clients.

Sr. Data Scientist/AI/Machine Learning Software Engineer (Contract)

pOrbis Group, Inc
01.2022 - 07.2022
  • Built an end-to-end machine learning project using Python, MongoDB & Postman
  • Tested API endpoints and overall functionality of projects for quality assurance
  • Led a team of Junior ML Engineers and ideated projects and deliverables
  • Created Jira tickets for each team member and myself for project management
  • Led daily standup/scrum for Data Science and Machine Learning team
  • Refactored code for existing projects and upgraded libraries/dependencies
  • Continuously improved training materials and training sessions
  • Achievements: I received an award after the second quarter of 2022 from pOrbis titled, “Distinguished Support in AIDA Project using AI and Machine Learning” on 1st July 2022.

Python ML/DL Engineer / Data Scientist (Contract)

pOrbis Group, Inc
11.2021 - 01.2022
  • Developed ml models, hyperparameter tuned them and enhanced scope of models after refactoring
  • Performed ideation of projects and deliverables
  • Code refactoring of existing projects and upgrading of libraries/dependencies
  • Developed training materials and trained Data Science and Machine Learning Engineers
  • Fixed backlog errors on pending projects from 2020 to date.

Remote Call Centre Agent

CURB Metropolitan Transit Association
05.2019 - 09.2019
  • Captured data on booking, rescheduling & cancellation of trips on the CRM page
  • Retrieved and keyed in data on Curb website
  • Booked and rescheduled trips/cabs for MTA Broker Services
  • Handled customer queries and dispositioned calls using ViciDial.

Research Assistant

Tade Group LLC under PEPFAR (Presidential Emergency Plan For Aids Relief)
03.2018 - 07.2018
  • Collected data on 8 indicators supported by PEPFAR in 400+ facilities
  • Located and recorded GPS coordinates of unmapped health facilities
  • Provided daily & weekly reports on the status of data collected in Health Facilities.

Teller/Customer Service Representative

SBM Bank Kenya(Chase Bank Kenya (IR))
02.2015 - 07.2016
  • Ensured confidentiality & integrity of client transaction data on Flexcube & Sybrin
  • Handled customer inquiries & processed cheques, TT, FX, RTGS/EFT
  • Achievements: Managed to cross-sell credit cards and loan facilities when clients came.
  • Best performing Teller in terms of total transactions, turnover, and productivity from August 2015 to February 2016 while at Chase Bank Kenya(IR).

Education

Data Science -

Moringa School
Nairobi, Kenya
12.2020

BSc. Food Science and Nutrition -

Jomo Kenyatta University of Agriculture And Technology
Nairobi, Kenya
07.2013

Skills

  • Data Structures and Algorithm Development
  • Data Processing and Feature Engineering
  • Exploratory Data Analysis
  • Model Training and Evaluation
  • Communication and Collaboration
  • Machine learning
  • Deep Learning
  • Performance Tuning
  • Model Integration and Deployment
  • Natural Language Processing
  • Cloud Computing
  • Research and Innovation
  • Statistics

Accomplishments

  • I received an award after the second quarter of 2022 from pOrbis titled, Distinguished Support in AIDA Project using AI and Machine Learning on 1st July 2022 https://www.linkedin.com/posts/porbis-group-inc_awards-porbis-activity-6952471061225492480-7cOv?utm_source=share&utm_medium=member_desktop
  • Managed to cross sell credit cards and loan facilities when clients came to my teller booth.
  • Best performing Teller in terms of the total number of transactions, turnover, and productivity from August 2015 to February 2016 while at Chase Bank Kenya(IR)

Technical Tools

  • Python, SQL and R
  • Tensorflow
  • Statsmodels
  • Sklearn/Scikit Learn
  • Git and Github
  • AWS
  • Microsoft Azure
  • Docker
  • Kubernetes
  • VS Code

Highlighted Projects

  • Contracted Data Scientist/AI/ML/DL Engineer, Swift Spread Mapping Project for pOrbis Group in Houston Texas, 04/2022 - 07/2022, Worked on a Flask Machine Learning Mapping Project from Ideation phase to deployment into the production environment using Python, Flask, AWS s3 bucket, MongoDB and Postman. Managed to create an LSTM model using tensorflow and other required libraries to automatically upload an excel file or pdf/jpeg/jpg/png, detect information from images of pdf’s such as account numbers, descriptions and amount/credit/debit figures using OCR. The next step was to modify its layout, account configuration and started the account mapping process based on existing mappings, lookup from mappings in a training file in MongoDB collection, and finally mapped using a machine learning model. The mappings were captured from previously saved existing mappings, lookup from training file and the ml model with the confidence, status, allocation percent and source of mapping. Worked on the development environment replication process and tested the api endpoints using Postman for deployment into production. This allowed pOrbis’s dealers to automatically map their trial balance files amount figures to the correct cell in our front end UI based on the account descriptions they provide for the hundreds of thousands of records by OEM Dealers, hence making work easier for them.
  • Contracted Data Scientist/AI/ML/DL Engineer, AIDA Time Series Forecasting for pOrbis Group in Houston Texas, 11/2021 - 02/2022, I worked with a dataset with the curse of dimensionality(>3000 features) and hundreds of thousands of records collected in a span of 8 years, to perform analysis and identify trends and relationships between the different features and how they affect outcome variables. My job was to clean the data, reduce the dimensions from >3000 variables to fewer features with high information value and high explained variance through PCA. I performed univariate, bivariate, and multivariate exploratory data analysis to achieve this in addition to Feature Importance Analysis. I identified granger causal relationships of important features across different times and created multiple-output regression models as well as multi-step forecasting models for the variables at different times of the year. This was done by making interactive notebooks that a user can use to filter through and get the desired analysis. I used LSTM Deep Learning models for time series forecasting.
  • Data Scientist, Parametric Survival Analysis for Same Day Auto Finance in Texas, Upwork, 09/2021, I performed Parametric Survival Analysis for a client from Upwork using R and obtained a 5-Star rating. I worked with a dataset that had fewer records than features with >35% missing values. My aim was to produce survival curves to estimate how many months it would take for a particular record/client to default on their payments. I used libraries such as smoothsurvreg and survival among others for the analysis and managed to exceed the client’s expectations. Used the weight of evidence technique to deal with missing values and created uniform bins & calculated estimated hazard outputs from day 0 to 48 in the form of a table followed by respective cumulative hazards for each record at month 48. Transposed columns to rows and created equally sized bins based on cumulative hazards and grouped by the bins & compared the bins with the mean of target variable Prepaid. and successfully produced survival curve estimates with an accuracy of > 85%

References

  • Tushar Chaudhary, Data Scientist & ML Engineer, pObis Group LLC, t.chaudhary@porbis.com
  • Jedidah Ochieng', Technical Mentor- Moringa School, 0706601058, jedidahakinyi@gmail.com

Timeline

AI/Deep Learning/Machine Learning Contractor

pOrbis Group, Inc
04.2023 - 10.2023

Sr. Data Scientist/AI/Machine Learning Software Engineer (Contract)

pOrbis Group, Inc
01.2022 - 07.2022

Python ML/DL Engineer / Data Scientist (Contract)

pOrbis Group, Inc
11.2021 - 01.2022

Data Science Consultant

Upwork/LinkedIn
01.2021 - Current

Remote Call Centre Agent

CURB Metropolitan Transit Association
05.2019 - 09.2019

Research Assistant

Tade Group LLC under PEPFAR (Presidential Emergency Plan For Aids Relief)
03.2018 - 07.2018

Teller/Customer Service Representative

SBM Bank Kenya(Chase Bank Kenya (IR))
02.2015 - 07.2016

Data Science -

Moringa School

BSc. Food Science and Nutrition -

Jomo Kenyatta University of Agriculture And Technology
DAVID MUNA MWANGI