
Analytical and impact-driven professional with 5+ years’ experience in Data Scient and Programme Monitoring, Evaluation and Learning (MEL) in development finance with interest on the banking and financial services sector. Currently pursuing an MSc in Data Science, with hands-on skills in Python, SQL, NoSQL, Power BI, and R. Adept at designing data frameworks, analyzing large datasets, and transforming data into actionable insights. Demonstrated success in stakeholder engagement, Key Performance Indicator (KPI) reporting, financial forecasting, and dashboard creation—now applying these skills to predictive modeling, customer analytics, credit risk, and fraud detection in a data-driven environment.
-Data Science & Statistical Modeling (Python pandas, NumPy scikit-learn, matplotlib, seaborn, Excel, R, SQL, SPSS, Power BI, Tableau)
-Machine Learning & Automation (Logistic Regression, Random Forests, Decision Trees, Naive Bayes, Support Vector Machines (SVM), Clustering, Feature Engineering, Anomaly Detection)
-Model Deployment (Streamlit dashboards, Flask APIs))
-Big Data & Cloud (Basic knowledge of PySpark and Apache Spark (via self-study), Familiarity with cloud platforms (Google Cloud, AWS fundamentals – Coursera)
-Data Handling (Feature Engineering, ETL, Data Cleansing, Data Wrangling)
-SQL & Data Management (MongoDB, Hive, Pig, PySpark (learning))
-Data Visualization & Reporting(Power BI Dashboards, Business Intelligence Tableau Reports, KPI Presentations, Matplotlib, Seaborn)
-Data Engineering & Warehousing (ETL Pipelines, SQL for Data Extraction and Transformation, Hadoop, Scala, Data Cleaning, Schema Design (Academic Projects)
-Methodologies (Agile Delivery, PRINCE2, Test-Driven Analysis)
-Stakeholder Engagement & Communication
1. Power BI and Tableau Performance Tracking and Monitoring Dashboards
Designed and deployed live dashboards to track program performance tracking and monitoring indicators, financial disbursements, and real-time field data.
2. Telecom Customer Churn Dataset
Built a customer churn prediction model using Python, applying machine learning algorithms (Naïve Bayes, SVM, Logistic Regression, Decision Tree, Random Forest), with data preprocessing, exploratory analysis, and model evaluation to improve accuracy and support business decision-making.
3. Stock Market Prediction Using LSTM and Sentiment Analysis
Developed a hybrid deep learning model using Long Short-Term Memory (LSTM) to predict stock trends on the Nairobi Securities Exchange, integrating financial time series data with sentiment analysis from news and social media. Improved predictive accuracy over traditional models (ARIMA, GARCH) by capturing both technical patterns and investor sentiment.
4. ETL and Data Warehousing Mini-Project (SQL)
Designed and implemented an ETL pipeline using SQL to extract, transform, and load customer records into a normalized schema. Focused on banking customer segmentation and product performance analysis.
5. Transaction Anomaly Detection (PySpark, Academic Simulation)
Used PySpark to process simulated transactional data and detect anomalies via clustering. Demonstrated Spark-based processing on large-scale data as part of a cloud computing course project.
Networking
Travelling
Community Development
Reading
Young Women Empowerment