Experienced data scientist with a strong background in statistical analysis, machine learning, and data visualization. Proficient in Python, R, SQL, and various data processing tools, with a focus on delivering actionable insights. Collaborative approach and adaptability consistently lead to impactful results in dynamic environments. Recognized for problem-solving abilities and innovative thinking in leveraging data to drive business decisions.
Key Accomplishments :
1. HIV Data analytics & use, quality assurance and HIV Modelling in Kenya
· Ensure that the Country Office and national partners have a timely and accurate measurement of change in conditions at the national and County levels, including both monitoring of HIV trends and through spectrally specific management information systems (e.g. DHIS, EID, Viral load, TB Tibu System)
· Provide Kenya HIV Prevalence estimates for both National and county planning. This is done jointly with CDC, UNAIDS, UNICEF, NASCOP, NACC and other HIV partners Using HIV mathematical modeling tools - EPP Spectrum
· Support UNICEF in National and county-level data collection and analysis on HIV at the outcome/impact levels of relevant indicators in collaboration with international partners.
· Provide technical support on Exploratory data analysis using Python, Jupyter, Anaconda, R and R studio for programming.
· Using python conduct Predictive analytics regression analysis, PCA, Survival analysis, to extract useful insights that inform strategic program decisions
· Using Python, Jupyter, and Scikit-learn built time series forecasting models for HIV commodities supply prediction and anomaly detection.
· Big data processing using My SQL database with Python and R under Hadoop and PySpark framework.
· Perform biannual Data Quality Assessments in the 47 counties in both HTS, PMTCT and Care &Treatment
2. Programme management, monitoring and delivery of results
· Ensure that the Country Office and partners have necessary information to assess progress towards expected results established in work plans, with special attention paid to identifying proper HIV indicators and means of verification during the planning phases, measuring progress in removing bottlenecks and barriers, and measuring the quality of MOH/ UNICEF implementation of its commitments to the host nation.
· Lead on supporting county teams in developing a system for monitoring and evaluating HIV program performance.
3. Research, surveys and evaluations
· Working with NASCOP, NACC, KNBS, CDC, UNAIDS, WHO and her partners within the technical working groups for conducting AIDS Impact Assessments in Kenya (PHIA/KAIS) for 2017. These include PHIA protocols, informed consent forms,Sampling, Questionnaires, Standard Operating Procedures, training materials and training plans, TOTs, data management and Analysis, country-specific work plans and corresponding budgets the collection, analysis and synthesis of data.
· Designed and maintained largescale databases for unstructured and structured data, and for data collection mobile apps
· Exploratory Data Analysis (EDA) to identify trends and correlations in survey data
· Developed data Visualization dashboards, Analytics & Storytelling to communicate insights using Tableau, Power BI and Python Pandas, matplotlib, and seaborn)
· Use of a centralized, integrated, and publicly accessible data repository for storing and maintaining all national health data-DHIS.
4. County Capacity development and support
· Strengthening the capacity of NBS and MOH Technical programs in the field of data management and alignment between health actors for transparency among partners.
· Revise Monitoring of MOH HIV tools with NASCOP, NACC, MOH and partners, Train MOH staff on HIV tools & Guidelines and Support in Rolling out in the county.
· Use of Visualization technologies, such as Tableau, PowerBI, Bokeh,Dash, Shinny, Plotly, Flask, Google charts, and Dundas BI for Visualizationand methodologies to unlock the value in UNICEF data
· Analyzing qualitative and quantitative data from EMR (CPAD, Open-MRS) systems and Clinical Studies Scientifically for Cohort data using SPSS, SAS and STATA;
· Exploratory Data Analysis (EDA) to identify trends and correlations in survey data
· Developed data Visualization dashboards, Analytics & Storytelling to communicate insights using Tableau, Power BI and Python Pandas, matplotlib, and seaborn)
· Using Python, Jupyter, and Scikit-learn built time series forecasting models for HIV commodities supply prediction and anomaly detection.
· Analyze and interpret Big survey data /complex datasets in My SQL database using Python and R under Hadoop and PySpark framework.
· Development and implementation of frameworks, and strategic plans in health information systems e.g. M&E plans, Registers for routine data collection and program monitoring; developing plans to ensure data quality.