Open to Opportunities

Brian Kipkemboi

Data Science Student ยท Analytics, Machine Learning, NLP & Human-Centred Problem Solving

I am a Master of Data Science student at James Cook University with a strong interest in applied analytics, machine learning, natural language processing, and real-world problem solving. I bring frontline human services experience to every model I build.

MDS
James Cook Uni
About Me

Where Data Science
Meets Human Impact

๐ŸŽ“

Academic Foundation

Master of Data Science (Professional) at James Cook University, Cairns - building deep expertise in machine learning, NLP, statistical analysis, and applied analytics.

โšก

Technical Capability

Proficient in Python, R, and SQL. Experienced with end-to-end pipelines - from web scraping and data wrangling to model training, evaluation, and visualisation.

๐Ÿค

Human-Centred Lens

Background in support work, youth services, and community environments - bringing stakeholder awareness, structured reporting, and communication to technical work.

๐ŸŒ

Cross-Cultural Perspective

Kenyan-Australian perspective spanning healthcare, international development, and community services - informing a uniquely empathetic approach to data problems.

I am an emerging data scientist who combines rigorous technical training with deep human services experience. My academic work at James Cook University has equipped me with strong foundations in machine learning, NLP, statistical modelling, and data visualisation, while my professional background in homelessness case management, residential youth support, and community care has given me something most data scientists lack: the ability to see the people behind the data.

I am particularly interested in health data, social impact analytics, intelligent systems, and innovation that makes real differences in people's lives. My projects span sentiment analysis, computer vision, IoT systems, and AI-powered advocacy tools, each one grounded in a genuine need I've observed firsthand.

What sets me apart is the ability to combine technical execution with clear communication and thoughtful design. I build models that work, dashboards that tell stories, and presentations that move stakeholders to action. I write code and I write compellingly about what that code means.

Technical Arsenal

Skills & Competencies

A full-stack data science toolkit, from statistical foundations to deployed applications paired with the professional strengths that make technical work meaningful.

๐Ÿ’ป
Programming & Languages
Python R SQL JavaScript HTML / CSS Bash
๐Ÿง 
Data Science & ML
EDA Statistical Analysis Machine Learning NLP Sentiment Analysis Clustering Regression Classification Deep Learning Computer Vision Topic Modelling Feature Engineering
๐Ÿ› 
Tools & Frameworks
Pandas NumPy Scikit-learn PyTorch TensorFlow NLTK Matplotlib Seaborn Plotly Tableau BeautifulSoup Selenium Flask OpenCV YOLO Git Jupyter AWS
โœฆ
Professional Strengths
Communication Structured Reporting Problem Solving Stakeholder Awareness Data Storytelling Crisis Management Cross-Cultural Collaboration Case Management Ethical Reasoning Technical Writing
Featured Work

Projects

Every project starts with a real question. These represent end-to-end data science - from problem framing through deployment.

NLP / Sentiment
๐Ÿ’ฌ

NLP Sentiment Analysis Pipeline

Full 11-step data science pipeline analysing 1,200+ social media comments. Dual-model scoring with VADER and TextBlob, ensemble classification, LDA topic modelling, temporal trend analysis, and word cloud generation.

VADER TextBlob LDA NLTK Web Scraping
Health / Social Impact
๐Ÿฅ

AI-Powered Benefits Advocacy Engine

Designed an intelligent system that scans government eligibility rules, matches client profiles, and identifies unclaimed benefits for vulnerable populations. Grounded in frontline homelessness case work and real service data.

RAG React Eligibility Matching Social Impact
Dashboard / IoT
๐Ÿ“Š

Real-Time Agricultural Quality Dashboard

Deployed a two-device IoT system using computer vision (YOLO + OpenCV) with LoRa mesh telemetry. Live Flask dashboard showing quality percentages, stock levels, and sales inference from edge devices.

YOLOv8 OpenCV Flask LoRa Raspberry Pi
Machine Learning
๐Ÿค–

Crop Disease Classification Model

Built a MobileNetV2 transfer learning model for agricultural disease detection from leaf images, with multilingual treatment recommendations. Targeted at smallholder farmers in resource-limited settings.

MobileNetV2 Transfer Learning TensorFlow Agriculture
Innovation / AI
โšก

GPT-Style Language Model Trilogy

Three progressive notebooks building transformer models from scratch: character-level GPT, document-parsing RAG with cross-attention, and web-scraping GPT with hybrid TF-IDF + BM25 retrieval.

PyTorch Transformers RAG From Scratch
Career Path

Experience

A trajectory from frontline human services to applied data science - each role building on the last.

2026 โ€” Present
AI Research Intern
JCU Innovation & Industry Lab โ€” Mind Scrapers Team
  • WIL placement (EG5300) under Dr. Samantha J. Horseman, completing 200 research hours
  • Building real-time computer vision + IoT monitoring systems deployed at field locations
  • Collaborating with industry partners on maritime AI, edge computing, and agricultural technology
  • Presenting at investment workshops, vibe coding panels, and CAVE 3.0 immersive lab launches
05/2024 โ€” Present
Homelessness Case Manager & Residential Youth Worker
Anglicare Far North Queensland โ€” Cairns
  • End-to-end case management for individuals experiencing or at risk of homelessness
  • Comprehensive intake, strengths-based assessments, and individualised care planning
  • Advocacy with housing, mental health, AOD, DFV, and legal service providers
  • Crisis management, de-escalation, and independent living skills development
  • Accurate documentation and KPI-driven outcome reporting within quality frameworks
10/2023 โ€” 06/2024
Residential / Youth Support Worker
Enabling Pathways โ€” 24/7 Respite Accommodation
  • Round-the-clock support for young people in residential care settings
  • Behavioural support, shift reporting, and multi-agency coordination
Kenya โ€” Prior
Healthcare & International Development
Reale Hospital ยท Nuru International Kenya
  • Foundational experience in healthcare administration and community development
  • International development programme delivery in rural Kenyan communities
Academic Background

Education

Master of Data Science (Professional)
James Cook University - Cairns Campus
In Progress
Comprehensive training in machine learning, NLP, statistical modelling, health data analytics, and applied research methods. Key coursework includes MA5851 (NLP & Machine Learning), health data capstone, and WIL industry placement (EG5300).
Certificate III in Individual Support
Community Services
Completed
Foundation qualification in disability, aged care, and community support work. Providing the human-centred lens that informs ethical data science practice and stakeholder-aware solution design.
Self-Directed Technical Development
Continuous Learning
Ongoing
Karpathy's "Neural Networks: Zero to Hero" curriculum. Three progressive GPT notebooks built from scratch. AWS SageMaker, Tableau, and cloud deployment. Open-source contributions and independent research in low-resource NLP.
Get In Touch

Let's Work Together

Open to data science roles, research collaborations, and partnerships - especially where analytics meets real-world human impact.

๐Ÿ“
Cairns, Queensland, Australia
๐Ÿ“ง
briankipkemboi52@gmail.com
๐Ÿ“ฑ
0448 748 242