Applied ML Engineer · AI Researcher
I build production ML systems at a national scale and publish the research behind the decisions.
End-to-end ML systems, open-source tools, and applied engineering.
CDIS Architecture
Designed & implemented Timor-Leste's national meteorological platform serving DNMG, part of UNEP's Green Climate Fund project. Live production at 97.64% task / 96.20% DAG success rate over 1.4+ years, processing 1 TB+/day across 5 NWP models (GFS, ECMWF-IFS/AIFS, ICON, and UK Met) and 2 satellites (Himawari-9 and GK2A). Key optimizations: NetCDF reads 30s → 80ms (375×); observation queries 4s → 800ms (5×).
CDIS CAP Alert System
Built the CAP v1.2 stack: pycap-validator (Python package for OASIS schema enforcement + digital signature verification), Django RSS backend, MQTT pub/sub, and n8n multi-channel dissemination (email, Facebook, Twitter/X). Deployed in production at DNMG Timor-Leste and Bangladesh Meteorological Department via Grameenphone for population-scale automated alerting.
Bengali Land Record OCR: Methodology
Production VLM system for extracting JL and plot numbers from Bangladesh government land-record scans at scale. Fine-tuned Qwen3-VL-2B/8B with sibling lookup + symbolic verifier + conformal prediction. Achieved 98.9% JL precision @ 94% auto-accept and 96.7% plot precision @ 92% auto-accept at 1.83s/image on a 396,802-image corpus across 6 regions.
DART Architecture
First-author ML theory + multi-domain empirical research formalizing SCDR as a new problem class. Proves no single-decoder loss can simultaneously achieve high CSI and Bias ≈ 1 (Theorem 1). DART (dual-decoder + gradient isolation) delivers 32–74% bias reduction at statistically equivalent CSI on Himawari atmospheric data (n=5 seeds, |t| up to 23.3); +0.049 CSI@10 on ShanghaiTech crowd density; beats pretrained RainNet on its own DWD radar data with ~7.6× less compute. Manuscript in preparation.
Boundary IoU across the 9-model series — every yolo11-seg variant plateaus at the 0.167 ceiling; v6 Mask2Former (Swin-L) breaks it to 0.210.
An instance-segmentation study delineating smallholder paddy parcels from 10 cm aerial imagery over Dhamrai, Bangladesh. A controlled 9-model series shows that every yolo11-seg variant — plus pseudo-labelling, RGB+nDSM multi-channel fusion (LiDAR-derived canopy height), and a small-patch specialist — plateaus at a Boundary IoU ceiling of 0.167. Mask2Former (Swin-L) is the first architecture to break it, reaching BIoU = 0.210 (+26% relative) at confidence 0.80. Built on 4,662 hand-digitised ground-truth parcels across 41 labelled tiles with matching LiDAR point clouds, trained on an NVIDIA GH200. Ships as a fully reproducible study (all 8 checkpoints, code, and a paper scaffold) with a live Streamlit demo.
System architecture — browser geolocation + SSE, FastAPI, and a LangGraph state machine over DeepSeek and BMD WRF forecasts.
A production-shaped weather assistant for Bangladesh that always replies in Bengali and answers weather questions only. Built as a single stateful LangGraph: a guardrail/intent node gates off-topic queries and routes service questions to a context-injected FAQ; a resolve_location node uses browser geolocation or pauses the graph with a human-in-the-loop interrupt() to ask the user; and a ReAct agent loop (DeepSeek) calls a weather tool backed by Bangladesh Meteorological Department WRF forecasts via the BDServers API at upazila resolution. FastAPI streams tokens over SSE; SqliteSaver checkpointing enables multi-turn memory and resume, with node-level retry policies for reliability. Thoroughly documented with architecture, graph-flow, state-schema, and request-sequence diagrams.
Multi-channel conversational sales AI on LangGraph + FastAPI + RAG over 10K+ products. Built for multi-tenant scale; 4 paying enterprise customers. NVIDIA Inception Program (2025). On course for Bangladesh Innovation Grant.
Production hybrid RAG chatbot for Bangladesh Dept. of Livestock Services. Qwen2.5-7B Q4_K_M on a 4GB GPU; BGE-M3 dense + BM25 sparse + RRF fusion + bge-reranker-v2-m3 cross-encoder; Redis-backed sliding-window session memory; Bangla/English/Banglish transliteration-before-retrieval. Achieved 71% beta satisfaction from production rating API.
Refactored the World Meteorological Organization's wis2downloader from a monolithic architecture to a distributed Celery + async + spatial-filtering pipeline. Reduced data ingestion latency from ~4 hours to 130ms. Merged upstream; adopted by Timor-Leste's national meteorological agency and other NMHSs.
AI GeoLAB Ltd
Designed and shipped the Khatian VLM cascade for Bengali land-record extraction at Bangladesh government scale: fine-tuned Qwen3-VL-2B/8B with sibling lookup + symbolic verifier + conformal prediction, achieving 98.9% JL precision at 94% auto-accept on 396,802 images across 6 regions.
Regional Integrated Multi-Hazard Early Warning System (RIMES)
Architected a 3-tier backend: Django REST API, Martin vector tile server (PostgreSQL-backed), and TiTiler raster tile server, containerized with Docker and deployed on HPC infrastructure. Built NLAS (71% beta satisfaction), Cirrus AI (full fine-tune via frontier distillation), and Heatshield (ICDDR,B). Developed the CAP v1.2 infrastructure deployed in Timor-Leste and Bangladesh. Open-source: wis2downloader refactor (4hr → 130ms, WMO).
Interactive Cares
Scaled platform 13× (8K → 100K users) and 8× DAU. Designed geo-distributed CDN, load balancer, and API consolidation. Co-led Accelerating Asia victory (Top 9 of 500+ global startups, 2023). Grew engineering team from 2 to 8; introduced CI/CD, cutting deployment time 60%.
Shipday, Inc. (Remote)
Integrated AI assistant chatbot into production platform. Built internal analytics dashboard improving marketing campaign ROI by 25%.
Survey of Bangladesh, Ministry of Defence — NSDI Project
Engineered geospatial automation pipeline reducing manual processing by 80% (saved 2,400+ person-hours/year). Built validation system processing 50GB+/month for 43 government departments.
Bangladesh University of Engineering and Technology (BUET)
GRE 329 (Q170, V159) · IELTS 8.5
Founder & Lead Architect · Founded July 2025
Enterprise AI platform built for operational scale.
Founded and building an end-to-end enterprise AI platform. Responsible for full-stack architecture, model development, and customer delivery.
Peer-reviewed papers, conference proceedings, and preprints.
arXiv:2509.09195 · Under preparation for ICLR 2027
Heliyon, 10(1), e23308 · Elsevier
Capacity Building Training, National Center of Meteorology · UAE
Climate Services User Forum, South Asian Hydromet Forum (SAHF) · Malé, Maldives