Search-indexed profile page

Projects and Case Studies | Bikash Sapkota

Public-safe case studies covering mobility data foundations, people-flow analytics, optimization systems, and document intelligence.

Mobility Data Foundation: GPS-scale geospatial pipelines

Problem: Raw mobility inputs are noisy, high-volume, and difficult to reuse across research, consulting, analytics, and product workflows.

Solution: Designed reproducible ingestion and transformation layers that validate, normalize, aggregate, enrich, and prepare location records for downstream analytics.

Impact: Reduced repeated preprocessing work by standardizing analytical datasets. Improved confidence in outputs through validation, documentation, and repeatable transformations. Created foundations for maps, dashboards, CSV exports, and model features.

Stack: Python, Spark, AWS S3, AWS Glue, GeoPandas, H3, Data quality

People-Flow Analytics Products: Urban planning and location intelligence

Problem: Urban and mobility teams need reliable views of movement patterns without manually interpreting raw GPS trajectories.

Solution: Built analytical outputs for OD movement, road-volume intensity, and POI-footfall patterns using spatial aggregation and map-ready data products.

Impact: Translated movement signals into decision-ready geospatial outputs. Supported stakeholder workflows across research, consulting, and product teams. Made repeated mobility analysis easier to reproduce and compare over time.

Stack: Python, H3, GeoPandas, ClickHouse, Maps, CSV products

Energy Pricing Optimization System: Optimization and platform integration

Problem: Pricing workflows needed optimization logic that could be integrated into operational systems and maintained over time.

Solution: Implemented optimization modules, API interfaces, and OpenADR-related integrations while improving architecture reliability.

Impact: Moved optimization research closer to production workflows. Created integration surfaces for client platforms. Reduced operational risk by refactoring away single points of failure.

Stack: Python, MINLP, REST APIs, OpenADR, Optimization, System refactoring

Document Intelligence Pipeline: OCR and operational automation

Problem: Scanned claims required structured extraction and classification before they could move efficiently through operational workflows.

Solution: Combined OCR extraction, text classification, rule-based extraction, NER, and interface improvements to support faster manual review.

Impact: Improved the path from scanned documents to structured operational data. Reduced friction for manual keying workflows. Connected ML extraction with practical back-office usability.

Stack: OCR, Tesseract, FineReader, WEKA, Random Forest, NER