NeuralOps

A multi-agent orchestration platform for an enterprise logistics company. Replaced a team of 12 manual operators with AI agents handling 40,000+ decisions per day.

AI AgentsLangGraphFastAPI2024

Outcome98.4% decision accuracy · 12× throughput · $2.1M annual savings

ClientEnterprise logistics operator (NDA)

IndustryLogistics & Supply Chain

Duration14 weeks

Team4 engineers, 1 product lead

Challenge

The client's operations team was processing 40,000+ route-planning, load-balancing, and exception-handling decisions per day. Each required a trained operator to review data from three systems, apply business rules, and make a judgement call. The team of 12 operators was at capacity, error rates were climbing, and scaling headcount further was economically unsustainable. They needed a system that could make these decisions autonomously - without sacrificing accuracy.

Approach

Week 1–2: Decision mapping

We embedded with the operations team for two weeks, shadowing operators and documenting every decision type, the data inputs required, the business rules applied, and the edge cases that required human judgement. We identified 23 distinct decision types, categorised them by complexity and frequency, and defined which could be fully automated versus which required a human-in-the-loop.

Week 3–5: Architecture design

We designed a multi-agent system using LangGraph for orchestration. A routing agent triaged incoming decision requests and dispatched them to specialist sub-agents - one for route optimisation, one for load balancing, one for exception handling. A supervisor agent monitored outputs and escalated to human review when confidence dropped below threshold. All agents shared a structured working memory via PostgreSQL.

Week 6–11: Build and integration

We built each specialist agent with a dedicated tool set: the route optimisation agent had access to maps APIs, historical route performance, and real-time traffic. The load balancing agent had access to warehouse inventory, vehicle capacity, and driver availability. All tool calls were structured with strict typing and logged in full. We integrated with the client's existing ERP via a FastAPI middleware layer.

Week 12–14: Eval, hardening, and handoff

We ran the system in shadow mode for two weeks - making decisions in parallel with human operators without acting on them. We compared outputs, identified disagreements, and tuned the agents. When accuracy exceeded 97% across all decision types, we began a phased rollout: AI-first with human review on exceptions, then full autonomy with random auditing. We built a monitoring dashboard so the operations manager could see decision volume, accuracy rates, and escalation patterns in real time.

Results

98.4%Decision accuracy

40,000+Daily decisions processed

12×Throughput increase

$2.1MAnnual cost savings

12 → 2Operator team reduction

1.8sAverage decision latency

Stack

PythonLangGraphOpenAI GPT-4FastAPIPostgreSQLRedisDockerAWS ECS

"We were sceptical that AI could handle the edge cases our best operators deal with. Hostwire proved us wrong. The system makes better decisions than most of our team, and the two operators who remain are focused on the genuinely hard problems."

Head of OperationsEnterprise Logistics Client

More work

PulseAI

A real-time market intelligence copilot for a VC fund - ingesting 10,000+ sources daily, extracting signals, and generating investment memos on demand.

View case study

Customer support team working at screens

Reachly

Tiered AI support deployment for a D2C skincare brand handling 1,200 to 1,800 support tickets a month. AI handles Tier 1 autonomously, drafts Tier 2 for human review, and routes Tier 3 directly to agents, with no bot loops.

View case study

ChainVault Protocol

A DeFi yield-optimization protocol with dynamic rebalancing across 8 chains. Designed the full tokenomics, smart contracts, and risk management layer.

View case study