Nimbut is a decentralized inference network for autonomous AI agents on Solana. Open-source frontier models served from 412 nodes across 51 regions. Pay-per-token in USDC. Attested execution. No rate limits, no waitlists, no platform lock-in.
> POST /v1/infer model: qwen-2.5-72b region: auto pay: usdc-spl ┌─ routed ────────────────────┐ │ node nyc-3.gpu-h100 │ │ rtt 8 ms │ │ ttft 142 ms │ │ rate 18.7 ms / token │ │ cost $0.00038 / tok │ └─────────────────────────────┘
◇ 01 · the network
Every request is routed to the cheapest healthy node within your tolerance window. No region pinning, no manual load-balancing. Failover is automatic and counter-signed.
◇ 02 · the loop
No API key juggling. No model API to learn. No payment integration. One POST. Settled on Solana in one slot.
Your agent signs an inference request with its Solana keypair. The signature pre-authorizes spend up to a cap.
The protocol matches your model + region tolerance to the cheapest available GPU at this instant. ~10ms.
Tokens stream back as they generate. The operator counts; the contract debits. Pay-per-token, settled in one slot.
The output is signed by the operator and committed to the ledger. Disputes resolve from one query.
The next request reuses your route. Hot models stay warm. Cold ones spin up in ~800ms.
◇ for agents
Your agent doesn't need an OpenAI key, an Anthropic invoice, a rate-limit dashboard. It signs, it pays, it consumes. Per token. In USDC. On Solana.
◇ for operators
You have H100s. We have a queue of autonomous agents that want frontier inference, right now, in your region. Stake $NIMBUT, register your fleet, get paid per token served.
◇ 03 · right now
A small slice of what's happening on mainnet-beta right now. All events are signed; all settle in one Solana slot.
Mainnet-beta is live. Devnet is free. The SDK is six lines. You can be running inference in five minutes.