Rent an RTX 4080 for production diffusion and 7B/13B serving. 16 GB GDDR6X with 9,728 Ada cores — the cheapest card that runs SDXL batch-4 at ~6.5 it/s, serves Llama-3 8B FP16 via vLLM, and finishes 8B QLoRA fine-tunes in 2–3 hours of spot rental. Spun up in under 90 seconds, billed per-minute, paid in BTC, USDT/USDC or CLORE. The 16 GB production tier.
Three years in, the 4080 is still one of the most rented cards on the network. NVLink-capable, 16 GB, and cheap — ideal for hobbyists, students, and side projects.
16 GB is the floor where vLLM with 16-request KV cache fits cleanly for 8B FP16 serving — the spec where hobby diffusion turns into real production batch work. ~70% of 4090 throughput at ~55% of the rental price, and FP8 inference paths supported.
SD 1.5, SDXL, ComfyUI workflows. Blender Cycles with OptiX delivers solid 1080p–4K renders at hobbyist-friendly cost.
vLLM and TGI containers run 7B–13B FP16 models with comfortable batch sizes. The cheapest path to production-grade open-source inference.
Older silicon, but 16 GB is 16 GB. For workloads that fit, the 4080 is the cheapest path to a real GPU. Specs from Nvidia's reference sheet.
// prices are spot-market lows · refreshed every 60 s
Every server is priced by its host. These are the live floors across the marketplace — you'll see hundreds of variants once you're in.
No sales call. No quota request. No three-week procurement. The first four commands are all you need.
Filter the marketplace by RTX 4080, country, GPU count, reliability score, network speed.
Choose a Docker image — PyTorch, vLLM, ComfyUI, Blender — or paste your own.
You get a public endpoint, an SSH key, and Jupyter on port 8888 in under 90 s.
Per-minute billing rounds to the second. Stop the instance and the meter stops with it.
Pick the 4080 when 16 GB is enough — SDXL batch-2, 7B fine-tuning, 13B INT8 inference. ~70% of 4090 throughput at ~55% of the rental price. Step up to 4090 for 24 GB and 70B INT4 work.
Consumer cards on CLORE.AI cover most hobby and indie workflows: Stable Diffusion 1.5 and SDXL, ComfyUI/Automatic1111, Flux.1, LoRA and QLoRA fine-tuning of 7B-13B LLMs, Whisper transcription, video transcoding, Blender Cycles, and game-server hosting. Anything that fits in 8-32 GB VRAM and runs in Docker runs here. You get full root SSH plus a Jupyter template if you want one.
Cold-start lands in roughly 60-90 seconds for a typical Docker image: server allocation, container pull, GPU passthrough, SSH up. Pre-cached templates (PyTorch, ComfyUI, vLLM, Ollama) are faster because the image is already on the host. Once running you pay per minute, so a 10-minute experiment costs ten minutes of rental, not an hour.
On-demand is a fixed per-hour price the host sets; the rental cannot be revoked while you have funds. Spot is auction-style: you bid, the highest bidder runs, and a higher bidder can preempt you. Spot is typically 30-50% cheaper. CLORE.AI charges 2.5% on spot and 10% on on-demand, split 50/50 with the host.
Spot prices on CLORE.AI usually beat RunPod community pricing because there is no centralized markup; you rent directly from the host with a 2.5% spot fee. Vast.ai is the closest comparison, and on consumer cards CLORE.AI is generally within a few cents per hour. Hold CLORE in your wallet for Proof of Holding and you stack up to 50% off the marketplace fee.
Yes. Point at any registry - Docker Hub, GHCR, Quay, your private registry - then set env vars, port forwards, and your SSH public key in the rent dialog. Templates on the platform are just preset configs; nothing is locked down. You get full root inside the container with GPU passthrough.
16 GB Ada — the production pick for SDXL/Flux at scale, 7B fine-tunes, and 13B INT8 inference.
16 GB fits 8B FP16 plus 16-request KV cache — the cheapest card to run a real serving stack.
Read the guide →Batch-4 generation pipeline for client work — 16 GB clears all VAE/CLIP/UNet caches simultaneously.
Read the guide →8B fine-tunes complete in 2–3 hours of 4080 spot rental — fits 8K context with gradient checkpointing.
Read the guide →Side-by-side specs across the consumer tier. Click any row to see that GPU.
Step-by-step guides verified on CLORE.AI hardware. Pick a workload, copy the docker image, ship in minutes.
Per-minute payouts in BTC, USDT, USDC or CLORE. No listing fee, no contracts, withdraw any time.
Hosts around the world are accepting workloads right now. Sign up, top up your wallet, and the next hour is yours.