Log in Rent Tesla V100
Tesla V100 80GB · live marketplace — from $0.28 / hr

Rent an Tesla V100
by the minute.
Not the month.

Rent a Tesla V100 for legacy training and FP32 scientific compute. 32 GB HBM2 at 900 GB/s — pre-Hopper Transformers pipelines run unchanged, FP32 CFD and molecular dynamics workloads sustain ~14 TFLOPS, batch Whisper transcription at 9× realtime. Retired hyperscaler silicon, re-listed at attractive rates by hosts who picked up bulk inventory. Billed per-minute, paid in BTC, USDT/USDC or CLORE.

Per-minute billing SSH + Docker + Jupyter Spot & on-demand 12 regions
DEPLOYMENT MANIFEST REQ-Tesla V100-2026-04-29-#82144
01 INSTANCE CONFIGURED
GPUNVIDIA Tesla V100 SXM5
COUNT×8 · NVLink
VRAM640 GB HBM2
FABRICNVSwitch · 900 GB/s
02 REGION & SLA VERIFIED
REGIONUS-EAST-2 · Tier III+
UPTIME99.95% SLA
NETWORK10 Gbit · <2ms
ATTEST.SOC 2 · ISO 27001
03 COMMERCIAL PRICED
SPOT$0.28/hr
ON-DEMAND$2.40/hr
◊ EXECUTABLE CLR · clore.ai/h100 · per-minute billing 29.04.2026 · 14:22 UTC
$0.28/hr
Starting on-demand price
32GB
HBM2 VRAM per card
12
Regions with Tesla V100 hosts
<90s
Cold-start to ready
workloads

Training-grade silicon,
by the minute.

Tesla V100s are what you rent when minutes matter. FP8 cuts memory and doubles throughput on Transformer workloads; HBM2 keeps the SMs fed; NVLink lets eight cards train as one.

Cheapest HBM + NVLink card listed

The only sub-$0.30 spot listing with HBM memory and NVLink support — meaningful when bandwidth-bound legacy code paths or FP32 scientific simulations need server-grade interconnect without paying A100 rates. Plenty of supply from hyperscalers retiring 2018-era inventory in 2026.

Llama-3 70B BF16 ~4.5× vs A100

Production LLM serving

TensorRT-LLM, vLLM, SGLang — all tuned for Volta. Serve 70B FP8 from a single card with margin to spare for KV-cache.

Llama-3 70B FP8 380 tok/s/user

Long-context inference

32 GB HBM2 + FlashAttention-3 means 128k-token contexts run without offload. Ideal for long-doc RAG and agent loops with tool use.

128k ctx, batch-8 no offload
why Tesla V100

The fastest GPU
you can rent by the minute.

Volta is the architecture the 70B and 405B foundation models were trained on. Specs from Nvidia's SXM5 datasheet; pricing reflects the lowest live on-demand floor.

Tesla V100 80GB A100 80GB RTX 5090 RTX 4090
Architecture Volta Ampere Blackwell Ada Lovelace
CUDA cores 5,120 6,912 21,760 16,384
VRAM 32 GB HBM2 32 GB HBM2e 32 GB GDDR7 24 GB GDDR6X
Memory bandwidth 3,350 GB/s 1,935 GB/s 1,792 GB/s 1,008 GB/s
FP16 / BF16 (dense) 756 TFLOPS 312 TFLOPS ~210 TFLOPS ~165 TFLOPS
From / hr (on-demand) $0.28 $1.20 $0.39 $0.31

// prices are spot-market lows · refreshed every 60 s

pricing

Two ways to rent.
Pay only for the minutes you use.

Every server is priced by its host. These are the live floors across the marketplace — you'll see hundreds of variants once you're in.

Spot

$0.28 / hr
≈ 0.0000025 BTC · 187 CLORE
  • Lowest possible rate
  • Per-minute billing
  • Can be interrupted by on-demand renter
  • Best for batch training, rendering
Browse spot Tesla V100s
MOST RENTED

On-demand

$2.40 / hr
≈ 0.0000056 BTC · 413 CLORE
  • Guaranteed availability
  • No preemption, ever
  • Per-minute billing
  • Best for inference, dev work, demos
Rent on-demand
Pay with
Bitcoin on-chain
CLORE native token
USDT / USDC ERC-20 · BEP-20
workflow

Four steps to a running Tesla V100.

No sales call. No quota request. No three-week procurement. The first four commands are all you need.

01 / FILTER

Pick your card

Filter the marketplace by Tesla V100 80GB, country, GPU count, reliability score, network speed.

02 / RENT

Click rent

Choose a Docker image — PyTorch, vLLM, ComfyUI, Blender — or paste your own.

$ clore rent --gpu "Tesla V100 80GB"
03 / CONNECT

SSH or Jupyter

You get a public endpoint, an SSH key, and Jupyter on port 8888 in under 90 s.

04 / STOP

Stop anytime

Per-minute billing rounds to the second. Stop the instance and the meter stops with it.

faq

Questions hosts and renters ask.

Should I still pick V100 over A100 in 2026?

Only for legacy code paths or budget-constrained FP32 scientific workloads. For transformer training, the A100 40GB is faster, has TF32, and isn't much more expensive. Pick V100 when the price gap matters more than throughput.

Can I pretrain a 70B model from scratch on this GPU?

Single-card, no - 70B pretraining needs an 8-GPU node minimum. CLORE.AI lists 8x H100, 8x H200, and 8x B200 pods with NVLink fabric for exactly this. A100 80GB pods run 70B FSDP training but at lower throughput than Hopper-class. For multi-week training, contact host operators for reserved-instance terms - listed in the marketplace under 'Reserved'.

What's the FP8 throughput here vs A100 80GB?

A100 80GB has no FP8 - peak is BF16/TF32. H100 introduces FP8 with TransformerEngine and roughly 4x the BF16 training throughput at 2x the rental price - so ~2x perf-per-dollar on FP8-eligible workloads. H200 matches H100 compute but adds 141 GB HBM3e. B200 doubles H100 FP8 again with 192 GB HBM3e. Pick by VRAM and bandwidth ceiling, not just sticker FLOPS.

Does this support NVLink-Switch / NVSwitch fabric?

8-GPU H100 SXM, H200 SXM, and B200 nodes ship with NVSwitch fabric - 900 GB/s peer bandwidth on H100/H200, 1.8 TB/s 5th-gen NVLink on B200. PCIe variants (H100 PCIe, A100 PCIe) have NVLink Bridge in pairs only. Multi-node fabric (NVLink-Switch across racks) is available on B200 hyperscale pods - filter by 'NVSwitch' in the marketplace.

Are 8-GPU pods available for FSDP and DeepSpeed training?

Yes. Multi-GPU listings expose all cards in a single rental as a coherent node with NVSwitch (where present), shared NVMe scratch, and InfiniBand or 100 GbE fabric for multi-node training. The standard PyTorch torchrun, DeepSpeed, and Megatron-LM launchers run unmodified. Filter the marketplace by GPU count to find 8x A100, 8x H100, 8x H200 nodes.

What's the HBM bandwidth comparison vs the predecessor?

V100 (HBM2, 900 GB/s) -> A100 40GB (HBM2e, 1,555 GB/s) -> A100 80GB (HBM2e, 1,935 GB/s) -> H100 (HBM3, 3,350 GB/s) -> H200 (HBM3e, 4,800 GB/s) -> B200 (HBM3e, 8,000 GB/s). Each generation roughly doubles bandwidth or VRAM; KV-cache-bound serving and bandwidth-bound training scale almost linearly with this number.

workload spotlight

Real numbers on the Tesla V100.

32 GB HBM2 Volta — retired hyperscaler silicon priced for legacy training and FP32 scientific compute.

Legacy HuggingFace training
Transformers + fp16 + DDP
~0.6× A100 40GB on BERT-large

Pre-Hopper Transformers pipelines run unchanged — V100 is the cheapest card with HBM and NVLink support.

Read the guide →
FP32 scientific simulation
PyTorch FP32 + cuFFT / cuSPARSE
~14 TFLOPS sustained FP32

CFD / molecular dynamics workloads that depend on FP32 — V100 is the cheapest HBM card with full FP32.

Read the guide →
Whisper-large transcription batch
faster-whisper + CTranslate2 fp16
~9× realtime, batch 16

32 GB HBM2 fits large-v3 + big batches — attractive for batch transcription where latency is not critical.

Read the guide →
datacenter comparison

Datacenter-tier comparison.

Side-by-side specs across the datacenter tier. Click any row to see that GPU.

GPU
HBM
Mem BW (GB/s)
FP8 TFLOPS
BF16 TFLOPS
NVLink BW
Transformer Engine
Spot $/hr
Tesla V100 / this page
32 GB HBM2
900
300 GB/s
$0.28
A100 40GB
40 GB HBM2e
1,555
312
600 GB/s
$0.78
A100 80GB
80 GB HBM2e
1,935
312
600 GB/s
$0.92
H100 tier focus
80 GB HBM3
3,350
1,979
989
900 GB/s
yes
$1.89
H200
141 GB HBM3e
4,800
1,979
989
900 GB/s
yes
$2.40
B200
192 GB HBM3e
8,000
~4,500
~2,250
1,800 GB/s
yes
$3.40
workload guides

Run these on your rented Tesla V100.

Step-by-step guides verified on CLORE.AI hardware. Pick a workload, copy the docker image, ship in minutes.

Language Models
llama.cpp server
GGUF quantized inference with HTTP/OpenAI-compatible API.
Training
HF Transformers training
Train and fine-tune with the Trainer API.
Training
Jupyter for ML training
Notebook-driven training and experimentation.
Audio Voice
Whisper transcription
OpenAI Whisper-large for speech-to-text.
Image Generation
A1111 WebUI on CLORE.AI
The classic SD WebUI with extensions and LoRA.
Language Models
Mistral / Mixtral
Run Mistral 7B and Mixtral 8x7B / 8x22B.
Advanced
CLORE API integration
Programmatic order creation via the public API.
See all guides →
other gpus

Compare with similar cards.

A100 40GB
40 GB · from $0.78/hr
Rent →
A100 80GB
80 GB · from $0.92/hr
Rent →
RTX 4090
24 GB · from $0.31/hr
Rent →

Your training run
is 60 seconds away.

Hosts around the world are accepting workloads right now. Sign up, top up your wallet, and the next hour is yours.