Advanced15 minGPU & AgentsDev Preview
GPU Matrix Multiplication
Runs a 512x512 PyTorch matrix multiplication on the available accelerator (CUDA, MPS, or CPU) by declaring @app.step(gpu='A100'), demonstrating how to route steps to GPU worker pools.
Coming SoonDev Preview
This content is part of an upcoming preview program. Request early access
#distributed#gpu#python#pytorch
flow.py