tensor_parallel

vllbc 收录于大模型分布式 LLM

2025-07-23 约 42 字预计阅读 1 分钟次阅读

参考

The Ultra-Scale Playbook: Training LLMs on GPU Clusters
💥 Training Neural Nets on Larger Batches: Practical Tips for 1-GPU, Multi-GPU & Distributed setups | by Thomas Wolf | HuggingFace | Medium
Training extremely large neural networks across thousands of GPUs.