TrainingActive
TWIST
High-Efficiency LLM Training System via Strand Interleaving on NVIDIA Hopper GPUs.
llmtrainingdistributed-systemshopper
MLSys @ USTC
Systems for distributed LLM training and efficient AI infrastructure.
High-Efficiency LLM Training System via Strand Interleaving on NVIDIA Hopper GPUs.
Adaptive clustering system for large-scale ML workloads with dynamic resource scheduling.