AI Inference Vs Training: A Clear-Cut Guide and How to Optimize Both
Leo
Oct 22, 2025
A GPU performance optimization pioneer who believes "computing resources should flow as efficiently as utilities." Specializes in transforming complex cluster management into actionable solutions.
1.Senior Engineer, AWS EC2 GPU Instance Optimization Team (5 years)
2.Led a 40% failure-rate reduction project for a 1,000-GPU cluster in autonomous driving
3.Core designer of WhaleFlux’s scheduling algorithms
1.MS Distributed Systems, Carnegie Mellon University
2.BS Computer Science, Tsinghua University