The Blog



Share

Deploy large models on Amazon SageMaker using DJLServing and DeepSpeed model parallel inference