Some clever networking hacks open the door

AI search provider Perplexity's research wing has developed a new set of software optimizations that allows for trillion parameter or large models to run efficiently across older, cheaper hardware using a variety of existing network technologies, including Amazon's proprietary Elastic Fabric Adapter.

 on GitHub for further scrutiny, present a novel approach to addressing one of the biggest challenges in serving large-scale mixture of experts models (MoE) at scale: memory and network latency.

> **Some clever networking hacks open the door**
>
> AI search provider Perplexity's research wing has developed a new set of software optimizations that allows for trillion parameter or large models to run efficiently across older, cheaper hardware using a variety of existing network technologies, including Amazon's proprietary Elastic Fabric Adapter.
>
> These innovations, detailed in a paper [published](https://arxiv.org/abs/2510.27656) this week and [released](https://github.com/perplexityai/pplx-garden) on GitHub for further scrutiny, present a novel approach to addressing one of the biggest challenges in serving large-scale mixture of experts models (MoE) at scale: memory and network latency.
>
> **Mo parameters, mo problems**
>
> **[...read more at theregister.com](https://www.theregister.com/2025/11/05/perplexity_1t_parameter_models_aws_efa/)**