Red Hat Expands AI Collaboration with AWS * Science and Technology Times

Red Hat AI on AWS Trainium and Inferentia AI chips to provide customers with greater choice, flexibility and efficiency for production AI workloads

Red Hat has announced a deeper partnership with Amazon Web Services (AWS) to enhance enterprise-grade generative AI capabilities. The collaboration focuses on delivering efficient AI inference at scale using AWS’s custom silicon and Red Hat’s open-source platforms. New tools and integrations aim to help organizations modernize infrastructure while balancing performance, cost, and flexibility.

Collaboration Details

The partnership brings together Red Hat’s AI Inference Server and AWS’s AI chipsets, Inferentia2 and Trainium3. By integrating these technologies, customers can run generative AI models with lower latency and improved cost efficiency. Red Hat claims performance gains of up to 30–40% compared with GPU-based Amazon EC2 instances. The collaboration also extends to Red Hat OpenShift, with an AWS Neuron operator enabling smoother deployment of AI workloads on AWS infrastructure.

Red Hat customers will gain easier access to AWS’s high-capacity accelerators through this initiative. The company has also released the amazon.ai Certified Ansible Collection, allowing orchestration of AI services on AWS. Contributions to the open-source vLLM project further strengthen inference and training capabilities, with Red Hat acting as a leading commercial contributor. vLLM serves as the foundation for llm-d, now available as a supported feature in Red Hat OpenShift AI 3.

Availability and Industry Impact

The AWS Neuron community operator is already accessible in the Red Hat OpenShift OperatorHub. Support for AWS AI chips in Red Hat’s AI Inference Server will enter developer preview in January 2026. Executives from Red Hat and AWS emphasized that the collaboration provides enterprises with scalable, cost-effective AI solutions. Industry analysts noted that the partnership reflects a broader trend of organizations shifting from experimental AI projects to sustainable production environments.