City
Epaper

Google expands cost-effective AI-optimised infrastructure portfolio for customers

By IANS | Updated: August 30, 2023 15:50 IST

New Delhi, Aug 30 Google on Wednesday expanded its artificial intelligence (AI)-optimised infrastructure portfolio that is both cost-effective ...

Open in App

New Delhi, Aug 30 Google on Wednesday expanded its artificial intelligence (AI)-optimised infrastructure portfolio that is both cost-effective and scalable for its Cloud customers.

The company is expanding its AI-optimised infrastructure portfolio with 'Cloud TPU v5e', the most cost-efficient, versatile, and scalable Cloud TPU to date, which is also now available in preview.

"Cloud TPU v5e is purpose-built to bring the cost-efficiency and performance required for medium- and large-scale training and inference. TPU v5e delivers up to 2x higher training performance per dollar and up to 2.5x inference performance per dollar for LLMs and gen AI models compared to Cloud TPU v4," Google said in a blogpost.

According to the company, TPU v5e is also incredibly versatile, with support for eight different virtual machine (VM) configurations, ranging from one chip to more than 250 chips within a single slice, allowing customers to choose the right configurations to serve a wide range of LLM and gen AI model sizes.

Cloud TPU v5e also provides built-in support for leading AI frameworks such as JAX, PyTorch, and TensorFlow, along with popular open-source tools like Hugging Face’s Transformers and Accelerate, PyTorch Lightning, and Ray.

Moreover, the tech giant announced that its A3 VMs, based on Nvidia H100 GPUs, delivered as a GPU Supercomputer, will be generally available next month to power customers large-scale AI models.

"Today, we’re thrilled to announce that A3 VMs will be generally available next month. Powered by Nvidia’s H100 Tensor Core GPUs, which feature the Transformer Engine to address trillion-parameter models, Nvidia’s H100 GPU, A3 VMs are purpose-built to train and serve especially demanding gen AI workloads and LLMs," Google said.

The A3 VM features dual next-generation 4th Gen Intel Xeon scalable processors, eight Nvidia H100 GPUs per VM, and 2TB of host memory.

Built on the latest Nvidia HGX H100 platform, the A3 VM delivers 3.6 TB/s bisectional bandwidth between the eight GPUs via fourth-generation Nvidia NVLink technology.

Disclaimer: This post has been auto-published from an agency feed without any modifications to the text and has not been reviewed by an editor

Tags: congresspitrodadelhimodideepikabjpwest-bengaldeepika-padukoneajay-devgnthakur
Open in App

Related Stories

PunePCMC Elections 2026: BJP Secures First Win in Pimpri-Chinchwad Before Polls As Ravi Landge Elected Unopposed

MumbaiBMC Election 2026: Nomination Scrutiny Shocks Major Parties as Congress, BJP, AAP Candidates Face Rejection

NationalDelhi Shocker: Man Stabbed to Death Near Shastri Park, Police Launch Probe

MumbaiBJP Candidate Ravi Raja Files Nomination From Ward 185 for BMC Election 2026

MumbaiBMC Election 2026: BJP Finalises 66 Candidates for Mumbai Civic Polls, Who Received AB Forms? See Full List

Technology Realted Stories

TechnologyNew PLI approvals to deepen value chains in components manufacturing: Industry

TechnologyNifty, Bank Nifty hit record highs, Sensex up 0.67 pc

TechnologyIndia rises from 123rd to 8th globally in WHO pharmacovigilance contributions: Nadda

TechnologyMaruti Suzuki India ends 2025 with production crossing record 22.55 lakh vehicles

TechnologyPriyanka Chaturvedi seeks urgent govt attention on AI apps on X sexualising women