Microsoft Azure expands availability of OpenAI GPT-4 and GPT-35-Turbo models to more regions

News

2 min. read

Published on August 8, 2023

by Davesh Beri

published on August 8, 2023

Share this article

Readers help support Windows Report. We may get a commission if you buy through our links.

Microsoft’s Azure AI infrastructure has made an expansion, offering OpenAI’s GPT-4 and GPT-35-Turbo models in various global regions. The service now covers regions like Australia East, Canada East, East United States 2, Japan East, and United Kingdom South.

The recently launched ND H100 v5 Virtual Machine series, featuring NVIDIA H100 Tensor Core GPUs and Quantum-2 InfiniBand networking, these virtual machines cater to complex AI workloads. This seems to be Microsoft’s response to the increasing complexities of AI applications.

The ND H100 v5 VMs include the following features:

Eight NVIDIA H100 Tensor Core GPUs and VMs offer faster AI performance.
4th Gen Intel Xeon Scalable processors
NVIDIA Quantum-2 ConnectX-7 InfiniBand ensures networking at 400Gb/s per GPU.
PCIe Gen5 provides 64GB/s bandwidth per GPU for host-to-GPU performance.
VMs feature DDR5 memory for large datasets and high data transfer speeds.
Performance tests show up to 6x speedup in matrix operations using 8-bit FP8.
ND H100 v5 VMs, achieve 2x speedup in large language models.

Azure OpenAI Service has nearly tripled its customer base since the last update. The service currently serves over 11,000 customers, adding an average of 100 new customers daily this quarter.

Businesses can use GPT-4 for tasks like content generation, document intelligence, and customer service, potentially opening doors to innovation and transformation across various sectors.

via MSPowerUser

Davesh Beri

User forum

0 messages

Sort by:

Leave a Reply Cancel reply