DRIZTI - Delivering Personal Supercomputing
  • Home
  • HPCBOX
    • Case Studies
  • Solutions
  • Availability
    • Azure Marketplace
    • Fully Managed HPC
  • About
    • News
    • Partners
    • Press
    • Support
    • Contact
  • Blog

HPCBOX ready for milan-X on microsoft azure

12/7/2021

 
This post is short since it's an update to a previous post which can be found here.

Early Holiday Gift

On November 8th, 2021, Microsoft delivered an early Holiday Gift by announcing the preview availability of an upgraded version of the Azure HBv3 virtual machines. This upgraded instance was enhanced by 3rd Gen AMD EPYC™ processors with AMD 3D V-cache, codenamed “Milan-X”.

Being a Microsoft Partner, and always staying at the forefront of HPC in the cloud, delivering one of the easiest to use and scale HPC cloud platforms, HPCBOX, we at Drizti, had to get our hands on this new hardware and make sure our HPCBOX platform was able to support it immediately on GA. So, obviously, we signed up for the private preview and Microsoft was kind enough to get us access around late November/early December.

Technical Specification

I won't be going into much detail of the technical specification for the upgraded SKU since a lot of information and initial benchmark results have been provided on this Azure Blog and this Microsoft Tech Community article. In short, this upgraded SKU has the same InfiniBand capability as the original HBv3 i.e., HDR, same amount of 448GB memory and local scratch nVME. However, the main change is the switch to AMD EPYC™ processors with AMD 3D V-cache, codenamed “Milan-X” which offers a significant boost to L3 cache and brings it to around 1.5 gigabytes on a dual socket HBv3 instance. 

Tests and Amazing Results

We performed a few tests to make sure that the HPCBOX platform was ready to support this upgraded SKU once it was released. We highlight the tests and the execution time speedup we noticed for them. I am not including case names and details here because some of them are customer model files. However, if you would like to get more details feel free to contact us.
Picture
In the above screenshot, you can see 4 powered-on HBv3 compute workers on an HPCBOX cluster. These HBv3 instances are the upgraded version of the SKU and on the right you can see the CPU information.
Picture
Lower is better
The tests were run on an HPCBOX cluster using CentOS 7.X as the base image.
TEST
SPEEDUP
ANSYS LSDYNA R12.0.0
1.51x
OpenFOAM v1906
1.34x
ANSYS CFX 2019R3
1.27x
ANSYS CFX 2021
1.64x
Test Speedup
LSDYNA R12.0.0 1.51x
OpenFOAM v1906 1.34x
CFX 2019R3 1.27x
CFX 2021 1.64x
In the chart and table above, we can clearly see a good improvement in performance using Milan-X. The larger cache size seems to give a good boost for the applications we tested and it appears that this boost will most likely be experienced with other workloads as well.

Availability in HPCBOX

Being at the forefront of HPC in the cloud, specifically on Microsoft Azure, Drizti is ready for the GA release of the upgraded HBv3 instances and we will be continuing our tests and providing feedback to Microsoft. We have a few more tests to conduct and also to make sure our AutoScaler is able to perform as expected with the Milan-X instances. It appears that the new HBv3 will be a drop in replacement in HPCBOX and most likely will just work out of the box with all the functionality delivered by HPCBOX.

Conclusion

Microsoft Azure has been the leader in delivering cutting-edge infrastructure for HPC in the cloud and we are privileged to have been partnering with the Azure HPC team. We had the opportunity of not only being a launch part for the original HBv3 "Milan" instances, but now, also be one of the early partners to test compatibility of these upgraded SKUs with Milan-X.

AMD EPYC "Milan" was a big upgrade for HPC and with Milan-X, it appears that AMD has advanced further and delivered another cutting-edge CPU with amazing performance. 

Drizti, with our HPCBOX platform on Microsoft Azure is able to deliver a fully interactive Supercomputing experience for our users and it's very exciting that we can upgrade our end users and accelerate their innovation in minutes by just upgrading their compute nodes!


Contact us to learn how you can get rid of in-house hardware and/or your existing cloud presence and get a seamless HPC upgrade to HPCBOX on Azure.
Picture

Author
Dev S.
Founder and CTO, Drizti Inc
All third-party product and company names are trademarks or registered trademarks 
of their respective holders. Use of them does not imply any affiliation or endorsement by them.

HPCBOX Intelligent autoscaler supports low-priority instances

8/17/2021

 
This post is an update to the previous post announcing preview availability of the HPCBOX AutoScaler.

Low Priority Instances

As of 2021-08-17, the HPCBOX AutoScaler includes support for low priority instances. Low priority instances are called by different names on different cloud platforms, spot instance on Azure and AWS, preemptible instance on Oracle OCI. 
The general idea being, these instances are the same hardware configuration as the standard instances, but, they can be preempted at any time with a short notification by the cloud vendor. Although these instances can be evicted, they do offer a much lower price and are well suited for jobs which don't have a tight deadline.

Ideal Setup

An important functionality for efficient use of low-priority instances is auto-selection and auto-rescheduling. Auto-Selection meaning, we want the right low-priority hardware to be selected based on the job type, for example, we would like to use specific GPGPU nodes for CUDA jobs, dense CPU nodes with high speed interconnect for CFD jobs, etc. Auto-rescheduling means, we would like the system to be able to automatically reschedule jobs when the cloud vendor is about to evict the instances and for applications which support it, we would also like the jobs to be automatically restarted from the last save point.
We are pleased to announce that the HPCBOX AutoScaler supports both these critical functions out of the box.​

Example Scenario

Let us consider a use-case to understand how the Intelligent AutoScaler in HPCBOX handles optimization and efficiently handles combining three different class​es of hardware on the same cluster.

The following picture represents an HPCBOX Cluster which is a combination of both reserved (resources with a usage commitment, on Azure called Reserved Instances), standard pay-as-you-go resources and low-priority resources. To optimize the budget spend in such a configuration, one would want the reserved instances to be always powered-on to provide a baseline capacity for the cluster and automate the use of pay-as-you-go+low-priority resources to minimize resource wastage. Furthermore, we could assume that the compute workers on this cluster are a combination of different hardware configurations, for example, on Azure, we could assume a combination of HB120rs_V2 and HB120-16rs_v3 (combination of AMD EPYC “Rome” and “Milan” hardware).
Picture
Depending on the type of job that comes into the system, AutoScaler either takes no action, powers on standard rate PAYG hardware, or, low-priority PAYG hardware. It also handles auto power-off of the instances once jobs leave the cluster.
Picture
Picture
Picture

Auto-Rescheduling

When the cloud backbone decides that it needs additional hardware for users who are willing to pay more, our low-priority workers are going to get evicted. However, the AutoScaler makes sure that user jobs get automatically rescheduled without any manual intervention.
Picture
Now, comes the best part of the AutoScaler. HPCBOX can automatically attempt restarting of the evicted nodes and when successful, user jobs which got evicted will automatically restart! All this is done with no manual intervention and this, we think, is really how an Intelligent HPC Cloud system should help users optimize their resource usage, deadlines and budgets!
Picture

Availability

HPCBOX AutoScaler is now available in preview and we would be very pleased to run a demo or perform a POC or pilot with you to optimize your cloud spend on HPC resources while making sure your jobs are always matched to the most suitable hardware. Schedule a meeting here.
Picture



​     
​Author

Dev S.
Founder and CTO, Drizti Inc

All third-party product and company names are trademarks or registered trademarks 
of their respective holders. Use of them does not imply any affiliation or endorsement by them.

HPCBOX supports AMD EPYC Zen 3 “Milan” on launch day

3/15/2021

 
This is one of the most exciting posts I’ve written in my 15+ years of existence in the High Performance Computing (HPC) industry.

AMD EPYC Zen 3 “Milan” is integrated, tested and available on the HPCBOX platform on Microsoft Azure, on LAUNCH DAY!

This is something which seldom happens in the HPC world, very few users ever get to start production on day one of the release of a new and very impressive processor generation upgrade.

I will not be doing any technical comparison between Zen 3 and Zen 2 and neither will I present any specific application benchmark numbers in this post. I am sure there’ll be many posts by AMD, Microsoft and other ISVs who’ll be publicly sharing information on the performance boost they see for their codes with EPYC Zen 3 “Milan”. This post will be more about HPCBOX and how we could deliver this upgrade experience for our users, on launch day and without having to set our hands on the physical hardware or physically be present in a datacenter(s) (actually multiple regions)! Awesome work and support from the Azure team!
​
Drizti was a launch partner for the new HBv3 instance size on Microsoft Azure and these instances are powered by the new EPYC 7xx3/Milan CPUs, HDR InfiniBand and sport very impressive dual NVMe drives which give a big performance boost to applications which use local scratch, specifically when they are striped.

​Read more about HBv3 here

Unbelievably quick CPU generation upgrade

It all started with Drizti getting access to 1000+ cores of HBv3 instances for functional and compatibility testing, to make sure we are ready for GA availability of HBv3 on launch day of AMD EPYC ”Milan” CPUs. We went through testing, adding necessary support within the HPCBOX platform to make sure we are able to use the cool new features available on the instances, like automate striping of NVMe, testing out MPI compatibility, testing the workflow component of HPCBOX, auto-scaling/shutdown/start etc. and all other functions which are offered by the HPCBOX platform. In addition to this, we also did some performance tests with different ISV and open-source codes, mainly to make sure the HPCBOX workflow engine can correctly handle the new instances and automatically optimize application pipelines to take advantage of the new hardware.

Some of the applications we tested were ANSYS CFX, ANSYS Fluent, OpenFOAM. At a high level, we can share that we are seeing impressive performance benefits of using EPYC “Milan”, particularly for large runs. Also, for applications which are local scratch bound, we expect users to get a good boost due to the possibility of having striped NVMes in HBv3.
​
We also did a test to see how easy it would be to upgrade HPC for our users and were really impressed that we could just switch our users from HBv2 to HBv3, meaning from “Rome” to “Milan” and to upgraded local scratch in under 30 minutes! This is mainly because of the design of the HPCBOX platform, it is a self-contained platform, and this gives us the ability to fine tune platform capabilities quickly without depending on external services to get upgraded first.
 Agile and Impressive!

In Action

Picture
ANSYS Fluent started on HPCBOX with the new HBv3 as compute nodes
Picture
ANSYS Fluent iterating on HPCBOX using 1200 cores of HBv3
Picture
ANSYS CFX on HPCBOX powered by 1200 HBv3 cores
Picture
OpenFOAM on HPCBOX utilizing 1200 HBv3 AMD EPYC Zen 3 "Milan" cores


​Partnership with Microsoft

Drizti is very thankful to Microsoft for having given us this opportunity of being a launch partner for the new HBv3 series and for supporting us in making sure we are ready for HBv3 in HPCBOX on launch day. 

Next Steps

We are looking forward to getting some of our users try out their models on the new instances and help them get a significant boost in productivity with the new hardware.
Please contact us here to learn how you can accelerate innovation with HPCBOX on Azure and how we can help you with Personal Supercomputing, keeping you on the latest and greatest HPC technologies, on launch day, tested and certified for use! You don’t need your own data center or even an in-house HPC team to get benefits of supercomputers in your product development.
​

All you need to do is get in touch with us!
Picture

​


Author
Dev S.
Founder and CTO, Drizti Inc

All third-party product and company names are trademarks or registered trademarks 
of their respective holders. Use of them does not imply any affiliation or endorsement by them.

    Categories

    All
    AMD EPYC "Milan"
    AMD EPYC "Milan X"
    AMD EPYC "Milan-X"
    ANSYS
    Application Showcase Series
    Autoscaling
    Azure
    Azure HBv3
    CFD
    CONVERGE
    Docker
    Hpc
    Hpc Automation
    Hpc Teams
    Modern Workplace
    Newsletter
    NVIDIA A-100
    OpenFOAM
    SU2

    RSS Feed

Home

About

Press

HPCBOX

News

Contact

All product and company names are trademarks™ or registered® trademarks of their respective holders. Use of them does not imply any affiliation with or endorsement by them. 
Copyright © 2017-2022. Drizti Inc. All Rights Reserved.
We use icons from Icons8.



  • Home
  • HPCBOX
    • Case Studies
  • Solutions
  • Availability
    • Azure Marketplace
    • Fully Managed HPC
  • About
    • News
    • Partners
    • Press
    • Support
    • Contact
  • Blog