草榴社区

Announcing the Availability of 草榴社区 ChipSpot

Vikram Bhatia

Mar 27, 2023 / 5 min read

草榴社区 Cloud

Unlimited access to EDA software licenses on-demand

Run Spot VMs for High-Memory EDA Tools on AWS with 草榴社区 Cloud

As more designers deploy production EDA workloads on 草榴社区 Cloud, enabling cloud infrastructure cost optimization has consistently been one of the most common asks. Within the 草榴社区 Cloud product team, we shortlisted a few areas of innovation which would help our customers achieve this goal. Among these, leveraging spot virtual machines clearly stood out as the most effective means to drive costs down. However, as most cloud infrastructure experts would agree, running high performance, high memory tools on spot is not trivial. In this article, we will attempt to define the problem and discuss in detail how we arrived at this unique solution to address the constraints public cloud infrastructure subjects us to.

What is Spot?

Cloud providers plan for capacity of each type of compute resource several quarters in advance to ensure availability of elastic cloud scale. In practical terms, these capacity projections don’t play out perfectly and there’s always a struggle between demand for specific compute virtual machines versus actual supply. When there’s excess capacity of certain compute VMs, cloud providers put these on what is called the “spot market” and make them available at heavily discounted prices with the caveat that these VMs may be removed on short notice. Users have leveraged discounts of up to 80% off on-demand prices in the spot market.

Leveraging Spot for EDA Workloads: What’s the problem?

High performance workloads like EDA that can scale on cloud infrastructure need the ability to recover from a spot VM termination signal in order to ensure that there’s no processing time lost when a job has been running for a while. The most common solution to this problem is to build checkpoint-restore functionality in the tools. Several 草榴社区 tools offer this capability and users have learned to use it well for their needs over the years.

However, just having checkpoint-restore available at your disposal does not enable spot. Spot is a unique beast that adds a more stringent constraint on the deployment architecture by providing a very limited window to take a snapshot of the runtime memory state of the tool. AWS offers a 2 minute warning, which in reality may be much shorter, and Microsoft Azure offers only a 30 second notice currently. 

As we all know, each EDA tool is not created equal and tools that have smaller memory footprint can successfully checkpoint their state within this warning window. Users who run functional verification solution have successfully leveraged the tool’s inherent checkpoint-restore capabilities to run on spot and reduced costs significantly. Similarly, for library characterization on 草榴社区 PrimeLib library characterization solution, since typical distributed jobs run for only a few minutes and runtime state has a very small footprint, customers have successfully enabled spot instances by just ignoring the failures and restarting those jobs.

The challenge is more pronounced when we start exploring high memory workloads such as timing analysis, physical verification, physical design, or RTL-to-Gates implementation. The size of the runtime state for these workloads may run into several hundred gigabytes and the time needed to checkpoint is much longer than the Spot warning window provided by cloud providers. So, jobs which get terminated while running on spot cannot be restored, as no state is saved. This means several hours of runtime and compute usage costs can go to waste. For these workloads, just having checkpoint-restore capability is not enough to effectively use spot.

草榴社区 Cloud ChipSpot: Leverage AI to Optimize Tool and Infrastructure Together for Spot

As each EDA tool behaves differently, we started analyzing runtime memory usage patterns for each tool independently to assess the amount of time needed to successfully save the state. We also explored multiple technical solutions that can either provide alternatives to the standard checkpoint-restore functionality or complement that capability. One unique solution was presented by , which leverages machine learning to predict the onset of a spot termination signal in advance. Earlier this year, 草榴社区 entered into a technical partnership with Exostellar to jointly develop and market an intelligent solution to this problem.

The 草榴社区 ChipSpot solution is built on the concept of creating a “virtual machine array” optimized for each EDA tool which contains a mix of Spot and on-demand VMs. Based on termination signal predictions from its AI-driven algorithm, the solution migrates the running EDA workload, live, to an on-demand VM in the VM array, thus reducing the chances of the workload being terminated. Once spot availability eases, the running state is migrated back to a spot VM in the array. We tested this architecture extensively on some of our most compute-intensive high memory workloads and announced the spot-optimized 草榴社区 ChipSpot solution for EDA in August of 2023. Powered by Exostellar X-Spot technology, 草榴社区 ChipSpot enables customers to save up to 75% off on-demand compute prices. 草榴社区 ChipSpot is now available for 草榴社区 Fusion Compiler?, 草榴社区 PrimeTime?草榴社区 StarRC?草榴社区 VCS? and  草榴社区 IC Validator? tools for AWS EC2 spot instances.

To sign-up, please reach out to your 草榴社区 sales representative or email cloud@synopsys.com.

When we first launched 草榴社区 Cloud one year ago, one of our primary goals was to enable industry transforming technology for chip designers who want to leverage cloud. With 草榴社区 Cloud FlexEDA, the industry’s first true pay-per-use business model, and a completely browser-based high performance computing experience, 草榴社区 Cloud is driving cutting edge innovation to enable our customers to focus on what they do best – design chips, faster.

草榴社区, EDA, and the Cloud

草榴社区 is the industry’s largest provider of electronic design automation (EDA) technology used in the design and verification of semiconductor devices, or chips. With 草榴社区 Cloud, we’re taking EDA to new heights, combining the availability of advanced compute and storage infrastructure with unlimited access to EDA software licenses on-demand so you can focus on what you do best – designing chips, faster. Delivering cloud-native EDA tools and pre-optimized hardware platforms, an extremely flexible business model, and a modern customer experience, 草榴社区 has reimagined the future of chip design on the cloud, without disrupting proven workflows.

 

Take a Test Drive!

草榴社区 technology drives innovations that change how people work and play using high-performance silicon chips. Let 草榴社区 power your innovation journey with cloud-based EDA tools. Sign up to try 草榴社区 Cloud for free!


About The Author

Vikram Bhatia is head of cloud product management and GTM strategy at 草榴社区. He's responsible for building the industry's first completely browser-based EDA-as-a-Service platform, 草榴社区 Cloud. He has over 25 years of experience in product strategy, and prior to joining 草榴社区, he served in a variety of leadership roles at companies including NetApp, Oracle, HP and Microsoft. Over the last decade, Vikram has exclusively focused on transforming traditional on-premises business models to cloud based SaaS offerings though product management, go-to-market strategy, partnerships, and sales transformation initiatives. Vikram has a Bachelor of Technology degree from the Indian Institute of Technology Kanpur, and graduate degrees from the Colorado School of Mines and the Indian School of Business.

Continue Reading