Tachyum Publishes Prodigy Performance Optimization Manual
[ Back ]   [ More News ]   [ Home ]
Tachyum Publishes Prodigy Performance Optimization Manual

LAS VEGAS — (BUSINESS WIRE) — December 17, 2024Tachyum® today announced that it has released a 1,600-page Performance Optimization Manual for its Prodigy® Universal Processor FPGA hardware.

Tachyum’s Prodigy Performance Optimization Manual provides detailed information on how to fully benefit from the performance features that are built into Prodigy. It includes the required design guidelines for the development of high-performance software for a broad range of applications, including cloud, AI, and HPC.

The manual outlines Prodigy’s revolutionary new Universal Processor microarchitecture, and intrinsic functions as well as processor instructions, throughputs, and latencies. Tachyum has also included Prodigy’s performance counters, which enable performance monitoring and analysis across a wide array of run-time events.

The Prodigy Performance Optimization Manual describes special considerations for performance optimization that include dispatch constraints, load/store alignment, optimizing memory routines, branch instruction alignment, special register access, register forwarding hazards, cache maintenance operations, and complex instructions.

“Software programmers, test engineers, compiler developers, and systems and solutions engineers will appreciate the opportunity to take this deep dive into how Prodigy offers inherent performance benefits for efficient processing of AI, cloud, and HPC workloads,” said Dr. Radoslav Danilak, founder and CEO of Tachyum. “Prodigy’s integrated features will help users achieve industry-leading compute efficiency to derive insights faster, to perform research faster, to generate results faster.”

The Prodigy Instruction Set Architecture (ISA) includes a large number of vector and matrix instructions that optimize the performance and efficiency of vector and matrix operations. The Prodigy ISA is a mix of RISC and CISC but doesn’t include any complex and/or long variable length, inefficient instructions that many CISC processors have. All instructions are either 32 or 64-bits wide, and some instructions include memory accesses to optimize performance.

As a Universal Processor offering industry-leading performance for all workloads, Prodigy-powered data center servers can seamlessly and dynamically switch between computational domains (such as AI/ML, HPC, and cloud) with a single homogeneous architecture. By eliminating the need for expensive dedicated AI hardware and dramatically increasing server utilization, Prodigy reduces CAPEX and OPEX significantly while delivering unprecedented data center performance, power, and economics. Prodigy integrates 192 high-performance custom-designed 64-bit compute cores, to deliver up to 4.5x the performance of the highest-performing x86 processors for cloud workloads, up to 3x that of the highest performing GPU for HPC, and 6x for AI applications.

Follow Tachyum

https://x.com/Tachyum
https://www.linkedin.com/company/tachyum
https://www.facebook.com/Tachyum/

About Tachyum

Tachyum is transforming the economics of AI, HPC, public and private cloud workloads with Prodigy, the world’s first Universal Processor. Prodigy unifies the functionality of a CPU, a GPU, and a TPU in a single processor to deliver industry-leading performance, cost and power efficiency for both specialty and general-purpose computing. As global data center emissions continue to contribute to a changing climate, with projections of their consuming 10 percent of the world’s electricity by 2030, the ultra-low power Prodigy is positioned to help balance the world’s appetite for computing at a lower environmental cost. Tachyum received a major purchase order from a U.S. company to build a large-scale system that can deliver more than 50 exaflops performance, which will exponentially exceed the computational capabilities of the fastest inference or generative AI supercomputers available anywhere in the world today. When complete in 2026, the Prodigy-powered system will deliver a 25x multiplier vs. the world’s fastest conventional supercomputer – built just this year – and will achieve AI capabilities 25,000x larger than models for ChatGPT4. Tachyum has offices in the United States, Slovakia and the Czech Republic. For more information, visit https://www.tachyum.com/.



Contact:

Mark Smith
JPR Communications
818-398-1424
marks@jprcom.com