You are here: Home > Computer Technology News > Why Are Nvidia's T4 Tensor Core Cards Special?



Back to all articles

Why Are Nvidia's T4 Tensor Core Cards Special?


Published: 5-10-2021



We’ve explained what “Tensor cores” are in a previous blog post, but what you may not know is that Nvidia is working hard at improving how fast these already blistering CPU cores are. The T4 PCIe card was first introduced in 2018, but it’s still a product that many people have never heard of. Yet it can play a crucial part in modern data centers.

What Does the T4 Do?

The T4 is designed to accelerate processing tasks typically used in machine learning applications and other high-performance tasks using tensor math. It can also perform general-purpose GPU computing tasks using CUDA cores. So anything that’s been written to use CUDA on regular GPUs, should work here as well.


What the T4 doesn’t do is connect to a display device and act as a GPU. It doesn’t have any back-panel IO at all and is designed to be installed in low-profile server systems.

The T4s Specs

The T4 card is essentially what you get when you take an Nvidia RTX GPU and remove the GPU features, such as display outputs. What you have left over are the CUDA cores, dedicated Tensor Cores and the ray-tracing acceleration hardware found in RTX GPUs. The T4 is based on the Turing chipset specifically, so it’s tensor and ray-tracing hardware matches those on equivalent RTX cards on a per-core basis.


The T4 in particular has:


  • 2560 CUDA cores.

  • 320 Turing Tensor cores.

  • 16GB of GDDR6 with ECC.


Compared to a regular x86 CPU typically used in servers, the T4 is much, much faster at processing jobs such as training neural nets or drawing inferences from data.


It’s about more than just hardware performance however. The T4 card is passively cooled, low-profile and only draws 70W at its peak. In environments where the energy cost of computation is a major factor, this makes it orders of magnitude cheaper to run machine learning and GPGPU tasks on something like the T4 than a typical server-grade CPU.

Who is the T4 For?

The T4 has a place both in server systems and in workstations. Especially for workstation systems where you need to do machine learning tasks such as creating deep fakes or upscale footage using high-end AI upscaling. The use cases for machine-learning acceleration are growing by the day and adding a dedicated card to handle that while you keep working in the foreground could be a cost-effective way to boost your available processing power.


For server owners in data centers or perhaps just in SME’s or creative groups who need to share processing time, T4 cards and the like offer a way to accelerate offline rendering or machine learning type workloads in a small package. Many servers already have several low-profile PCIe slots to spare, which means that T4s can act as in-place upgrades and free up traditional CPUs for other tasks.

A Niche Card With Wide Applications

While a headless GPU packed with specialized silicon isn’t a component we’d recommend to every customer, the application of machine learning methods and the need to perform tensor math is growing rapidly. The T4 makes a whole lot of sense for a surprisingly large number of consumers.






LIST OF COMPATIBLE WORKSTATIONS


Sort By:
1
Titan W24 Octane - Intel Xeon W-2400 Series CPUs Content Creation Workstation PC for CAD, 3D and VR Design, up to 24 CPU Cores Titan W24 Octane - Intel Xeon W-2400 Series CPUs Content Creation Workstation PC for CAD, 3D and VR Design, up to 24 CPU Cores


Virtual and Augmented Reality are set to be two of the most important development areas in professional work, as well as entertainment, education, and many other use cases. The W24 is designed to get you into the ground floor of VR design, and take you wherever you’d want to go in the future.





Intel Xeon W-2400 Series Processors Workstation Computer for:

AI • Machine Learning • 3D Rendering • CAD/CAM • GPU Parallel Computing • 3D Modeling • CGI • Computer Animation • Video Editing • Design & Visualization • Game Engineering

Starting Price: $4,195.00
Titan W422 Octane - Intel Xeon W-2200 Series Processors Workstation PC for VR Design, CUDA GPU Rendering up to 18 CPU Cores Titan W422 Octane - Intel Xeon W-2200 Series Processors Workstation PC for VR Design, CUDA GPU Rendering up to 18 CPU Cores


We’ve carefully chosen the best processor and motherboard combination to let you use as much performance from your multi-GPU setup as possible. In fact, every component in this workstation has been chosen around the concept of multi-GPU computing. If you’re looking to crunch some GPU workloads, this is where to start.





Intel Xeon W-2200 Series Processors Workstation Computer for:


GPU Parallel Computing • Engeniring CAD/CAM • Graphic Design • 3D Modeling • CGI • Computer Animation • Video Editing • Design & Visualization • Gaming

Starting Price: $4,995.00
Titan W34 Octane - Intel Xeon W-3400 Series Processors Workstation PC for Digital Animation, AI, Deep Learning up to 56 CPU Cores Titan W34 Octane - Intel Xeon W-3400 Series Processors Workstation PC for Digital Animation, AI, Deep Learning up to 56 CPU Cores


We’re living in the AI age now, and if you want to develop or run AI models on your local system, you’ll need the hardware to pull it off in practical time frames The Titan W34 Octane offers the latest Intel Xeon AI-acceleration technology to make short work of your AI projects




Intel Xeon W-3400 Series Processors Workstation Computer for:

Architectural Engineering • GPU Parallel Computing Deep Learning • AI • CAD/CAM • Graphic Design • 3D Modeling • CGI • Computer Animation • Video Editing • Design & Visualization • Medical Science

Starting Price: $4,995.00
Titan S34 - Intel Xeon W-3400 Series 4U Rackmount Workstation PC for 3D Animation, AI, Deep Learning up to 56 CPU Cores Titan S34 - Intel Xeon W-3400 Series 4U Rackmount Workstation PC for 3D Animation, AI, Deep Learning up to 56 CPU Cores


3D Animation, AI, and Deep Learning are rapidly evolving and becoming critical components in numerous fields such as entertainment, healthcare, and scientific research. The Titan S34 is built specifically for these cutting-edge applications and designed to give you the best performance right out of the box.





4U Rackmount Workstation / Server Computer for:

Design & Visualization • Data Analysis • AI / Deep Learning Computing • Media / Video Streaming • Cloud Gaming • Animation and Modeling • Design & Visualization • 3D Rendering • Diagnostic Imaging • Machine Learning

Starting Price: $5,235.00
Titan A790 - AMD Ryzen Threadripper Pro 7000 Series Workstation PC - up to 96 cores Titan A790 - AMD Ryzen Threadripper Pro 7000 Series Workstation PC - up to 96 cores


Our Titan A790 OCTANE PRO is a high-performance workstation PC powered by the AMD Ryzen Threadripper Pro 7000 Series, capable of handling up to 96 cores. This workstation is designed for professionals requiring extreme computing power at a highly-competitive price.





AMD Threadripper Pro 7000 Series Workstation Computer for:

3D Rendering • CAD/CAM • Product Design • 3D Modeling • CGI • Computer Animation • Video Editing • Design & Visualization • Machine Learning • Fluid Dynamics

Starting Price: $5,585.00
Titan A499 OCTANE PRO - AMD Ryzen Threadripper Pro 5000 WX Series Workstation PC - up to 64 cores Titan A499 OCTANE PRO - AMD Ryzen Threadripper Pro 5000 WX Series Workstation PC - up to 64 cores

There are plenty of computers out there with “Pro” tacked on to the end of their names, but none like the Titan A499 Octane Pro deliver an equal amount of CPU threads and performance at a price that won’t make you feel like you’re paying for a brand. Like all Titan Workstations, this system is built using top grade internal components to ensure you get the maximum performance out of your processor.





AMD Threadripper Pro 5000 WX Series Workstation Computer for:

3D Rendering • Deep Learning • Data Analysis • AI • Machine Learning • Media / Video Streaming • Cloud Gaming • Animation and Modeling • Design & Visualization • Diagnostic Imaging

Starting Price: $5,974.25
Titan A790 OCTANE PRO - AMD Ryzen Threadripper Pro 7000 Series Workstation PC - up to 96 cores Titan A790 OCTANE PRO - AMD Ryzen Threadripper Pro 7000 Series Workstation PC - up to 96 cores


Our Titan A790 OCTANE PRO is a high-performance workstation PC powered by the AMD Ryzen Threadripper Pro 7000 Series, capable of handling up to 96 cores. This workstation is designed for professionals requiring extreme computing power at a highly-competitive price.





AMD Threadripper Pro 7000 Series Workstation Computer for:

3D Rendering • CAD/CAM • Product Design • 3D Modeling • CGI • Computer Animation • Video Editing • Design & Visualization • Machine Learning • Fluid Dynamics

Starting Price: $6,085.00
Titan X550 - Dual 2nd Gen Intel Xeon Scalable Processors Workstation PC For High CPU / GPU Computing Server up to 56 CPU Cores Titan X550 - Dual 2nd Gen Intel Xeon Scalable Processors Workstation PC For High CPU / GPU Computing Server up to 56 CPU Cores


You want computer power and you want it now. Whether GPU, CPU or both types of performance the X550 brings the cutting edge of technology straight to your desktop or server rack. Offering a staggering dual Xeon Scalable and Quad GPU configuration, this is one of the most serious number-crunching machines money can buy.





Dual 2nd Gen Intel Xeon Scalable Workstation Computer for:

Deep Learning • Data Analysis • AI • Machine Learning • Media / Video Streaming • Cloud Gaming • Animation and Modeling • Design & Visualization • 3D Rendering • Diagnostic Imaging

Starting Price: $6,880.00
Titan S600 - Dual AMD EPYC Milan CPUs + 8x GPUs Server PC for AI / Deep Learning HPC up to 128 cores - Supermicro 4124GS-TNR Titan S600 - Dual AMD EPYC Milan CPUs + 8x GPUs Server PC for AI / Deep Learning HPC up to 128 cores - Supermicro 4124GS-TNR


The future of AI and Machine Learning is here with the Titan A600. This powerhouse workstation is dedicated to supporting the most demanding AI and Machine Learning systems. With the power of two AMD EPYC Milan CPUs and support for 8 GPUs, the A600 is a monster of deep learning capabilities. Even the most advanced AI would be in awe of the processing power housed within the A600.





4U Rackmount Workstation / Server Computer for:

Deep Learning • Data Analysis • AI • Machine Learning • Media / Video Streaming • Cloud Gaming • Animation and Modeling • Design & Visualization • 3D Rendering • Diagnostic Imaging

Starting Price: $9,717.50
Titan S575 - Dual 2nd Gen Intel Xeon Scalable CPUs + 10x GPUs Server PC for AI / Deep Learning HPC up to 56 Cores - Supermicro 4029GP-TRT Titan S575 - Dual 2nd Gen Intel Xeon Scalable CPUs + 10x GPUs Server PC for AI / Deep Learning HPC up to 56 Cores - Supermicro 4029GP-TRT


The X575 is a multi threaded, multi GPU capable system with the option to install up to 10 dual slot GPUs. Perfect for those who want GPU Supercomputing ability in a convenient rack-mounted form, the Titan X575 is a uniquely designed, flexible parallel processing workstation server. Up to 56 hyper-threaded Intel cores and ten GPUs mean no compromises for medical, nuclear, oil & gas or render farm parallel computing applications.





4U Rackmount Workstation / Server Computer for:

Deep Learning • Data Analysis • AI • Machine Learning • Media / Video Streaming • Cloud Gaming • Animation and Modeling • Design & Visualization • 3D Rendering • Diagnostic Imaging

Starting Price: $10,295.00
   
 
1