BRICS News Magazine
Login Cart Register
Nvidia unveils new GPU designed for long-context inference
Technology

Nvidia unveils new GPU designed for long-context inference

Sophie Mueller 37 views
Editor's Choice Featured

Topics

More from TechCrunch

Nvidia unveils new GPU designed for long-context inference

Newsletters

TechCrunch Daily News
TechCrunch Mobility
Startups Weekly
StrictlyVC

Related

Apple Intelligence: Everything you need to know about Apple’s AI model and services

Are bad incentives to blame for AI hallucinations?

Serverless cloud platform Koyeb now lets developers spin up Tenstorrent’s AI accelerators

Latest in AI

Nvidia unveils new GPU designed for long-context inference

Smart ring maker Oura’s CEO addresses recent backlash, says future is a ‘cloud of wearables’

Apple Intelligence: Everything you need to know about Apple’s AI model and services

Latest

AI

Amazon

Apps

Biotech & Health

Climate

Cloud Computing

Commerce

Crypto

Enterprise

EVs

Fintech

Fundraising

Gadgets

Gaming

Google

Government & Policy

Hardware

Instagram

Layoffs

Media & Entertainment

Meta

Microsoft

Privacy

Robotics

Security

Social

Space

Startups

TikTok

Transportation

Venture

Events

Startup Battlefield

StrictlyVC

Newsletters

Podcasts

Videos

Partner Content

TechCrunch Brand Studio

Crunchboard

Contact Us

In Brief Posted:

At the AI Infrastructure Summit on Tuesday, Nvidia announced a new GPU called the Rubin CPX, designed for context windows larger than 1 million tokens.

Part of the chip giant’s forthcoming Rubin series, the CPX is optimized for processing large sequences of context and is meant to be used as part of a broader “disaggregated inference” infrastructure approach. For users, the result will be better performance on long-context tasks like video generation or software development.

Nvidia’s relentless development cycle has resulted in enormous profits for the company, which brought in $41.1 billion in data center sales in its most recent quarter.

The Rubin CPX is slated to be available at the end of 2026.

Topics

Founders: land your investor and sharpen your pitch. Investors: discover your next breakout startup. Innovators: claim a front-row seat to the future. Join 10,000+ tech leaders at the epicenter of innovation. Register now and save up to $668.Regular Bird rates end September 26

Newsletters See More Subscribe for the industry’s biggest tech news

Every weekday and Sunday, you can get the best of TechCrunch’s coverage.

TechCrunch Mobility is your destination for transportation news and insight.

Startups are the core of TechCrunch, so get our best coverage delivered weekly.

Provides movers and shakers with the info they need to start their day.

Nvidia unveils new GPU designed for long-context inference Russell Brandom 39 minutes ago Gadgets Smart ring maker Oura’s CEO addresses recent backlash, says future is a ‘cloud of wearables’ Sarah Perez 2 hours ago Apps Apple Intelligence: Everything you need to know about Apple’s AI model and services Brian Heater Amanda Silberling 2 hours ago X LinkedIn Facebook Instagram youTube Mastodon Threads Bluesky TechCrunchStaffContact UsAdvertiseCrunchboard JobsSite Map Terms of ServicePrivacy PolicyRSS Terms of UseCode of Conduct Apple Event 2025Oura RingNew EmojisPlexSnapTech LayoffsChatGPT © 2025 TechCrunch Media LLC.

About the Author

Sophie

Sophie Mueller

View all articles

Comments (0)

Sign in to Comment

Join the discussion and share your thoughts on this article.

Sign In

No Comments Yet

Be the first to share your thoughts on this article!

diş beyazlatma