Nvidia unveils new GPU designed for long-context inference
Topics
More from TechCrunch
Nvidia unveils new GPU designed for long-context inference
Newsletters
TechCrunch Daily News
TechCrunch Mobility
Startups Weekly
StrictlyVC
Related
Apple Intelligence: Everything you need to know about Apple’s AI model and services
Are bad incentives to blame for AI hallucinations?
Serverless cloud platform Koyeb now lets developers spin up Tenstorrent’s AI accelerators
Latest in AI
Nvidia unveils new GPU designed for long-context inference
Smart ring maker Oura’s CEO addresses recent backlash, says future is a ‘cloud of wearables’
Apple Intelligence: Everything you need to know about Apple’s AI model and services
Latest
AI
Amazon
Apps
Biotech & Health
Climate
Cloud Computing
Commerce
Crypto
Enterprise
EVs
Fintech
Fundraising
Gadgets
Gaming
Government & Policy
Hardware
Layoffs
Media & Entertainment
Meta
Microsoft
Privacy
Robotics
Security
Social
Space
Startups
TikTok
Transportation
Venture
Events
Startup Battlefield
StrictlyVC
Newsletters
Podcasts
Videos
Partner Content
TechCrunch Brand Studio
Crunchboard
Contact Us
In Brief Posted:
At the AI Infrastructure Summit on Tuesday, Nvidia announced a new GPU called the Rubin CPX, designed for context windows larger than 1 million tokens.
Part of the chip giant’s forthcoming Rubin series, the CPX is optimized for processing large sequences of context and is meant to be used as part of a broader “disaggregated inference” infrastructure approach. For users, the result will be better performance on long-context tasks like video generation or software development.
Nvidia’s relentless development cycle has resulted in enormous profits for the company, which brought in $41.1 billion in data center sales in its most recent quarter.
The Rubin CPX is slated to be available at the end of 2026.
Topics
Founders: land your investor and sharpen your pitch. Investors: discover your next breakout startup. Innovators: claim a front-row seat to the future. Join 10,000+ tech leaders at the epicenter of innovation. Register now and save up to $668.Regular Bird rates end September 26
Newsletters See More Subscribe for the industry’s biggest tech news
Every weekday and Sunday, you can get the best of TechCrunch’s coverage.
TechCrunch Mobility is your destination for transportation news and insight.
Startups are the core of TechCrunch, so get our best coverage delivered weekly.
Provides movers and shakers with the info they need to start their day.
Nvidia unveils new GPU designed for long-context inference Russell Brandom 39 minutes ago Gadgets Smart ring maker Oura’s CEO addresses recent backlash, says future is a ‘cloud of wearables’ Sarah Perez 2 hours ago Apps Apple Intelligence: Everything you need to know about Apple’s AI model and services Brian Heater Amanda Silberling 2 hours ago X LinkedIn Facebook Instagram youTube Mastodon Threads Bluesky TechCrunchStaffContact UsAdvertiseCrunchboard JobsSite Map Terms of ServicePrivacy PolicyRSS Terms of UseCode of Conduct Apple Event 2025Oura RingNew EmojisPlexSnapTech LayoffsChatGPT © 2025 TechCrunch Media LLC.
About the Author
Sophie Mueller
View all articlesComments (0)
No Comments Yet
Be the first to share your thoughts on this article!