Close Menu

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Meta Plans to Use AI for 90% of Product Risk Assessments

    June 1, 2025

    Google Quietly Launches AI Edge Gallery App for Running Hugging Face Models Locally on Android

    June 1, 2025

    SpaceX Targets 170 Orbital Launches in 2025, Aims to Set New Industry Benchmark

    May 31, 2025
    Facebook X (Twitter) Instagram Pinterest
    EchoCraft AIEchoCraft AI
    • Home
    • AI
    • Apps
    • Smart Phone
    • Computers
    • Gadgets
    • Live Updates
    • About Us
      • About Us
      • Privacy Policy
      • Terms & Conditions
    • Contact Us
    EchoCraft AIEchoCraft AI
    Home»Computers»IBM’s z17 Mainframe Designed for AI Workloads and Long-Term Enterprise Needs
    Computers

    IBM’s z17 Mainframe Designed for AI Workloads and Long-Term Enterprise Needs

    EchoCraft AIBy EchoCraft AIApril 8, 2025No Comments4 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Share
    Facebook Twitter LinkedIn Pinterest Email

    IBM has announced the launch of its latest mainframe system, the IBM z17, a high-performance computing platform engineered to support artificial intelligence workloads at scale.

    IBM z17 Mainframe Key Takeaways

    Highlights

    AI-Ready Mainframe: The IBM z17, built on the second-generation Telum II processor, is engineered to support over 250 AI use cases—including generative models and AI agents—making it a future-ready platform for enterprise workloads.
    Performance Boost: With a 50% increase in inference operations over its predecessor (up to 450 billion inferences per day) and enhanced energy efficiency (7.5× acceleration with 5.5× less energy), the z17 offers a significant performance upgrade.
    Advanced AI Acceleration: Integration of IBM Spyre AI Accelerator chips (48 at launch, with plans to double) and an integrated DPU for enhanced I/O performance elevate the z17’s ability to handle complex, resource-intensive AI tasks.
    Scalability through Ensemble Architecture: The use of a Mixture of Experts (MoE) architecture allows the z17 to efficiently manage computing workloads by activating only the necessary components during inference.
    Robust Security and Integration: Designed with full encryption and seamless interoperability with existing enterprise hardware and open-source ecosystems, the z17 ensures long-term reliability and security for critical data operations.
    Significant Infrastructure Investments: IBM’s commitment to scaling its AI infrastructure—using over 100,000 Nvidia H100 GPUs and planning a $10 billion data center—underscores the strategic importance of the z17 in modernizing enterprise IT.

    Built on IBM’s second-generation Telum II processor, the z17 is designed to address over 250 AI use cases, including generative models and AI agents.

    The system is part of IBM’s ongoing effort to align long-standing enterprise infrastructure with the evolving demands of AI-driven operations.

    Despite perceptions of mainframes as legacy systems, they remain a central part of global enterprise infrastructure.

    As of 2024, 71% of Fortune 500 companies continue to rely on mainframes, and the market itself is valued at approximately $5.3 billion, according to Market Research Future. IBM’s introduction of the z17 aims to modernize this established platform with capabilities tailored to the AI era.

    The IBM z17 delivers a notable performance boost, capable of executing up to 450 billion inference operations per day—representing a 50% increase over the performance of its predecessor, the z16, which launched in 2022.

    The system also emphasizes security, with full encryption and seamless integration across existing enterprise hardware and open-source ecosystems, allowing for flexible AI deployment.

    According to Tina Tarquinio, Vice President of Product Management and Design for IBM Z, the z17 has been in development for five years, predating the 2022 surge in interest around generative AI.

    Tarquinio noted that extensive customer input—over 2,000 hours of research and interviews with more than 100 enterprise clients—helped shape the design, with common themes emerging around the need for enhanced performance, AI acceleration, and long-term flexibility.

    At launch, the z17 will support 48 IBM Spyre AI accelerator chips, with plans to increase capacity to 96 within the first year.

    This expansion is intended to accommodate increasingly complex and resource-intensive AI models. IBM has emphasized that the system includes built-in overhead to allow for future AI advancements, including larger local memory footprints and newer processing requirements.

    Energy efficiency is a key component of the z17’s design. IBM reports that the system delivers 7.5 times the AI acceleration of the z16 while using 5.5 times less energy compared to similar platforms handling multi-model workloads.

    This balance of performance and energy efficiency may appeal to organizations seeking to scale AI operations without proportionally increasing energy consumption or operational costs.

    Introduction of the IBM Spyre AI Accelerator

    The Spyre AI Accelerator plays a central role in expanding the z17’s AI capabilities. Each Spyre chip features 32 AI-specific cores and supports up to 1TB of memory.

    These chips are engineered for handling complex tasks such as generative AI and large language models. Up to eight Spyre cards can be installed in a single I/O drawer, delivering significant computational power with a power consumption of no more than 75W per card.

    Integrated DPU for Enhanced Data Processing

    The Telum II processor also introduces an integrated Data Processing Unit (DPU) designed for I/O acceleration. This integration enhances performance for data-intensive applications by speeding up complex networking and storage protocols.

    The DPU supports a 50% increase in I/O density compared to previous generations, enabling better scalability and more efficient data throughput within the system.

    Scalability Through Ensemble AI Architecture

    The combined use of Telum II processors and Spyre Accelerators allows the z17 to support ensemble AI approaches, where multiple models are integrated to boost accuracy and resilience.

    This architecture ensures the z17 can scale with increasing AI complexity while maintaining robust performance and operational efficiency.

    Although IBM has not released pricing information, the z17 will be generally available starting June 8.

    AI Computers IBM z16 z17
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleEverest Ransomware Gang’s Leak Site Hacked, Replaced With Anti-Crime Message
    Next Article Amazon Introduces Nova Sonic, a Real-Time AI Voice Model with Multimodal Capabilities
    EchoCraft AI

    Related Posts

    AI

    Meta Plans to Use AI for 90% of Product Risk Assessments

    June 1, 2025
    AI

    Google Quietly Launches AI Edge Gallery App for Running Hugging Face Models Locally on Android

    June 1, 2025
    AI

    Perplexity Labs Launches, Automating Spreadsheets, Reports, and Web App Creation

    May 31, 2025
    Add A Comment
    Leave A Reply Cancel Reply

    Search
    Top Posts

    Samsung Galaxy S25 Rumours of A New Face in 2025

    March 19, 2024371 Views

    CapCut Ends Free Cloud Storage, Introduces Paid Plans Starting August 5

    July 12, 2024145 Views

    Windows 12 Revealed A new impressive Future Ahead

    February 29, 2024128 Views
    Categories
    • AI
    • Apps
    • Computers
    • Gadgets
    • Gaming
    • Innovations
    • Live Updates
    • Science
    • Smart Phone
    • Social Media
    • Tech News
    • Uncategorized
    Latest in AI
    AI

    Meta Plans to Use AI for 90% of Product Risk Assessments

    EchoCraft AIJune 1, 2025
    AI

    Google Quietly Launches AI Edge Gallery App for Running Hugging Face Models Locally on Android

    EchoCraft AIJune 1, 2025
    AI

    Perplexity Labs Launches, Automating Spreadsheets, Reports, and Web App Creation

    EchoCraft AIMay 31, 2025
    AI

    Hugging Face Introduces Two Open-Source Humanoid Robots to Expand Access to Robotics

    EchoCraft AIMay 31, 2025
    AI

    Tencent Releases HunyuanPortrait: Open-Source AI Model for Animating Still Portraits

    EchoCraft AIMay 29, 2025

    Subscribe to Updates

    Get the latest tech news from FooBar about tech, design and biz.

    Stay In Touch
    • Facebook
    • YouTube
    • Twitter
    • Instagram
    • Pinterest
    Tags
    2024 Adobe AI AI agents AI Model android Anthropic apple Apple Intelligence Apps ChatGPT Claude AI Copilot Elon Musk Galaxy S25 Gaming Gemini Generative Ai Google Google I/O 2025 Grok AI Hugging Face India Innovation Instagram IOS iphone Meta Meta AI Microsoft NVIDIA Open-Source AI OpenAI Open Ai PC Reasoning Model Samsung Smart phones Smartphones Social Media TikTok U.S whatsapp xAI Xiaomi
    Most Popular

    Samsung Galaxy S25 Rumours of A New Face in 2025

    March 19, 2024371 Views

    Apple A18 Pro Impressive Leap in Performance

    April 16, 202465 Views

    Google’s Tensor G4 Chipset: What to Expect?

    May 11, 202449 Views
    Our Picks

    Apple Previews Major Accessibility Upgrades, Explores Brain-Computer Interface Integration

    May 13, 2025

    Apple Advances Custom Chip Development for Smart Glasses, Macs, and AI Systems

    May 9, 2025

    Cloud Veterans Launch ConfigHub to Address Configuration Challenges

    March 26, 2025

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Facebook X (Twitter) Instagram Pinterest
    • Home
    • Contact Us
    • Privacy Policy
    • Terms & Conditions
    • About Us
    © 2025 EchoCraft AI. All Right Reserved

    Type above and press Enter to search. Press Esc to cancel.

    Manage Consent
    To provide the best experiences, we use technologies like cookies to store and/or access device information. Consenting to these technologies will allow us to process data such as browsing behavior or unique IDs on this site. Not consenting or withdrawing consent, may adversely affect certain features and functions.
    Functional Always active
    The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.
    Preferences
    The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.
    Statistics
    The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.
    Marketing
    The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.
    Manage options Manage services Manage {vendor_count} vendors Read more about these purposes
    View preferences
    {title} {title} {title}