News Summary 

  • AMD introduces Ryzen AI 400 and Pro 400 series processors. The Ryzen AI 400 series targets consumer and commercial devices with up to 60 NPU TOPS for Co-Pilot+ PCs and AI features. The Pro 400 series is aimed at business users seeking enhanced manageability and security.  
  • AMD introduces new Ryzen AI Max+ SKUs, bringing high-performance AI and graphics to ultra-thin notebooks, workstations, and small-form-factor devices for creation, gaming, and AI development.  
  • AMD unveils the Ryzen AI Halo A Mini PC, delivering Ryzen AI Max+ performance for AI developers and offering an out-of-the-box experience that accelerates AI innovation at the edge.  
  • AMD announced the Ryzen 7 1950X3D, its flagship gaming processor based on Zen 5 architecture with AMD 3D V-Cache, designed for enthusiasts prioritizing top-tier gaming performance. This model stands apart from AI-focused SKUs.  
  • AMD sees strong year-on-year growth in OEM adoption of Ryzen AI processors, with more systems launching across consumer, commercial, and gaming segments throughout 2026.  
  • AMD announces AMD ROCm 7.2 software for Windows and Linux, bringing seamless support for Ryzen AI 400 series processors and inclusion in ComfUI.  

At CES 2026, AMD also revealed its latest mobile and desktop processors, expanding its client computing portfolio. This launch underscores AMD’s drive to lead in AI capabilities, premium gaming performance, and commercial-ready features, bringing these advances to more systems and users than ever before.  

AMD introduced the new AMD Ryzen AI400 series for Co-Pilot Plus PCs and Ryzen AI Max+ processors for premium ultra-thin and light notebooks and small-form-factor desktops. The company also announced the Ryzen AI Pro 400 series, enabling AI acceleration, modern security, and enterprise-class manageability to meet the needs of today’s business landscape. Recognizing AI as central to the PC experience, AMD is strengthening its hardware portfolio with AMD Ryzen AI Halo, the company’s first branded AI developer platform. AMD pairs this hardware with new ROCm 7.2 software support for Ryzen AI 400 series processors and an AI bundle for AMD Software Adrenaline Edition, ensuring AI adoption and development are more accessible than ever.  

AMD announces the Ryzen 7 9850X3D, an improved gaming CPU with a higher boost clock built on Zen 5 and 3D V-Cache. Radeon users get FSR Redstone for ML frame generation and upscaling in new AAA games.  

The PC is being redefined by AI, and AMD is leading that transformation, said Jack Huynh, senior vice president and general manager of the AMD Computing and Graphics Group. Across consumer, commercial, and enthusiast teams’ systems. We are delivering platforms that bring high-performance computing, leadership AI, interactive graphics, and a growing software ecosystem that strengthens developers and creators, so intelligence is built in, performance and effectiveness scale smoothly, and innovation reaches every form factor. Our full-stack approach is coming to life, enabling a smarter, faster, and more absorbing experience for users today and tomorrow.  

AMD ROCm Software Experience Developer Access 

AMD announced that AMD ROCm, the open software platform, now supports Ryzen AI 400 series processors and is available for download through Confi UI. The upcoming AMD ROCm software 7.2 release will extend compatibility throughout both Windows and Linux, and new PyTorch builds can now be easily accessed through AMD software for simplified deployment on Windows.  

Over the past year, AMD ROCm software has delivered up to a five times improvement in AI performance. Platform support has doubled across Ryzen and Radeon products in 2025, and availability now spans Windows and additional Linux distributions, resulting in a year-on-year increase of up to 10x in downloads. Together, these updates make AMD ROCm software a more powerful and accessible foundation for AI development, reinforcing AMD as a platform of choice for developers to build the next generation of intelligent applications. 

Source: AMD Expands AI Leadership Across Client, Graphics, and Software with New Ryzen, Ryzen AI, and AMD ROCm Announcements at CES 2026 

Built for reliable AI in production, GPT 5.4 offers stronger reasoning, dependable execution, and scalable agent workflows.  

We are excited to share that OpenAI’s GPT-5.4 is now available in Microsoft Foundry. This model helps organizations move from planning to reliably completing work in real production settings. As AI agents handle longer, more complex workflows, consistency and follow-through are just as important as they entail. GPT-5.4 offers stronger reasoning and built-in computer-use features to support automation and reliable execution across tools, files, and multi-step workflows.  

GPT-5.4 Enhanced Dependability in Production AI 

GPT-5.4 is designed for organizations running AI in production, where consistency, instruction-following, and context retention are crucial. It advances reasoning, coding, and agent workflows to help AI not just plan but complete tasks with fewer interruptions and less supervision.  

GPT-5.4 is more stable during extended interactions than earlier versions, giving teams confidence to depend on agent-based AI for daily needs.  

GPT-5.4 brings new capabilities created for production-grade AI:  

  • It provides more consistent reasoning, maintaining clear intent across complex, multi-step interactions.  
  • Instruction alignment is improved, with less need for tuning and oversight.  
  • Performance is faster, making workflows more responsive for real-time use.  
  • It includes built-in computer-use features for organizing tools, accessing files, extracting data, running code safely, and handling tasks between agents.  
  • Tool use is more reliable, reducing the need for prompt tuning and human monitoring.  
  • It generates higher-quality outputs like documents, spreadsheets, and presentations with a more consistent structure.  

These improvements ensure more predictable AI performance for longer, complex tasks.  

Turning Capabilities Into Actual Results 

GPT-5.4 delivers practical value in production, where reliable task completion is critical:  

  • Agent-driven workflows such as customer support, research assistance, and business process automation  
  • Enterprise Managed Work, including drafting documents, analyzing data, and generating presentation-ready outputs.  
  • Developer workflows spanning code generation, refactoring, debugging, support, and UI scaffolding  
  • Extended thinking tasks where logical uniformity must be preserved across longer interactions  

Teams using GPT 5.4 in production experience less task drift, fewer workflow failures, and improved output predictability.  

GPT 5.4 Pro: Deeper Analysis for Complex Decisions 

GPT 5.4 is an enhanced version designed for situations where deep, comprehensive analysis is critical, and it is optimized for reliable execution and follow-through in production tasks.  

Additional capabilities include:  

  • Multipath reasoning evaluation allows alternative approaches to be explored before selecting a final response.  
  • Greater cognitive depth supporting problems with trade-offs or multiple valid solutions  
  • Its improved stability supports sustained analytical tasks along long reasoning chains.  
  • It offers enhanced decision support when thoroughness outweighs speed.  

Organizations select GPT-5.4 Pro for thorough analysis of complex challenges, such as scientific research, while GPT-5.4 is ideal for reliable task execution with strong follow-through.  

Microsoft Foundry: Enterprise Grid Control From The Start 

Organizations access GPT-4 and GPT-5.4 via Microsoft Foundry, which provides controls for Responsible Production-Grade AI. Foundry simplifies policy enforcement, monitoring, versioning, and auditability, helping teams manage AI over time.  

With GPT-5.4 in Microsoft Foundry, organizations can add advanced agent features to existing systems and meet security, compliance, and operational requirements from the start. 

Source: Introducing GPT-5.4 in Microsoft Foundry 

AWS has launched a new, stateful runtime environment for AI agents in Amazon Bedrock, developed in collaboration with OpenAI. This environment supports long-running, complex AI workflows that keep context and memory across multiple steps and sessions.  

Key Features and Benefits 

This new stateful runtime marks a major change from traditional stateless AI runtimes, which handle each request separately.  

  • Persistent context and memory: agents maintain a consistent context, including conversation history, tool state, and identity boundaries. Developers no longer need to build external state management systems.  
  • Long Running Tasks: The runtime manages complex, asynchronous workloads and multi-step processes for hours or days. Agents work independently on projects like customer support, IT automation, or data processing.  
  • Simplified production deployment: teams focus on business logic as orchestration and state are fully managed by the new architecture.  
  • Integration with the AWS Ecosystem: This environment operates within the customer’s AWS environment and integrates with AWS security tools and identity systems, such as Amazon Cognito, as well as governance tools.  
  • Powered by OpenAI Models: The Runtime Environment incorporates models from OpenAI that have been specifically optimized for AWS and are offered through Amazon Bedrock as part of the AWS OpenAI partnership. Microsoft Azure will still be the only provider of stateless OpenAI APIs.  

Impact on Enterprise AI 

This AWS OpenAI partnership indicates a shift toward agentic infrastructure as a platform for businesses. This means:  

  • Accelerated development: It reduces the time to production-ready AI agents from months to weeks.  
  • Improved capabilities: It allows for more advanced applications, such as customer support across multiple systems, sales operations workflows, and internal IT automation with approvals and audits  
  • Governance and control: The architecture provides a managed pipeline using an AI-driven lifecycle (AI-DLC) framework that assesses agents for performance, cost, and security before deployment.  

The stateful runtime environment should be available through Amazon Bedrock in the next few months.  

AI agents are great at reasoning, but the real challenge is making sure they can reliably handle multi-step tasks over time using real tools and systems with proper controls.  

Today, we are making this easier by partnering with Amazon to launch a new stateful runtime environment that runs directly on Amazon Bedrock. AWS customers can use this environment, powered by OpenAI models and optimized for AWS, to support agent workflows with the state reliability and governance needed for production.  

Making It Easier to Bring Agents Into Production 

Many Agent prototypes based on stateless APIs tackle simple use cases:  

  • One prompt  
  • One answer  
  • Maybe one call to a tool.  

Production work is different. Real workflows unfold across many steps. They require context based on previous actions. They depend on multiple tool outputs, approvals, and system state, and need trusted guardrails to secure environments.  

With stateless APIs, development teams must build the orchestration layer themselves. They need to decide how to store state, call tools, handle errors, and safely resume long-running tasks.  

The Stateful Runtime Environment is built to make this easier. It runs within your AWS environment and works well with AWS services. Instead of assembling separate requests, your agents can now automatically handle complex tasks with context-carrying forward memory, workflow state, environment, use, and permission boundaries.  

What Can You Do With This 

Now it’s easier to build solutions like:  

  • multi-system customer support  
  • sales operational workflows  
  • internal IT automation  
  • financial processes that include approvals and audits  

Faster Time to Production for Multi-Step Workflows 

When the runtime manages orchestration and state across steps, teams can focus on the workflow and business logic rather than building additional support systems.  

Designed For Long-Running Tasks 

Stateful tasks are built to run reliably over time, keeping the context and control boundaries needed for multi-step work.  

AWS Native Deployment and Governance 

To understand how this stateful runtime can benefit your organization, reach out to your OpenAI team or request a contact from us today.

Source: Introducing the Stateful Runtime Environment for Agents in Amazon Bedrock 

NVIDIA recently released the GeForce Game Ready driver 595.59 to improve performance in Resident Evil: Requiem, but, according to a machine translation, this update has caused problems with RTX 3000-series and newer cards. Users report that the driver only detects one fan on their GPUs.  

Some people suspected that third-party apps like MSI AfterBurner were causing the issue; however, another user experienced the same problem even without AfterBurner installed.  

NVIDIA appears to have removed the driver update, as it is no longer on their website (a driver is the software that allows the operating system to communicate with your graphics card). If you have already installed the latest driver and are experiencing problems, you should roll back to the previous version. To do this in the NVIDIA app, click the three dots in the Drivers tab.  

If you do not have NVIDIA’s software open, Windows Device Manager, expand Display Adapters, and double-click your GPU in the Properties window. Go to the Drivers tab and select Roll Back Driver. If that option is unavailable, NVIDIA recommends uninstalling the GPU driver and reinstalling the latest available version to solve the issue, since the problematic driver has been removed.  

The new NVIDIA 595.71 driver has brought new problems not present in last week’s troubled 595.59 release, which NVIDIA pulled a few days ago. As a result, several users and at least one YouTuber have found that the 595.71 driver limits GPU overclocking on many RTX 40 and 50 series cards. The most affected models lose about 200 MHz of overclocking headroom compared to earlier drivers.  

The issue seems to be artificial voltage limits added to the in driver 595.71, either by mistake or on purpose. YouTuber bang4buckpcgamer showed that his Asus TUF Gaming RTX 5090 lost 65 mV of voltage headroom, keeping the card below 1 W. This change reduced his overclocking headroom by about 171 MHz, dropping from 3165 MHz to just under 3000 MHz. This only happens when the offset is about 150 MHz. With a 150 MHz offset or less, the GPU does not restrict voltage and can reach up to 1.060 V. Similar issues have also been shared on the NVIDIA forums. One user with an RTX 5080 reported that their GPU used to hit 3,100-3,200 MHz with previous drivers and can now only reach 2,395 MHz with 595.71.80 owner published their own 3DMark scores with the previous 591.86 driver compared to 591.71 with a hefty 450 MHz GPU overclock. They found the new driver was running the GPU 300 lower and pulling 43 fewer watts than from 403W to 360W.  

However, not all RTX 50 series GPUs appear to be affected. Curiously, three commentators on bang4buckpcgamer’s aforementioned YouTube video with a Gigabyte Aorus Master RTX 5090 graphics card report having no restrictions whatsoever. Another RTX 5090 owner with a PNY EPYC OC variant reported no issues achieving a max overclock of 3157 MHz with the latest driver. Two RTX 5070 owners, one with an Asus variant and the AMD MSI Gaming Trio OC, also reported no issues.  

This variability suggests that some owners may simply be lucky with their hardware, so their cards’ voltage and frequency scaling are not affected by this bug. Still, the new issue has led to many angry comments from gamers, with some blaming AI code for causing problems with Nvidia’s drivers. NVIDIA has officially not recognized the issue yet, but the artificial voltage limits do appear to be a bug rather than an official change. The NVIDIA patch notes don’t mention any new voltage limits, and certain GeForce RTX GPU models apparently aren’t subject to any limitations when running 595.71. We’ll have to see whether the company issues another corrective release or a fixed driver in the near future, along with any further explanation of the issue.

Source: Nvidia rolls back its latest driver update — Game Ready Driver 595.59 reportedly causes fan issues on RTX 3000, 4000, and 5000-series GPUs

Last week, Apple released the first iOS 16.4 beta, which included a bunch of new features and changes to Apple’s mobile software. Included was the news that Apple had started testing encrypted RCS messaging, but only between iPhones. Now, with the release of iOS 16.4 beta 2, that’s changed.  

At the time, I hoped Apple would soon test encryption for RCS messages between iPhones and Androids. I did not expect this to happen quickly, but I am glad it has.  

iPhone beta testers must have the latest iOS 16.4 beta installed to participate in cross-platform testing. Android users will need to have the latest version of Google Messages.  

Apple has affirmed that this feature will be in testing for a while. It will not ship the final version of iOS 26.4 and is not available for all devices and carriers. You have to be one of the few beta testers to send encrypted messages to your Android-using friends.  

Apple has previously confirmed that RCS won’t change the green bubble situation, so Android users will remain green regardless of encryption status, while iMessages are displayed in blue. However, testers will see a lock icon on all encrypted messages, indicating the security of their conversations.  

That change applies to our CS and iMessage, so there is absolutely no confusion. No lock means your messages are about as secure as an open gate.  

The benefit of RCS messaging between iPhone and Android is that all cross-platform issues are gone. Larger file-sharing limits mean photos and videos are not heavily compressed, and users get modern features like real-time typing indicators and reaction emojis.  

Soon, everyone will benefit from end-to-end encryption, which secures messages from anyone trying to intercept them. We do not know when it will be available to all. Apple has only said it will arrive in future releases of iOS, iPadOS, macOS, and watchOS 26.  

Ultimately, we will have to wait until Apple confirms everything works as intended.

Source: iOS 26.4 beta 2 now lets iPhones send encrypted RCS messages to Android — here’s how it works 

Meta engineers have launched KernelAgent, a multi-agent system that automates the creation and tuning of GPU kernels for AI workloads. This open-source tool, available in the Meta-pyTorch/KernelAgent GitHub repository, uses large language models and a hardware-guided feedback loop to generate fast, verifiable Triton kernels from PyTorch programs.  

Key Features 

  • Multi-agent system: KernelAgent splits the complex task of kernel optimization into dedicated roles. ProfilerAgent monitors and collects hardware performance data; JudgeAgent analyzes requests to identify areas for improvement; and the Optimization manager coordinates the workflow and decides which optimizations to pursue. These agents work together in cycles.  
  • Hardware-guided optimization: Instead of relying on static models like traditional compilers, KernelAgent bases its choices on real hardware performance data, including compute usage and memory bandwidth, gathered with NVIDIA Nsight Compute (NCU).  
  • Ongoing feedback loop: The system uses a closed-loop workflow.  
  1. Profiling: The system collects hardware metrics when it runs the kernel for the first time  
  1. Diagnosis: A powerful language model reviews the data to find performance bottlenecks.  
  1. Optimization: another large language model creates an improved kernel based on these suggestions.  
  1. Verification & Benchmarking: The system tests the new kernel to ensure it is accurate and performs well.  
  1. Iteration: The process repeats, and agents learn from previous successes and failures saved in shared memory.  
  • The Optimization Manager explores several optimization paths in parallel, keeping only the best-performing kernels.  
  • KernelAgent identifies and fuses parts of PyTorch programs, replacing them with optimized Triton kernels.  

Performance 

On 100 L1 KernelBench tasks, KernelAgent achieved 2.02x speedup over previous kernels and averaged 1.56x faster than the default torch. compile, reaching 89% of hardware efficiency on an H100 GPU.  

Optimizing GPU kernels is becoming more important for today’s AI workloads. As models get bigger and more specialized, performance increasingly depends on kernel efficiency rather than just the algorithms. However, manually tuning kernels requires significant expertise and an in-depth understanding of GPU hardware, memory, and performance trade-offs. The challenge only grows as more channels and kernels are added, and each new GPU architecture requires new optimization strategies.  

In practice, skilled kernel engineers use a step-by-step approach to optimize kernels. They profile kernels with tools such as NVIDIA Nsight Compute and examine hardware performance counters to identify bottlenecks and make targeted improvements.  

They ask questions like:  

  • Is the tiling strategy missing out on memory bandwidth?  
  • Does the kernel need a full redesign rather than just parameter tuning?  

Often, they have to evaluate several kernel designs, each with its own bottleneck, before finding one that fully utilizes the hardware. This process works well, but it usually takes days or even weeks.  

Modern compiler stacks have made big advances in automating kernel generation. For example, Torch.compile captures computation graphs and generates Triton kernels via graph transformations. Pattern matching and compiler rules, as well as other systems like TVM and XLA, employ similar tools to handle many common kernel patterns and deliver good performance from the start. Still, most compiler rules rely on static models rather than real measurements from running or actual hardware.  

KernelAgent seeks to automate this diagnosis-driven optimization process by harnessing real hardware signals to steer Kernel’s tuning of forward-pass (inference) kernels, which directly impact latency and throughput. This system is built on three fundamental principles:  

  • For every hardware decision, both bottleneck identification and optimization selection should be based on precise profiling data.  
  • Adapt multiple optimization tactics concurrently. Given identical hardware data, there may be several viable optimization pathways. KernelAgent evaluates these alternatives in parallel, saving time and synthesizing previous strategies to generate superior algorithms.  
  • Iterate by learning from every round using shared memory. Optimization agents review what succeeded or failed, storing insights collectively to inform future cycles and avoid repeating errors. 

SourceKernelAgent: Hardware-Guided GPU Kernel Optimization via Multi-Agent Orchestration 

The Gemini app provides embedding models that generate embeddings for text, images, video, and other content types. You can use these embeddings for activities such as semantic search, classification, and clustering, which often yield more accurate, context-aware results than keyword searches.  

The newest model, Gemini-Embedding-2-Preview, is the first from Gemini API to handle multiple content types, mapping text, images, video, audio, and documents into one shared embedding space. This enables searching, classification, and clustering across over 100 languages. For more details, check out the Multimodal Embedding section. If you only need text, Gemini-Embedding-001 remains available.  

If your product relies on retrieval, augmented generation (RAG) embeddings are crucial for making these systems more accurate, coherent, and context-aware for teams seeking a managed RAG solution. A file search tool makes RAG management easier and more affordable.  

Google has launched Gemini Embedding 2 for public previews, bringing enhancements over the previous version.  

As Google’s first native multimodal embedding model, Gemini Embedding 2 can map text, images, video, and documents into one shared embedding space. It was released alongside new AI features for Workspace apps.  

If you are new to this, embedding models are different from generative models like Gemini 3. Embedding models help computers understand context by turning text, images, or video into vectors, which are mathematical formats that computers can read and analyze. These embeddings yield more context-aware results across tasks such as semantic search, classification, and clustering than keyword-based methods.  

The first Google Embedding model only worked with text. Gemini Embedding 2 now supports text, images, videos, audio, and documents in a single unified embedding space across 100 languages. Below are the content limits:  

  • Text: up to 8192 tokens per request.  
  • Images: up to 6 images per request, supporting PNG and JPEG formats.  
  • Video: up to 120 seconds of video in MP4 or MOV format per request.  
  • Audio: processes and embeds audio data directly without needing transcriptions.  
  • Documents: can be PDFs up to 6 pages long.  

In a blog post, Google said the new model streamlines complex pipelines and enhances a wide variety of multi-modal downstream tasks from retrieval-augmented generation (RAG) and semantic search to sentiment analysis and data clustering. The model can analyze detailed relationships among different media types by accepting multiple media types in a single request, such as images and text.  

For example, Google noted that Gemini embeddings can help legal professionals find important information during the discovery phase of litigation. The multimodal embedding improves precision and recall across millions of records and enhances image and video search.  

Gemini embeddings (Gemini-embeddin-2-preview) are now available for public preview through Gemini, the Gemini API, and Vertex. The Gemini-embedding-001 model is still available for text-only needs.

SourceEmbeddings 

Google releases Gemini Embedding 2 AI model with multimodal support

Quantum computing is rapidly advancing, challenging enterprise security strategies to evolve just as quickly. This change prompts a critical question: Do you know what cryptography your business depends on today?  

Most organizations cannot fully answer this question. Quantum-computation methods pose a threat to today’s asymmetric cryptography, and new regulations, such as NIST’s PQC guidance, now require stronger systems. Discovering, analyzing, and planning for cryptographic changes is essential.  

Additionally, with the latest release of IBM Quantum Safe Explorer, clients running Z-Linux platforms, including IBM Linux, can now use it via the command line interface (CLI). This update delivers Quantum Safe visibility directly to mission-critical workloads, seamlessly extending these capabilities.  

Meet IBM Quantum Safe Explorer. 

IBM Quantum Safe Explorer enables developers and security teams to quickly pinpoint cryptographic elements in application code, APIs, and environments. This utility streamlines cryptographic inventory and assessment for evaluating quantum readiness.  

The tool helps track cryptographic assets, such as algorithms, keys, certificates, and libraries, across applications and infrastructure. It can generate cryptographic bills of materials (CBOMs) and assess cryptographic risks to prepare for quantum-safe transitions.  

Security and compliance professionals can use the tool to answer critical questions about quantum readiness.  

  • Users can identify which cryptographic algorithms are deployed.  
  • Assess whether they are outdated or insecure.  
  • Locate where these algorithms reside.  
  • Determine which applications use them and evaluate preparedness to adopt post-quantum solutions.  

With these new capabilities, clients using Z Linux and Linux One environments gain targeted answers to previously unresolved cryptographic questions, directly aligning with Quantum Safe Explorer’s core value.  

Unlocking Crypto Agility with IBM Quantum Safe Explorer 

Preparing for a quantum-safe future requires more than technical vulnerability scans. Organizations need visibility, agility, and management over encryption systems. IBM Quantum Safe Explorer supports diverse stakeholders, positioning CBOIMs at the center of risk management and long-term crypto-agility goals.  

Value Delivered At Two Levels 

  1. For InfoSec and DevSecOps Leads: Portfolio Visibility & Governance 

Security and development leads gain clear visibility into cryptographic risks across portfolios. CBs provide a unified inventory for audits and compliance, highlight high-risk areas, and automate inventory generation in each development cycle for continuous oversight.  

Dashboards display essential metrics, including new crypto components, changing risk scores, and remediation coverage across projects. This helps DevSecOps teams add cryptographic checks to the CI/CD pipeline with little disruption.  

  1. For Leadership & the C-suite, translating risk into action 

Executive dashboards distill technical details into clear business insights. Leaders quickly understand which applications depend on vulnerable cryptography, the completeness of cryptographic inventories, and areas needing agility improvements. Quantum Safe Explorer equips executives with compliance-ready evidence to demonstrate NIST quantum-safe conformance to regulators, supporting both current mandates and future requirements, such as the U.S. federal mandate for a cryptographic inventory by 2025. This enables leadership to prioritize risk mitigation and facilitate strategic planning for crypto agility investments.  

This shift allows leadership to proactively manage cryptographic risk at the business level, ensuring strategic alignment and operational readiness for evolving security requirements.  

Crypto Agility Anti-Patterns: What to Watch For 

Crypto agility requires the ability to update cryptographic algorithms across systems efficiently and reliably. Quantum Safe Explorer identifies any practices that may hinder this flexibility, including embedded algorithm versions, absent fallback mechanisms, or inconsistent library implementations.  

By identifying and mapping these issues to code paths, our solution provides teams with a clear plan for fixing them. This supports both current compliance and future resilience.  

New CLI support for Quantum Safe Explorer on Z Linux. 

This release adds CLI support for zLinux, making it easy to integrate into secure environments such as financial systems, public cloud platforms, and other regulated workloads.  

This means clients can now:  

  • Run cryptography discovery natively on Z Linux without moving data off the platform.  
  • Integrate QSE into CI/CD pipelines or system automation scripts.  
  • Generate CBOMs on demand for audit compliance or quantum readiness planning.  
  • Build a roadmap for replacing vulnerable cryptographic components over time.  

The CLI was designed for Enterprise DevOps teams to get started: install it on zLinux and configure the required access credentials. Add CLI commands to existing process scripts or automation workflows to perform cryptographic discovery. Generate CBO and reports, and track assets. Run these steps at defined intervals or during build and deployment stages to ensure up-to-date visibility.  

Why IBM LinuxOne 5? 

IBM Linux One 5 provides a secure platform for IBM Quantum Safe Explorer, allowing firms to prepare applications for the future with post-quantum cryptography.  

The system uses secure boot technology to prevent malware from being loaded during startup, improving cyber resiliency, and keeping the system safe. The crypto express 8s (CEX8s) hardware security module supports both classical and quantum-safe cryptography, meeting needs for confidentiality, integrity, and non-repudiation.  

LIDEX One 5 protects sensitive data both when stored and in use, thanks to its cybersecurity and privacy features. Its built-in crypto accelerators, confidential computing, and NIST-standardized post-quantum cryptography provide a strong foundation for quantum resistance in modern IT systems.  

Integration That Delivers: Security, Compliance, And Agility 

Combining IBM Quantum Safe Explorer with IBM LinuxOne 5 brings multiple technical benefits, such as:  

  • Quantum Safe Explorer leverages LinuxOne 5’s crypto accelerators and confidential computing, enabling deep cryptographic analysis and protecting sensitive data throughout its lifecycle.  
  • Simplified compliance: the combined solution makes it easier to comply with regulations such as PCI DSS, FIPS, GDPR, and the EU’s Digital Operations Resilience Act (DORA).  

What does this mean for LinuxOne clients? 

IBM LinuxOne clients are familiar with security and availability. These systems are built to process sensitive workloads with built-in encryption, hardware isolation, and high availability.  

However, even the most secure systems use cryptographic algorithms that may be decades old, and some of them are now at risk due to quantum computing. Many organizations also do not know where or how these algorithms are used.  

With this release, LinuxOne clients can now perform cryptographic discovery directly on the platform without exposing data. They can identify algorithm dependencies early, enabling crypto agility before major changes are necessary. This also helps them prepare for new NIST post-quantum standards.  

The solution makes it easier to meet compliance requirements for regulations such as FIPS, GDPR, and PCI DSS, which now require greater insight and control over cryptographic assets.  

To summarize, Quantum Safe Explorer on Z-Linux helps organizations improve their cryptography practices and plan for the future directly within their LinuxOne environments, securely, efficiently, and at scale.

Source: Extending quantum-safe visibility to LinuxONE: IBM Quantum Safe™ Explorer now available with CLI support on Z-Linux 

As the semiconductor industry moves from general-purpose computing toward specialized AI acceleration, AMD’s upcoming Gen 6 architecture, known as Morpheus, will introduce a major change: Coron, native support for IMT (for Bit integer) instructions.  

This shift toward INT4 is a significant evolution from previous architectures that emphasized FP16 and INT8 for machine learning. The transition reflects not only technological advancement but also aligns with industry trends for efficient edge inference.  

The Move to 4-bit Precision 

The main challenge for local AI, whether on a desktop PC or a workstation, is memory bandwidth and cache pressure. Large language models (LLMs) and diffusion models consume significant memory. With INT4 quantization, it can compress models much more than the current int8 standard.  

Instructions let the processor fit more data into the same amount of memory cache. For example, a model that once needed 16 GB of VRAM or system memory can now be compressed to 4-6 GB using 4-bit weights, with little loss in accuracy. For most consumer tasks with native hardware support, these operations avoid the usual quantization tax, which is the extra software work needed to convert 4-bit data back to 3 higher precision for calculations.  

Architectural Synergy: AVX 512 and the AI Engine 

NT4 support in Zen 6 is not simply an add-on; it is built into the updated AVX-512 execution units. By expanding the vector map to support 4-bit-packed integers, AMD delivers a significant boost in Token-Per-Second performance for running Local LLMs.  

Zen6 will also have closer integration between its x86 cores and the XDNA3 Neural Processing Units (NPU). The NPU manages ongoing background AI tasks, while the Zen6 cores use INT4 instructions for large on-demand tasks, such as real-time code completion or live translation. This hybrid setup keeps the CPU as a key part of the AI processing pipeline.  

Impact On Local AI Development 

For US developers using frameworks like PyTorch and TensorFlow, native int8 support makes it easier to run small language models (SLMs) like Llama3 or PHI3. In the past, running these models locally needed a high-end GPU. With Zen6, the CPU can handle adversarial inference on its own, reducing the need for cloud APIs and improving data privacy for businesses.  

Key Benefits of Gen 6 INT4 Support Include: 

  • Reduced memory bottlenecks: lower-precision data moves faster through the Infinity Fabric and memory controllers.  
  • Improved power efficiency: fewer bits per operation translates directly to lower joules per inference.  
  • Enhanced cache locality: more parameters fit in L2 and L3 caches, reducing the need to fetch data from slower system RAM.  

Conclusion: The Future of the AI PC 

By building iMT4 support into Zen6, AMD demonstrates that the AI PC is no longer just an idea it’s an imminent, practical reality. As late 2026 nears, Zen6 is poised to reshape the expectations for performance and autonomy in local AI. For developers and businesses, the barriers to running advanced AI locally are on the verge of vanishing.

Source: AMD Introduces Ryzen AI Embedded Processor Portfolio, Powering AI-Driven Immersive Experiences in Automotive, Industrial and Physical AI 

Policy in Amazon Bedrock AgentCore lets developers set up and enforce security controls for how AI agents interact with tools, creating a secure boundary around agent activities. AI agents can adapt to handle a range of tasks, from answering customer questions to automating workflows across multiple tools and systems, but this flexibility can also introduce new security risks as agents might misunderstand business rules or exceed their intended limits.  

In AgentCore, developers can build policy engines. These are software components that automatically enforce rules. Developers store explicit policies in these engines and connect them to gateways, which control and monitor the flow of requests. The system checks all agent traffic passing through Amazon Bedrock AgentCore gateways. It ensures each request complies with the defined policies before agents can access tools.  

Policies are written in Cedar, an open-source language for creating and enforcing authorization rules. This helps developers clearly define what agents can access and what actions they can take. Policy in AgentCore also lets developers write policies in plain English, so they do not have to use Cedar. The system deciphers these natural-language rules, generates possible policies, checks them against the tools set up, and uses automated checks to spot overly broad, overly strict, or impossible-to-make rules. This helps customers find and fix problems before policies are enforced.  

Policy in AgentCore provides detailed rights based on user identity and tool inputs, making it safer to use autonomous agents at scale by handling security outside the agency’s code. Developers can focus on building new features while maintaining strong security. This removes the need for custom security work and lowers the risk of agents bypassing policies.  

Key Benefits 

PolAgentCore policy delivers three main benefits for secure, scalable AI agent deployment. Fine-grained Control: Define the actions agents can take, the tools they can use, and the conditions under which they can use them.   

  • Deterministic Enforcement: Consistently enforce policies outside agent code for reliable security. Accessible Authoring: Create policies in English or Cedar for broad team adoption.EnfAll enforcement decisions are logged in CloudWatch for compliance purposes.  

Key Features 

Policy in AgentCore provides a full set of tools to manage agent interactions with policies. Main features include:  

  • Policy enforcement: the system checks all agent requests against set policies before granting access to tools.  
  • Access Controls: allow detailed permissions driven by user identity and tool input.  
  • Policy authoring: Write clear, validated policies in Cedar. You can also create policies in plain English, which the system translates and checks.  
  • Policy Monitoring integrates with Amazon CloudWatch (a monitoring service that collects and tracks metrics) to observe policy checks and decisions.  
  • Structure collaboration works with VPC security groups and other AWS security tools.  
  • Audit Logging keeps comprehensive logs of policy decisions for compliance and troubleshooting.

Source: Policy in Amazon Bedrock AgentCore: Control Agent-to-Tool Interactions