Oracle has expanded its infrastructure by deploying high-performance clusters directly into federal environments. This enables the government to keep sensitive data within its own secure, compliant systems across remote tactical sites and city offices, providing enhanced data residency and national security. Public agencies can now process large data sets locally, avoiding exposure over public internet connections. This decentralized sovereign model marks a significant shift in how governments manage sensitive digital assets.  

Hardening The Digital Perimeter At The Operational Edge. 

At the heart of this expansion is roving-edge infrastructure: portable, rugged server units that operate even when disconnected from central data centers. These units let military and emergency teams analyze in the field, enabling them to make real-time decisions during critical situations. This kind of tactical autonomy is vital for defense and disaster recovery, in which every second counts. It helps keep missions going even if regular communication lines are down.  

These edge systems meet strict security standards, impact level 5 and 6, to handle the Department of Defense’s most sensitive data. Oracle provides air-gapped hardware separated from outside networks and layers this with encrypted storage and secure boot, ensuring zero-trust protection. This gives agencies full data sovereignty and advanced solutions for logistical and planning challenges.  

Sovereign Compliance and Jurisdictional Integrity 

A primary driver of the 2026 rollout is the requirement for local data governance in the US, especially for agencies that must keep citizen data isolated from commercial servers. Oracle’s Sovereign AI Cloud provides exclusive environments managed by fully cleared US staff, preventing data mixing and supporting audit readiness through a transparent chain of custody.  

The sovereign model leverages inter-agency data sharing by allowing government departments to collaborate securely on shared cloud platforms. For instance, the Department of Energy and EPA can jointly run simulations on local servers, keeping data protected from public networks, and identity-based access controls ensure only authorized agency staff can access sensitive data, accelerating secure, efficient government collaboration.  

Accelerating Public Sector Innovation Via Localized Assets 

By placing high-performance hardware at the edge, Oracle is enabling real-time monitoring across smart city initiatives and public utility management. Local governments can use these sovereign clusters to regulate traffic flow, manage water distribution, and monitor electric grids in real time. Because the data is processed locally, the system can respond to environmental challenges in milliseconds. This reduces the risk of system-wide failures and improves the quality of life for citizens in urban and rural regions alike. It turns the edge of the network into an active engine of civic efficiency and technological growth.  

The expansion also comprises specialized foundational logic templates designed for government workflows. These templates allow agencies to quickly deploy automated systems to permit, process permit applications, manage social services, or analyze economic trends. By reducing the technical barrier to entry, Oracle helps smaller government entities use sophisticated digital tools previously reserved for large federal departments. The democratization of power ensures that every level of government can benefit from the latest architectural breakthroughs. It creates a stronger, more responsive public infrastructure that can adapt to the changing needs of the population.  

Future Proofing the National Infrastructure 

By moving the processing to the edge, Oracle emphasizes sustainability and resilience. Modern edge systems use high-efficiency cooling, low-power hardware, and renewable energy, maximizing longevity and responsible taxpayer investment, a forward-looking approach to national technology.  

Oracle’s 2026 plan includes quantum-resistant encryption to safeguard government data against future threats, ensuring long-term digital security. By deploying these protections now, Oracle demonstrates the importance of staying ahead in technology for national interests and privacy.  

The Unseen Architecture of National Security 

As these digital systems become central to governing, we witness a shift: civic infrastructure becomes more secure and responsive, addressing safety needs. The government office evolves from slow paper-based processes to efficient, reliable logic, replacing insecurity with confidence in robust protection.  

In the future, our democracy may be supported by secure, reliable systems that protect our progress and preserve our sovereignty. Our world is becoming increasingly connected and responsive, always ready to serve the public good. Clear, logical systems will help ensure the nation’s future is strong and transparent. We are building a world in which technology quietly supports our goal of a more secure, better-secured society. Now is the time for leaders, agencies, and organizations to seize this opportunity, leverage sovereign AI solutions, and initiate the next era of secure, responsive governance for all citizens.

Source: Oracle Introduces Fusion Agentic Applications for Finance and Supply Chain 

Intel set a new standard in AI performance by fine-tuning Llama 2 70B with low-rank adapters and training the MLPerf GPT-3 model using over 1,000 Gaudi 2 accelerators in the Intel Tiber development cloud, according to MLCommons’ latest benchmark results.  

What’s new: MLCommons has released the results of its MLPerf training v4.0 benchmark (an industry standard set of tests to measure machine learning training performance). Intel’s results highlight the options that Gaudi2 AI accelerators (specialized hardware components designed to accelerate AI tasks) offer businesses. Community-driven software (improvements and tools created by open-source contributors) makes generative AI development easier, and standard Ethernet networking (the common network technology used to connect computers and devices) enables flexible scaling for the first time. Intel submitted the results from a single Gaudi2 system with 1,024 accelerators on the Intel Tiber Developer Cloud, demonstrating Gaudi2’s performance and scalability, as well as the cloud’s ability to train the MLPerf GPT-3 175B parameter model (a benchmark test using a very large AI language model with 175 billion parameters).  

“The industry needs better generative AI solutions with high performance and efficiency. The latest MLPerf results from MLCommons highlight the unique value of Intel Gaudi as businesses seek more affordable, scalable systems with standard networking and open software. This makes generative AI more accessible to more customers.” – Zane Ball, Intel Corporate Vice President and General Manager, DCAI Product Management.  

Why it matters: Many customers want to use generative AI but face challenges with cost, scale, and development. Last year, only 10% of enterprises successfully launched GenAI projects. Intel’s AI solutions help businesses overcome these barriers. Gaudi AI is a scalable, accessible option for training large language models with 70-175 billion parameters. The upcoming Gaudi 3 accelerator will offer even better performance, openness, and choice for enterprise GenAI.  

How Intel Gaudi 2 MLPerf Results Show Transparency 

The MLPerf results confirm that Gaudi2 remains the only MLPerf benchmarked alternative to the Nvidia H100 for AI computing training GPT-3 on the Tiber Developer Cloud. Intel achieved a time-to-train of 66.9 minutes using 1024 Gaudi accelerators, highlighting strong scaling performance for very large language models in a cloud environment.  

The benchmark suite introduced a new test: fine-tuning the Llama 2 70B parameter model with low-rank adapters. Fine-tuning large language models is a common need for many customers and AI practitioners, making this a practical benchmark. Intel’s submission reached a time-to-train of 78.1 minutes on eight Gaudi 2 accelerators. For this, Intel used open-source software from OptiML (a toolkit for optimizing AI models for Habana accelerators), 03 from DeepSpeed (a tool for memory-efficient training), and FlashAttention-2 (a method to speed up attention mechanisms in transformer models). The benchmark task force, led by engineers from Intel’s Habana Labs (developers of the Gaudi accelerators) and Hugging Face (a provider of open-source AI tools), created the reference code and rules.  

How Intel Gaudi Delivers Value In AI 

High costs have kept many businesses out of the AI market, but Gaudi (Intel’s specialized AI hardware accelerator) is changing that. At Computex (an annual computer expo), Intel announced that a standard AI kit with eight Gaudi accelerators and a universal baseboard costs $65,000, about one-third the cost of similar platforms. A kit with eight Gaudi 3 accelerators (the next generation of Intel’s AI hardware) and a baseboard is listed at $125,000, about two-thirds the cost of comparable options.  

Growing momentum shows Gaudí’s value. Customers chose Gaudi for its price-performance benefits and accessibility, such as:  

  • Naver, a major South Korean cloud provider and search engine with over 600 million users, is building a new AI ecosystem. They are making it easier for customers to adopt large language models (advanced AI systems that understand and generate text) by reducing development costs and project timelines.  
  • AI Sweden, a partnership between the Swedish government and private companies, uses Gaudi (Intel’s AI accelerator hardware) to fine-tune models with municipal content (data from local governments). This helps improve efficiency and public services for people in Sweden.  

How Intel Type Developer Cloud Helps Customers Use Gaudi 

The Tiber Developer Cloud (Intel’s managed cloud platform) offers a managed, cost-effective platform for developing and deploying AI models, from single nodes to large clusters. In the Tiber Developer Cloud, Intel provides access to its accelerators (specialized AI processors), CPUs, GPUs, OpenAI software (artificial intelligence tools), and other services. Intel customer Seekr recently launched SeekrFlow, an AI development platform using Intel’s Developer Cloud to serve its clients.  

According to cio.com, Seekr cited cost savings of 40 to 400% from the Tiber developer cloud for select AI workloads compared to on-premises systems with other vendors, GPUs, and another cloud service provider, along with 20% faster AI training and 50% faster AI inference than other on-premises systems.  

What’s next: Intel plans to submit MLPerf results for the Gaudi3 AI accelerator in the next inference benchmark. Gaudi3 is expected to deliver stronger AI training and inference performance on key models and will be available from equipment manufacturers in fall 2024.

Source: Intel Gaudi Enables a Lower Cost Alternative for AI Compute and GenAI 

ServiceNow has launched a new framework to help traditional enterprise systems become more resilient and self-managing. This approach allows IT systems to automatically monitor and maintain their own performance, finding and addressing issues before users notice them. By combining monitoring tools with automation, the platform reduces the need for constant manual supervision. This matters more as hybrid cloud setups grow more complicated. The aim is to create digital systems that can fix themselves without ongoing human involvement.  

Establishing The Architecture Of Autonomous Resolution 

Automated root cause analysis drives this new system, scanning thousands of logs in real time to find sources of problems. Previously, IT teams spent hours manually sorting through data during outages. Now, ServiceNow’s platform quickly locates the exact code or hardware causing issues. Fast responses are especially important to keep finance and healthcare services running without interruption. Issues are fixed in seconds, not hours.  

Once a problem is detected, prescriptive remediation scripts automatically resolve it. For example, abnormal memory use prompts the system to restart services or reallocate resources. All fixes follow a closed-loop governance process to satisfy security rules, with every action recorded in an audit trail for supervisor review. This ensures transparency and accountability, even with a faster response time.  

ServiceNow Pushes Autonomous IT Systems Via Predictive Modeling. 

An essential aspect of predictive health monitoring is its ability to detect early warning signs. Instead of waiting for failures, the system uses past data to forecast hardware and database issues. Preemptive load balancing reallocates workloads to healthy systems before failures occur. This proactive approach allows IT teams to schedule maintenance and use real data.  

By learning what normal activity looks like, the system can spot suspicious changes from regular usage. If odd power usage or data changes happen, the platform isolates the affected area. Micro-segmentation blocks issues from spreading across the network. Ongoing monitoring ensures steady operation, providing greater resilience than systems that rely solely on manual checks.  

Orchestrating Multi-Cloud Environments With Fluid Logic 

Businesses often use many different cloud services and their own server rooms, creating technology silos. ServiceNow acts as a central management layer, linking these systems together. It handles company-wide updates and patches, keeping setups the same everywhere, and avoiding mismatched versions. As a result, one IT team can manage global systems just as easily as a single local server setup.  

This orchestration also optimizes cloud costs. By tracking resources, the system powers down idle setups, such as unused development systems, and restores them when needed. Smart resource management ensures businesses only pay for what they use, improving efficiency.  

The Evolution Of The Human System Partnership 

Moving to self-healing systems does not remove the need for skilled IT professionals. It lets them focus on safety, strategy, and design instead of routine tasks. They set rules and goals for autonomous operations. This change fosters creative problem-solving and promotes people-technology collaboration.  

As ServiceNow advances autonomous IT systems, we are entering an era of more responsive infrastructure. The system adapts to an organization’s needs, learning from its learning preferences over time to deliver a tailored experience. This smart automation helps technology support human goals. Problems no longer reach users—they are fixed before anyone notices. The technology now quietly handles itself with steady, reliable performance.  

The Unseen Architecture Of Perpetual Uptime 

As digital systems improve, we are entering an era marked by stable and reliable technology. Networks will become silent protectors, making outages rare so that continuous service becomes the norm. This change leads to a future where users depend on technology without worrying about technical failures.  

Looking forward, digital systems will keep the world running smoothly in the background. Success means providing invisible, nonstop support so people can focus on new ideas. As we develop self-fixing, strong technology, we move toward a future defined by constant reliability and added options for growth.

Source: Sorry, this path is closed, but the front door is open 

The Buzz 

  • CoreWeave extended its Q4 revenue projections and saw its backlog grow to nearly $67 billion, CNBC reported.  
  • Meta and OpenAI are major contributors to CoreWeave’s contract pipeline, reinforcing the company’s position in AI infrastructure.  
  • The backlog, equal to several years of current revenue, shows enterprises can reliably secure GPU computing power for ongoing and future projects, reducing uncertainty.  
  • These results reassure customers about the stability of AI infrastructure, supporting CoreWeave’s IPO timing and customers’ technology strategies.  

CoreWeave reported Q4 results above revenue expectations and revealed a $67 billion backlog, much larger than many tech companies’ yearly revenues. With Meta and OpenAI in the lead, these numbers show that enterprise AI spending is not only steady but growing faster than most expected just six months ago.  

CoreWeave gave Wall Street clear proof that AI infrastructure spending is here to stay. The company revealed a contract backlog of nearly $67 billion, a figure that changes how people view enterprise AI investment.  

Thursday’s results underscore a pivotal turning point for the AI infrastructure market. Amid debate over real versus speculative GPU demand, CoreWeave’s $67 billion signed backlog offers rare clarity: enterprises are securing capacity for the future.  

OpenAI and Meta are key customers in CoreWeave’s pipeline, though the company has not shared contract values for each. The involvement of both companies is significant. Meta’s need for AI-powered feeds, recommendation systems, and its Metaverse projects are well known. OpenAI, working to stay ahead in large language models and expand ChatGPT, is now one of the industry’s biggest users of computing power. The timing of CoreWeave’s success is especially notable. The company went public in late 2025 during a period when the market was cautious about AI infrastructure investments. Some doubted whether large spending by Microsoft, Google, and Amazon on their own data centers would leave room for specialized providers. CoreWeave’s backlog shows the opposite: demand has surpassed even the biggest companies’ efforts.  

CoreWeave’s business model is different from traditional cloud providers in important ways. While Amazon Web Services and Google Cloud offer general-purpose computing with GPUs as just one option, CoreWeave has focused on accelerated computing from the start. This specialization is important for customers who need to run large training jobs or serve inference at scale. Every part of CoreWeave’s system, from networking to cooling, was designed for GPU work. This focus has brought it in customers beyond just AI labs and big tech companies. Financial firms are running quantitative models, biotech companies are working on drug discovery, and media companies are creating visual effects, all of which need the GPU power CoreWeave offers. However, it is the AI workloads, training, fine-tuning, and, more recently, inference that have fueled the rapid growth seen in the backlog.  

The $67 billion backlog also shows how AI companies are planning their infrastructure. These are not short-term contracts for temporary capacity. Enterprises are taking multi-year commitments, expecting their computing needs to remain steady or increase. For CoreWeave, this long-term visibility changes the business outlook. The company can invest in new hardware and data center expansion with confidence that the revenue will come. Broader market implications extend beyond CoreWeave’s balance sheet. NVIDIA, which supplies the GPUs that power CoreWeave’s infrastructure, gets another validation point for its data center roadmap. The networking equipment providers, power infrastructure companies, and real estate developers building the physical plants that house these systems all benefit from the sustained demand signal.  

However, these results raise questions about market structure as CoreWeave, Lambda Labs, and others expand, and large cloud firms grow their GPU services. Competition for hardware and customers intensifies. NVIDIA’s latest GPUs remain in short supply, so every chip CoreWeave gets is one less for rivals.  

The earnings beat is as significant as the backlog. Surpassing revenue expectations, especially amid cost-cutting pressures faced by other cloud providers, affirms CoreWeave’s pricing power and the commitment of enterprise customers. The backlog represents real enterprise investment, not speculation.  

CoreWeave’s Q4 results do more than show one company’s results. They signal to customers that AI infrastructure investments are moving from trials to essential services. The $67 billion backlog, anchored by enterprise customers such as Meta and OpenAI, reassures customers of CoreWeave’s capacity to meet multi-year needs across the AI industry. Customers, whether startups or established firms, can plan with greater confidence that their growing GPU demands will be matched by available enterprise-grade infrastructure.

Source: CoreWeave’s $67B Backlog Signals AI Infrastructure Boom Isn’t Slowing 

Generative AI has advanced rapidly in the last three years. These advances have sparked innovation in nearly every industry, especially healthcare. Its applications now include summarizing patient-doctor visits, scheduling appointments, extracting key patient data for authorizations, and assisting with diagnosis and treatment plans.  

We are committed to offering technologies, tools, and resources to support healthier lives everywhere through bold and responsible AI use.  

Better health is a team effort. Many of our biggest breakthroughs have come from working with top clinical, public health, and academic organizations. Together, we have detected breast cancer as accurately as radiologists, sequenced genomes faster, and screened hundreds of thousands of patients for diabetic retinopathy.  

Healthcare’s digital transformation is just getting started, and we want to ensure that AI advances happen alongside the healthcare community, not just to it. With this in mind, let’s look at some of the most important developments we see as organizations start using AI.  

1. AI Agents Are Transforming Healthcare Workflows 

Across all industries, we are entering the era of AI agents powered by generative AI. These smart systems can access information, plan ahead, and take actions to achieve specific goals. This marks a big change from using AI as just a tool to seeing it as a partner.  

AI agents help new workflows get adopted more easily by providing clear, practical benefits. They address common concerns. For example, they reduce manual work. This lets healthcare workers spend more time on patient care and worry less about added workload or uncertain outcomes.  

In healthcare, AI agents help solve ongoing problems. They can reduce administrative work, which often limits time for patient care. Clinicians spend over a third of their week on tasks such as maintaining patient records, managing insurance forms, and handling documentation. AI agents can automate these jobs, such as scheduling, paperwork, and summarizing patient histories.  

For example, Highmark Health, a leading healthcare organization in the US, created an application that enables Allegheny Health Network clinicians to analyze medical records for potential issues and to suggest clinical guidelines to improve submissions. This has reduced administrative work and improved patient experiences. Bayer is also working on an AI innovation platform that uses generative AI to help builders and developers build apps for radiologists, making image and data analysis more efficient.  

By introducing these practical advantages, AI agents pave the way for new working methods that bolster organizational resilience, foster collaboration, and lead to measurable improvements in patient outcomes and care delivery.  

2. AI-Powered Search Is Improving Access to Information 

Healthcare providers face huge amounts of information from research papers and scattered patient records to new guidelines, and have little time to process it all and make good decisions. This problem is exacerbated by traditional keyword searches, which often struggle with complex medical terms and abbreviations.  

Semantic search powered by clinical knowledge graphs helps healthcare workers quickly find the most relevant and accurate information across multiple sources, including electronic health records, scanned documents, and more. For example, AI-powered search can find mentions of diabetes in patient records and display related information, such as the latest treatments, prescribed medications, test results, and common related conditions. However, healthcare organizations will need ways to ensure AI-generated answers are accurate, such as grounding responses in reliable data or providing citations to trusted sources.  

Many of our customers and partners are now combining AI-powered search with generative AI models, such as Google’s Gemini models. This lets clinicians ask questions about a patient’s record and get quick answers, making it easier to find exactly what they need. Meditech, a leader in electronic health records, has added advanced AI search and summarization for its Expanse EHR system. These new features give clinicians fast, easy access to complete patient information, so they can review past notes and confirm conditions like sepsis or surgical site infections in minutes rather than spending a long time on chart reviews.  

3. AI Platforms Are Essential for AI Success 

Generative AI is a powerful tool for increasing productivity and improving access to health information. To use AI successfully, organizations must invest in platforms that support easy deployment and management of generative AI solutions. This enables rapid progress from ideas to real outcomes.  

Tools need tools to build, test, and monitor models, and to address challenges such as bias, errors, and changes in model performance.  

Vertex AI 

Vertex AI provides a single environment to address these challenges. It offers features for thorough evaluation, bias detection, grounding, and ongoing monitoring. This helps keep AI outputs reliable and accurate. It also makes it easier to add AI into healthcare workflows and routines.  

Choosing the right platform helps organizations realize AI’s value quickly. Companies need models tested for bias, features that simplify adoption, and tools to connect AI to their data. Built-in governance, management, and security are also important.  

Since AI depends on data quality, leaders should assess how well the platform’s ecosystem supports secure data foundations. The launch of Gemini 2.0 is a big step forward. It brings new ways to process various types of data, including clinical records, operational data, notes, images, audio, and video. Healthcare organizations should combine different types of data and use advanced analytics and AI solutions to get the most from these advances.  

The Cloud Healthcare API, for instance, facilitates ingestion, storage, and management of important healthcare data types, including HL7v2, FHIR, DICOM, and unstructured text. Cloud Healthcare API also lets providers connect clinical and medical data to the full Google Cloud ecosystem, such as BigQuery and Vertex AI, to gain deeper insights through data analytics and AI.  

By embracing AI responsibly and strategically and working together, the healthcare industry can deliver better patient care, improve efficiency, and drive innovation for a healthier future.  

To learn more about the state of AI in healthcare, read our in-depth 2025 Healthcare Trends report.

Source: Healthcare’s AI transformation: Agents, search, and platforms 

As AI competition intensifies, many saw silicon’s physical limits as insurmountable. Taiwan Semiconductor Manufacturing Company, TSMC, has advanced these limits with the A16 process node: a 1.6 nm charge technology targeting mass production in the second half of 2026. This shift in chip design addresses not only further transistor miniaturization but also innovations in power delivery and thermal management.  

The A16 node stands out for breaking from traditional manufacturing methods. With its new Super Power Retail (SPR) technology, TSMC is tackling the power wall that has slowed the development of next-gen AI chips. By the end of 2025, major AI companies will have already shifted their hardware plans to match this 1.6 nm milestone. A16 is more than a minor upgrade; it will serve as the foundation for the next decade of generative AI and high-performance computing.  

The Technical Leap: Superpower Rail and the 1.6 Nm Frontier 

The A16 process marks TSMC’s entry into Angstrom-scale technology, employing an enhanced gate-all-around (GAA) nanosheet transistor. While the previous two nm (N2) node introduced GAAFETs, the A16 introduces the Super Power Rail, an advanced backside power delivery network that relocates power circuits beneath the silicon wafer. Unlike Intel’s PowerVia approach, TSMC’s SPR supplies power directly to the source and drain of each transistor.  

The direct contact method is more difficult to manufacture, but it offers significant electrical improvements by relocating power delivery to the back and keeping signal routing on the front. SPR eliminates routing congestion common in dense AI chips. A16 delivers 8 to 10% higher clock speeds at the same voltage and reduces power use by 15 to 20% compared to the N2P (2 nm enhanced) node. Logic density increases by 1.1 times, enabling more processing cores within the same footprint.  

Initial reactions from the semiconductor research community have been highly favorable, though some experts note the immense manufacturing hurdles. Moving power to the backside requires advanced wafer bonding and thinning technologies, techniques that must be executed with atomic-level precision. However, TSMC’s decision to stick with existing extreme ultraviolet (EUV) lithography tools for the initial A16 ramp, rather than immediately jumping to the more expensive High NA EUV machines, suggests an intentional strategy to maintain high yields while providing cutting-edge performance.  

The AI Gold Rush: NVIDIA, OpenAI, and the Battle for Capacity 

The A16 roadmap announcement has triggered a rush among top tech companies. NVIDIA, a leader in AI data centers, has reportedly secured early exclusive access to A16 for its 2027 Feynman GPU. For Nvidia, the 20% power savings from A16 is a key advantage, especially as data centers work to manage the heat and power needs of large H100 and Blackwell clusters.  

In a surprising strategic shift, OpenAI has also emerged as a key stakeholder in the A16 era, working alongside partners such as Broadcom and Marvell. OpenAI is reportedly developing its own custom silicon and an Extreme Processing Unit (XPU) optimized for its GPT-5 and Sora models. Using TSMC’s A16 node, OpenAI seeks to achieve a level of vertical integration that could eventually reduce its reliance on off-the-shelf hardware. Meanwhile, Apple, traditionally TSMC’s largest customer, is expected to use A16 for its 2027 M6 and A21 chips, ensuring its edge AI capabilities remain ahead of the competition.  

The competitive implications reach beyond chip designers to other foundries. Intel, which has been vocal about its five-node-in-four-years strategy, is currently shipping its 18A node with PowerVia technology. While Intel reached the market first with backside power, TSMC’s A16 is widely viewed as a more refined and efficient implementation. Samsung has also faced challenges, with reports showing that its 3nm GAA yields have trailed TSMC’s, leading some customers to migrate their 2026 and 2027 orders to the Taiwanese giant.  

Wider Significance: Energy Geopolitics and Scaling Principles 

The move to A16 and the Angstrom era has big effects on the wider AI world. By late 2025, AI data centers are expected to use almost half of all data center electricity worldwide. The efficiency gains from Super Power rail technology are not only a technical upgrade but also needed for economic and environmental reasons. For large companies like Microsoft and Meta, adopting A16 chips could save billions of dollars each year by reducing cooling and electricity costs.  

This development also underscores the semiconductor supply chain’s importance to global politics. TSMC’s market value hit a record $1.5 trillion in late 2025, underscoring its role as the foundry utility of the global economy. Still, having so much key technology in Taiwan is a strategic worry. To address this, TSMC is accelerating equipment upgrades in Arizona and Japan and aims to start A16 production in the US by 2028 to meet security needs for American AI labs.  

Compared to earlier milestones such as FinFET-to-GAAFET, A16 marks a technical shift. The industry priority is shifting from scaling for smaller features to architectural intelligence. Instead of focusing on transistor count increases (as in Moore’s Law), system-on-wafer scaling is now central. The methods for building, powering, and interconnecting chips are as technically crucial as transistor size.  

The Road to Sub-1nm: What Lies Beyond A16? 

Looking forward, the A16 node is just the start of the Angstrom era. TSMC is researching the A14 (1.4 nm) and A10 (1 nm) nodes targeting a launch in the late 2020s. These nodes are expected to employ new channel materials, such as two-dimensional semiconductors and molybdenum disulfide (MoS2), to overcome silicon’s scaling limits.  

In the short term, the industry will watch TSMC’s N2 node ramp in 2025. This will signal how well A16 might do. If TSMC keeps its usual yield rates with GAA FETs, moving to A16 and Super Power Rail in 2026 should go smoothly. Still, there are challenges, especially with packaging. As chips become more complex, advanced 3D packaging, such as CoWoS (chip-on-wafer-on-substrate), will be required. This packaging connects A16 chips to high bandwidth memory (HBM4), which could slow down the supply chain.  

Experts believe the A16’s success will open the door to new AI applications that were once too costly to run. This could mean real-time, high-quality video generation and autonomous agents capable of managing complex multi-step tasks. As hardware gets more efficient, the cost of running AI models or inference will fall, making advanced AI common in consumer electronics and industrial automation.  

Summary and Final Thoughts 

TTSMC’s A16 and Super Power Rail Technology signal a major advance for AI. By moving power delivery to the wafer’s back and reaching 1.6 nm, TSMC provides the thermal and electrical capacity critical for rapid AI growth. With mass production expected in late 2026, A16 is set to propel the next wave of AI innovation. 

For investors, the message is clear: new chip designs now drive the semiconductor industry. While Intel and Samsung are progressing, TSMC leads with its Angstrom roadmap, making it the top choice for AI companies. The coming yield reports from the 2 nm ramp will indicate if TSMC remains on track for A16.

Source: TSMC’s A16 Roadmap: The Angstrom Era and the Breakthrough of Super Power Rail Technology 

Today, we are announcing new capabilities in Azure AI Foundry. These features make it easier for developers to build, observe, and govern multi-agent systems. They also help organizations close the trust gap in AI.  

As more organizations adopt agentic AI, eight out of ten enterprises now use some form of agent-based AI, according to PwC. Managing these systems is becoming more complex. Developers deal with scattered tools, and organizations struggle to ensure agents act responsibly. Our latest Azure AI Foundry updates are designed to tackle these issues directly.  

Introducing Microsoft Agent Framework (Public Preview) 

The Microsoft Agent Framework, now in public preview, is an open source SDK and runtime. It makes it easier to manage multi-agent systems. The framework combines AutoGen, a previous Microsoft research project, with the enterprise features of Semantic Kernel. Together, these form one commercial-grade framework that brings the latest research directly to developers.  

With Microsoft Agent Framework, developers can:  

  • Start by experimenting locally, then deploy to Azure AI Foundry with built-in observability, durability, and compliance.  
  • Integrate any API using OpenAPI, work across different runtimes with Agent2Agent (A2A), and connect to tools on the fly with Model Context Protocol (MCP).  
  • Apply the latest multi-agent patterns, such as Magnetic One, and organize agents into workflows.  
  • Reduce switching between tools and platforms.  
  • Create multi-agent systems that connect Azure AI Foundry, Microsoft 365 Copilot, and other agent platforms.  

This framework helps developers stay focused. An industry study found that half of developers lose over 10 hours each week due to scattered tools. This shows why solutions that simplify work and improve the developer experience are needed.  

One organization that uses the Microsoft Agent Framework to reduce friction is KPMG. KPMG’s transformation began with KPMG Clara, its cloud-based smart auditing platform used on every KPMG audit worldwide.  

KPMG Clara AI aligns with the open-source Microsoft Agent Framework built on semantic kernel autogen convergence.  

This setup lets KPMG Clara AI link specialized agents to enterprise data and tools while using built-in safeguards and an open developer ecosystem. Open-source connectors enable agents to work with Azure AI Foundry and external systems, making it easier to scale multi-agent collaboration globally.  

Foundry agent service and Microsoft Agent Framework connect our signals to data and to each other. Performance and observability features in Azure AI Foundry give KPMG firms what they need to succeed in a regulated industry. – Sebastian Stockle, Global Head of Audit Innovation and AI at KPMG International.  

Contribute code and feedback to help shape agentic AI by engaging with the Microsoft Agent Framework.  

Multi-Agent Workflows (Private Preview) 

Building on the Microsoft Agent Framework, we are bringing these features to the cloud with multi-agent workflows in Foundry Agent Service. This new feature lets developers manage complex multi-step business processes using a structured workflow layer. It also maintains state throughout the process.  

With multi-agent workflows, your teams can:  

  • Coordinate multiple agents across long-running tasks with persistent state (stored information that persists as tasks continue) and context sharing (allowing agents to share relevant information during collaboration).  
  • Automate complex enterprise scenarios, including customer onboarding, financial transaction processing, and supply chain automation.  
  • Leverage built-in error handling, retries, and recovery to improve reliability at scale.  

You can create and debug workflows visually using the VS Code extension or Azure AI Foundry. Then, you can deploy, test, and manage them in Foundry along with your existing solutions.  

Several customers are currently piloting this feature, and broader availability is planned soon.  

Observability Across Popular Networks With OpenTelemetry Contributions. 

We are also improving multi-agent observability by contributing to open telemetry. This helps standardize tracing and telemetry for agentic systems.  

These improvements give teams better insight into agent workflows, tool calls, and collaboration. This is essential for debugging, optimization, and compliance. We worked with Outshift, Cisco’s incubation engine, to make these enhancements to open telemetry.  

Thanks to these updates, Azure AI Foundry now offers unified observability for agents built with different frameworks, including Microsoft Agent Framework, Langchain, LangGraph, and OpenAI Agents SDK.  

Voice Live API in Azure Foundry Is Now Generally Available 

More Meta agent workflows now start with voice inputs and end with voice outputs. We are pleased to announce that Voice Live API is now generally available. This tool lets developers and businesses build scalable production-ready voice AI agents. Voice Live API is a unified real-time speech-to-speech interface that combines speech-to-text (STT), generative AI models, text-to-speech (TTS) avatars, and conversational enhancements into a single low-latency pipeline.  

Companies like Capgemini, Healow, Astrotech, and Agora are using the Voice Live API to create customer service agents, educational tutors, HR assistants, and multilingual agents. Voice Live API is changing how developers build voice AI agents by offering an integrated, scalable, and efficient solution.  

Responsible AI Capabilities In Public Preview 

As agent observability and framework integration improve, it is just as important to ensure AI systems operate responsibly and securely, especially as they become a bigger part of key business processes.  

McKinsey’s 2025 Global AI Trust Survey found that the biggest barrier to AI adoption is the lack of governance and risk management tools. To address this, we will soon release the following responsible AI features in public preview: Task adherence: Help agents stay aligned with assigned tasks.  

  • Prompt shields with spotlighting help protect against prompt injection. They also spotlight risky behavior.  
  • PII detection identifies and manages sensitive data.  

These features are part of Azure AI Foundry, helping organizations build confidently and meet both internal and external standards.  

To further empower developers, Azure AI Foundry offers more than just a platform it’s a trusted agent factory. 

Azure AI Foundry is more than just a platform. It is a trusted agent factory for developers and businesses. Whether you are a CIO aiming to scale AI responsibly, a security architect focused on governance, or a developer building the next generation of intelligent agents, Azure AI Foundry gives you the tools, frameworks, and trust you need.  

Microsoft stands out in the AI landscape with its commitment to open standards, interoperability, and responsible AI. The Microsoft Agent Framework, now in public preview, is a unified enterprise-grade framework that integrates cutting-edge research and allows developers to seamlessly orchestrate multi-agent systems with built-in observability, durability, and compliance.  

Our framework, unlike others, supports integration with any API through OpenAPI. It also allows collaboration across runtimes with agent-to-agent (A2A) and dynamic tool connections using NCP. This helps developers avoid switching between tools and stay focused. It speeds up innovation.  

The open-source nature of the framework invites developers to contribute and shape the future of agentic AI. This makes it a truly collaborative, forward-thinking platform. With Microsoft, organizations can trust that their AI systems will be powerful, efficient, responsible, and secure. It addresses the top barriers to AI adoption identified in McKinsey’s 2025 Global AI Trust Survey.

Source: Introducing Microsoft Agent Framework 

Amazon Web Services is upgrading its data center connections to 1.6T optical networking standards to address the growing communication tasks as thousands of processors sync data across distributed clusters. Doubling the bandwidth from 800G to 1.6T enables faster server connections and supports large-scale computing, computing models that require a rapid exchange rate. As demand increases, A-AWS is moving to 1.6T optical networks to ensure hardware doesn’t slow future digital device services.  

The Physics Of High Velocity Data Transmission  

Switching to 1.6 terabit-per-second (1.6T) speeds changes how light carries information through fiber-optic cables. AWS uses 200G-per-lane technology, which means each channel can transmit 200 gigabits per second, allowing eight high-speed channels (or lanes) to run over one optical interface. This setup makes the networking equipment simpler and increases each rack’s capacity. It also delivers a more efficient fabric, fabric architecture-a network structure that connects servers and devices, allowing fast, flexible data movement to manage the unpredictable traffic of modern data processing.  

To manage the heat and power needs at these speeds, AWS is building co-packaged optics (CPO) directly into its custom networking switches. Traditional modules lose efficiency because electrical signals travel over copper before being converted to light. By placing the optical engine closer to the switch chip, AWS reduces signal degradation and lowers power consumption per gigabit. The photonic integration helps keep high-density data centers stable and enables the network to run at top speed without exceeding safety limits.  

Resolving the Latency Crisis in Distributed Computing 

A major challenge in contemporary computing is synchronous latency, in which processors must wait for data from other nodes before they can continue working. Even a few microseconds of delay in a cluster can result in thousands of idle cycles. The 1.6T standard uses ultra-low-latency protocols to deliver small, time-sensitive packets quickly. This supports the global state of a distributed task consistent across all our hardware.  

AWS’s move to 1.6T optical networks also relies on forward error correction (FEC) algorithms that detect and correct data errors in real time during transmission in high-speed optical systems. This prevents the need to resend packets and ensures data accuracy and integrity at terabit speeds, where even minor interference can cause major problems. By strengthening the communications layer, AWS delivers predictable, consistent network performance for enterprise customers, enabling researchers to run complex simulations with full confidence in data integrity.  

Scaling The Global Backbone For Uninterrupted Throughput 

The 1.6T optics upgrade extends beyond single-server racks and includes inter-availability zone (inter-AZ) connections. These long fiber paths link different data center campuses within the same geographic region, providing redundancy and load balancing. Upgrading these backbones lets AWS move tasks between buildings based on power or cooling needs, enabling seamless workload migration and uninterrupted computing. This creates a fluid infrastructure, an adaptable system that quickly responds to changing needs.  

AWS is also using next-generation optical amplifiers devices that simply amplify light signals to keep signals strong over long distances. These amplifiers increase the intensity of light pulses without adding the noise (unwanted interference) that often affects high-frequency waves. This enables high-fidelity connectivity, meaning reliable, clear data transfer across hundreds of miles of fiber, making a regional cluster work like one large computer. For users, this means speedier load times and more responsive apps, no matter where the hardware is. It moves the cloud closer to geographic transparency. The idea is that users do not notice where computing resources are physically located.  

Addressing The Economic Facts Of Infrastructure Growth 

Although moving to 1.6T requires a large capital investment the initial money spent on equipment  it lowers operational expenditure (OPEX), the ongoing cost of running the system per terabit of data, by sending more data through fewer cables. AWS cuts the cost of cable management (handling and maintaining wiring) and port maintenance. The energy savings from 200G-per-lane signaling also help reduce the network’s total carbon footprint, which is the environmental impact of greenhouse gas emissions. This sustainability is important for global organizations that need to report their environmental effects.  

The upgrade simplifies the hardware cycle by reducing upgrade frequency. By moving directly to 1.6T, AWS future-proofs its network for projected five-year data growth. This long-term technical deployment maintains the network’s competitive advantage. Partners and developers can now build more complex software with confidence that the network will scale to future needs. That stability supports technological risk-taking and drives industry innovation.  

The Crystalline Pulse of the New Internet 

As 1.6 terabit connections come online, the data center becomes more efficient and responsive. This shift lets us address delay as a technical challenge rather than a persistent issue. With time, bottlenecks can become rare, letting ideas move at the speed of light.  

In the future, invisible systems will support our daily lives with care. As the world becomes more connected and responsive, technology will finally keep pace with human thought.

Source: AWS News Blog 

High-performance computing is evolving as NVIDIA’s B300 design introduces a new thermal management method for data centers. Previously, data centers struggled to remove heat from dense servers, limiting both density and reliability. The B300 replaces traditional air cooling with built-in liquid cooling, directly boosting efficiency. This enables more transistors per chip without overheating, increasing computational power. By adding cooling channels to the silicon, NVIDIA enables processors to run at full speed, boosting performance and stability.  

Engineering the Shift to Native Liquid Cooling 

The B300 series uses a special direct-to-chip liquid cooling system that replaces large copper heat sinks with microchannel cold plates. These plates sit closely against the processor, letting a non-conductive coolant pull heat away much more efficiently than air. As a result, this design directly addresses the thermal resistance that usually builds up between the chip and its cooler, which can hinder consistent operation. By removing this barrier, the system maintains steady temperatures even during heavy workloads, ensuring the hardware remains reliable over time and delivers stable, long-lasting performance.  

NVIDIA’s B300 also includes a manifold-integrated chassis that simplifies cooling for large server racks. Rather than using separate hoses for each card, the rack itself delivers coolant to all the components, making large-scale deployment more efficient. This design reduces installation complexity and lowers the risk of leaks or flow issues, directly contributing to smoother operations and easier maintenance. The system is also built to manage the pressure drop associated with fast-moving coolant, ensuring even coolant distribution across all components and supporting maximum hardware uptime. This kind of integration is key to maintaining the dependability of cloud services and minimizing disruptions.  

Overcoming the Constraints of Air-based Dissipation  

Traditional air cooling has reached its air density ceiling because fans can’t move enough air to cool 1,000-watt processors, limiting server power and scalability. The B300 solves this by using liquid, which can carry away much more heat than air. Water and special coolants can remove up to four thousand times more heat than the same amount of air, directly enabling higher rack density. This means data centers can fit more compute power into less space and don’t need large HVAC systems or specialized hot-air setups, resulting in both cost and space savings. Switching to liquid cooling also solves the problem of server room noise. Air-cooled data centers rely on thousands of fast fans, which are loud and energy-intensive. The B300 uses quiet, low-speed pumps instead, so the system runs almost silently. Cutting parasitic power means more electricity is available for computing, improving efficiency. This also helps organizations run data centers more efficiently and sustainably.ise.  

Strengthening Reliability Through Thermal Stability 

Changes in temperature are a major cause of semiconductor failure, as components inside expand and contract. The B300 uses active thermal leveling to keep the temperature steady regardless of how the workload changes. This precise control means the processor remains within safe temperature limits. When the processor is idle, the system slows the coolant flow; when the workload increases, the flow speeds up immediately. This thermal equilibrium prevents tiny cracks and solder issues that often occur in air-cooled systems, directly expanding the mean time between failures (MTBF) of critical components.  

Keeping the temperature steady also helps B300 chips avoid the performance jitters caused by thermal throttling. In older systems, when fans couldn’t keep up, the processor would slow down, causing delays. In cloud applications, the B300’s liquid cooling keeps performance steady even beneath heavy loads, directly enabling consistent processing for time-sensitive tasks. Such reliability is necessary for mission-critical telemetry and real-time financial processing where every millisecond matters. Air-cooled systems just can’t deliver this level of stability in dense setups, risking performance.  

Simplifying Data Center Infrastructure Requirements 

Using NVIDIA’s B300 design lets data centers clear out a lot of clutter, leading to significant CapEx reductions. For example, without large air ducts and raised floors, new sites can have lower ceilings and simpler ventilation systems, both of which are tied directly to lower construction costs. Reduced infrastructure also makes it easier to reuse existing municipal spaces. Additionally, modular cooling units (CDUs) at the ends of server rows form a closed-loop system, where their proximity minimizes energy loss from coolant movement, increasing overall efficiency.  

The P300 hardware also includes predictive leak-detection sensors built into every unit’s firmware. These sensors detect even small changes in humidity or pressure and can shut down only the affected area before any damage occurs, directly reducing risk during operations. This self-heating infrastructure enables operators to use liquid cooling at scale without worrying about major leaks. The system can isolate a faulty part while the rest continue to run, maintaining high system availability even during maintenance and ensuring continuous operation.  

Defining The Horizon Of Sustainable Power 

As the need for computing power grows, thermal efficiency is becoming the main measure of success. The B300 moves away from brute-force cooling used in the past. It shows that the future of computing is about working in balance with the physical world, not just making faster chips. We are heading toward a time when data centers are quiet, liquid-cooled, and run smoothly with their surroundings. Soon, the idea of a cooling limit will look outdated.  

We are moving into a domain of thermal transparency in which machines no longer struggle with their own heat. The design of global networks now focuses on stability, long life, and quiet, steady power. Every bit of coolant and every microchannel in the silicon helps keep things safe and reliable. The system now runs smoothly and quietly, keeping pace with our digital needs. In the future, the systems that support our lives will value their internal balance as much as their performance. This clear approach means the cloud’s future will be as cool and reliable as the water that powers it. 

Source: Nvidia News 

A recent robotics patent offers a glimpse of the future of robotics. The new type of robot will move more like a human than current models do, thanks to greater humanlike movement capabilities and the ability to adapt and respond in real time to changing environments. 

The patent also provides insight into how this new technology will affect the operation of robots across many industries, including manufacturing and healthcare. 

Key patented features include 

  • – Robots can dynamically adjust movement using sensors. 
  • – Sensors provide feedback on balance while moving. 
  • – Integrated systems better coordinate robot movements. 

– Task execution is more precise, improving accuracy. 

With all of the above capabilities built into the next generation of robots, robots with this technology will be able to respond and adapt as humans do, rather than just carrying out predetermined motions. 

Why Human-Like Movement Matters 

Traditional robots are efficient but limited. Their movements are often mechanical, repetitive, and inflexible. 

Human-like movement introduces: 

  • Greater flexibility in tasks 
  • Safer interaction with humans 
  • Better performance in complex environments 

This is especially important in industries where precision and adaptability are critical. 

The Technology Behind It 

Several different types of advanced components form a system: 

  • Sensors to sense the robot’s environment 
  • AI systems for making decisions based on sensor information 
  • Mechanical systems that allow each element to move freely. 

Combining these components enables the robot to continuously adjust its behavior rather than being programmed to perform fixed tasks. 

Impact on Automation 

This cutting-edge technology will potentially change the face of automation. Some of the industries that stand to gain from this include: 

  • The manufacturing industry (more complex assembly jobs) 
  • The medical industry (assisting robots) 
  • The logistics industry (working in unpredictable environments) 

By allowing for more natural movement, robots will be able to perform tasks that were once considered too dangerous or difficult. 

Performance and Efficiency Gains 

Improved functionality also equals increased efficiency through adaptive movements. Benefits include: 

  • Fewer task execution errors 
  • Shorter response times 
  • Better energy use 

For companies, this will lead to increased productivity and decreased operating costs. 

Challenges Ahead 

While promising, the technology still faces hurdles: 

  • High development costs 
  • Complexity in implementation 
  • Need for extensive testing. 

It may take time for such systems to be widely adopted in commercial applications. 

What This Means for the US 

For the US, this patent signals continued leadership in robotics innovation. 

It could: 

  • Strengthen automation capabilities 
  • Support industrial growth 
  • Drive advancements in AI and engineering 

As competition in robotics increases globally, such innovations will play a key role in maintaining technological advantage. 

Conclusion 

By integrating adaptive, human-like motions into their designs. This patent represents another advancement towards machines that not only perform tasks but also emulate human-like movements & responses.

Source- Google Patents