Apple Metal 4 Adds Native Neural Scheduling For Mac GPUs

Apple Metal 4 now provides native low-latency neural scheduling for Mac GPUs, enabling developers to integrate AI features more efficiently.

Developers can now run Core ML models directly on the GPU alongside graphics and compute workloads. Notable features include:

MTL4Macmachinelearning

command encoder

integrated tensor support

improved synchronization

These enhancements allow AI workloads to operate independently of the CPU.

Here are some important updates in Metal 4 for AI and Graphics

Native Neural Scheduling with MTL4 Machine Learning Command Encoder Machine Learning Inference can be integrated into the rendering pipeline.

Tensor support: The API and Metal Shading Language now have built-in Tensor support, making machine learning workflows smoother.

Optimized workflow: machine learning compute and rendering instructions can now be combined, reducing requirements for CPU-side synchronization.

Game Porting Toolkit 3.0: This version adds experimental Metal IFX integration and supports more instruction sets.

Game Porting Toolkit 3.0: This version adds experimental Metal IFX integration and supports more instruction sets.

In summary, Metal 4 is designed for Apple silicon Macs and delivers advanced AI-powered graphics.

With Metal 4, you can now run Core ML models efficiently as part of your Metal workflow. This helps when your app needs to use model output in a Metal context, for example, rendering a scene or running a compute task. To add machine learning inference, convert your Core ML model into a MetalML package during development. Then use that package in a machine learning encoder at runtime.

Your app can handle rendering, compute, and machine learning tasks in a single command buffer, with no CPU wait or extra sync. When you run Core ML models on the GPU, your app sends inputs from a compute pass and quickly uses outputs from a machine learning pass.

Metal 4 adds new tensor types, which are multi-dimensional arrays used for machine learning model data. The Digital Sharing language now includes tensor operators and features such as cooperative tensors. This lets your shader code work with tensor data in parallel at any GPU stage.

Discover Metal 4

Metal 4 is designed to fulfill the demands of today’s applications. Its simpler API helps you get the best performance on Apple Silicon with less overhead and enhanced resource management. Compilation is now clearer and faster, with new options to reduce runtime compilation.

Metal 4 now fully supports machine learning, including native tensor support in the API and the Shading Language. You can add machine learning to your Metal application by running large networks from the command line or by adding inference operations immediately into your shader code.

Metal 4 additionally extends the MTL device that you already use today. You can incrementally adopt features that will most help your app or game in the order you need them.

Metal 4 Games

Create modern games that work well on all Apple devices. Metal 4 helps you handle large sets of resources more efficiently. With new sparse resource placement, you can better utilize system memory. Also, use familiar APIs to quickly port your games from other platforms to Metal.

The latest Metal compiler updates give you fine-grained control over shader compilation. Use a dedicated context to adjust the quality of service and ensure optimal performance. Quickly compile pipelines ahead of time to reduce run-time delays and share compilation results across render pipelines using a common Metal IR for faster development and testing.

Go Further With MetalIFX And Ray Tracing

Benefit from improved upscaling with integrated denoising for higher quality, sharper renders at high resolutions. Achieve higher frame rates using new frame interpolation, resulting in smoother animation without sacrificing visual quality.

Ray tracing now uses intersection function buffers for more flexible indexing, simplifying porting ray tracing code to Metal. Flags for acceleration structure builds let you choose between faster intersection computation or smaller memory footprints, optimizing performance for your specific needs.

Metal 4 Machine Learning

You can now add machine learning to your metal application. Tensors are a native resource type for working with data. You can encode machine learning commands into the same Metal command buffers and use the same barriers to sync work on large networks. You can also embed inference directly into your shaders with Metal Performance Primitives, which are optimized for all Apple platforms.

Game Porting Toolkit 3

Evaluate even more gains with an expanded instruction set, sparse resources, and experimental support for Metal IFX upscaling, denoising, and frame interpolation. Build and debug your app remotely from Microsoft Visual Studio. Supercharge your HLSL shaders with access to Apple GPU features such as: