u/thekhronosgroup

▲ 25 r/vulkan

New Vulkan sample: Rasterization Order Attachment Access

This sample demonstrates VK_EXT_rasterization_order_attachment_access, which enables framebuffer attachment reads from one fragment to the next in rasterization order, without requiring explicit synchronization or subpass self-dependencies. Techniques like programmable blending become far more practical.

The sample pairs the extension with VK_KHR_dynamic_rendering and VK_KHR_dynamic_rendering_local_read to show how to implement framebuffer fetch with guaranteed fragment ordering using modern Vulkan patterns.

Explore the sample: https://github.com/KhronosGroup/Vulkan-Samples/tree/main/samples/extensions/rasterization_order_attachment_access

reddit.com
u/thekhronosgroup — 4 days ago
▲ 21 r/vulkan

Tile Based Rendering Best Practices

Are you optimizing Vulkan applications for mobile or tile-based GPUs? The Vulkan Guide has a dedicated section on Tile Based Rendering (TBR) best practices worth bookmarking.

Unlike traditional immediate-mode GPUs, tile-based architectures process the framebuffer in small screen regions, keeping work on fast on-chip memory before writing results to main system memory. For Vulkan developers, this means memory bandwidth is often the dominant performance factor.

Key takeaways from the guide:

  • Use load/store ops intentionally. Render pass attachment configuration is your primary tool for controlling bandwidth. Setting the right loadOp and storeOp values tells the driver whether to clear, load, or discard data, directly impacting whether data has to travel off-chip.
  • Keep depth and stencil transient. If depth and stencil buffers are not needed after a render pass, mark them with LOAD_OP_CLEAR and STORE_OP_DONT_CARE. This allows the driver to keep them entirely on-chip and avoid the cost of writing them back to external memory.
  • Favor compact pixel formats. On-chip tile memory is fixed in size. Smaller bit-depth formats allow the hardware to fit more data per tile, reducing spills to external memory and improving efficiency.
  • Optimize for the binning pass. Tilers process geometry twice: once to bin triangles into tiles, then again to shade pixels. Separating vertex positions from other attributes like UVs and normals lets the GPU read only what it needs during binning, cutting unnecessary bandwidth.
  • Provide clear intent to the driver. Because Vulkan abstracts hardware details like tile size, the best way to optimize is to use correct render pass configurations and memory flags so the driver can make informed decisions on your behalf.

The full guide covers these topics in depth, including guidance on MSAA, transient attachments, and dynamic rendering:

https://docs.vulkan.org/guide/latest/tile_based_rendering_best_practices.html

reddit.com
u/thekhronosgroup — 10 days ago
▲ 34 r/vulkan

Khronos has released the Vulkan SC SDK

Vulkan SC Working Group and RasterGrid have released the Vulkan SC SDK, bringing together the full set of tools for safety-critical Vulkan development into a single installer for Linux and Windows. What's included: Loader, Validation Layers, Device Simulation Layer, Pipeline Cache Compiler integration, CMake support, the VulkanSC-cube sample, and more. The goal was to reduce setup friction significantly with automatic environment configuration, better CMake integration, and automatic PCC discovery are all part of it.

Feedback and contributions are very much encouraged. Drop thoughts in the GitHub Discussions thread or on Discord -- the Working Group is actively listening.

https://khr.io/1o0

u/thekhronosgroup — 16 days ago
▲ 9 r/HPC

The IWOMP 2026 Call for Papers is open.

The 22nd International Workshop on OpenMP takes place October 7-9, 2026 at TU Wien in Vienna, Austria. The theme this year is "OpenMP: Adaptability for Heterogeneous Multi-Device Systems."

Topics of interest include accelerated computing and offloading, performance portability, machine learning with OpenMP, runtime environments, tasking, vectorization, memory management, and more.

Submissions are limited to 12 pages (excluding references). Accepted papers will be published in Springer's Lecture Notes in Computer Science (LNCS) series.

Submission deadline: May 29, 2026 (AoE)

Learn more and submit: https://www.iwomp.org/call-for-papers/

u/thekhronosgroup — 22 days ago