u/Aware-Ticket-5585

Been running GPU inference workloads on k8s and got tired of the dcgm-exporter → Prometheus → PromQL → KEDA chain just to autoscale based on GPU utilization. 5 components, 15-30s metric lag, PromQL queries to maintain.


So I built keda-gpu-scaler — a KEDA external scaler that talks to NVML directly on each GPU node via a DaemonSet. Reads GPU utilization, memory, temperature, power and serves them over gRPC to KEDA. Sub-second metrics, no Prometheus in the loop.


Wrote about the architecture and why it has to be an external scaler (not a native one) on the CNCF blog: https://www.cncf.io/blog/2026/05/27/gpu-autoscaling-on-kubernetes-with-keda-building-an-external-scaler/


It ships with pre-built profiles for vLLM, Triton, training jobs, and batch workloads. Scale-to-zero works too.


GitHub: https://github.com/pmady/keda-gpu-scaler
Docs: https://keda-gpu-scaler.readthedocs.io


Still early (v0.1.0) so if you're running GPU workloads on k8s I'd appreciate feedback, bug reports, or contributions. Roadmap and open issues are on the repo.Been running GPU inference workloads on k8s and got tired of the dcgm-exporter → Prometheus → PromQL → KEDA chain just to autoscale based on GPU utilization. 5 components, 15-30s metric lag, PromQL queries to maintain.


So I built keda-gpu-scaler — a KEDA external scaler that talks to NVML directly on each GPU node via a DaemonSet. Reads GPU utilization, memory, temperature, power and serves them over gRPC to KEDA. Sub-second metrics, no Prometheus in the loop.


Wrote about the architecture and why it has to be an external scaler (not a native one) on the CNCF blog: https://www.cncf.io/blog/2026/05/27/gpu-autoscaling-on-kubernetes-with-keda-building-an-external-scaler/


It ships with pre-built profiles for vLLM, Triton, training jobs, and batch workloads. Scale-to-zero works too.


GitHub: https://github.com/pmady/keda-gpu-scaler
Docs: https://keda-gpu-scaler.readthedocs.io


Still early (v0.1.0) so if you're running GPU workloads on k8s I'd appreciate feedback, bug reports, or contributions. Roadmap and open issues are on the repo.

Hi everyone,

I am preparing to file my EB-1A petition and am looking for recommendations for immigration law firms that specialize heavily in the tech, cloud infrastructure, and AI space.

I want to avoid generalist firms. I need an attorney who natively understands modern enterprise tech credentials so I don't have to spend hours explaining the impact of my infrastructure work.

I am using a burner account for privacy, but here is an anonymized, high-level breakdown of my profile:

Role: Senior Cloud Platform Engineer at a Fortune 500 company (US-based).
Elite Credentials: I hold a top-tier global certification in cloud-native architecture (held by fewer than 400 people worldwide), plus a recognized expert designation from a major enterprise cloud provider.
Judging: Peer reviewer for a Tier 1 IEEE Journal, plus 12 papers reviewed across 3 international AI and Cloud computing conferences.
Scholarly Articles: 9 published technical articles in premier industry venues, including an IEEE publication and several official, high-traffic ecosystem blogs.
Original Contributions: Quoted as an AI infrastructure expert in mainstream tech media, featured as an expert panelist for a major software engineering live stream, and armed with strong independent recommendation letters from Fortune 500 tech leaders.

What I am looking for:

Firms with a proven, high approval rate specifically for elite Software/Platform Engineers.
Firms that offer a free initial CV evaluation.
Honest feedback: Should I go with the big names (Chen/WeGreened, Ellis Porter, Wegmuller) or is there a boutique firm that excels with highly technical, non-academic profiles?

If anyone with a similar software/infrastructure background has successfully filed recently, I would highly appreciate your attorney recommendations or DMs.

Thanks in advance!

We built an open-source KEDA external scaler for GPU workloads - no Prometheus needed