Skip to main content

RoleBasedGroup API

RoleBasedGroup Spec

apiVersion: rbgs.sgl-project.dev/v1
kind: RoleBasedGroup
metadata:
name: my-inference
spec:
roles:
- name: prefill
workloadType: StatefulSet
replicas: 2
roleSpec:
# Role-specific configuration
- name: decode
workloadType: LeaderWorkerSet
replicas: 4
roleSpec:
# Role-specific configuration

Role Types

TypeDescription
StatefulSetStateful workloads with stable network identity
DeploymentStateless workloads
LeaderWorkerSetMulti-node distributed inference

RoleBasedGroup Status

status:
roles:
- name: prefill
readyReplicas: 2
- name: decode
readyReplicas: 4
conditions:
- type: Ready
status: "True"

For complete API reference, see the CRD definitions.