Skip to content

Commit

Permalink
Enhance DG self-hosted config
Browse files Browse the repository at this point in the history
  • Loading branch information
Thai Nguyen authored and thainguyensunya committed Mar 8, 2025
1 parent 90fd2e4 commit 224222d
Show file tree
Hide file tree
Showing 3 changed files with 9 additions and 3 deletions.
5 changes: 4 additions & 1 deletion backend/charts/deepgram-self-hosted/dev_omi_values.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -60,6 +60,9 @@ scaling:
# Discuss a reasoanble value with your Deepgram Account Representative
# Must also set engine.concurrencyLimit.activeRequests if using request ratio for autoscaling
requestCapacityRatio:
behavior:
scaleUp:
stabilizationWindowSeconds: 120 # Wait 2 minutes before scaling up

api:
image:
Expand Down Expand Up @@ -114,7 +117,7 @@ engine:
gpu: 1
limits:
memory: "40Gi"
cpu: "8000m"
cpu: "12000m"
gpu: 1
# Discuss a reasonable value with your Deepgram Account Representative
# If not using autoscaling, can be left empty, but must be set if using
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -242,7 +242,7 @@ gcloud container node-pools create engine-pool \
--num-nodes 1 \
--enable-autoscaling \
--max-nodes 8 \
--machine-type g2-standard-8 \
--machine-type g2-standard-12 \
--accelerator=type=nvidia-l4,count=1,gpu-driver-version=latest \
--node-labels k8s.deepgram.com/node-type=engine

Expand Down
5 changes: 4 additions & 1 deletion backend/charts/deepgram-self-hosted/prod_omi_values.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -60,6 +60,9 @@ scaling:
# Discuss a reasoanble value with your Deepgram Account Representative
# Must also set engine.concurrencyLimit.activeRequests if using request ratio for autoscaling
requestCapacityRatio:
behavior:
scaleUp:
stabilizationWindowSeconds: 120 # Wait 2 minutes before scaling up

api:
image:
Expand Down Expand Up @@ -114,7 +117,7 @@ engine:
gpu: 1
limits:
memory: "40Gi"
cpu: "8000m"
cpu: "12000m"
gpu: 1
# Discuss a reasonable value with your Deepgram Account Representative
# If not using autoscaling, can be left empty, but must be set if using
Expand Down

0 comments on commit 224222d

Please sign in to comment.