Skip to content

Commit 673f1bb

Browse files
committed
Revert the transport backend to UCX
Signed-off-by: Shixiaowei02 <[email protected]>
1 parent c789000 commit 673f1bb

File tree

4 files changed

+5
-5
lines changed

4 files changed

+5
-5
lines changed

cpp/tensorrt_llm/batch_manager/cacheTransceiver.cpp

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -89,7 +89,7 @@ std::unique_ptr<BaseCacheTransceiver> CacheTransceiverFactory::createCacheTransc
8989
}
9090
else
9191
{
92-
backendType = executor::CacheTransceiverConfig::BackendType::NIXL;
92+
backendType = executor::CacheTransceiverConfig::BackendType::UCX;
9393
}
9494
}
9595
cacheTransceiverConfig.value().setBackendType(backendType);

docs/source/features/disagg-serving.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -118,7 +118,7 @@ cache_transceiver_config:
118118
max_tokens_in_buffer: <int>
119119
```
120120
121-
`backend` specifies the communication backend for transferring the kvCache, valid options include `DEFAULT`,`UCX`, `NIXL`, and `MPI`, the default backend is NIXL.
121+
`backend` specifies the communication backend for transferring the kvCache, valid options include `DEFAULT`,`UCX`, `NIXL`, and `MPI`, the default backend is UCX.
122122

123123
`max_tokens_in_buffer` defines the buffer size for kvCache transfers, it is recommended to set this value greater than or equal to the maximum ISL (Input Sequence Length) of all requests for optimal performance.
124124

examples/disaggregated/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -12,7 +12,7 @@ The `trtllm-serve` command supports the `extra-llm-config.yaml` parameter. In th
1212

1313
```yaml
1414
cache_transceiver_config:
15-
# KV cache transmission backend. Valid options include `DEFAULT` (i.e., NIXL), `UCX`, `NIXL`.
15+
# KV cache transmission backend. Valid options include `DEFAULT` (i.e., UCX), `UCX`, `NIXL`.
1616
backend: <str>
1717
# KV cache buffer size. Set it ≥ the maximum ISL (Input Sequence Length) for best performance.
1818
max_tokens_in_buffer: <int>

tensorrt_llm/_torch/pyexecutor/kv_cache_transceiver.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -39,9 +39,9 @@ def create_kv_cache_transceiver(
3939
if cache_transceiver_config.backend == "DEFAULT":
4040
# When cache_transceiver_config.backend is not set, fallback to env_vars settings
4141
# NIXL is the default backend
42-
cache_transceiver_config.backend = "NIXL"
42+
cache_transceiver_config.backend = "UCX"
4343
# Ordered by priority
44-
env_vars = [("TRTLLM_USE_UCX_KVCACHE", "UCX"),
44+
env_vars = [("TRTLLM_USE_NIXL_KVCACHE", "NIXL"),
4545
("TRTLLM_USE_MPI_KVCACHE", "MPI")]
4646
for env_var, be_type in env_vars:
4747
if getenv(env_var) == "1":

0 commit comments

Comments
 (0)