Collect GPU index assignments from SLURM gres_detail and filter Job Analyzer GPU charts (#129) by lushengt-meta · Pull Request #129 · facebookresearch/gcm

lushengt-meta · 2026-04-09T03:32:24Z

Summary:

Adds GPU index collection from the SLURM REST API's gres_detail field to the GCM pipeline, and uses it in the FAIR Job Analyzer to show only the GPUs assigned to a job (instead of all 8 GPUs on the node).

Background:

When a job uses fewer GPUs than available on a node (e.g., --gpus-per-task=1 on an 8-GPU node), the Job Analyzer previously showed metrics for all 8 GPUs. The existing GPUS_REQUESTED field comes from TRES-PER-NODE (always 8 for the full node), not the per-task allocation. TRES_GPUS_ALLOCATED correctly reports the count (e.g., 1 ) but not which specific GPU indices are assigned.

The SLURM REST API provides gres_detail — an array of strings with exact GPU index assignments per node (e.g., "gpu:ampere:1(IDX:7)"). Verified on AVA RSC: scontrol show job -d | grep GRES → GRES=gpu:ampere:1(IDX:7)

Pipeline change (Python):

parsing.py: Added parse_gres_gpu_indices() that parses gres_detail strings into GPU index lists. Returns a comma-separated string of indices for single-node partial-GPU jobs (e.g., "7" or "0,3,5"), None for full-node (8 GPUs) or multi-node jobs. This avoids storing unnecessary data.
squeue.py: Added GRES_GPU_INDICES field (nullable, defaults to None) and "gres_detail" → "GRES_DETAIL" REST API mapping. Adds one string column to existing fair_job_data rows — no extra entries.
test_parsers.py: Added 12 test cases covering single GPU, multiple GPUs, range notation, full-node, multi-node, and edge cases (empty, null, N/A).

Job Analyzer change (Hack):

FairJob.php: Added $gresGpuIndices property
FAIRJobAnalyzerLatestJobInfoModule.php: Queries GRES_GPU_INDICES from fair_job_data Scuba table
FAIRJobAnalyzerPerfAnalyzerModule.php: When gresGpuIndices is available (e.g., "7"), filters all 5 GPU ODS charts (utilization, temperature, SM util, SM occupancy, memory) to gpu=(7) with per-GPU reduceTerm. When null (full-node or multi-node), shows all GPUs with the original averaged reduceTerm.

Scope:

Single-node partial-GPU jobs: shows only the assigned GPUs (100% accurate)
Single-node full-GPU jobs: shows all GPUs (unchanged, no filtering needed)
Multi-node jobs: shows all GPUs (unchanged — gres_detail has per-node values that can't be stored in fair_job_data's 1-row-per-job format)

Differential Revision: D99787988

github-actions · 2026-04-09T03:32:38Z

CI Commands

The following CI workflows run automatically on every push and pull request:

Workflow	What it runs
GPU Cluster Monitoring Python CI	lint, tests, typecheck, format, deb build, pyoxidizer builds
Go packages CI	shelper tests, format, lint

The following commands can be used by maintainers to trigger additional tests that require access to secrets:

Command	Description	Requires approval?
`/metaci tests`	Runs Meta internal integration tests (pytest)	Yes — a maintainer must trigger the command and approve the deployment request
`/metaci integration tests`	Same as above (alias)	Yes

Note: Only repository maintainers (OWNER association) can trigger /metaci commands. After commenting the command, a maintainer must also navigate to the Actions tab and approve the deployment to the graph-api-access environment before the jobs will run. See the approval guidelines for what to approve or reject.

luccabb · 2026-04-10T06:57:14Z

+    GRES_GPU_INDICES: str | None = field(
+        default=None,
+        metadata={
+            "parser": parse_gres_gpu_indices,
+            "field_name": "GRES_DETAIL",
+            "slurm_field": True,
+        },
+    )


can we move to parsed_field here?

parsed_field doesn't support a default parameter — it delegates to field(metadata=...) without passing default. Since GRES_GPU_INDICES must default to None (it's only populated via the REST API path, not squeue CLI), I used field(default=None, metadata=...) directly. This preserves all the same metadata (parser, field_name, slurm_field) that parsed_field would set.

luccabb · 2026-04-10T07:03:40Z

+    if len(indices) >= 8 or not indices:
+        return None


can we adapt this to work for nodes with > 8 GPUS?

Done. Removed the hardcoded len(indices) >= 8 threshold from the parser — it now always returns the parsed indices regardless of count. The filtering decision is moved to the PHP caller (FAIRJobAnalyzerPerfAnalyzerModule.php), which compares the number of GPU indices against total GPUs on the node. Added test cases for 10-GPU and 16-GPU nodes to verify.

luccabb

since squeue CLI has no GRES_DETAIL this field would always be null, we'd need to add support for Slurm REST (SlurmRestClient) in the collection pipeline

luccabb · 2026-04-10T07:27:12Z

@claude did we miss anything?

claude · 2026-04-10T07:27:32Z

Claude finished @luccabb's task in 1m 24s —— View job

Review: Did we miss anything?

Read changed files and gather context
Read CLAUDE.md for repo guidelines (none found)
Analyze for missed issues beyond existing review comments
Post comprehensive review

Beyond the items you already flagged, here are additional issues I found:

1. `GRES_DETAIL` is not a valid `squeue -O` format field — CLI collection will break

JOB_DATA_SLURM_FIELDS is used in client.py:179 to build the squeue --all -O format spec. Since GRES_DETAIL is not a recognized squeue output format field, this will cause squeue to error (or silently produce misaligned columns) for all CLI-based collection, not just this field.

This is the most critical issue. Since squeue CLI doesn't support GRES_DETAIL, adding it to JOB_DATA_SLURM_FIELDS (which happens automatically via the slurm_field: True metadata) will break the CLI path. The manual field() with "slurm_field": True on line 79-86 of squeue.py includes it in the CLI format spec.

Fix: Either set "slurm_field": False so it's excluded from JOB_DATA_SLURM_FIELDS, or (better) use parsed_field with slurm_field=False as luccabb suggested. The field would then only be populated via the REST API path in _map_job_fields. Fix this →

2. Hardcoded 8-GPU threshold doesn't generalize

As you noted in your review comment — parsing.py:322 hardcodes len(indices) >= 8 to detect full-node jobs. Nodes with 4 GPUs (e.g., A100x4) or >8 GPUs won't be handled correctly:

A 4-GPU full-node job would return "0,1,2,3" instead of None (unnecessary filtering)
A 10-GPU partial allocation (e.g., 10 of 16 GPUs) would return None (no filtering when it should)

The function has no way to know the node's total GPU count from gres_detail alone. Options:

Pass the node's GPU count as a parameter
Remove the threshold entirely and always return the parsed indices (let the caller decide)
Document this as a known limitation

3. `_map_job_fields` joins list values with commas — ambiguity with IDX commas

In rest_client.py:84, gres_detail (which is a JSON array) gets joined with ,:

if isinstance(value, list):
    value = ",".join(str(v) for v in value)

For a multi-node job, gres_detail might be ["gpu:ampere:3(IDX:0,3,5)", "gpu:ampere:3(IDX:1,4,7)"]. After joining: "gpu:ampere:3(IDX:0,3,5),gpu:ampere:3(IDX:1,4,7)".

The parse_gres_gpu_indices regex re.findall(r"IDX:([0-9,\-]+)", v) would match ["0,3,5", "1,4,7"] — correctly detecting multi-node (len > 1). However, if a single-node job has IDX values like IDX:0,3,5 and the comma-join places another entry adjacent without a clear delimiter, parsing could be ambiguous. Currently this works because the ) after IDX values separates them, but it's fragile. Consider using a different delimiter (e.g., |) for the list join, or better yet, handle the list in the parser directly.

4. Missing `default=None` in `parsed_field` approach

If you switch to parsed_field (per luccabb's comment), note that parsed_field doesn't currently support setting a default value — it delegates to field(metadata=...) without passing default. You'll need to either:

Add default support to parsed_field, or
Use Optional type annotation and ensure instantiate_dataclass handles missing keys by defaulting to None

Check how instantiate_dataclass handles missing keys for fields with slurm_field=False — if the key isn't in the row dict, it needs to gracefully default to None.

5. Minor: no `@typechecked` on the new test

The existing test functions in test_parsers.py use @typechecked decorator (e.g., test_parse_gres, test_parse_tres). The new test_parse_gres_gpu_indices at line 587 doesn't — minor inconsistency.

Summary

The most critical issue is #1 — slurm_field: True will inject GRES_DETAIL into the squeue CLI format spec, which will likely break CLI-based job collection. The other items are correctness/robustness improvements worth addressing.

…nalyzer GPU charts (facebookresearch#129) Summary: Adds GPU index collection from the SLURM REST API's gres_detail field to the GCM pipeline, and uses it in the FAIR Job Analyzer to show only the GPUs assigned to a job (instead of all 8 GPUs on the node). ## Background: When a job uses fewer GPUs than available on a node (e.g., --gpus-per-task=1 on an 8-GPU node), the Job Analyzer previously showed metrics for all 8 GPUs. The existing GPUS_REQUESTED field comes from TRES-PER-NODE (always 8 for the full node), not the per-task allocation. TRES_GPUS_ALLOCATED correctly reports the count (e.g., 1 ) but not which specific GPU indices are assigned. The SLURM REST API provides gres_detail — an array of strings with exact GPU index assignments per node (e.g., "gpu:ampere:1(IDX:7)"). Verified on AVA RSC: scontrol show job <id> -d | grep GRES → GRES=gpu:ampere:1(IDX:7) ## Pipeline change (Python): - parsing.py: Added parse_gres_gpu_indices() that parses gres_detail strings into GPU index lists. Returns a comma-separated string of indices for single-node partial-GPU jobs (e.g., "7" or "0,3,5"), None for full-node (8 GPUs) or multi-node jobs. This avoids storing unnecessary data. - squeue.py: Added GRES_GPU_INDICES field (nullable, defaults to None) and "gres_detail" → "GRES_DETAIL" REST API mapping. Adds one string column to existing fair_job_data rows — no extra entries. - test_parsers.py: Added 12 test cases covering single GPU, multiple GPUs, range notation, full-node, multi-node, and edge cases (empty, null, N/A). ## Job Analyzer change (Hack): - FairJob.php: Added $gresGpuIndices property - FAIRJobAnalyzerLatestJobInfoModule.php: Queries GRES_GPU_INDICES from fair_job_data Scuba table - FAIRJobAnalyzerPerfAnalyzerModule.php: When gresGpuIndices is available (e.g., "7"), filters all 5 GPU ODS charts (utilization, temperature, SM util, SM occupancy, memory) to gpu=(7) with per-GPU reduceTerm. When null (full-node or multi-node), shows all GPUs with the original averaged reduceTerm. ### Scope: - Single-node partial-GPU jobs: shows only the assigned GPUs (100% accurate) - Single-node full-GPU jobs: shows all GPUs (unchanged, no filtering needed) - Multi-node jobs: shows all GPUs (unchanged — gres_detail has per-node values that can't be stored in fair_job_data's 1-row-per-job format) Differential Revision: D99787988

meta-codesync · 2026-04-15T00:08:11Z

@lushengt-meta has exported this pull request. If you are a Meta employee, you can view the originating Diff in D99787988.

…nalyzer GPU charts (facebookresearch#129) Summary: Adds GPU index collection from the SLURM REST API's gres_detail field to the GCM pipeline, and uses it in the FAIR Job Analyzer to show only the GPUs assigned to a job (instead of all 8 GPUs on the node). ## Background: When a job uses fewer GPUs than available on a node (e.g., --gpus-per-task=1 on an 8-GPU node), the Job Analyzer previously showed metrics for all 8 GPUs. The existing GPUS_REQUESTED field comes from TRES-PER-NODE (always 8 for the full node), not the per-task allocation. TRES_GPUS_ALLOCATED correctly reports the count (e.g., 1 ) but not which specific GPU indices are assigned. The SLURM REST API provides gres_detail — an array of strings with exact GPU index assignments per node (e.g., "gpu:ampere:1(IDX:7)"). Verified on AVA RSC: scontrol show job <id> -d | grep GRES → GRES=gpu:ampere:1(IDX:7) ## Pipeline change (Python): - parsing.py: Added parse_gres_gpu_indices() that parses gres_detail strings into GPU index lists. Returns a comma-separated string of indices for single-node partial-GPU jobs (e.g., "7" or "0,3,5"), None for full-node (8 GPUs) or multi-node jobs. This avoids storing unnecessary data. - squeue.py: Added GRES_GPU_INDICES field (nullable, defaults to None) and "gres_detail" → "GRES_DETAIL" REST API mapping. Adds one string column to existing fair_job_data rows — no extra entries. - test_parsers.py: Added 12 test cases covering single GPU, multiple GPUs, range notation, full-node, multi-node, and edge cases (empty, null, N/A). ## Job Analyzer change (Hack): - FairJob.php: Added $gresGpuIndices property - FAIRJobAnalyzerLatestJobInfoModule.php: Queries GRES_GPU_INDICES from fair_job_data Scuba table - FAIRJobAnalyzerPerfAnalyzerModule.php: When gresGpuIndices is available (e.g., "7"), filters all 5 GPU ODS charts (utilization, temperature, SM util, SM occupancy, memory) to gpu=(7) with per-GPU reduceTerm. When null (full-node or multi-node), shows all GPUs with the original averaged reduceTerm. ### Scope: - Single-node partial-GPU jobs: shows only the assigned GPUs (100% accurate) - Single-node full-GPU jobs: shows all GPUs (unchanged, no filtering needed) - Multi-node jobs: shows all GPUs (unchanged — gres_detail has per-node values that can't be stored in fair_job_data's 1-row-per-job format) Differential Revision: D99787988

lushengt-meta · 2026-04-15T00:24:07Z

since squeue CLI has no GRES_DETAIL this field would always be null, we'd need to add support for Slurm REST (SlurmRestClient) in the collection pipeline

Correct — GRES_DETAIL is a REST API-only field and is already mapped via "gres_detail": "GRES_DETAIL" in REST_TO_SQUEUE_FIELD_MAP. The _map_job_fields method in SlurmRestClient populates it from the REST API response. For CLI-based collection, slurm_field is set to False so GRES_DETAIL is excluded from JOB_DATA_SLURM_FIELDS and won't be injected into the squeue format spec. The field defaults to None when not populated.

lushengt-meta · 2026-04-15T00:25:20Z

@claude did we miss anything?

addressed issues that report by claude.

…nalyzer GPU charts (facebookresearch#129) Summary: Adds GPU index collection from the SLURM REST API's gres_detail field to the GCM pipeline, and uses it in the FAIR Job Analyzer to show only the GPUs assigned to a job (instead of all 8 GPUs on the node). ## Background: When a job uses fewer GPUs than available on a node (e.g., --gpus-per-task=1 on an 8-GPU node), the Job Analyzer previously showed metrics for all 8 GPUs. The existing GPUS_REQUESTED field comes from TRES-PER-NODE (always 8 for the full node), not the per-task allocation. TRES_GPUS_ALLOCATED correctly reports the count (e.g., 1 ) but not which specific GPU indices are assigned. The SLURM REST API provides gres_detail — an array of strings with exact GPU index assignments per node (e.g., "gpu:ampere:1(IDX:7)"). Verified on AVA RSC: scontrol show job <id> -d | grep GRES → GRES=gpu:ampere:1(IDX:7) ## Pipeline change (Python): - parsing.py: Added parse_gres_gpu_indices() that parses gres_detail strings into GPU index lists. Returns a comma-separated string of indices for single-node partial-GPU jobs (e.g., "7" or "0,3,5"), None for full-node (8 GPUs) or multi-node jobs. This avoids storing unnecessary data. - squeue.py: Added GRES_GPU_INDICES field (nullable, defaults to None) and "gres_detail" → "GRES_DETAIL" REST API mapping. Adds one string column to existing fair_job_data rows — no extra entries. - test_parsers.py: Added 12 test cases covering single GPU, multiple GPUs, range notation, full-node, multi-node, and edge cases (empty, null, N/A). ## Job Analyzer change (Hack): - FairJob.php: Added $gresGpuIndices property - FAIRJobAnalyzerLatestJobInfoModule.php: Queries GRES_GPU_INDICES from fair_job_data Scuba table - FAIRJobAnalyzerPerfAnalyzerModule.php: When gresGpuIndices is available (e.g., "7"), filters all 5 GPU ODS charts (utilization, temperature, SM util, SM occupancy, memory) to gpu=(7) with per-GPU reduceTerm. When null (full-node or multi-node), shows all GPUs with the original averaged reduceTerm. ### Scope: - Single-node partial-GPU jobs: shows only the assigned GPUs (100% accurate) - Single-node full-GPU jobs: shows all GPUs (unchanged, no filtering needed) - Multi-node jobs: shows all GPUs (unchanged — gres_detail has per-node values that can't be stored in fair_job_data's 1-row-per-job format) Differential Revision: D99787988

…nalyzer GPU charts (facebookresearch#129) Summary: Pull Request resolved: facebookresearch#129 Adds GPU index collection from the SLURM REST API's gres_detail field to the GCM pipeline, and uses it in the FAIR Job Analyzer to show only the GPUs assigned to a job (instead of all 8 GPUs on the node). ## Background: When a job uses fewer GPUs than available on a node (e.g., --gpus-per-task=1 on an 8-GPU node), the Job Analyzer previously showed metrics for all 8 GPUs. The existing GPUS_REQUESTED field comes from TRES-PER-NODE (always 8 for the full node), not the per-task allocation. TRES_GPUS_ALLOCATED correctly reports the count (e.g., 1 ) but not which specific GPU indices are assigned. The SLURM REST API provides gres_detail — an array of strings with exact GPU index assignments per node (e.g., "gpu:ampere:1(IDX:7)"). Verified on AVA RSC: scontrol show job <id> -d | grep GRES → GRES=gpu:ampere:1(IDX:7) ## Pipeline change (Python): - parsing.py: Added parse_gres_gpu_indices() that parses gres_detail strings into GPU index lists. Returns a comma-separated string of indices for single-node partial-GPU jobs (e.g., "7" or "0,3,5"), None for full-node (8 GPUs) or multi-node jobs. This avoids storing unnecessary data. - squeue.py: Added GRES_GPU_INDICES field (nullable, defaults to None) and "gres_detail" → "GRES_DETAIL" REST API mapping. Adds one string column to existing fair_job_data rows — no extra entries. - test_parsers.py: Added 12 test cases covering single GPU, multiple GPUs, range notation, full-node, multi-node, and edge cases (empty, null, N/A). ## Job Analyzer change (Hack): - FairJob.php: Added $gresGpuIndices property - FAIRJobAnalyzerLatestJobInfoModule.php: Queries GRES_GPU_INDICES from fair_job_data Scuba table - FAIRJobAnalyzerPerfAnalyzerModule.php: When gresGpuIndices is available (e.g., "7"), filters all 5 GPU ODS charts (utilization, temperature, SM util, SM occupancy, memory) to gpu=(7) with per-GPU reduceTerm. When null (full-node or multi-node), shows all GPUs with the original averaged reduceTerm. ### Scope: - Single-node partial-GPU jobs: shows only the assigned GPUs (100% accurate) - Single-node full-GPU jobs: shows all GPUs (unchanged, no filtering needed) - Multi-node jobs: shows all GPUs (unchanged — gres_detail has per-node values that can't be stored in fair_job_data's 1-row-per-job format) Differential Revision: D99787988

…nalyzer GPU charts (facebookresearch#129) Summary: Adds GPU index collection from the SLURM REST API's gres_detail field to the GCM pipeline, and uses it in the FAIR Job Analyzer to show only the GPUs assigned to a job (instead of all 8 GPUs on the node). ## Background: When a job uses fewer GPUs than available on a node (e.g., --gpus-per-task=1 on an 8-GPU node), the Job Analyzer previously showed metrics for all 8 GPUs. The existing GPUS_REQUESTED field comes from TRES-PER-NODE (always 8 for the full node), not the per-task allocation. TRES_GPUS_ALLOCATED correctly reports the count (e.g., 1 ) but not which specific GPU indices are assigned. The SLURM REST API provides gres_detail — an array of strings with exact GPU index assignments per node (e.g., "gpu:ampere:1(IDX:7)"). Verified on AVA RSC: scontrol show job <id> -d | grep GRES → GRES=gpu:ampere:1(IDX:7) ## Pipeline change (Python): - parsing.py: Added parse_gres_gpu_indices() that parses gres_detail strings into GPU index lists. Returns a comma-separated string of indices for single-node partial-GPU jobs (e.g., "7" or "0,3,5"), None for full-node (8 GPUs) or multi-node jobs. This avoids storing unnecessary data. - squeue.py: Added GRES_GPU_INDICES field (nullable, defaults to None) and "gres_detail" → "GRES_DETAIL" REST API mapping. Adds one string column to existing fair_job_data rows — no extra entries. - test_parsers.py: Added 12 test cases covering single GPU, multiple GPUs, range notation, full-node, multi-node, and edge cases (empty, null, N/A). ## Job Analyzer change (Hack): - FairJob.php: Added $gresGpuIndices property - FAIRJobAnalyzerLatestJobInfoModule.php: Queries GRES_GPU_INDICES from fair_job_data Scuba table - FAIRJobAnalyzerPerfAnalyzerModule.php: When gresGpuIndices is available (e.g., "7"), filters all 5 GPU ODS charts (utilization, temperature, SM util, SM occupancy, memory) to gpu=(7) with per-GPU reduceTerm. When null (full-node or multi-node), shows all GPUs with the original averaged reduceTerm. ### Scope: - Single-node partial-GPU jobs: shows only the assigned GPUs (100% accurate) - Single-node full-GPU jobs: shows all GPUs (unchanged, no filtering needed) - Multi-node jobs: shows all GPUs (unchanged — gres_detail has per-node values that can't be stored in fair_job_data's 1-row-per-job format) Differential Revision: D99787988

lushengt-meta requested review from calebho, giongto35, jj10306 and luccabb as code owners April 9, 2026 03:32

meta-cla bot added the cla signed label Apr 9, 2026

meta-codesync bot added fb-exported meta-exported labels Apr 9, 2026

luccabb requested changes Apr 10, 2026

View reviewed changes

luccabb reviewed Apr 10, 2026

View reviewed changes

meta-codesync bot changed the title ~~Collect GPU index assignments from SLURM gres_detail and filter Job Analyzer GPU charts~~ Collect GPU index assignments from SLURM gres_detail and filter Job Analyzer GPU charts (#129) Apr 15, 2026

lushengt-meta force-pushed the export-D99787988 branch from a744964 to 828670e Compare April 15, 2026 00:08

lushengt-meta force-pushed the export-D99787988 branch from 828670e to 7eafc8d Compare April 15, 2026 00:20

lushengt-meta force-pushed the export-D99787988 branch from 7eafc8d to 6b7e07f Compare April 15, 2026 00:59

lushengt-meta requested a review from luccabb April 15, 2026 01:48

lushengt-meta force-pushed the export-D99787988 branch from 6b7e07f to f2bafdd Compare April 15, 2026 03:58

lushengt-meta force-pushed the export-D99787988 branch from f2bafdd to da069ae Compare April 15, 2026 04:01

lushengt-meta force-pushed the export-D99787988 branch from da069ae to 82c9f41 Compare April 15, 2026 04:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Collect GPU index assignments from SLURM gres_detail and filter Job Analyzer GPU charts (#129)#129

Collect GPU index assignments from SLURM gres_detail and filter Job Analyzer GPU charts (#129)#129
lushengt-meta wants to merge 1 commit intofacebookresearch:mainfrom
lushengt-meta:export-D99787988

lushengt-meta commented Apr 9, 2026 •

edited by meta-codesync bot

Loading

Uh oh!

github-actions bot commented Apr 9, 2026

Uh oh!

luccabb Apr 10, 2026

Uh oh!

lushengt-meta Apr 15, 2026

Uh oh!

luccabb Apr 10, 2026

Uh oh!

lushengt-meta Apr 15, 2026

Uh oh!

luccabb left a comment

Uh oh!

luccabb commented Apr 10, 2026

Uh oh!

claude bot commented Apr 10, 2026 •

edited

Loading

Uh oh!

meta-codesync bot commented Apr 15, 2026

Uh oh!

lushengt-meta commented Apr 15, 2026

Uh oh!

lushengt-meta commented Apr 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

lushengt-meta commented Apr 9, 2026 • edited by meta-codesync bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Background:

Pipeline change (Python):

Job Analyzer change (Hack):

Scope:

Uh oh!

github-actions bot commented Apr 9, 2026

CI Commands

Uh oh!

luccabb Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

lushengt-meta Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

luccabb Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

lushengt-meta Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

luccabb left a comment

Choose a reason for hiding this comment

Uh oh!

luccabb commented Apr 10, 2026

Uh oh!

claude bot commented Apr 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review: Did we miss anything?

1. GRES_DETAIL is not a valid squeue -O format field — CLI collection will break

2. Hardcoded 8-GPU threshold doesn't generalize

3. _map_job_fields joins list values with commas — ambiguity with IDX commas

4. Missing default=None in parsed_field approach

5. Minor: no @typechecked on the new test

Summary

Uh oh!

meta-codesync bot commented Apr 15, 2026

Uh oh!

lushengt-meta commented Apr 15, 2026

Uh oh!

lushengt-meta commented Apr 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

lushengt-meta commented Apr 9, 2026 •

edited by meta-codesync bot

Loading

claude bot commented Apr 10, 2026 •

edited

Loading

1. `GRES_DETAIL` is not a valid `squeue -O` format field — CLI collection will break

3. `_map_job_fields` joins list values with commas — ambiguity with IDX commas

4. Missing `default=None` in `parsed_field` approach

5. Minor: no `@typechecked` on the new test