AIP score flakiness

### Priority Level

Medium

### Task Summary

The Attribute Inference Protection score seems to be fluctuating a lot between runs, specifically for or ai_generated_essays and call_transcripts, which are predominantly text columns. The score just randomly jumps between 10/5 or 10/7.5. 
https://wandb.ai/nemo-llm-service/matt_faiss_removed_reb/table?nw=nwusermkornfield
https://wandb.ai/nemo-llm-service/matt_faiss_still_there_2/table?nw=nwusermkornfield

I'm suspecting something like columns being designated different types between runs. But should confirm. 

### Technical Details & Implementation Plan

_No response_

### Dependencies

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AIP score flakiness #280

Priority Level

Task Summary

Technical Details & Implementation Plan

Dependencies

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

AIP score flakiness #280

Description

Priority Level

Task Summary

Technical Details & Implementation Plan

Dependencies

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions