Skip to content

Conversation

@ChenZiHong-Gavin
Copy link
Collaborator

This PR adds uniform input-validation logic to all readers (CSV, JSON, JSONL, Parquet, Pickle):

  • Every document must contain a "type" field; an AssertionError is raised if missing.
  • Text-type check: when type == "text", the field specified by self.text_column (usually "content") must also be present; otherwise a ValueError is raised.

@gemini-code-assist
Copy link
Contributor

Important

Installation incomplete: to start using Gemini Code Assist, please ask the organization owner(s) to visit the Gemini Code Assist Admin Console and sign the Terms of Services.

@ChenZiHong-Gavin ChenZiHong-Gavin merged commit dec4549 into main Nov 6, 2025
3 checks passed
@ChenZiHong-Gavin ChenZiHong-Gavin deleted the feature/input-file-validation branch November 6, 2025 09:41
CHERRY-ui8 pushed a commit to CHERRY-ui8/GraphGen that referenced this pull request Dec 17, 2025
…t-file-validation

feat: validate content of input files
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants