Skip to content

Add helm managed overrides to audio/video datasets#1

Open
charlesbluca wants to merge 2 commits intojioffe502:audio_video_retrievalfrom
charlesbluca:audio_video_retrieval_helm
Open

Add helm managed overrides to audio/video datasets#1
charlesbluca wants to merge 2 commits intojioffe502:audio_video_retrievalfrom
charlesbluca:audio_video_retrieval_helm

Conversation

@charlesbluca
Copy link

No description provided.

# Video dataset (96 files, ~54 GB)
video:
path: /path/to/video_retrieval_data
path: /localhome/charlesb/datasets/video_retrieval_data
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we have to change back these paths to default

Copy link
Owner

@jioffe502 jioffe502 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

happy to merge after these small changes

datasets:
bo767:
path: /path/to/bo767
path: /localhome/charlesb/datasets/bo767
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
path: /localhome/charlesb/datasets/bo767
path: /path/to/bo767

# Video dataset (96 files, ~54 GB)
video:
path: /path/to/video_retrieval_data
path: /localhome/charlesb/datasets/video_retrieval_data
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
path: /localhome/charlesb/datasets/video_retrieval_data
path: /path/to/video_retrieval_data

# Audio dataset (MP3 files for audio retrieval)
audio:
path: /path/to/audio_retrieval_data_mp3
path: /localhome/charlesb/datasets/audio_retrieval_data_mp3
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
path: /localhome/charlesb/datasets/audio_retrieval_data_mp3
path: /path/to/audio_retrieval_data_mp3

gpu_search: false # Use GPU for search
embedding_model: auto # auto-detect or specify model name
vdb_backend: lancedb # milvus or lancedb
vdb_backend: milvus # milvus or lancedb
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we are technically defaulting to lancedb. not sure if you have hesitations for this for helm. LMK

Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

definitely default to lance now

jioffe502 pushed a commit that referenced this pull request Mar 6, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants