Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

New warnings in v3.3.1 acceptable? #998

Open
david-shreffler opened this issue Jan 14, 2025 · 10 comments
Open

New warnings in v3.3.1 acceptable? #998

david-shreffler opened this issue Jan 14, 2025 · 10 comments

Comments

@david-shreffler
Copy link

Finally got v3.3.1 installed by pinning pytorch==2.3.1 and torchaudio==2.3.1 (#967 (comment)).

Transcripts are generated with diarization, but can anyone confirm with this new version that none of these (especially the last one) are an indication of whisperX not performing as expected?

New:

INFO:speechbrain.utils.quirks:Applied quirks (see `speechbrain.utils.quirks`): [allow_tf32, disable_jit_profiling]
INFO:speechbrain.utils.quirks:Excluded quirks specified by the `SB_DISABLE_QUIRKS` environment (comma-separated list): []

Existing in v3.1.1, assuming still okay?

Lightning automatically upgraded your loaded checkpoint from v1.5.4 to v2.5.0.post0. To apply the upgrade to your files permanently, run `python -m pytorch_lightning.utilities.upgrade_checkpoint whisperx/lib/python3.12/site-packages/whisperx/assets/pytorch_model.bin`
Model was trained with pyannote.audio 0.0.1, yours is 3.3.2. Bad things might happen unless you revert pyannote.audio to 0.x.
Model was trained with torch 1.10.0+cu102, yours is 2.3.1+cu118. Bad things might happen unless you revert torch to 1.x.

New:

/home/ubuntu/whisperx/lib/python3.12/site-packages/pyannote/audio/utils/reproducibility.py:74: ReproducibilityWarning: TensorFloat-32 (TF32) has been disabled as it might lead to reproducibility issues and lower accuracy.
It can be re-enabled by calling
   >>> import torch
   >>> torch.backends.cuda.matmul.allow_tf32 = True
   >>> torch.backends.cudnn.allow_tf32 = True
See https://github.com/pyannote/pyannote-audio/issues/1370 for more details.

New:

/home/ubuntu/whisperx/lib/python3.12/site-packages/pyannote/audio/models/blocks/pooling.py:104: UserWarning: std(): degrees of freedom is <= 0. Correction should be strictly less than the reduction factor (input numel divided by output numel). (Triggered internally at ../aten/src/ATen/native/ReduceOps.cpp:1807.)
  std = sequences.std(dim=-1, correction=1)

Lastly, can any of these be suppressed?

@pramadikaegamo
Copy link

same issues with me

@Charlie11-Model
Copy link

Hi, I receive the same feedback. But mine exits with an error messageAn error occurred during diarization: list indices must be integers or slices, not tupleJust after your second last share:See https://github.com/pyannote/pyannote-audio/issues/1370 for more details.Can you help me to get the transcript with diarization successfully?

@pramadikaegamo
Copy link

is there a solution to this issue? @m-bain @Barabazs

@591280714

This comment has been minimized.

@kevdawg94

This comment has been minimized.

4 similar comments
@cczw2010

This comment has been minimized.

@niradar

This comment has been minimized.

@danhurtes

This comment has been minimized.

@GlitchOwl
Copy link

+1

@weiyuwang2024
Copy link

same issue here, any solutions?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

10 participants