Releases: juanmc2005/diart
Version 0.9.1
Bug fixes
This version fixes a crash caused by a change in matplotlib's API (issue #234).
It also introduces a test suite and CI improvements for development.
Quantized ONNX versions of pyannote/segmentation
and pyannote/embedding
were added under assets/models
to run tests independently from the HuggingFace space.
What's Changed
- Add test suite by @juanmc2005 in #237
Full Changelog: v0.9...v0.9.1
Version 0.9
New Features!
Major changes in this new version! Including compatibility with pyannote 3.*, SpeechBrain, WeSpeaker and NeMo embedding models, totaling 8 new models to create speaker diarization pipelines and 1 new model for voice activity detection.
This version also adds compatibility with ONNX models and a documentation page at diart.readthedocs.io
What's Changed
- Add compatibility with pyannote 3.0 embedding wrappers by @sorgfresser in #188
- Add support for powerset segmentation model by @hbredin in #198
- Add ONNX compatibility by @juanmc2005 in #204
- README improvements by @juanmc2005 in #207
- Add documentation page by @juanmc2005 in #209
- Make ONNX runtime optional by @juanmc2005 in #215
- Add reproducibility warning in README by @juanmc2005 in #216
New contributors
Thank you @sorgfresser for your huge contribution in #188 !
Full Changelog: v0.8...v0.9
Version 0.8
What's Changed
- Voice Activity Detection by @juanmc2005 in #143
- Fix sample rate issues by @juanmc2005 in #153
- [joss] Add Conda environment by @sneakers-the-rat in #172
- [joss] Use ABC and abstractmethod rather than NotImplementedError by @sneakers-the-rat @juanmc2005 in #173 #179
- Fix torchaudio version incompatibility by @juanmc2005 in #181
- Add special audio source for Apple devices by @juanmc2005 in #182
- Catch keyboard interrupt in
diart.stream
by @juanmc2005 in #183 - Remove
PipelineConfig.from_dict()
by @juanmc2005 in #189 - Fix link to reproducibility section by @hbredin in #191
New Contributors
Thank you @sneakers-the-rat for your extremely valuable feedback and help as part of the JOSS review!
Full Changelog: v0.7...v0.8
Version 0.7
What's Changed
- Plug & play speaker diarization pipelines by @juanmc2005 in #101
- Parallel execution of Benchmark by @juanmc2005 in #124
- Better installation instructions by @juanmc2005 in #130
- Rewrite websockets by @juanmc2005 in #131
- Add left padding to short audio files by @juanmc2005 in #135
- Allow to select microphone device by @juanmc2005 in #136
- Update logo and demo gif by @juanmc2005 in #137
- Shift pipeline timestamps by @juanmc2005 in #138
Full Changelog: v0.6...v0.7
Version 0.6
What's Changed
- Compatibility with torchaudio streams by @juanmc2005 in #91
- Online speaker diarization as a block by @juanmc2005 in #92
- Fix bug: RTTM output not being patched when closing plot window by @juanmc2005 in #100
- Add
cropping_mode
toDelayedAggregation
by @bhigy in #105 - Compatibility with pyannote.audio 2.1.1 requirements by @juanmc2005 in #108
New Contributors
Full Changelog: v0.5.1...v0.6
Version 0.5.1
What's Changed
- Fix wrong config reference and unpatched annotation by @juanmc2005 in #89
Full Changelog: v0.5...v0.5.1
Version 0.5
What's Changed
- Add
study_or_path
as a Path for conversion from string by @AMITKESARI2000 in #74 - Update WebSocketAudioSource by @ckliao-nccu in #78
- Fix bug with empty RTTMs by @zaouk in #81
- Add websocket compatibility + other improvements by @juanmc2005 in #77
- Export csv report in
diart.benchmark
when output is provided by @juanmc2005 in #86
New Contributors
- @AMITKESARI2000 made their first contribution in #74
- @ckliao-nccu made their first contribution in #78
Acknowledgements
Thank you @AMITKESARI2000, @ckliao-nccu and @zaouk for all the bug fixes!
Full Changelog: v0.4...v0.5
Version 0.4
What's Changed
- Replace
resolve_features
withTemporalFeatureFormatter
by @juanmc2005 in #59 - Make
pyannote.audio
optional (still mandatory to run default pipeline) by @juanmc2005 in #61 - Minor features and improvements by @juanmc2005 in #64
- Adds documentation for some of the classes and methods by @zaouk in #31
- Add hyper-parameter tuning with optuna by @juanmc2005 in #65
New Contributors
Full Changelog: v0.3...v0.4
Version 0.3
What's Changed
- Python 3.7 compatibility and PortAudio error fix by @Yagna24 in #29
- Add citation by @hbredin in #38
- Benchmark script + improvements and bug fixes by @juanmc2005 in #46
- Improve API names by @juanmc2005 in #47
- Add
OverlapAwareSpeakerEmbedding
class by @juanmc2005 in #51 - Add inference API with
RealTimeInference
andBenchmark
by @juanmc2005 in #55
New Contributors
- Thank you @Yagna24 for your contribution in python 3.7 compatibility!
Full Changelog: v0.2.1...v0.3
Version 0.2.1
What's Changed
- Fix empty segment in
buffer_output
causing a crash by @juanmc2005 in #24
Full Changelog: v0.2...v0.2.1