Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Leaderboard: BRIGHT Long gone #1978

Closed
Muennighoff opened this issue Feb 5, 2025 · 0 comments · Fixed by #2041
Closed

Leaderboard: BRIGHT Long gone #1978

Muennighoff opened this issue Feb 5, 2025 · 0 comments · Fixed by #2041
Assignees
Labels
leaderboard issues related to the leaderboard

Comments

@Muennighoff
Copy link
Contributor

We had two BRIGHT categories previously; now there is only one

Image
@isaac-chung isaac-chung added the leaderboard issues related to the leaderboard label Feb 6, 2025
@KennethEnevoldsen KennethEnevoldsen self-assigned this Feb 12, 2025
KennethEnevoldsen added a commit that referenced this issue Feb 12, 2025
KennethEnevoldsen added a commit that referenced this issue Feb 13, 2025
#2041)

* fix: Add BRIGHT Long

Fixes #1978

* fix: Add BRIGHT(long)

* fix bug in task results

* updated bright

* updated tests for TaskResults
silky1708 pushed a commit to silky1708/mteb that referenced this issue Mar 10, 2025
embeddings-benchmark#2041)

* fix: Add BRIGHT Long

Fixes embeddings-benchmark#1978

* fix: Add BRIGHT(long)

* fix bug in task results

* updated bright

* updated tests for TaskResults
isaac-chung added a commit that referenced this issue Mar 13, 2025
* misc: Add image classification descriptive stats implementation (#2045)

* add ImageClassificationDescriptiveStatistics

* add MNIST descriptive stats

* use tuples instead

* add label count and update docstrings

* update MNIST example

* Update tasks table

* fix: Add column descriptions to leaderboard (#2039)

* fix: Add column descriptions to leaderboard

* removed existing overlap

* fix: Add BRIGHT (long) and fix bug in TaskResult.filter_and_validate() (#2041)

* fix: Add BRIGHT Long

Fixes #1978

* fix: Add BRIGHT(long)

* fix bug in task results

* updated bright

* updated tests for TaskResults

* 1.34.12

Automatically generated by python-semantic-release

* misc: Add image clustering descriptive stats implementation (#2057)

* add image clustering descirptive stats and run
* finish off last one
* remove script

* fix: Update embed_dim for  jina models (#2058)

see embeddings-benchmark/results#117

* Update tasks table

* 1.34.13

Automatically generated by python-semantic-release

* Add giga embeddings (#1741)

* add gigaembeddings

* use jasper

* fix name

* create sentence_transformer instruct wrapper

* apply instruction template

* fix jasper

* update meta

* misc: Add ZS and multilabel image classification descriptive stats implementation (#2059)

* add image clustering descirptive stats and run

* finish off last one

* remove script

* add ImageMultilabelClassificationDescriptiveStatistics

* add VOC2007

* add zeroshot and mnist example

* Update tasks table

* Rename MIEB task classes with duplicated names (#2061)

fix class names

* misc: Add VisualSTS descriptive stats (#2062)

* add visualsts stats

* add last dataset

* Update tasks table

* fix: Added gte models (#1539)

* fix: Added gte models

* fix: Add mixbai models (#1540)

for #1515

* fix: Add climate fever v2 (#1873)

* Updated ClimateFEVER dataset with new version

* Adds Fill in the empty metadata.

* Updates the date tuple

* Update class name

Co-authored-by: Kenneth Enevoldsen <[email protected]>

* Update domains

Co-authored-by: Kenneth Enevoldsen <[email protected]>

* Update task_subtypes

* Update annotations_creators for the first version

* Update date

Co-authored-by: Kenneth Enevoldsen <[email protected]>

* Update task subtypes

* Update path

* Update description

---------

Co-authored-by: Kenneth Enevoldsen <[email protected]>
Co-authored-by: Mina Parham <[email protected]>

* Update tasks table

* fix: Updating paper scripts (#1958)

* change reference revisions to align with paper

* Update author list

* Added code for main results table

* updated minor changes

* added external as a "no_revision_available" case

* revert unintended changes

* format

* 1.34.14

Automatically generated by python-semantic-release

* Add datasets for a benchmark newly introduced for "Engineering" domain (#1911)

* adding clustering tasks (built-bench-clustering S2S & P2P)

* updated built-bench-clustering tasks

* Updated BuiltBenchClustering tasks

* Added "Engineering" as new domain to TaskMetadata.py
* Updated tasks table in docs
* Updated task metadata for BuiltBenchClustering S2S and P2P

* updated metadata for clustering tasks

* Add/update BuiltBench tasks

- Add BuiltBenchRetrieval task
- Add BuiltBenchReranking task
- Update metadata for BuiltBenchClusterinP2P
- Update metadata for BuiltBenchClusterinS2S

* update BuiltBench benchmark

* Update mteb/benchmarks/benchmarks.py

Co-authored-by: Roman Solomatin <[email protected]>

* Update mteb/tasks/Clustering/eng/BuiltBenchClusteringS2S.py

Co-authored-by: Roman Solomatin <[email protected]>

* Update mteb/tasks/Clustering/eng/BuiltBenchClusteringP2P.py

Co-authored-by: Roman Solomatin <[email protected]>

* Update mteb/benchmarks/benchmarks.py

Co-authored-by: Isaac Chung <[email protected]>

* Fix formatting via ruff

---------

Co-authored-by: Roman Solomatin <[email protected]>
Co-authored-by: Isaac Chung <[email protected]>

* Update tasks table

* misc: update model names to adjust for adding to results repo (#2074)

* update model names to adjust for adding to results repo

* update model meta script

* misc: Add all image classification descriptive stats (#2073)

* add most image classification descr stats

* revert changes to encoder

* add stats

---------

Co-authored-by: Roman Solomatin <[email protected]>

* Update tasks table

* ci: Rerun tests that fail due to networking issues. (#2029)

* fix: rerun tests that fail - Networking

* update tests to use tmp_path

* set versions for dev dependencies

* add pytest options to pyproject.toml

* add rerun json.decoder.JSONDecodeError

* remove JSONDecodeError from pyproject.toml

* add huggingface_hub.errors.HfHubHTTPError

* add huggingface_hub.errors.LocalEntryNotFoundError
https://github.com/embeddings-benchmark/mteb/actions/runs/13298535701/job/37139767443?pr=2044

* FileNotFoundError
https://github.com/embeddings-benchmark/mteb/actions/runs/13302915091/job/37147507251?pr=2029

* add doc to pytest rerun

---------

Co-authored-by: sam021313 <[email protected]>

* fix: generate metadata (#2063)

* fix: generate metadata

* use logging not print for script

* lint

* add iso639 to dev pyproject

* fix import

* add memory_usage_mb

* set version for iso639

Co-authored-by: Kenneth Enevoldsen <[email protected]>

---------

Co-authored-by: sam021313 <[email protected]>
Co-authored-by: Kenneth Enevoldsen <[email protected]>
Co-authored-by: Roman Solomatin <[email protected]>

* 1.34.15

Automatically generated by python-semantic-release

* fix: add missing `e5` training datasets (#2065)

add missing training datasets

* 1.34.16

Automatically generated by python-semantic-release

* fix: Ensure voyage model uses different naming scheme (#2083)

* fix: Added make command for running leaderboard locally

* fix: Ensure voyage models doesn't re-use the name

* 1.34.17

Automatically generated by python-semantic-release

* fix: Freeze model/rank columns in leaderboard (#2044)

* fix: freeze model/rank columns in leaderboard

* freezing zero-shot column

* update min gradio version to 5.16.0 in pyproject.toml

---------

Co-authored-by: Shikhar Shiromani <[email protected]>

* 1.34.18

Automatically generated by python-semantic-release

* fix: Fixed previous incorrect specification of splits for CMTEB ( MTEB(cmn, v1) ) (#2086)

Fixes #2064

* 1.34.19

Automatically generated by python-semantic-release

* Remove duplicated string in docstring of TaskMetadata class (#2087)

* Remove duplicated string in docstring of TaskMetadata class

* Remove duplicated dataset field

* fix: Smarter leaderboard caching with cachetools (#2085)

* Added smarter caching to callbacks

* Added cachetools as a dependency

* Ran linting

* Removed debugging print statement

* Bumped Gradio version

* Dependency fixes

* Dependency fixes

---------

Co-authored-by: Kenneth Enevoldsen <[email protected]>

* fix: Missing fixes for #2086 - change MultilingualSentiment split from test to validation in CMTEB (#2088)

* fix: Fixed previous incorrect specification of splits for CMTEB ( MTEB(cmn, v1) )

Fixes #2064

* change MultilingualSentiment split from test to validation in CMTEB

* 1.34.20

Automatically generated by python-semantic-release

* merge gme models (#2089)

* fix: Add back task filtering by modalities (#2080)

* add back task filtering by modalities

* add unit test

* check if task modalities is a subset of model modalities and fix tests

* add model_modalities_more_than_task_modalities case

* 1.34.21

Automatically generated by python-semantic-release

* Added gtr-t5-base/large/xl/xxl metadata to mteb (#2092)

* Added GTR Models to codebase

* Linted gtr models file.

* Added gtr-base/large/xl/xxl to sentence_transformers_models.py

* Added memory_usage_mb and training_datasets

* Reformatted training dataset names

* Reformatted training dataset names

* Reformatted training dataset names

---------

Co-authored-by: sufen <[email protected]>

* misc: Add Any2TextMutipleChoice Descriptive Statistics (#2095)

* add Any2TextMutipleChoiceDescriptiveStatistics

* run on all tasks

* Update tasks table

* fix: Updated model annotations for GTE, e5, gritlm, and SFR models (#2101)

Reported with references to paper + qoutes.

* fix: Update links (#2098)

* Fix link

* Fix link

* 1.34.22

Automatically generated by python-semantic-release

* Add model inf-retriever-v1-1.5b (#2106)

Add inf-retriever-v1-1.5b model

* docs: Fix typos & refine text (#2102)

* Update app.py

* Fix typos

* misc: Run Zeroshot Classification Descriptive Stats (#2105)

* add most datasets

* add birdsnap and imgnet1k

* add scimmir and sun397

* add uck101 zs

* Update tasks table

* fix: add warning about task category conversion (#2108)

add warning about task category conversion

* 1.34.23

Automatically generated by python-semantic-release

* fix: Add codesage-large-v2 (#2090)

* Add codesage-large-v2

* Address comments

* Add training dataset

* Fix issues

* Format code

* Remove unnecessary wrapper

* 1.34.24

Automatically generated by python-semantic-release

* fix: add training data to BGE-m3-custom-fr (#2110)

This ensure that is it correctly filtered as non-zero-shot

* 1.34.25

Automatically generated by python-semantic-release

* fix: Upgrade ruff to be gradio compatible (#2111)

* fix: update ruff to be gradio compatible (>=0.9.3)

* format

* fix: upgrade ruff to latests (same as gradio compatible)

* 1.34.26

Automatically generated by python-semantic-release

* docs: Follow google docstring format (#2115)

Fixes #2113

* Update leaderboard_refresh.yaml (#2121)

* fix InstructSentenceTransformer Model name (#2125)

fix params

* fix voyage (#2127)

* fix: update e5 instruct training data (#2129)

update e5 training data

* 1.34.27

Automatically generated by python-semantic-release

* format

* Update tasks table

* fix: Add 2 new Static Sentence Transformer models (#2112)

* Add 2 new Static Sentence Transformer models

* Add Tatoeba

Co-authored-by: Roman Solomatin <[email protected]>

---------

Co-authored-by: Roman Solomatin <[email protected]>

* 1.34.28

Automatically generated by python-semantic-release

* add is_cross_encoder (#1869)

* add is_cross_encoder

* Update mteb/model_meta.py

Co-authored-by: Kenneth Enevoldsen <[email protected]>

* change value

---------

Co-authored-by: Kenneth Enevoldsen <[email protected]>

* Qodo embed 1 1.5 b (#2137)

* feat: Add Qodo-Embed-1-1.5B model metadata

* fix: Add Qodo models to overview imports

* fix: Add adapted_from field to Qodo model metadata

* Update mteb/models/qodo_models.py

Co-authored-by: Roman Solomatin <[email protected]>

* relint

---------

Co-authored-by: Tal Sheffer <[email protected]>
Co-authored-by: Roman Solomatin <[email protected]>

* misc: merge summary retrieval into bitext mining (#2140)

merge summary retrieval into bitext mining

* test: fix dataset availability test (#2141)

This simplified the test and also make it a lot simpler. It also removed about 100 test cases which where all to the same API call.

* fix: Update NVIDIA-Embed training data (#2143)

Added a few missing annotations for nvidia-embed

* 1.34.29

Automatically generated by python-semantic-release

* fix: Add annotations for Voyage exp (#2144)

* fix: Update NVIDIA-Embed training data

Added a few missing annotations for nvidia-embed

* fix update annotationf for voyage exp

* 1.34.30

Automatically generated by python-semantic-release

* Fix tokens num in cde models (#2148)

fix tokens

* feat: Add Qodo-Embed-1-7B model metadata and rename existing model (#2146)

* feat: Add Qodo-Embed-1-7B model metadata and rename existing model

* lint

* fix revision

* update license name

---------

Co-authored-by: Tal Sheffer <[email protected]>

* 1.35.0

Automatically generated by python-semantic-release

* misc: add Any2AnyRetrievalDescriptiveStatistics (#2139)

add Any2AnyRetrievalDescriptiveStatistics

* Update tasks table

* Added zero-shot percentages and different filtering scheme (#2153)

* Added zero-shot percentages and different filtering scheme

* Update mteb/model_meta.py

Co-authored-by: Roman Solomatin <[email protected]>

---------

Co-authored-by: Roman Solomatin <[email protected]>

* fix: Incorrect annotations for Mistral-based embedding models (#2157)

Fixes #2155

* 1.35.1

Automatically generated by python-semantic-release

* Update FaMTEBRetrieval.py (#2171)

The URL pointed to the settings page instead of the main repo URL. Now it is fixed.

* Update tasks table

* fix: Add Training data annotations (#2173)

* redo to voyage to only training data

* Add training data annotation for Kalm embeddings #2168

* Add correct training data annotations to Stella #2164

* removed fiqa PL as it does not exist

* remove ArxivClusteringS2S.v2 as it does not exist

* Add training data annotation for GIST embedding #2166

* fix max tokens for kalm models #2162

* remove eli 5

* 1.35.2

Automatically generated by python-semantic-release

* feat: Add MIEB and MIEB-lite as benchmarks (#2035)

* add mieb and mieb-lite to benchmarks

* add CompositionalityEvaluation and DocumentUnderstanding types

* add VisionCentric type

* add missing comma

* split STS17MultilingualVisualSTS and STSBenchmarkMultilingualSTS to eng and non-eng

* use aggregate task instead so we can name the subsets

* shorten names

* fix import

* alternative strategy to avoid using get_task

* follow other aggregate tasks and skip metadata test

* run LB without errors when selecting MIEB(-lite)

* add back the capability as taks type

* typo

* extend description

* split into mieb(eng) and mieb(multilingual)

* remove unneeded files

* remove aggtask additions for test

* edit descriptions based on screenshots

* shorten

* rename to Compositionality and include ImageCoDeT2IMultiChoice

* re-tag missing VisionCentric tasks

* re-tag rparis and roxford as retrieval and include fixes

* re-tag voc2007 as image cls

* make lint

* correct num task types in descriptions

* add one model to models_to_annotate

* add mieb reference models

* update task types

* relabel to multilingual retrieval task type to align with paper

* fix reference and bibtex

* edit task list to match with final list

* add back agg task to reproduce table column in paper

* fix filtering and import

* update tests

* mieb lite add back missing tasks

* fix metadata test

* multi should have all 4 variants

* fix task counts

* lite has 10 task types

* fix visualSTS-17 lang splits

* Aggregate task can now use subsets & eval langs to filter TaskResults

* fix test and mark VisualSTS17 as multilingual

* fix tests

* add agg task running script

* add voyage meta

* fix citations

* capitalize

* add coarse/fine labels

---------

Co-authored-by: gowitheflow-1998 <[email protected]>

* Update tasks table

* 1.36.0

Automatically generated by python-semantic-release

* fix: update training datasets and revision for jina models (#2179)

* feat: update training datasets and revision for jina models

* feat: update training datasets and revision for jina models

* fix: Add more training data annotations (#2178)

* redo to voyage to only training data

* Add training data annotation for Kalm embeddings #2168

* Add correct training data annotations to Stella #2164

* removed fiqa PL as it does not exist

* remove ArxivClusteringS2S.v2 as it does not exist

* Add training data annotation for GIST embedding #2166

* fix max tokens for kalm models #2162

* remove eli 5

* fix: add training data for Bilingual Embeddings

fixes #2167

* 1.36.1

Automatically generated by python-semantic-release

* Added training data annotation for e5-base-4k (#2186)

* fix: Added training data annotations to MXBAI (#2185)

* fix: Update MTEB(Scandinavian) to use new DanFEVER (#2180)

This also resolves the missing data in the leaderboard.

Fixes #2172

* fix: Added training data annotation for MMLW models (#2188)

* Added training data annotation for MMLW models

* Added GIST annotations Kenneth missed

* Added Stella en 400m training data'

* 1.36.2

Automatically generated by python-semantic-release

* fix: Added training data for sentence-croissant (#2189)

* 1.36.3

Automatically generated by python-semantic-release

* fix: update ru models annotation (#2181)

* 1.36.4

Automatically generated by python-semantic-release

* fix: Alphabetical ordering of tasks in dropdowns (#2191)

* 1.36.5

Automatically generated by python-semantic-release

* misc: Speed up qrel creation in any2anyretrieval (#2196)

* use numpy vectorized operations instead of row-by-row

* scores are int

* use 'mteb.MTEB' instead of 'MTEB' for custom model (#2199)

* add base models for e5 (#2183)

* add similar datasets (#2205)

* add similar datasets

* add nano

* update is filled

* Update mteb/abstasks/TaskMetadata.py

Co-authored-by: Kenneth Enevoldsen <[email protected]>

---------

Co-authored-by: Kenneth Enevoldsen <[email protected]>

* add labse annotation (#2182)

* add labse annotation

* Update mteb/models/sentence_transformers_models.py

Co-authored-by: Kenneth Enevoldsen <[email protected]>

---------

Co-authored-by: Kenneth Enevoldsen <[email protected]>

* fix: Fixed leaderboard crash (#2221)

* Fixed leaderboard crash

* Fixed language selection error

* Ran linting

* 1.36.6

Automatically generated by python-semantic-release

* fix: More training data annotations (#2220)

* Added training  data annotation for bge-gemma

* Added missing annotations for Voyage models

* Added training data for sts-multilingual-mpnet

* Added all mteb datasets to STS-multilingual training data

* 1.36.7

Automatically generated by python-semantic-release

* Add LLM2CLIP (OpenAI variants) (#2222)

* model loading and get_text_embeddings

* add image_emb, fused_emb, and calc probs methods

* add b16 model

* add llm2clip_openai_l_14_224 (not working yet)

* got llm2clip_openai_l_14_224 working

* make lint

* add training sets and allow py files

* Change `dataset on HF` test to use official api (#2213)

* refactor dataset checking

* increase timeout

* increase timeout

* remove timeout

* Descriptive stats functions for Any2AnyMC and ImageTextPC (#2197)

* Add Any2AnyMC descriptive stats

* Add descriptive stats function for ImageTextPC

* add descriptive stats examples

* linter

* update multi choice descriptive stats

* Update tasks table

* fix: Add training data annotations to uderver-bloom models (#2210)

* fix: Add training data annotations to uderver-bloom models

fixes #2193

* fix: add mixedbread

---------

Co-authored-by: Márton Kardos <[email protected]>

* 1.36.8

Automatically generated by python-semantic-release

* Add comment to `voyage-3-m-exp` model (#2229)

* remove model size from voyage-3-m-exp model

* Update mteb/models/voyage_models.py

* Update mteb/models/voyage_models.py

* docs: Update description of EURLex (#2231)

* Automatically add similar tasks to training_tasks (#2228)

* refactor dataset checking

* increase timeout

* increase timeout

* remove timeout

* start

* automatically find datasets

* update comment

* fix aggregate task metadata

* fixes

* lint

* rename

* update fetch check

* Remove overlapping legends from radar chart (#2195)

* Remove overlapping legends from radar chart

* ensure graph is not blocked

* Overlapping legend issue of Radar Chart

* misc: Run Any2AnyRetrieval descriptive stats (#2223)

* run a few datasets

* add a few more

* run more tasks

* add more datasets

* remove pdb

* remove newline

* add more datasets

* Update tasks table

* misc: Add rest of the vision centric and compositionality descriptive stats (#2267)

add the rest

* Update tasks table

* Fix `calculate_memory_usage_mb` in adding_a_model.md (#2271)

* Add Arabic-Triplet-Matryoshka-V2 model metadata to MTEB (#2270)

* Add Arabic-Triplet-Matryoshka-V2 model metadata to MTEB

* Update memory_usage_mb with correct calculated value

* Update mteb/models/Arabic_Triplet_Matryoshka_V2.py

Co-authored-by: Roman Solomatin <[email protected]>

* Update mteb/models/Arabic_Triplet_Matryoshka_V2.py

Co-authored-by: Roman Solomatin <[email protected]>

* remove comments

* added correct memory usage

* Update mteb/models/Arabic_Triplet_Matryoshka_V2.py

Co-authored-by: Roman Solomatin <[email protected]>

* Apply linter fixes with ruff

* Update mteb/models/Arabic_Triplet_Matryoshka_V2.py

Co-authored-by: Roman Solomatin <[email protected]>

* Update mteb/models/Arabic_Triplet_Matryoshka_V2.py

Co-authored-by: Roman Solomatin <[email protected]>

* Add Arabic_Triplet_Matryoshka_V2 to overview.py

* Rename model file to ara_models.py and update imports

---------

Co-authored-by: Roman Solomatin <[email protected]>

* fix: Add WebFAQ Retrieval dataset (#2236)

* Add WebFAQ Retrieval dataset

Signed-off-by: Michael Dinzinger <[email protected]>

* Small change WebFAQRetrieval.py

Signed-off-by: Michael Dinzinger <[email protected]>

* Add remaining languages to WebFAQ Retrieval task

Signed-off-by: Michael Dinzinger <[email protected]>

* Add descriptive stats

Signed-off-by: Michael Dinzinger <[email protected]>

---------

Signed-off-by: Michael Dinzinger <[email protected]>

* Update tasks table

* 1.36.9

Automatically generated by python-semantic-release

* fix: Formatting issue in Performance Plot (#2237)

* Formatting issue in Performance Plot

* make lint

* added function for better code readability

* 1.36.10

Automatically generated by python-semantic-release

* ci: run test_dataset_on_hf separately (#2201)

* dont run test_dataset_on_hf in every pr

* lint

* Update call pytest test_datasets

Co-authored-by: Kenneth Enevoldsen <[email protected]>

* Update tests/test_tasks/test_all_abstasks.py

Co-authored-by: Kenneth Enevoldsen <[email protected]>

* not datasets for test

* run dataset loading test for push or pull_request

* apply feedback

---------

Co-authored-by: sam021313 <[email protected]>
Co-authored-by: Kenneth Enevoldsen <[email protected]>

* add gemini-embedding-exp-03-07 (#2279)

* add gemini-embedding-exp-03-07

* remove space for lint

* lint fix

* update link (#2281)

* fix: Run remaining MIEB desc stats (#2288)

* run Vidore

* GLDv2

* run the rest

---------

Co-authored-by: Isaac Chung <[email protected]>

* Update tasks table

* 1.36.11

Automatically generated by python-semantic-release

* fix: Added Filter Modality (#2262)

* Added Filter Modality

* resolve suggestions

* make lint

* make sure test pass

* make lint

* added exclusive_modality_filter and unit tests

* Integrate tests on overview.py

* Update tests/test_overview.py

Co-authored-by: Roman Solomatin <[email protected]>

* added task related to image modality

* Update mteb/abstasks/AbsTask.py

Co-authored-by: Kenneth Enevoldsen <[email protected]>

* Update mteb/abstasks/AbsTask.py

Co-authored-by: Kenneth Enevoldsen <[email protected]>

* update overview..py

* make lint

* update documentation

---------

Co-authored-by: Roman Solomatin <[email protected]>
Co-authored-by: Kenneth Enevoldsen <[email protected]>

* 1.36.12

Automatically generated by python-semantic-release

* fix: Add `ModelMeta` license & custom validations (#2293)

* license validation

* move licenses

* update imports

---------

Co-authored-by: Isaac Chung <[email protected]>

* 1.36.13

Automatically generated by python-semantic-release

* ci: Add pre-commit hook (#2194)

* make dev life nicer - pre-commit hooks

* add pre-commit to install

* update precommit

* update ruff pre-commit

* lint

* lint

---------

Co-authored-by: sam021313 <[email protected]>

* Update tasks table

* fix: bug in voyage implementation (#2304)

* fix: Fix bug in voyage implementation

"passage" is not a valid input for the voyage API. Remapped to "document".

* Update mteb/models/voyage_models.py

Co-authored-by: Roman Solomatin <[email protected]>

---------

Co-authored-by: Roman Solomatin <[email protected]>

* 1.36.14

Automatically generated by python-semantic-release

* fix: Update voyage name to include Org. (#2322)

* 1.36.15

Automatically generated by python-semantic-release

* Added VDR Model (#2290)

* Added VDR Model

* change custom wrapper to SentenceTransformer Wrapper

* remove kwargs and add TODO for Image Modality

* Update mteb/models/vdr_models.py

Co-authored-by: Roman Solomatin <[email protected]>

---------

Co-authored-by: Roman Solomatin <[email protected]>

* fix: Resolve conflicting dependencies (#2323)

These errors where discovered when trying to install the package using `uv`.

We have a problem with salesforce-lavis, which is not compatible with the current set of dependencies.

* 1.36.16

Automatically generated by python-semantic-release

* fix: remove SyntaxWarnings in py312 (#2325)

* fix: Resolve conflicting dependencies

These errors where discovered when trying to install the package using `uv`.

We have a problem with salesforce-lavis, which is not compatible with the current set of dependencies.

* fix: Remove syntax warnings occuring in python 3.12

```
Python 3.12.0 (main, Oct  2 2023, 20:56:14) [Clang 16.0.3 ] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> import mteb # no syntax warnings
>>>
```

* 1.36.17

Automatically generated by python-semantic-release

* fix: add annotation models for stella zh (#2277)

* fix: add annotation models for stella zh

Additionally fixed a few annotation errors

* format

* Update mteb/models/stella_models.py

Co-authored-by: Isaac Chung <[email protected]>

---------

Co-authored-by: Isaac Chung <[email protected]>

* 1.36.18

Automatically generated by python-semantic-release

* fix: Add ModelMeta rubert-mini-frida, BERTA (#2330)

* Add rubert-mini-frida model meta

* Add BERTA model meta

* docs: fix typos

* 1.36.19

Automatically generated by python-semantic-release

* fix: Add WebFAQ bitext mining tasks (#2326)

* Add WebFAQ bitext mining tasks

Signed-off-by: Michael Dinzinger <[email protected]>

* Lower number of language pairs in WebFAQBitextMining

Signed-off-by: Michael Dinzinger <[email protected]>

---------

Signed-off-by: Michael Dinzinger <[email protected]>

* Update tasks table

* 1.36.20

Automatically generated by python-semantic-release

* make lint

* fix validation for license

* fix remaining validation errors

---------

Signed-off-by: Michael Dinzinger <[email protected]>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Kenneth Enevoldsen <[email protected]>
Co-authored-by: github-actions <[email protected]>
Co-authored-by: Roman Solomatin <[email protected]>
Co-authored-by: Mina Parham <[email protected]>
Co-authored-by: Mina Parham <[email protected]>
Co-authored-by: Mehrzad Shahin-Moghadam <[email protected]>
Co-authored-by: Roman Solomatin <[email protected]>
Co-authored-by: Sam <[email protected]>
Co-authored-by: sam021313 <[email protected]>
Co-authored-by: Shikhar Shiromani <[email protected]>
Co-authored-by: Shikhar Shiromani <[email protected]>
Co-authored-by: Ruslan Bel'kov <[email protected]>
Co-authored-by: Márton Kardos <[email protected]>
Co-authored-by: sufen-f <[email protected]>
Co-authored-by: sufen <[email protected]>
Co-authored-by: Niklas Muennighoff <[email protected]>
Co-authored-by: Samuel Yang <[email protected]>
Co-authored-by: Aradhye Agarwal <[email protected]>
Co-authored-by: Tom Aarsen <[email protected]>
Co-authored-by: talshef <[email protected]>
Co-authored-by: Tal Sheffer <[email protected]>
Co-authored-by: garciasces <[email protected]>
Co-authored-by: gowitheflow-1998 <[email protected]>
Co-authored-by: Wang Bo <[email protected]>
Co-authored-by: Munot Ayush Sunil <[email protected]>
Co-authored-by: Yaya Sy <[email protected]>
Co-authored-by: Imene Kerboua <[email protected]>
Co-authored-by: Eng. Omar Najar <[email protected]>
Co-authored-by: Michael Dinzinger <[email protected]>
Co-authored-by: Jinhyuk Lee <[email protected]>
Co-authored-by: Isaac Chung <[email protected]>
Co-authored-by: sergeyz-zh <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
leaderboard issues related to the leaderboard
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants