Skip to content

feat: Add chunking function for sequence tagger training on sentences exceeding token limit #2348

feat: Add chunking function for sequence tagger training on sentences exceeding token limit

feat: Add chunking function for sequence tagger training on sentences exceeding token limit #2348

Triggered via pull request August 8, 2024 23:50
Status Failure
Total duration 26m 23s
Artifacts

ci.yml

on: pull_request
Fit to window
Zoom out
Zoom in

Annotations

10 errors and 1 warning
test: tests/embedding_test_utils.py#L44
TestTransformerDocumentEmbeddings.test_embedding_works_with_weird_text[False-Hybrid mesons , qq \u0304 states with an admixture] FutureWarning: `clean_up_tokenization_spaces` was not set. It will be set to `True` by default. This behavior will be depracted in transformers v4.45, and will be then set to `False` by default. For more details check this issue: https://github.com/huggingface/transformers/issues/31884
test: tests/embedding_test_utils.py#L44
TestTransformerDocumentEmbeddings.test_embedding_works_with_weird_text[False-typical proportionalities of \u223c 1nmV \u2212 1 [ 3,4 ] .] FutureWarning: `clean_up_tokenization_spaces` was not set. It will be set to `True` by default. This behavior will be depracted in transformers v4.45, and will be then set to `False` by default. For more details check this issue: https://github.com/huggingface/transformers/issues/31884
test: tests/embedding_test_utils.py#L44
TestTransformerDocumentEmbeddings.test_embedding_works_with_weird_text[False-\U0001f91f \U0001f91f \U0001f91f h\xfcllo] FutureWarning: `clean_up_tokenization_spaces` was not set. It will be set to `True` by default. This behavior will be depracted in transformers v4.45, and will be then set to `False` by default. For more details check this issue: https://github.com/huggingface/transformers/issues/31884
test: tests/embedding_test_utils.py#L44
TestTransformerDocumentEmbeddings.test_embedding_works_with_weird_text[False-\U0001f91fhallo \U0001f91f \U0001f91f \U0001f91f \U0001f91f] FutureWarning: `clean_up_tokenization_spaces` was not set. It will be set to `True` by default. This behavior will be depracted in transformers v4.45, and will be then set to `False` by default. For more details check this issue: https://github.com/huggingface/transformers/issues/31884
test: tests/embedding_test_utils.py#L44
TestTransformerDocumentEmbeddings.test_embedding_works_with_weird_text[False-\U0001f91f] FutureWarning: `clean_up_tokenization_spaces` was not set. It will be set to `True` by default. This behavior will be depracted in transformers v4.45, and will be then set to `False` by default. For more details check this issue: https://github.com/huggingface/transformers/issues/31884
test: tests/embedding_test_utils.py#L44
TestTransformerDocumentEmbeddings.test_embedding_works_with_weird_text[False-\uf8f9] FutureWarning: `clean_up_tokenization_spaces` was not set. It will be set to `True` by default. This behavior will be depracted in transformers v4.45, and will be then set to `False` by default. For more details check this issue: https://github.com/huggingface/transformers/issues/31884
test: tests/embedding_test_utils.py#L86
TestTransformerDocumentEmbeddings.test_keep_batch_order[False] FutureWarning: `clean_up_tokenization_spaces` was not set. It will be set to `True` by default. This behavior will be depracted in transformers v4.45, and will be then set to `False` by default. For more details check this issue: https://github.com/huggingface/transformers/issues/31884
test: tests/embedding_test_utils.py#L158
TestTransformerDocumentEmbeddings.test_default_embeddings_stay_the_same_after_saving_and_loading[False] FutureWarning: `clean_up_tokenization_spaces` was not set. It will be set to `True` by default. This behavior will be depracted in transformers v4.45, and will be then set to `False` by default. For more details check this issue: https://github.com/huggingface/transformers/issues/31884
test: tests/embedding_test_utils.py#L184
TestTransformerDocumentEmbeddings.test_embeddings_load_in_eval_mode[False] FutureWarning: `clean_up_tokenization_spaces` was not set. It will be set to `True` by default. This behavior will be depracted in transformers v4.45, and will be then set to `False` by default. For more details check this issue: https://github.com/huggingface/transformers/issues/31884
test: tests/embeddings/test_transformer_document_embeddings.py#L25
test_if_loaded_embeddings_have_all_attributes[False] FutureWarning: `clean_up_tokenization_spaces` was not set. It will be set to `True` by default. This behavior will be depracted in transformers v4.45, and will be then set to `False` by default. For more details check this issue: https://github.com/huggingface/transformers/issues/31884
test
The following actions uses Node.js version which is deprecated and will be forced to run on node20: actions/checkout@v3, actions/setup-python@v4, actions/cache@v3. For more info: https://github.blog/changelog/2024-03-07-github-actions-all-actions-will-run-on-node20-instead-of-node16-by-default/