feat: Add chunking function for sequence tagger training on sentences exceeding token limit · flairNLP/flair@b523769

Triggered via pull request August 8, 2024 23:50

MattGPT-ai

synchronize #3520

MattGPT-ai:GH-3519/add-sentence-chunking-method

Status Failure

Total duration 26m 23s

Artifacts –

ci.yml

on: pull_request

test

26m 12s

Annotations

10 errors and 1 warning

test: tests/embedding_test_utils.py#L44

TestTransformerDocumentEmbeddings.test_embedding_works_with_weird_text[False-Hybrid mesons , qq \u0304 states with an admixture] FutureWarning: `clean_up_tokenization_spaces` was not set. It will be set to `True` by default. This behavior will be depracted in transformers v4.45, and will be then set to `False` by default. For more details check this issue: https://github.com/huggingface/transformers/issues/31884

test: tests/embedding_test_utils.py#L44

TestTransformerDocumentEmbeddings.test_embedding_works_with_weird_text[False-typical proportionalities of \u223c 1nmV \u2212 1 [ 3,4 ] .] FutureWarning: `clean_up_tokenization_spaces` was not set. It will be set to `True` by default. This behavior will be depracted in transformers v4.45, and will be then set to `False` by default. For more details check this issue: https://github.com/huggingface/transformers/issues/31884

test: tests/embedding_test_utils.py#L44

TestTransformerDocumentEmbeddings.test_embedding_works_with_weird_text[False-\U0001f91f \U0001f91f \U0001f91f h\xfcllo] FutureWarning: `clean_up_tokenization_spaces` was not set. It will be set to `True` by default. This behavior will be depracted in transformers v4.45, and will be then set to `False` by default. For more details check this issue: https://github.com/huggingface/transformers/issues/31884

test: tests/embedding_test_utils.py#L44

TestTransformerDocumentEmbeddings.test_embedding_works_with_weird_text[False-\U0001f91fhallo \U0001f91f \U0001f91f \U0001f91f \U0001f91f] FutureWarning: `clean_up_tokenization_spaces` was not set. It will be set to `True` by default. This behavior will be depracted in transformers v4.45, and will be then set to `False` by default. For more details check this issue: https://github.com/huggingface/transformers/issues/31884

test: tests/embedding_test_utils.py#L44

TestTransformerDocumentEmbeddings.test_embedding_works_with_weird_text[False-\U0001f91f] FutureWarning: `clean_up_tokenization_spaces` was not set. It will be set to `True` by default. This behavior will be depracted in transformers v4.45, and will be then set to `False` by default. For more details check this issue: https://github.com/huggingface/transformers/issues/31884

test: tests/embedding_test_utils.py#L44

TestTransformerDocumentEmbeddings.test_embedding_works_with_weird_text[False-\uf8f9] FutureWarning: `clean_up_tokenization_spaces` was not set. It will be set to `True` by default. This behavior will be depracted in transformers v4.45, and will be then set to `False` by default. For more details check this issue: https://github.com/huggingface/transformers/issues/31884

test: tests/embedding_test_utils.py#L86

TestTransformerDocumentEmbeddings.test_keep_batch_order[False] FutureWarning: `clean_up_tokenization_spaces` was not set. It will be set to `True` by default. This behavior will be depracted in transformers v4.45, and will be then set to `False` by default. For more details check this issue: https://github.com/huggingface/transformers/issues/31884

test: tests/embedding_test_utils.py#L158

TestTransformerDocumentEmbeddings.test_default_embeddings_stay_the_same_after_saving_and_loading[False] FutureWarning: `clean_up_tokenization_spaces` was not set. It will be set to `True` by default. This behavior will be depracted in transformers v4.45, and will be then set to `False` by default. For more details check this issue: https://github.com/huggingface/transformers/issues/31884

test: tests/embedding_test_utils.py#L184

TestTransformerDocumentEmbeddings.test_embeddings_load_in_eval_mode[False] FutureWarning: `clean_up_tokenization_spaces` was not set. It will be set to `True` by default. This behavior will be depracted in transformers v4.45, and will be then set to `False` by default. For more details check this issue: https://github.com/huggingface/transformers/issues/31884

test: tests/embeddings/test_transformer_document_embeddings.py#L25

test_if_loaded_embeddings_have_all_attributes[False] FutureWarning: `clean_up_tokenization_spaces` was not set. It will be set to `True` by default. This behavior will be depracted in transformers v4.45, and will be then set to `False` by default. For more details check this issue: https://github.com/huggingface/transformers/issues/31884

test

The following actions uses Node.js version which is deprecated and will be forced to run on node20: actions/checkout@v3, actions/setup-python@v4, actions/cache@v3. For more info: https://github.blog/changelog/2024-03-07-github-actions-all-actions-will-run-on-node20-instead-of-node16-by-default/

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add chunking function for sequence tagger training on sentences exceeding token limit #2348

Summary

feat: Add chunking function for sequence tagger training on sentences exceeding token limit #2348

Jobs

Run details

ci.yml

Annotations