I set atttentionmask to None, but the model doesn't learned well, my inputs didn't padded so I dont want attention mask. How to resolve this? I also tried add a all ones attnetionmask, but the result also very worse.