-
Notifications
You must be signed in to change notification settings - Fork 24
Email cleaner #335
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Email cleaner #335
Conversation
from .numbers import ( | ||
annotator_split | ||
) | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Shouldn't be in this PR
```python | ||
import re | ||
|
||
ATTRIBUTE: str = "headline" #only text attributes |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Change headline to text please.
@@ -0,0 +1 @@ | |||
This modules removes certain aspects of the email to focus on the context. The aspects being removed are Sentences starting with "EXTERNAL MAIL", Brackets, that starts with "cid.image", everything after the Disclaimer or until the next Mail in case of response-mails and everything after the signiture in brackets or until the next Mail. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Typos
This modules removes certain aspects of the email to focus on the context. The aspects being removed are Sentences starting with "EXTERNAL MAIL", Brackets, that starts with "cid.image", everything after the Disclaimer or until the next Mail in case of response-mails and everything after the signiture in brackets or until the next Mail. | |
This module removes certain aspects of the email to focus on the context. The aspects being removed are sentences starting with "EXTERNAL MAIL", brackets, that start with "cid.image", everything after the disclaimer or until the next mail in case of response mail, and everything after the signature in brackets or until the next mail. |
PR checklist: