BERT(Bidirectional Encoder Representations from Transformers) is a natural language processing (NLP) model developed by Google. It's a neural network that analyzes text by considering the context of words in a sentence, rather than just looking at them one by one. BERT is used to improve the accuracy of search engines, language translation, and conversational AI.
THIS Project Also has multiple Visualizations that helps in understanding the Data Quality Analysis using BERT.
CLASS DISTRIBUTION BEFORE UNDERSAMPLING :-
CLASS DISTRIBUTION AFTER UNDERSAMPLING :-
ANOMALY SCORE DETECTION :-
WORD COUNT DISTRIBUTION :-
CHARACTER COUNT DISTRIBUTION :-
CONFUSION MATRIX :-
PCA VISULAIZATIONS OF BERT EMBEDDINGS :-
FEATURE IMPORTANCE(RANDOM FOREST) :-
ROC CURVE FOR ANOMALY DETECTION :-
FIRST DOWNLOAD THE CODE. CREATE THE FOLDER WITH ANY NAME U WANT AND IN THAT FOLDER PASTE THE PY AND DATASET FILE FROM CODE FILE WHICH U HAVE DOWNLOADED. THEN OPEN VS CODE AND OPEN THE FOLDER U CREATED AND OPEN THE TERMINAL. PASTE THE TEXT FROM THE REQUIREMENT TXT FILE FROM THE DOWNLOADED CODE FILE. WAIT UNTIL IT INSTALLS THE REQUIREMENTS
python bert_anomaly_detection.py








