Shallow baseline models for text in Tensorflow
SparseLM
: Linear model trained over sparse bag-of-words representationLSTM
: Long short-term memory modelLSTMPreTrain
: LSTM with pretrained embeddings
LinearModel
: Linear model over fixed, pretrained embeddingsfastText
: Implementation of fastText in TensorflowfastTextPreTrain
: fastText initialized with pretrained embeddings
TTBB
: Implementation of A Simple but Tough-to-Beat Baseline for Sentence Embeddings in TensorflowTTBBTune
:TTBB
with gradient-tuned embeddings, common component, and smoothing parameterTTBBTuneExact
:TTBB
with gradient-tuned embeddings and smoothing parameter, with common component updated exactly
pip install --requirement python-package-requirements.txt
scripts/get_senna.sh
scripts/get_word_freq.sh
python shalo_test.py
Test data is derived from Pang and Lee (2005).