This repo is about sentiment analysis using Enhanced Disentangled-Language-Focused Collaborative Network
This is a robust framework for multimodal sentiment analysis that effectively handles noisy and incongruent data in real-world scenarios.
EDLFCN builds upon the DLF (Disentangled-Language-Focused) architecture with six novel components designed to enhance performance and robustness:
Adaptively weights modalities based on quality and alignment with language:
Computes gating scores: gm = σ(Wg[Entropy(Xm); Sim(Xm, XL)]) Emphasizes cleaner modalities in shared space while preserving unique features
Applies contrastive learning to align modalities:
Alignment loss: Lalign = -log(e^s(Shm,Shn)/τ / ∑e^s(Shm,Shk)/τ) Preserves relationships between different modalities' shared features
Reinforcement learning optimizes attention weights:
Attention weights: α = softmax(QlangK⊤mod) Policy gradient update: ∇J ∝ R · ∇ log π(α|X) Dynamically focuses on the most relevant audio/visual features
Implements hierarchical processing across multiple linguistic levels:
Ffinal = [CNNword(HL); BiGRUphrase(HL); Transformerutterance(HL)]
GAN with language guidance for robust missing data reconstruction:
Generator: Xm_rec = G(Xm_noisy, HL) Discriminator detects real/fake features Loss: LGAN + λ||Xm - Xm_rec||1
Ensures separation between shared and specific subspaces:
Minimizes redundancy across modalities Enforces orthogonality constraints
git clone https://github.com/yourusername/EDLFCN.git
cd EDLFCN
conda create -n edlfcn python=3.8
conda activate edlfcn
pip install -r requirements.txt