- DeepFabric - Create large-scale synthetic training data for model distillation and fine-tuning of LLMs.
-
CTGAN - Conditional GAN for generating synthetic tabular data.
-
DoppelGANger - Using GANs for Sharing Networked Time Series Data: Challenges, Initial Promise, and Open Questions
-
synner - Generating Realistic Synthetic Data
-
SDV - Synthetic data generation for tabular data
-
TGAN - Generative adversarial training for generating synthetic tabular data.
-
MirrorDataGenerator - MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations
-
plaitpy - plait.py - a fake data modeler
Contributions of any kind welcome, just follow the guidelines!