Skip to content

vdesai-dev/awesome-synthetic-data

Repository files navigation

Awesome Synthetic Data Awesome lint

A list of tools, papers and datasets on synthetic data generations and use!

Contents

Featured

  • DeepFabric - Create large-scale synthetic training data for model distillation and fine-tuning of LLMs.

Tools

  • CTGAN - Conditional GAN for generating synthetic tabular data.

  • DoppelGANger - Using GANs for Sharing Networked Time Series Data: Challenges, Initial Promise, and Open Questions

  • synner - Generating Realistic Synthetic Data

  • SDV - Synthetic data generation for tabular data

  • TGAN - Generative adversarial training for generating synthetic tabular data.

  • MirrorDataGenerator - MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations

  • plaitpy - plait.py - a fake data modeler

Contributing

Contributions of any kind welcome, just follow the guidelines!

Contributors

Thanks goes to these contributors!

About

An updates, maintained and curated list of synthetic data tools

Resources

License

Code of conduct

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors