This content originally appeared on DEV Community and was authored by Izeno
I made a tool because getting datasets for NLP or tabular data is tough. It uses an AI API to generate synthetic data. You can define columns with names, types, and prompts, and set the number of rows, up to 50,000 or more as much as you need. It’s in Python with a basic interface. It’s on GitHub here: https://github.com/VoxDroid/Zylthra. I needed it for some work, and it does the job. If anyone tries it, let me know what’s off.
This content originally appeared on DEV Community and was authored by Izeno

Izeno | Sciencx (2025-03-31T13:57:43+00:00) A Tool I Built for Synthetic Datasets. Retrieved from https://www.scien.cx/2025/03/31/a-tool-i-built-for-synthetic-datasets/
Please log in to upload a file.
There are no updates yet.
Click the Upload button above to add an update.