AgentSynth¶
Synthetic, verified multi-step agent trajectories for fine-tuning agentic LLMs — tool-use, grounded code execution, and multi-agent collaboration — with a built-in LLM-as-Judge evaluation loop. It runs offline for free and scales up with any model.
The thesis in one line: the value isn't generating agent trajectories, it's generating ones you can trust. So verification is core, not a side feature.
Install¶
pip install agentsynth-ai # core: generate + evaluate + export
pip install "agentsynth-ai[app]" # + the Gradio UI
pip install "agentsynth-ai[all]" # everything
60-second tour¶
from agentsynth import AgentTrajectoryGenerator, TrajectoryEvaluator, verify_trajectory
gen = AgentTrajectoryGenerator() # offline mock by default
traj = gen.generate("What's the weather in Paris, and 18% tip on $54?")
result = TrajectoryEvaluator().evaluate(traj) # 6-dimension rubric
print(result.overall, result.passed)
print(verify_trajectory(traj).verified) # re-checks tool args, execution, safety
Where to go next¶
- Vision — the problem, the bet, and the principles.
- Architecture — how the pieces fit together.
- Fine-tune & benchmark — turn the data into a model and prove it helps.
- API reference — the public surface.
The live demo runs on Hugging Face Spaces, and the code is on GitHub.