Home / Technology / AI Agents Create Their Own Training Data
AI Agents Create Their Own Training Data
27 Nov
Summary
- New framework allows AI agents to autonomously generate training data.
- AgentEvolver uses LLM reasoning for efficient environmental exploration.
- This innovation lowers costs and speeds up custom AI assistant development.

Alibaba's Tongyi Lab has introduced AgentEvolver, a groundbreaking framework that empowers AI agents to autonomously generate their own training data by exploring application environments. This innovative approach utilizes the reasoning capabilities of large language models to facilitate autonomous learning, thereby circumventing the substantial costs and manual labor typically required for dataset creation.
The self-evolving system operates through three core mechanisms: self-questioning to discover functions and generate diverse tasks, self-navigating to enhance exploration efficiency by learning from past experiences, and self-attributing to provide granular feedback on individual actions. This allows agents to continuously refine their capabilities without predefined tasks or reward functions.
Experiments on benchmarks like AppWorld demonstrated significant performance gains, with substantial improvements in reasoning and task execution. For enterprises, AgentEvolver dramatically lowers the barrier to entry for developing bespoke AI agents and custom AI assistants, fostering more scalable, cost-effective, and continually improving intelligent systems.




