Agents
1. Concept
Agents in CAMEL are autonomous entities capable of performing specific tasks through interaction with language models and other components. Each agent is designed with a particular role and capability, allowing them to work independently or collaboratively to achieve complex goals.
1.1. Base Agent Architecture
All CAMEL agents inherit from the BaseAgent
abstract class, which defines two core methods:
reset()
: Resets the agent to its initial statestep()
: Performs a single step of the agent’s operation
1.2. Chat Agent
The ChatAgent
is the primary implementation that handles conversations with language models. It supports:
- System message configuration for role definition
- Memory management for conversation history
- Tool/function calling capabilities
- Response formatting and structured outputs
- Multiple model backend support with scheduling strategies
- Async operation support
2. Types
2.1. ChatAgent
The main agent implementation for handling conversations with language models. Features include:
- Tool integration and management
- Memory management with customizable window sizes
- Output language control
- Response termination handling
- Structured output support via Pydantic models
2.2. CriticAgent
Specialized agent for evaluating and critiquing responses or solutions. Used in scenarios requiring quality assessment or validation.
2.3. DeductiveReasonerAgent
Agent focused on logical reasoning and deduction. Breaks down complex problems into smaller, manageable steps.
2.4. EmbodiedAgent
Agent designed for embodied AI scenarios, capable of understanding and responding to physical world contexts.
2.5. KnowledgeGraphAgent
Specialized in building and utilizing knowledge graphs for enhanced reasoning and information management.
2.6. MultiHopGeneratorAgent
Agent designed for handling multi-hop reasoning tasks, generating intermediate steps to reach conclusions.
2.7. SearchAgent
Focused on information retrieval and search tasks across various data sources.
2.8. TaskAgent
Handles task decomposition and management, breaking down complex tasks into manageable subtasks.
3. Usage
3.1. Basic Chat Agent Usage
3.2. Simplified Agent Creation
The ChatAgent
supports multiple ways to specify the model:
3.3. Using Tools with Chat Agent
3.4. Structured Output
4. Best Practices
4.1. Memory Management
- Use appropriate window sizes to manage conversation history
- Consider token limits when dealing with long conversations
- Utilize the memory system for maintaining context
4.2. Tool Integration
- Keep tool functions focused and well-documented
- Handle tool errors gracefully
- Use external tools for operations that should be handled by the user
4.3. Response Handling
- Implement appropriate response terminators for conversation control
- Use structured outputs when specific response formats are needed
- Handle async operations properly when dealing with long-running tasks
4.4. Model Specification
- Use the simplified model specification methods for cleaner code
- For default platform models, just specify the model name as a string
- For specific platforms, use the tuple format (platform, model)
- Use enums for better type safety and IDE support
5. Advanced Features
5.1. Model Scheduling
The agent supports multiple model backends with customizable scheduling strategies:
5.2. Output Language Control
Control the language of agent responses: