Agent Foundry

Agent Problems

Practice building AI agents with hands-on challenges.

103 problems23 Easy47 Medium33 Hard· 0 solved
1Fix the Hallucinating AgentPrompt DesignEasy2Right Tool for the JobTool CallingEasy3Handle the API FailureError RecoveryTool CallingMed.4Structured Output EnforcerPrompt DesignEasy5Chain-of-Thought ActivatorPrompt DesignEasy6Few-Shot Tool SelectorPrompt DesignTool CallingEasy7Role Persona EnforcerPrompt DesignEasy8Build a Weather ToolTool CallingEasy9Build a Database Query ToolTool CallingEasy10Ambiguity ResolverPrompt DesignMed.11Multi-Language AgentPrompt DesignMed.12Tool Parameter ValidatorTool CallingGuardrailsMed.13Conversation Boundary SetterPrompt DesignGuardrailsMed.14Dynamic Prompt AssemblyPrompt DesignOrchestrationMed.15Instruction HierarchyPrompt DesignHard16Self-Correction LoopPrompt DesignEvaluationHard17Prompt CompressionPrompt DesignCost OptimizationHard18API Wrapper ToolTool CallingEasy19Tool with ConfirmationTool CallingGuardrailsEasy20Multi-Step Tool ChainTool CallingOrchestrationMed.21Parallel Tool ExecutionTool CallingOrchestrationMed.22Tool Fallback StrategyTool CallingError RecoveryMed.23Dynamic Tool RegistryTool CallingMed.24Tool Output ParserTool CallingMed.25Nested Tool CallsTool CallingOrchestrationMed.26Tool Rate LimiterTool CallingCost OptimizationHard27Composable Tool PipelineTool CallingOrchestrationHard28Tool Schema EvolutionTool CallingHard29Async Tool with PollingTool CallingOrchestrationHard30Tool with Structured InputTool CallingMed.31Multi-Tool Conflict ResolutionTool CallingHard32Add Conversation HistoryMemoryEasy33Context Window TruncationMemoryCost OptimizationEasy34Remember User PreferencesMemoryMed.35Summarization MemoryMemoryCost OptimizationMed.36Selective Memory RecallMemoryMed.37Memory Conflict ResolutionMemoryMed.38Entity Extraction MemoryMemoryMed.39Scoped MemoryMemoryOrchestrationMed.40Time-Aware MemoryMemoryHard41Memory Budget ManagerMemoryCost OptimizationHard42Cross-Session PersistenceMemoryHard43Multi-Agent Shared MemoryMemoryMulti-AgentHard44Conversation Fork HandlingMemoryOrchestrationHard45Memory-Augmented Tool UseMemoryTool CallingMed.46Basic Document QARAGEasy47Chunk and EmbedRAGEasy48Source CitationRAGMed.49No Match DetectorRAGMed.50Multi-Document SynthesisRAGMed.51Metadata FilterRAGTool CallingMed.52Conversational RAGRAGMemoryMed.53Hybrid SearchRAGMed.54RAG with TablesRAGHard55RAG Query RewritingRAGPrompt DesignMed.56Agentic RAGRAGOrchestrationHard57Multi-Index RAGRAGHard58RAG Evaluation PipelineRAGEvaluationHard59Knowledge Conflict ResolutionRAGPrompt DesignHard60Graceful Timeout HandlerError RecoveryEasy61Input SanitizerGuardrailsEasy62PII Detector and RedactorGuardrailsEasy63Output Length LimiterGuardrailsCost OptimizationEasy64Retry with Exponential BackoffError RecoveryMed.65Prompt Injection DetectorGuardrailsPrompt DesignMed.66Topic Restriction GuardGuardrailsMed.67Output Schema ValidatorGuardrailsMed.68Sensitive Action GatekeeperGuardrailsTool CallingMed.69Circuit Breaker PatternError RecoveryOrchestrationMed.70Partial Failure HandlerError RecoveryMed.71Infinite Loop DetectorError RecoveryOrchestrationHard72Multi-Layer DefenseGuardrailsHard73Adversarial Prompt ResistanceGuardrailsPrompt DesignHard74Cascading Failure IsolationError RecoveryMulti-AgentHard75Two-Agent RouterMulti-AgentEasy76Agent as a ToolMulti-AgentTool CallingEasy77Sequential Agent PipelineMulti-AgentOrchestrationEasy78Researcher-Writer CrewMulti-AgentMed.79Critic-Generator LoopMulti-AgentEvaluationMed.80Hierarchical DelegationMulti-AgentOrchestrationMed.81Agent SpecializationMulti-AgentMed.82Structured Inter-Agent MessagesMulti-AgentMed.83Agent Handoff with ContextMulti-AgentMemoryMed.84Competitive Agent SelectionMulti-AgentEvaluationHard85Dynamic Agent SpawningMulti-AgentOrchestrationHard86Agent Consensus ProtocolMulti-AgentHard87Cross-Framework PipelineMulti-AgentOrchestrationHard88Multi-Agent DebuggingMulti-AgentEvaluationHard89Sequential Workflow BuilderOrchestrationEasy90Conditional Branch RouterOrchestrationEasy91Human-in-the-Loop ApprovalOrchestrationMed.92Parallel Branch and MergeOrchestrationMed.93Map-Reduce over DocumentsOrchestrationRAGMed.94Streaming Intermediate ResultsOrchestrationMed.95Model Routing by ComplexityCost OptimizationMed.96Token Budget ManagerCost OptimizationMed.97Caching LayerCost OptimizationMed.98Workflow CheckpointingOrchestrationError RecoveryHard99Dynamic Workflow AssemblyOrchestrationHard100Event-Driven AgentOrchestrationHard101Agent Evaluation HarnessEvaluationHard102A/B Test Two AgentsEvaluationCost OptimizationHard103Production ObservabilityEvaluationOrchestrationHard