All Insights
December 2025

Sunday Coffee & Code: Claude Code charges ahead!!!

Today's job was to split the existing PoC into a multi-agent system, making the RFP Responder system easier to manage, understand conceptually, architecturally sound and learn about A2A with Microsoft MAF.

By Steve Harris

Todayโ€™s job was to split the existing PoC into a multi-agent system, making the RFP Responder system easier to manage, understand conceptually, architecturally sound and learn about A2A with Microsoft MAF. To tackle this I decided to use Claude Code web (in Research Preview), connect it to a new repo and see how it went. To be honest my expectations have truly been exceeded (Renรฉ Fournier - I now get it!!). After pulling together a prompt to get Claude Code to base itโ€™s work on the public PoC I left it running - it developed a plan, executed and left me with a pull request to review. I read through the code and the readme changes it made and every single thing makes sense (it even created startup and shutdown scripts for the Agents). Next thing to do is to test it - will be testing it in the real world in early January with a real RFP response. I asked it to write a LinkedIn post to explain what it did - hereโ€™s the raw post: --------------- ๐Ÿš€ Just Transformed a Monolithic AI System into a Multi-Agent Architecture I recently refactored an RFP response automation system from a single 708-line Python script into a distributed multi-agent architecture using the Microsoft Agent Framework. Hereโ€™s what that journey looked like: ๐—ง๐—ต๐—ฒ ๐—–๐—ต๐—ฎ๐—น๐—น๐—ฒ๐—ป๐—ด๐—ฒ: The original PoC worked great - it automated RFP responses using Ollama (qwen3:14b), Docling for PDF parsing, and ChromaDB for RAG. But as a monolithic script, it wasnโ€™t production-ready for scaling or distributed deployment. ๐—ง๐—ต๐—ฒ ๐—ฆ๐—ผ๐—นu๐˜๐—ถ๐—ผ๐—ป: Built 4 specialized agents communicating via REST APIs (A2A pattern): ๐ŸŽฏ Orchestrator Agent - Coordinates the workflow ๐Ÿ“„ RFP-Extractor Agent - PDF โ†’ Structured JSON ๐Ÿ“š CompanyInfo-Extractor Agent - Ingests company data into vector DB โœ๏ธ RFP-Prep Agent - Generates responses using RAG ๐—ž๐—ฒ๐˜† ๐—•๐—ฒ๐—ป๐—ฒ๐—ณ๐—ถ๐˜๐˜€: โœ… Independent scaling - Each agent scales based on its workload โœ… Better separation of concerns - Single responsibility per agent โœ… Production-ready - Service-based architecture with health checks โœ… Easier maintenance - Update one agent without touching the others โœ… Zero functionality loss - 100% based on proven working code ๐—ง๐—ฒ๐—ฐ๐—ต ๐—ฆ๐˜๐—ฎ๐—ฐ๐—ธ: โ€ข Microsoft Agent Framework for agent orchestration โ€ข Flask for REST APIs โ€ข ChromaDB for vector storage โ€ข Ollama for local LLM inference โ€ข Python with async/await patterns ๐—ง๐—ต๐—ฒ ๐—ฅ๐—ฒ๐˜€๐˜‚๐—น๐˜: ~3,100 lines of well-structured, documented code across 17 files. Each agent can run standalone or as part of the coordinated system. Full deployment automation with start/stop scripts and comprehensive documentation. The transformation took what was a great proof-of-concept and made it ready for real-world deployment at scale. Anyone else working on multi-agent systems or migrating monoliths to microservices? Would love to hear about your experiences!

Want to Discuss This Topic?

Steve is always happy to have a direct conversation.