Todayโs job was to split the existing PoC into a multi-agent system, making the RFP Responder system easier to manage, understand conceptually, architecturally sound and learn about A2A with Microsoft MAF. To tackle this I decided to use Claude Code web (in Research Preview), connect it to a new repo and see how it went. To be honest my expectations have truly been exceeded (Renรฉ Fournier - I now get it!!). After pulling together a prompt to get Claude Code to base itโs work on the public PoC I left it running - it developed a plan, executed and left me with a pull request to review. I read through the code and the readme changes it made and every single thing makes sense (it even created startup and shutdown scripts for the Agents). Next thing to do is to test it - will be testing it in the real world in early January with a real RFP response. I asked it to write a LinkedIn post to explain what it did - hereโs the raw post: --------------- ๐ Just Transformed a Monolithic AI System into a Multi-Agent Architecture I recently refactored an RFP response automation system from a single 708-line Python script into a distributed multi-agent architecture using the Microsoft Agent Framework. Hereโs what that journey looked like: ๐ง๐ต๐ฒ ๐๐ต๐ฎ๐น๐น๐ฒ๐ป๐ด๐ฒ: The original PoC worked great - it automated RFP responses using Ollama (qwen3:14b), Docling for PDF parsing, and ChromaDB for RAG. But as a monolithic script, it wasnโt production-ready for scaling or distributed deployment. ๐ง๐ต๐ฒ ๐ฆ๐ผ๐นu๐๐ถ๐ผ๐ป: Built 4 specialized agents communicating via REST APIs (A2A pattern): ๐ฏ Orchestrator Agent - Coordinates the workflow ๐ RFP-Extractor Agent - PDF โ Structured JSON ๐ CompanyInfo-Extractor Agent - Ingests company data into vector DB โ๏ธ RFP-Prep Agent - Generates responses using RAG ๐๐ฒ๐ ๐๐ฒ๐ป๐ฒ๐ณ๐ถ๐๐: โ Independent scaling - Each agent scales based on its workload โ Better separation of concerns - Single responsibility per agent โ Production-ready - Service-based architecture with health checks โ Easier maintenance - Update one agent without touching the others โ Zero functionality loss - 100% based on proven working code ๐ง๐ฒ๐ฐ๐ต ๐ฆ๐๐ฎ๐ฐ๐ธ: โข Microsoft Agent Framework for agent orchestration โข Flask for REST APIs โข ChromaDB for vector storage โข Ollama for local LLM inference โข Python with async/await patterns ๐ง๐ต๐ฒ ๐ฅ๐ฒ๐๐๐น๐: ~3,100 lines of well-structured, documented code across 17 files. Each agent can run standalone or as part of the coordinated system. Full deployment automation with start/stop scripts and comprehensive documentation. The transformation took what was a great proof-of-concept and made it ready for real-world deployment at scale. Anyone else working on multi-agent systems or migrating monoliths to microservices? Would love to hear about your experiences!
All Insights
December 2025
Sunday Coffee & Code: Claude Code charges ahead!!!
Today's job was to split the existing PoC into a multi-agent system, making the RFP Responder system easier to manage, understand conceptually, architecturally sound and learn about A2A with Microsoft MAF.
By Steve Harris
Want to Discuss This Topic?
Steve is always happy to have a direct conversation.
