Jason Haley

Ramblings from an Independent Consultant

Hands-on Boston AgentCon Workshop

This past Saturday, I held my latest hands on workshop: From Static Files to Smart Agents: Unlocking Document Intelligence at AgentCon Boston. I ended up with around 30 attendees. There was quite a bit of interaction, with some real practical questions being asked about RAG and GenAI applications. Hand-on-Lab: From Static Files to Smart Agents: Unlocking Document Intelligence The repository is located at: https://github.com/JasonHaley/agentcon-boston-workshop What is it? The end result (which is the “completed” branch of the repo), is a Chainlit application designed to provide four use cases: Compare a clause between two contracts and get detail and risk analysis of the clause Analyze a clause that is in one contract but not the other Compare two complete contracts Create a …

Microsoft Build 2025: Azure AI Search

Microsoft Build 2025: Azure AI Search If you use Azure AI Search for your RAG applications, you are going to be excited about the new features that were just announced at Microsoft Build 2025. If you want a video that provides a good overview, I’d recommend watching Pablo Castro’s talk Knowledge Retrieval: RAG for enterprise agents with Azure AI Search, he did a great job highlighting most of the new features. Agentic Retrieval (new feature) From the documentation site: “… agentic retrieval is a new parallel query processing architecture that incorporates user conversation history and Azure OpenAI models to plan, retrieve and synthesize queries for improved results. It produces high-quality grounding data for custom chat and generative AI solutions that include agents.” This …

Custom AI Search Skill to Convert Markdown Tables to JSON

Custom Azure AI Search Skill to Convert Markdown Tables to JSON I was recently asked what options there are for handling large tables in PDFs, when ingesting files for a RAG application. The question was concerning a retrieval system that was having a problem identifying relevant data in some tables. The problem seemed to be due to the headings not being in the same chunk as the data - so the data doesn’t have any context to indicate its meaning. This blog by Lu Zhang (Microsoft) - A Heuristic Method of Merging Cross-Page Tables based on Document Intelligence Layout Model describes how to identify and merge tables using the Azure AI Document Intelligence Layout Model, she also provides working code to do it on GitHub. She goes over how to get the tables, locate them and figure out …