AI Con USA 2026 - MLOps & AIOps
Sunday, June 7
Fundamentals of AI—ICAgile Certification (ICP-FAI)
Monday, June 8
Build Your Own AI News Agent with Redis + LangGraph.js.
NewTired of sifting through endless news sources and missing the topics you care about? This hands-on workshop empowers you to build a personalized AI news assistant using Redis and LangGraph.js. You’ll learn to ingest articles from RSS feeds, summarize and analyze them, extract structured data, and generate embeddings—all stored in Redis for powerful hybrid (vector + structured) search. Your agent will provide a web interface for browsing and semantic search, and even chat with you about current events, retrieving relevant articles and metadata using RAG techniques. LangGraph.js orchestrates...
Wednesday, June 10
Architecting the Digital Work System: The Shift to Multi-Agent Enterprise Workflows
The era of single-turn AI prompting is over, giving way to the sophisticated "digital assembly line." This session guides technical leaders, CTOs, and AI product managers through the critical leap from isolated AI tools to coordinated, multi-agent constellations capable of autonomously executing complex enterprise workflows. Marshall will dissect the four levels of agentic evolution, focusing on today's epicenter of innovation: cross-system orchestration. Because the risk of cumulative failure multiplies with every automated handoff, he will unpack the architectural strategies required to...
Tracing the Mind of the Machine: Observability for AI Agents
AI agents have evolved beyond LLM chatbots; they possess the ability to plan, reason, and act autonomously. However, as their autonomy increases, understanding how they make decisions becomes more challenging. Traditional methods of observability—such as metrics, logs, and traces—capture outcomes but do not reveal the underlying reasoning. This session will explore how AI Agent Observability can shed light on the decision-making process by collecting and analyzing agent traces. We will discuss emerging standards like the Model Context Protocol (MCP), which provides structured and shareable...
Revolutionizing Healthcare with AI
Agentic AI at the Edge: Building Mobile AI Agents for Automotive & MedTech
Shipping AI into regulated, safety-critical mobile apps is more than “just call an LLM.” In automotive and medical contexts, we must meet strict privacy, latency, and auditability requirements while working with flaky connectivity and device constraints. In this talk, Ronak will share a field-tested architecture for agentic AI on the edge: a hybrid edge–cloud agent that uses tool-calling plugins, a policy engine for pre/post-prompt guardrails, offline fallbacks, and structured telemetry for audits. He will walk through how his team bounds agent behavior with capability cards, redacts PHI/...
Unlock Exponential Productivity: The AI Maturity Model for Product Engineering
Are you ready to transform your productivity from incremental gains to exponential growth? Whether you're an individual contributor or engineering leader, this session introduces the AI Maturity Model, a proven framework that guides technical teams through five distinct levels of AI adoption—from 33% productivity boosts to an extraordinary 1000% increase. Discover how to navigate each stage: Level 1 (Foundation, 33%) builds essential AI awareness; Level 2 (Literacy, 75%) develops practical AI skills; Level 3 (Fluency, 300%) masters AI-assisted workflows; Level 4 (Agents, 500%) implements...
Thursday, June 11
The Master Control Policy Server: A Model-Agnostic Architecture for Enterprise LLM Governance and Risk Mitigation
As Large Language Models (LLMs) transition from conversational tools to autonomous AI agents capable of high-stakes, multi-step actions, the reliance on generalized, default safety alignment from model providers has become insufficient and often hazardous. This session addresses the critical need for application-specific, custom guardrails necessitated by persistent jailbreak vulnerabilities, the high cost of achieving effective fine-tuning alignment, and the challenge of enforcing proprietary organizational compliance (e.g., contextual PII protection or specific regulatory mandates unique...
Engineering AI Infrastructure for Efficient Inference at Scale
As AI models grow in complexity and scale, inference efficiency has emerged as a critical engineering challenge for enterprise deployment. Traditional infrastructure built for training workloads often fails to meet the latency, throughput, and cost demands of large-scale inference operations. In this session, Sandeep will be sharing practical insights from engineering AI infrastructure at Broadcom, focusing on the end-to-end optimization of compute, networking, and storage subsystems. The talk explores techniques such as dynamic workload placement, adaptive batching, model quantization,...
From Reactive to Proactive: Intuit's Journey in AI-Powered Incident Management
With thousands of Intuit services, managing incidents efficiently and effectively becomes essential. In this talk, Akshay will examine how they have moved beyond traditional incident management methods to embrace AI-driven solutions for speed, resilience, and continuous improvement. The session will outline how Intuit has scaled incident management across thousands of services, ensuring robust support across their ecosystem. See how the team utilized real-time insights and advanced automation to enhance AWS-related incident detection and resolution across Intuit's infrastructure, as well...
Evals Are a Team Sport: Building Scalable Evaluation Pipelines for Trustworthy AI
Modern AI systems fail not only because of flawed models but because evaluation is often treated as a one-time task rather than an ongoing discipline. This session addresses the challenge of scaling evaluation across teams and pipelines to ensure model reliability, fairness, and performance. Drawing from real-world experience in large-scale financial analytics, Anusha Dwivedula will examine how product and data teams can collaborate to design a continuous evaluation framework that integrates precision, recall, and drift metrics with observability, lineage, and quality controls. She will...
Proven, Not Promised: Real-World Insights from Nelnet’s AI Transformation
Engineering leaders are under constant pressure to deliver faster, improve quality, and reduce costs despite fixed resources. At Nelnet, this challenge was compounded by ongoing acquisitions, a complex application landscape, and competing team priorities. Brittany Wilson will discuss how they evaluated the impact of Generative AI (GenAI) on the software development lifecycle without disrupting critical delivery work and how Nelnet adopted a measured pilot approach. Over 12 weeks, a side-by-side experiment compared a GenAI-enabled team with a business-as-usual team working on similar...