AI Con USA 2025 - Data Engineer

Customize your AI Con USA 2025 experience with sessions covering data engineering.

Monday, June 9

Matt Eland
Leading EDJE
MB

Beginning Data Analysis and Machine Learning with Jupyter Notebooks

Monday, June 9, 2025 - 8:30am to 12:00pm

In this beginner-friendly workshop you'll see how you can get started with data analytics and data science using Jupyter Notebooks. Matt will start with the basics of notebooks and then move on to using Python, Pandas, and NumPy to perform basic exploratory data analysis. See how you can use Plotly Express to create interactive charts and visuals with only a minimal amount of code. Once you've grasped the basics of understanding and visualizing the data Matt will move on to machine learning with SciKit-Learn as you train and evaluate predictive regression and classification models. The...

Joshua Powers
Dev Technology Group
MC

Get Your Data Ready for AI/ML

Monday, June 9, 2025 - 8:30am to 12:00pm

Understanding the readiness of your source data before you launch an expensive AI/ML project lets you take corrective data engineering measures that will streamline the project and give you the best probability of a successful outcome.  Artificial Intelligence (AI) and Machine Learning (ML) projects can provide significant returns on investment when they are applied to narrow but difficult business problems and are supported by adequate amounts of relevant, quality data. Many such projects start with high hopes but get derailed due to fundamental problems with source data, which were...

Justin Castilla
Elastic
MG

Introduction to RAG Applications: Building Conversational AI for Domain-specific Search

New
Monday, June 9, 2025 - 1:00pm to 4:30pm

This beginner-friendly workshop introduces participants to the fundamentals of Retrieval-Augmented Generation (RAG) applications. Using a pre-configured Docker environment featuring Python, Elasticsearch for vector storage, and OpenAI as the LLM, attendees will learn how to build a RAG-powered conversational portal. Throughout the session, participants will create a RAG application to consume and query a sample dataset of Washington State regulation documents. Replace these sample documents with your own PDF files, and you’ll interact with your data in no time! By the end, attendees will...

Tuesday, June 10

Justin Castilla
Elastic
TC

Clean Your Filthy RAGS! Optimizing, Accelerating, and Evaluating RAG Applications

New
Tuesday, June 10, 2025 - 8:30am to 12:00pm

Retrieval-Augmented Generation (RAG) applications are becoming essential for companies, combining AI with real-time data retrieval to enhance customer experiences. While Large Language Models (LLMs) handle general conversation well, they struggle with domain-specific, up-to-date information, often producing inaccurate or unhelpful responses. This workshop will empower participants with the necessary skills to optimize RAG applications using existing best practices. Justin will walk through integrating RAGAS, a framework designed to evaluate, monitor, and fine-tune the performance of RAG...

Matt Eland
Leading EDJE
TD

Common Software, Data, and AI Architectures and the Ways They Fail

New
Tuesday, June 10, 2025 - 8:30am to 12:00pm

This tutorial will examine the complexity that lives in software systems, data ingestion workflows, MLOps pipelines, and artificial intelligence systems. This session blends together cloud architecture, quality assurance, risk management, and security mindsets as Matt Eland explores how modern systems are structured, the problems their complexity helps us solve, and the ways these systems can break - or be broken. The session will alternate between interactive lectures with practical illustrations and group exercises around case studies as you explore how existing systems can fail and what...

TH

AI Deep Dive: Exploring AWS Using Real-World Scenarios

Tuesday, June 10, 2025 - 1:00pm to 4:30pm

Deepen your AI and machine learning expertise using AWS in an Immersive, hands-on workshop. You’ll use real-world AI challenges while leveraging AWS services like Amazon SageMaker, Bedrock, and Lambda to build and optimize AI-driven solutions. As the session unfolds, new constraints and data anomalies will emerge, mirroring the complexities of real-world AI/ML implementation. Gain insight into how AI solutions perform under evolving conditions, learning to adapt, optimize, and troubleshoot unexpected challenges. Learn the importance of collaboration, strategic thinking, problem-solving,...

Wednesday, June 11

erica_greene_headshot
Yahoo News
K1

Best Practices for Using AI for Structured Data Extraction

Wednesday, June 11, 2025 - 8:30am to 9:30am

Structured data extraction, or data tagging, is one of the easiest and impactful applications of modern AI. Before the wide availability of pre-trained AI models, the process of “understanding” unstructured data either required constructing complex heuristic logic or investing in a machine learning team who could train models in-house. Now, cheap and powerful tagging machines are an API call away, redefining what is possible for how we can understand our data. In this talk, I'll share how we’ve used AI at Yahoo News to improve our content understanding pipelines. Yahoo was a pioneer in...

sida_peng_headshot
Microsoft
K2

The Impact of AI on Developer Productivity

Wednesday, June 11, 2025 - 9:35am to 10:20am

Generative AI tools hold promise to increase human productivity.  In the world of software development, GitHub Copilot was one of the first practical applications of the use of generative AI to support developer productivity.  However, measuring software productivity is non-trivial.  For example, developer productivity gains is more than just producing code faster. If these artifacts don’t meet quality standards or themselves bring cost efficiency challenges then there may not be much overall improvement. To truly understand the benefits and challenges of AI-powered copilots requires real-...

Jason_Arbon
Checkie.AI
W1

Building Reliable AI Agent Flows

Wednesday, June 11, 2025 - 11:00am to 11:45am

AI agents are revolutionizing how we interact with software, but their reliability remains a critical challenge. In this talk, Jason Arbon explores the key principles and techniques for designing AI agent flows that are predictable, testable, and robust. Attendees will learn how to structure AI-driven interactions to minimize failure points, leverage validation mechanisms, and integrate automated testing strategies to ensure smooth execution. Drawing from real-world applications and lessons learned in AI-driven software testing, this session will provide practical insights for engineers,...

W3

Testing Powered by AI/ML Synthetic Test Data: A Game Changer

Wednesday, June 11, 2025 - 11:00am to 11:45am

The session covers a testing approach that utilizes a new age Test Data Management (TDM) technique which learns from production data. Machine learning analyzes the data to create a model capable of generating synthetic data with identical source data attributes. Multiple learning iterations refine the model, enhancing its accuracy with each cycle. The data model incorporates security measures like differential privacy, enabling safe movement to lower environments. Later, generative AI leverages this model to produce desired volumes of test data for various testing types, including...

Vijay Panwar
Panasonic Avionics Corporation
W10

Navigating AI Governance: Building Trust in a Regulated Future

Wednesday, June 11, 2025 - 2:25pm to 3:10pm

As artificial intelligence systems increasingly influence critical decisions across industries, ensuring compliance with evolving governance and regulatory standards is both a challenge and a necessity. This presentation will explore the complexities of AI governance, focusing on balancing innovation with compliance in a rapidly changing regulatory environment. Vijay Panwar will examine real-world challenges such as bias mitigation, data privacy adherence, and ethical transparency, providing actionable strategies to design AI systems that comply with global standards like GDPR and emerging...

Mary Thorn
S&P Global
K3

Five Ways to Operationalize AI at Scale

Wednesday, June 11, 2025 - 3:50pm to 4:35pm

Enterprises often struggle with how to incorporate AI and machine learning in a repeatable, sustainable manner. In today’s competitive landscape, AI is no longer just a trend but a necessity. Generative AI has quickly become an essential tool for businesses—and with companies expected to explain its usage and justify any lack thereof—organizations are looking for ways to leverage AI at scale. This keynote provides a strategic roadmap to unlock the full potential of AI within organizations, focusing on cultural readiness, enablement, ethical considerations, and addressing biases. Attendees...

Dona Sarkar
Microsoft
K4

AI for Real People—Panel

Wednesday, June 11, 2025 - 4:40pm to 5:40pm

We have heard a LOT of hype about AI and AGI and ASI and all of the nonsense. But in year 3 of Generative AI, people are actually moving beyond talking and finding real value in AI. Come and hear from REAL AI implementers about how AI is having an impact on businesses across the board including small businesses, non-profits, big companies, and more.

Thursday, June 12

Venkata Kampana
Amazon Web Services
K5

AWS Public Health Modernization: Leveraging GenAI for Government Innovation

Thursday, June 12, 2025 - 8:30am to 9:30am

Join Venkata Kampana, Senior Solutions Architect from the AWS Health and Human Services team, and Tim Collinson, the CTO of 11:59, an AWS consulting partner, for an insightful discussion on transforming public health systems across federal, state, and local governments. This session will showcase real-world implementations of GenAI and AWS technologies that are revolutionizing public health operations. They will demonstrate innovative solutions, including their IDP implementation utilizing Bedrock's Data Automation (BDA) feature with confidence scoring and bounding box capabilities,...

andreas_bohman
University of Washington
K6

AI at Scale: Balancing Innovation, Governance, and Risk in Large Organizations

Thursday, June 12, 2025 - 9:35am to 10:20am

As AI reshapes industries, large organizations must scale innovation while upholding governance, security, and ethical responsibility. Deploying AI at scale isn’t just a technical challenge—it’s a strategic balancing act between agility and compliance, risk and reward. Andreas Bohman, CIO at the University of Washington, will discuss strategies to drive AI-powered innovation without compromising regulatory obligations, operational effectiveness, or public trust. He will share governance strategies that enable innovation rather than restrict it. He’ll also talk about addressing critical...

Matt Payne
Width.ai
T6

RAG Has Evolved - Enhance Your RAG Pipeline with These Concepts

Thursday, June 12, 2025 - 11:40am to 12:25pm

The majority of businesses today can set up a fundamental RAG pipeline that effectively handles most use cases. However, this basic setup eventually reaches its limitations in terms of functionality and accuracy, hindering further advancements. Matt Payne aims to detail the necessary pipeline components for building advanced RAG pipelines. For each component, he will explain the what, when, why, and how and provide real-world examples. Key areas of focus include leveraging tools and function calling, which enables you to create a systematic approach to using knowledge from multiple sources...