AI Con USA 2025 - Data Architecture
Sunday, June 8
Certified Data & Analytics Tester (DAU-CDAT)
Monday, June 9
Beginning Data Analysis and Machine Learning with Jupyter Notebooks
In this beginner-friendly workshop you'll see how you can get started with data analytics and data science using Jupyter Notebooks. Matt will start with the basics of notebooks and then move on to using Python, Pandas, and NumPy to perform basic exploratory data analysis. See how you can use Plotly Express to create interactive charts and visuals with only a minimal amount of code. Once you've grasped the basics of understanding and visualizing the data Matt will move on to machine learning with SciKit-Learn as you train and evaluate predictive regression and classification models. The...
Get Your Data Ready for AI/ML
Understanding the readiness of your source data before you launch an expensive AI/ML project lets you take corrective data engineering measures that will streamline the project and give you the best probability of a successful outcome. Artificial Intelligence (AI) and Machine Learning (ML) projects can provide significant returns on investment when they are applied to narrow but difficult business problems and are supported by adequate amounts of relevant, quality data. Many such projects start with high hopes but get derailed due to fundamental problems with source data, which were...
Tuesday, June 10
Clean Your Filthy RAGS! Optimizing, Accelerating, and Evaluating RAG Applications
NewRetrieval-Augmented Generation (RAG) applications are becoming essential for companies, combining AI with real-time data retrieval to enhance customer experiences. While Large Language Models (LLMs) handle general conversation well, they struggle with domain-specific, up-to-date information, often producing inaccurate or unhelpful responses. This workshop will empower participants with the necessary skills to optimize RAG applications using existing best practices. Justin will walk through integrating RAGAS, a framework designed to evaluate, monitor, and fine-tune the performance of RAG...
Common Software, Data, and AI Architectures and the Ways They Fail
NewThis tutorial will examine the complexity that lives in software systems, data ingestion workflows, MLOps pipelines, and artificial intelligence systems. This session blends together cloud architecture, quality assurance, risk management, and security mindsets as Matt Eland explores how modern systems are structured, the problems their complexity helps us solve, and the ways these systems can break - or be broken. The session will alternate between interactive lectures with practical illustrations and group exercises around case studies as you explore how existing systems can fail and what...
Wednesday, June 11
Testing Powered by AI/ML Synthetic Test Data: A Game Changer
The session covers a testing approach that utilizes a new age Test Data Management (TDM) technique which learns from production data. Machine learning analyzes the data to create a model capable of generating synthetic data with identical source data attributes. Multiple learning iterations refine the model, enhancing its accuracy with each cycle. The data model incorporates security measures like differential privacy, enabling safe movement to lower environments. Later, generative AI leverages this model to produce desired volumes of test data for various testing types, including...