| Episode | Status |
|---|---|
| Episode | Status |
|---|---|
Today's guest is Vaithi Bharath, Associate Director of Data Science & AI Solutions at Bayer. Bharath joins Emerj Editorial Director Matthew DeMello to break down why clinical R&D timelines often slip ...
Vaithi Bharath from Bayer explains that clinical R&D delays stem primarily from fragmented systems, slow handoffs, and validation requirements—not AI model performance. He details how data moves through 4-5 disconnected systems from EDC capture to final submission, with each handoff requiring manual reconciliation and extensive validation. The solution involves layering AI-guided workflows on top of existing validated systems via APIs, maintaining human-in-the-loop approval while reducing cycle time from weeks to days without triggering full revalidation processes.
Bharath maps the complete clinical data lifecycle from EDC systems (Medidata, Veeva) through CTMS, safety databases, statistical computing environments, to final submission deliverables. He explains how data passes through 4-5 major handoff points requiring manual file transfers, format conversions (CRF to SDTM/ADAM), and reconciliation across disparate teams and vendors, causing database lock cycles to stretch from days to weeks.
Bharath distinguishes validation from testing: validation asserts what must and must not happen, not just that something works. He shares a real example where a vendor integration issue forced 6-8 months of manual ZIP file handoffs because the CSV release process for a modern integration was too expensive and time-consuming to fast-track.
Bharath explains FDA's 21 CFR Part 11 requirements for electronic records and signatures, detailing what auditors actually look for: tamper-proof logs, complete access justifications, training evidence, and reproducible results. He emphasizes that FDA expects to reproduce exact results using the same code and data, requiring immutable audit trails at every step.
Bharath describes how AI tools from vendors like SAS are introducing guided workflows for SDTM/ADAM transformations and protocol deviation checks. The key is maintaining human clinical review of AI suggestions while creating immutable audit trails. He emphasizes these are pilots and proofs-of-concept, not fully mature implementations, with humans still validating every AI-generated suggestion.
Bharath explains how standardized AI-guided workflows enable reusable validation macros, increasing reusability from 50% to 80% across studies. This translates to faster database locks, fewer study analysis plan amendments (which can cost hundreds of thousands to millions), and more consistent quality regardless of team experience level.
Bharath advocates for building API-based wrappers around existing validated systems rather than expensive rip-and-replace projects. He explains how to leverage EDC APIs (like Veeva) to add AI-guided workflow layers, plug QC checks into existing ETL pipelines, and enhance validation frameworks without revalidating the entire stack—saving months to years of implementation time.
Bharath shares Bayer's consolidation approach: moving from 7-8 disparate system integrations to a three-tier stack (data management/prep, lakehouse for transformation, analytics environment). This shifts data wrangling burden upstream, allowing statistical teams to focus on analysis rather than data preparation, while adopting market-standard platforms with plug-and-play API capabilities.
Bharath and the host synthesize the discussion into a practical roadmap: identify 8-10 routine weekly questions causing the most rework, map data owners and approvers, pilot 2 questions as guided workflows in a governed sandbox, track days saved and rework avoided, then scale what passes audits. The key is complementing validated systems with explainable workflows that maintain human approval and audit-ready traceability.
Reducing R&D Cycle Time in Pharma Without Increasing Regulatory Risk - with Vaithi Bharath of Bayer
Ask me anything about this podcast episode...
Try asking: