Category: Data Management
-

Part 2: Enhancing EHR Data Quality: Key Findings from Recent Studies
This post is Part 2 of our 2-part series on recent research studies looking at EHR data quality. Part 1 presented 3 studies from 2025, and Part 2 presents 2 more recent studies and a proposed technological shift that avoids the pitfalls of traditional data pipelines. Study #4. Interoperability Remains a Pain Point While previous…
-

Part 1: Reactive vs. Proactive: The High Cost of Unvalidated EHR Data
Electronic Health Records were meant to be the backbone of modern healthcare and AI clinical prediction system. Instead, the 2025 research landscape paints a sobering picture. EHR data is incomplete, inconsistent, and often wrong at a scale that threatens clinical research, AI development, and even day‑to‑day patient care. Across multiple new studies, a consistent theme…
-

Autonomous Agents Visiting Data
The Google AI Agents Intensive Course (5DGAI) debuted in March 2025 (First 5DGAI) and returned in November 2025 (second 5DGAI), offering developers a front-row seat to the rapid evolution of agentic systems. The First 5DGAI course focused on foundational skills: writing prompts, training agents, customizing them using Retrieval-Augmented Generation (RAG), and deploying them via MLOps.…
-

The FAIRLYZ Survey Report: Optimizing Data Sharing for Scientific Progress
Biomedical data sharing, especially crucial for AI’s reliance on large, high-quality datasets, fuels scientific progress by enabling new research and collaborations around data reuse, and the development of new tools trained on the data. To gain insights into current data sharing practices and challenges, the FAIRLYZ project surveyed NIAID-funded researchers. An analysis of the survey…

