The pattern that makes agents trustworthy: ingest external data into a Cloud Storage lake, refine it through BigQuery, and serve it to agents via structured and semantic retrieval. End-to-end on Google Cloud, from raw bytes to agent context — with a curated 2024–2026 research reading list.