Who is Daniel Astudillo?

Daniel Astudillo is a software engineer based in New York City. He currently works at S&P Global building data platforms and full-stack applications, and previously built payment and benefit systems at Visa.

What technologies does Daniel Astudillo work with?

Daniel works across the stack with React, TypeScript, and Next.js on the frontend, and .NET Core, C#, Spring Boot, and Java on the backend. He has deep experience with PostgreSQL, BigQuery, gRPC, and distributed messaging systems.

Where is Daniel Astudillo based?

Daniel is based in New York City, where he works as a software engineer at S&P Global.

What has Daniel Astudillo built?

Daniel has taken a data API from a 21-second worst case to roughly 200–300ms (Storage Write API, then PostgreSQL), built a real-time event pipeline processing 100K+ daily events at 99.99% uptime, and modernized Visa payment and eligibility APIs at production scale (including paths above 20M requests per month). He writes about this work on his blog.

What is Daniel Astudillo's educational background?

Daniel graduated from Williams College with a Bachelor of Arts in Computer Science and Mathematics.

May 2026Updated June 20266 min readStart hereCase study

Cutting a data API from 21s to ~250ms

Warehouse DML on the request path was the bug. Stage 1 fixed writes; stage 2 fixed reads. Neither stage was query tuning.

Performance
BigQuery
gRPC
PostgreSQL
.NET

#TL;DR

Phase	Change	Latency (hot paths)
Baseline	BigQuery DML (`INSERT` / `MERGE` / `UPDATE`) over REST	~21–30s tail under load
Stage 1	Storage Write API v2 over gRPC (append rows, not SQL jobs)	under 2s
Stage 2	PostgreSQL + EF Core OLTP; warehouse for enrichment only	~200–300ms

Interactive UI operations — list views, keyed lookups, validation saves — were backed by a warehouse used as an OLTP store. Below: how I measured it, the two migrations, and what I refused to optimize.

#Symptoms: tail latency, not averages

Tickets piled up on paths that should feel instant. Means on cached routes looked fine; p95/p99 sat at 21–30s on write-heavy and full-list reads.

Root causes after instrumentation:

Signal	Interpretation
Every DML = query job	BigQuery schedules, plans, executes — even for single-row writes
DML quota contention	Concurrent mutations throttle under load
Columnar engine on keyed reads	OLTP access pattern on analytical storage

Structured benchmarks on INSERT/MERGE/UPDATE/DELETE over REST confirmed: Capacitor-backed storage + job semantics ≠ row store semantics. Not "bad SQL" — wrong system for synchronous UI.

#Before

.NET API

REST client → BigQuery jobs

BigQuery

DML + SELECT on same store

Web client

blocks on warehouse latency

#Stage 1: Storage Write API (gRPC)

Replace hot-path DML with AppendRows on a committed write stream:

var client = await BigQueryWriteClient.CreateAsync();
var stream = await client.CreateWriteStreamAsync(new CreateWriteStreamRequest
{
    Parent = tableReference,
    WriteStream = new WriteStream { Type = WriteStream.Types.Type.Committed },
});
 
await using var append = client.AppendRows();
await append.WriteAsync(new AppendRowsRequest
{
    WriteStream = stream.Name,
    ProtoRows = new ProtoRows
    {
        WriterSchema = protoSchema,
        SerializedRows = { rowBatch.Select(r => r.ToByteString()) },
    },
});

Why this beats DML REST:

DML over REST	Storage Write API v2
SQL parse + plan per call	Predefined proto schema
Query job slot	Streaming append
DML concurrency limits	Write stream throughput
High per-op overhead	Amortized stream session

Measured result: hot write paths under 2s (~60% vs baseline). Still too slow for UI — reads remained on BigQuery.

Stream lifecycle matters: create stream → append batches → finalize. Reusing streams avoids per-request stream creation cost (we pooled streams per table partition in practice).

#Stage 2: PostgreSQL OLTP + hybrid reads

Writes improved; reads still hit the warehouse. UI needed:

Keyed lookups (WHERE id = @id, WHERE parent_id IN (...))
Small bounded result sets (nav counts, picker lists)
Transactional updates with row-level consistency

#Schema + index strategy

Model only what the UI queried — not a warehouse-wide mirror:

SQL

CREATE TABLE records (
  id          UUID PRIMARY KEY,
  tenant_id   UUID NOT NULL,
  status      SMALLINT NOT NULL,
  updated_at  TIMESTAMPTZ NOT NULL
);
CREATE INDEX ix_records_tenant_status ON records (tenant_id, status);

EF Core migrations versioned alongside API routes. Partial indexes on hot filters where cardinality justified them.

#Cutover mechanics

Route-by-route — no big bang:

Backfill — batch copy from warehouse with row-count + checksum parity
Shadow read — API handler queries Postgres + warehouse, compares hashes in logs (not user-visible)
Flip read — feature flag per route; monitor p95/error rate
Flip write — dual-write window, then warehouse write off
Enrichment only — hierarchy/analytics stay async off hot path

Shadow read handler shape:

var pgRow = await _pg.Records.FindAsync(id);
var bqRow = await _bqEnrichment.FetchAsync(id); // async, optional
if (_shadowMode) _comparer.LogDelta(pgRow, bqRow);
return Map(pgRow, bqRow); // UI reads Postgres only

#After

.NET API

EF Core · OLTP routes

PostgreSQL

primary read/write

BigQuery

warehouse · hierarchy joins

Measured result: hot reads 200–300ms p95, variance collapsed.

#What we refused to do

Index-tune warehouse DML into sub-second (wrong tool)
Big-bang schema migration (rollback must be per-route)
Drop warehouse entirely (enrichment still valuable)

#Adjacent production constraints

Latency isn't valid in isolation:

Multi-pod cache — centralized Redis Next.js cache handler; without it, pods serve divergent cached pages
Gateway + API test coverage — regressions on migrated routes caught in CI
Multi-step workflows — approval state machines require correctness gates; fast wrong writes are worse than slow right ones

#Lessons

Profile p99, not mean — UI trust follows worst-case.
Write API ≠ read database — stage 1 and 2 solve different bottlenecks; order mattered.
Hybrid architecture is valid — OLTP for interaction, warehouse for analytical enrichment.
Ask "should this request touch this system?" before "how do I tune this query?"

#Stage-by-stage latency (measured)

Stage	Symptom	Change	Outcome
Baseline	~21s p99 on list endpoints	Profiled query plan	Warehouse scan on OLTP path
Stage 1	Still slow writes	Routed reads to Postgres replica + partial indexes	Interactive lists under 1s
Stage 2	Enrichment still blocking	Async BigQuery enrichment off hot path	~250ms p99 on migrated routes

The win was not one index — it was stopping analytical reads from sharing the OLTP connection pool during business hours.

#Multi-pod cache gotcha

After horizontal scaling the Next.js BFF, pods served different cached pages until a centralized Redis cache handler was wired. Latency looked fixed on one pod and flaky globally — always validate cache coherence when the API tier scales out.

#Closing thought

Slow reads are usually a topology problem: the wrong database tier on the wrong path, not a missing index on the first hop. Fix where the request walks before you argue about ORM choice.

#Reader field guide

When this pattern applies: A product UI waits on warehouse DML or full-table scans; p99 latency dominates support noise while averages look fine; you need keyed reads, small writes, or transactional updates on the request path. Skip it if the workload is batch-only analytics with no synchronous caller.

Operational checklist

Measure p95/p99 per route before tuning SQL — means hide warehouse job latency.
Classify each handler: OLTP (keyed read/write) vs enrichment (can be async).
For hot writes still on BigQuery, pilot Storage Write API append streams before debating indexes.
Backfill Postgres (or your row store) with row-count + checksum parity; run shadow reads before flipping traffic.
Cut over one route at a time with a feature flag; watch p95 and error rate, not deploy count.
Keep warehouse on the path only for hierarchy joins and analytics that do not block the UI thread.

Where to put the data (request path)

Caller need	Warehouse DML/SELECT	Storage Write API	PostgreSQL (OLTP)
Single-row UI save	Poor (job semantics)	Good for append-heavy writes	Best
Keyed lookup by id	Poor (columnar scan)	N/A for reads	Best
Bulk nightly ETL	Good	Good	Usually wrong tier
Real-time enrichment	Async/off hot path	Async/off hot path	Primary row + async join

#On this site

Post	Why
Building a microfrontend platform for data products	What happened after the API was fast enough for humans — frontend platform constraints
Bridging AMQPS and JMS for real-time events	Async domain events off the request path (different bottleneck class)

#References (curated)

I still sanity-check warehouse semantics against Google’s docs before arguing for Postgres—DML job latency is easy to misread when averages look fine.

Reference	Notes
BigQuery Storage Write API	Append streams for row ingestion without DML jobs on the hot path.
BigQuery DML limitations	Why `INSERT`/`MERGE` behave like batch work, not keyed OLTP.
PostgreSQL partial indexes	Smaller indexes when route predicates are stable.