Careers / Engineering ยท Infrastructure

Engineering ยท Infrastructure

Scaling & Load Balancing Engineer

One call is a demo. Ten thousand at once is a company. You will own the systems that place, balance, and sustain a fleet of concurrent phone calls โ€” the telephony capacity, the concurrency, and the reliability under real load.

๐Ÿ“ San Francisco (in person) ๐Ÿ•‘ Full time ๐Ÿงญ Infrastructure ๐Ÿ’ต Competitive + equity

About the role

Every Delfino call ties up real, scarce resources: a phone line, a voice pipeline, model capacity, and a slice of compute โ€” all held live for the length of the call. As we grow from hundreds to many thousands of concurrent calls, the interesting problem stops being "can it place a call" and becomes "can it place ten thousand, balanced across carriers and regions, without a single one dropping."

That is your problem. You will own capacity and concurrency: how calls are queued, scheduled, and load-balanced across telephony providers and workers; how we stay inside carrier rate limits; how we scale up for a morning rush and down after; and how the whole fleet stays reliable and observable when something upstream fails.

What you'll do

What we're looking for

Nice to have

Design for the failure

Phone networks are messy and carriers fail in creative ways. We assume things will break and build so a bad line degrades a single call, never the fleet. You will set that reliability bar and hold us to it โ€” with metrics, not vibes.

Scale the fleet that never drops a call

If keeping thousands of live calls balanced and healthy sounds like fun, we should talk.

Connect with Delfino AI

โœ‰ Contact