Another day, another “Using PostgreSQL for…” thing it wasn’t designed for. This ...

direwolf20 · 2026-01-25T18:44:35 1769366675

The other system you're using that isn't Postgres can also go down.

Many developers overcomplicate systems. In the pursuit of 100% uptime, if you're not extremely careful, you removed more 9s with complexity than you added with redundancy. And although hyperscalers pride themselves on their uptime (Amazon even achieved three nines last year!) in reality most customers of most businesses are fine if your system is down for ten minutes a month. It's not ideal and you should probably fix that, but it's not catastrophic either.

hinkley · 2026-01-25T20:08:52 1769371732

What I’ve found is that, particularly with internal customers, they’re fine with an hour a month, possibly several, as long as not all of your eggs are in one basket.

The centralization pushes make a situation where if I have a task to do that needs three tools to accomplish, and one of them goes down, they’re all down. So all I can do is go for coffee or an early lunch because I can’t sub in another task into this time slot. They’re all blocked by The System being down, instead of a system being down.

If CI is borked I can work on docs and catch up on emails. If the network is down or NAS is down and everything is on that NAS, then things are dire.

plaguuuuuu · 2026-01-26T02:19:14 1769393954

good luck doing anything if kafka is down though

reactordev · 2026-01-25T21:50:19 1769377819

>The other system you're using that isn't Postgres can also go down.

Only if DC gets nuked.

Many developers overcomplicate systems and throw a database at the problem.

mwigdahl · 2026-01-25T23:32:04 1769383924

Wow, TIL there was an atomic attack on the capitol in October!

reactordev · 2026-01-25T23:53:30 1769385210

DC=Data Center

DC!=Washington, DC

mwigdahl · 2026-01-26T05:01:57 1769403717

I wondered, but the lack of "the" before "DC" tipped me toward interpreting it as the place name, especially as AWS us-east-1 is in Northern Virginia. Thanks for clarifying!

direwolf20 · 2026-01-26T00:53:02 1769388782

Which system is immune to all downtime except the DC getting nuked?

reactordev · 2026-01-26T01:27:01 1769390821

Properly designed distributed systems.

Challenge: Design a fault tolerant event-driven architecture. Only rule, you aren’t allowed to use a database. At all. This is actually an interview question for a top employer. Answer this right and you get a salary that will change your life.

direwolf20 · 2026-01-26T11:30:12 1769427012

No, those go down all the time. AWS had three nines last year. Bitcoin had the value overflow incident.

reactordev · 2026-01-26T17:43:20 1769449400

Credit cards still worked…

Email still worked…

Again, there are fault tolerant distributed systems out there that don’t rely on a single point of failure.

That’s not to say failure doesn’t happen.

fcarraldo · 2026-01-25T18:21:01 1769365261

There are a ton of job/queue systems out there that are based on SQL DBs. GoodJob and SupaBase Queues are two examples.

It’s not usable for high scale processing but most applications just need a simple queue with low depth and low complexity. If you’re already managing PSQL and don’t want to add more management to your stack (and managed services aren’t an option), this pattern works just fine. Go back 10-15yrs and it was more common, especially in Ruby shops, as teams willing to adopt Kafka/Cassandra/etc were more rare.

reactordev · 2026-01-25T21:47:22 1769377642

And there are a ton that aren’t.

tlb · 2026-01-26T12:47:53 1769431673

I think the PG designers would be surprised by the claim that it wasn't designed for this. Database designers try very hard to support the widest possible range of uses.

If all queue actions are failing instantly, you probably want a separate throttle to not remove them from the Kafka queue, since you'd rather keep them there and resume processing them normally instead of from the DLQ when queue processing is working again. In fact, the rate limit implicitly enforced by adding failure records to the DLQ helps with this.

hnguyen14 · 2026-01-25T18:05:20 1769364320

How so? There are queues that use SQL (or no-SQL) databases as the persistence layer. Your question is more specific to the implementation, not the database as persistence layer itself. And there are ways to address it.

senbrow · 2026-01-25T18:11:41 1769364701

Criticism without a better solution is only so valuable.

How would you do this instead, and why?

reactordev · 2026-01-25T21:48:53 1769377733

Watching a carpenter try to weld is equally only so valuable. I think the explanation is clear.

odie5533 · 2026-01-25T17:48:25 1769363305

You wouldn't ack the message if you're not up to process it.

trympet · 2026-01-25T20:57:13 1769374633

I prefer using MS Exchange mailboxes for my message queue.