Should We Be Muddying the Relational Waters? | Use Cases for MySQL & MongoDB

Many of you know I publish a newsletter monthly. One thing I love about it is that after almost a decade of writing it regularly, the list has grown considerably. And I’m always surprised at how many former colleagues are actually reading it.

So that is a really gratifying thing. Thanks to those who are, and if you’re not already on there, sign-up here.

My Personal Thoughts on: Should We Be Muddying the Relational Waters?

Recently a CTO & former customer of mine reached out. He asked:

“I’m interested to hear your thoughts on the pros and cons of using a json column to embed data (almost like a poor-man’s Mongo) vs having a separate table for the bill of materials.”

Interesting question. Here are my thoughts.

1. Be Clean or Muddy?

In my view, these type of design decisions are always about tradeoffs.

The old advice was normalized everything to start off with. Then as your performance tuning, denormalize in special cases where it’ll eliminate messy joins. The special cases would then also need to be handled at insert & update time, as you’d have duplication of data.

NoSQL & Mongo introduce all sorts of new choices. So too Postgres with json column data.

We know that putting everything in one table will be blazingly fast, as you don’t need to join. So, reads will be cached cleanly, and hopefully based on a single ID or a small set of ID lookups.

Also: Is the difference between dev & ops a four-letter word?

2. Go Relational

For example, you might choose MySQL or Postgres as your datastore, and use it for what it’s good at. Keep your data in rows & columns, so you can later query it in arbitrary ways. That’s the discipline up front and the benefit & beauty down the line.

I would shy away from the NoSQL add-ons that some relational vendors have added, to compete with their newer database cousins. This starts to feel like a fashion contest after a while.

Related: Is automation killing old-school operations?

3. Go Distributed

If you’d like to go the NoSQL route, for example, you could choose Mongodb. You’ll gain advantages like distributed-out-of-the-box, eventually consistent, and easy coding & integration with applications.

The downside is you’ll have to rearrange and/or pipeline to a relational or warehouse (redshift?) if & when you need arbitrary reports from that data. For example, there may be new reports & ways of slicing & dicing the data that you can’t foresee right now.

Read: Do managers underestimate operational cost?

4. Hazards of Muddy Models

Given those two options, I’m erring against the model of muddying the waters. My feeling is that features like JSON blobs in Postgres, and the memcache plugin in MySQL are features that the db designers are adding to compete in the fashion show with the NoSQL offerings, and still keep you in their ecosystem. But those interfaces within the relational (legacy?) databases are often cumbersome and clunky compared to their NoSQL cousins like Mongo.

Also: Is the difference between dev & ops a four-letter word?

5. Tradeoffs of Isolation

Daniel Abadi and Jose Faleiro published an interesting article on a very related topic.

The upshot is that in databases you can choose *TWO* of these three characteristics. Fairness, Isolation & Throughput.

Conclusion

Relational databases sacrifice throughput for fairness & isolation. Distributed databases sacrifice isolation to bring you gains in throughput & horizontal scalability of writes. That’s a lot of big words to say one simple thing.

DB2 SQL Error SQLCODE=-904 (How to Fix)

ByNolan Granger September 7, 2023October 11, 2023

The DB2 SQL error SQLCODE = -904 indicates the unavailability of a DB2 resource, in most cases, a tablespace. However, other inaccessible resources like packages, internal workspaces, or buffer pools might also be the cause. You can go into the depths of this failure by finding out the reason from the IBM information center. Generally,…

Database Management | Database Operations

Azure Function Connect to SQL Database | 5 Steps to Follow

ByNolan Granger September 6, 2023October 11, 2023

You can connect your Azure Function to your SQL Database without any integration programs. The process includes declaring the input and output bindings within the definition of the Azure function you want to connect. In this article, we shall discuss how to use VS Code to achieve just that. How to Connect Azure Function to…

Database Management

RDS OR MYSQL – TEN USE CASES

ByNolan Granger August 14, 2023August 28, 2023

Amazon’s Relational Database Service is based on MySQL under the hood. So many colleagues and clients ask me – should I go with RDS or MySQL? As with every technology question, the answer is – it depends. In this article, we are going to discuss 10 use cases of RDS or MySQL. Let’s get started below….

Database Management | SQL

Export Database Diagram SQL Server to image

ByNolan Granger November 23, 2023November 26, 2023

Database diagrams in SQL Server are invaluable visual representations of database structures, relationships, and entities. Often, users need to export these diagrams to images for documentation, presentations, or sharing purposes. While SQL Server Management Studio (SSMS) provides a native tool for creating database diagrams, exporting them to images requires additional steps. In this article, we…

SQL

Does DBCC CheckDB Use TempDB | Answered

ByNolan Granger November 27, 2023November 27, 2023

Yes, DBCC CHECKDB in Microsoft SQL Server uses TempDB for temporary storage during its execution. DBCC CheckDB utilizes TempDB, albeit indirectly. When you initiate a DBCC CheckDB command in SQL Server, it undergoes several stages of validation and verification to ensure database integrity. TempDB, a fundamental system database in SQL Server, plays a crucial yet…

Database Management | Database Operations

How To Copy SQL Server Database | Covered In 15 Steps

ByNolan Granger September 4, 2023October 11, 2023

Copying a database from an SQL server is the process of copying and distributing data and database objects from one database to another. A part and parcel of this task is to synchronize all data between the databases to ensure data integrity and consistency. To copy the SQL server database, the ‘copy database’ option from…