Serverless and NoSQL: Crappy Names, Great Ideas

When I first heard the term NoSQL, I was mad as hell.

I’m a database guy, making my living with Microsoft SQL Server. I love it dearly, despite some faults here and there. Hearing the term “NoSQL” angered me because removing the SQL language wasn’t the solution to tough development problems.

At first, the NoSQL movement literally meant no SQL language: developers dumped all their data into key/value stores and did all joins in the application. Good for a laugh, but obviously that can’t last too long.

NoSQL later got translated into Not Only SQL, which makes more sense because bigger applications usually need a mix of relational data, key-value stores, cache layers, etc. A lot of stuff gets dumped into the NoSQL bucket, and I’m a big fan of it. After all, not everything belongs in a relational database.

Today, in 2016, I don’t take the term NoSQL to be a personal insult, or an arrow to the heart of the technology I know and love. It’s just a term – albeit a crappy one – to describe a lot of great ideas.

The “serverless” name is NoSQL all over again.

I’d been following AWS Lambda with interest for a while. In theory – write code, upload, workload.

Oh, there are most definitely servers, but you don’t manage:

  • What hardware the servers are running
  • How servers are networked together
  • Security and permissions on servers
  • Deployment of code across servers
  • The quantity of servers
  • The rate at which the servers are spun up & down

(Before you leave an angry comment, read those bullet points carefully. You still manage all of that stuff indirectly, you just don’t manage it on servers.)

You just pay Amazon or Google or Microsoft, and in theory, they do a decent job of all this for you. Do they do as good of a job as you or I might do? Well, that’s a tricky question, and it depends on the amount of time you have available per day to do those things – and I’ve got a lot of stuff to do.

The term “serverless” refers to jobs, not apps.

Over the years, I’ve been a manager, then a sysadmin, then developer, then a DBA. (I’m skipping stuff too – it’s been a long, meandering ride.) I love so many of those roles so much – I really enjoy unboxing server parts and putting ’em together.

But today, I’m a small business owner, and I wanna get some fun stuff to market as quickly and inexpensively as I can. There’s a lot of work involved:

  1. Funding
  2. Designing
  3. Building
  4. Managing
  5. Supporting
  6. Marketing

Serverless architecture does not mean a reduction in overall costs.

It just means a change in allocation. You spend less time managing servers, and more time managing processes. If anything, the building and supporting parts skyrockets right now because serverless platforms and tools involve tougher hiring and training. There’s no free lunch, and everything’s a tradeoff.

Bringing stuff to life is a really fun game that involves a few questions:

  • What do you want to bring to life?
  • How much money & time do you think each of those steps will take?
  • How much money do you need to squirrel away in order to do it?
  • What shortcuts would you be willing to take to go live faster?

For example, how could I build something asynchronous where the server response times don’t matter? Could I take a task that used to take human beings hours, do it asynchronously, and deliver it via email in minutes, and seem blazing fast in comparison?

Sure, in a perfect world, I’d get VC funding for everything, build it on the Microsoft stack that I know and love, hosted on real physical servers in my own data center. It’s fun building a little empire of flashing lights and whirring fans. I’m not being sarcastic there: I really enjoy that.

But as a small business owner, I can’t afford to do that, and I have to get creative. Serverless architecture isn’t the solution – it’s just a part of some solutions. Like NoSQL, you have to understand where it makes sense.

The Pros and Cons of Serverless Architecture

Today’s serverless architecture design is new. Really, really new. (Yeah, yeah, Grandpa, you could argue that your mainframe apps were serverless, but you’re missing the point. Go back to yelling at the cloud.)

That immaturity has a lot of drawbacks by itself:

  • Learning serverless is hard – the awesome list of serverless resources is a good place to start
  • Hiring is nearly impossible – the tech is so new that few people know it yet, and those who do are expensive
  • Getting help is tough – there’s hardly any questions & answers on StackOverflow, for example (AWS LamdbaWindows Azure Service Fabric)
  • Best practices don’t exist yet
  • If it’s down, it’s just down – it’s outside of your control
  • Vendor lock-in – each vendor implements it differently right now, and porting code between Amazon and Microsoft would be very expensive

That last one is ugly because in theory, you’re worried about any vendor screwing it up. They could jack up prices, make a breaking change, or just deprecate the whole platform.

The next major drawback is single-transaction performance. Today’s serverless platforms have much higher latency – for example, if your function hasn’t run recently, AWS Lambda has to start up a container for it. Forget running an e-commerce site on this – after a second or two, Google-referred users will just hit the back button and try someone else’s store.

The cons above add up to one thing: if you’re a midsize profitable company, building a traditional application or web site, you should probably not use serverless design. Your application will be slower to build, slower to access, and harder to troubleshoot.

But if you go in knowing those drawbacks, the advantages can make it a good fit for a few types of applications, like the ones we’re building at the moment.

The Advantages of Using Serverless Architecture

Someone else manages uptime and that related staffing. When the serverless provider’s servers go down, they’re the ones who have to manage it, not me. For a non-critical app like the one we’re working on now, that makes perfect sense.

Hosting costs and performance scale linearly. If no one is using your app, you don’t pay. As more people use it, your costs go up. For the applications we’re working on now, we’re only projecting dozens of users per hour, which means hardware or VMs would be sitting around idle. If it catches on later, great – but even if only dozens of folks use it, we’re still quite happy with the costs.

We’re getting valuable experience. We have a lot of application & service ideas that all involve asynchronous access (queues), low performance requirements, and analyzing stored data. With one company, we picked an app that was the easiest one to bring to production first, and we’re testing whether serverless architecture will work for the rest of the ideas.

aws-lambda

Choosing a Serverless Platform

Mid-2016 is a tough time to bet on a platform. Several smaller independent players got in before the big guns, and we won’t review the smaller folks here since we don’t have any experience with them. Focusing on the big ones:

Amazon Web Services offers Lambda, which charges by the number of times your code runs, plus a per-second cost for the memory you use. You can run Node.js, Python, and Java code as Lambda functions.

Microsoft’s equivalent is Azure Functions, but it’s brand spankin’ new:

Microsoft is playing one heck of a game of catch-up in the cloud business. Given how new and undocumented AWS Lambda is, Microsoft stands a pretty good chance of being competitive in the serverless space.

Finally, Google Cloud Functions is only in alpha, and the documentation includes this terrifying disclaimer:

This is an Alpha release of Google Cloud Functions. This feature might be changed in backward-incompatible ways and is not recommended for production use. It is not subject to any SLA or deprecation policy.

Ouch. Your platform decision will come down to the serverless landscape at the time you’re making the decision, plus your reliance on the other cloud services provided by each vendor.