DevOps is broken

dijit · on Oct 20, 2022

I’ve long held this opinion but I consistently get drowned out.

DevOps has different meaning depending on who you’re talking to, even some definitions that appear similar are different in nuanced but important ways.

All “devops” as a job title has done has muddy responsibilities and given many folks the wrong impression of what an operations discipline should be.

There is also a lot of rewriting of history that gets thrown in, similar to how when people talk about cloud then the only alternative is to start making CPUs by hand and begin building your own nuclear reactors. It’s the idea of what came before, not the reality, that people seem to be defensive of.

It’s honestly exhausting to discuss.

So instead I became CTO so I can solve this mess properly, I don’t hire devops, I hire infra engineers, build engineers, release engineers and: backend engineers.

Roles so simple that you already have a clue what they do, which is sort of the point of job titles.

duxup · on Oct 20, 2022

Agreed, anytime I talk with someone about DevOps ... we end up having to hash out the entire process to really know what either of us are actually talking about. Otherwise you have these situations

"Yea the DevOps guy messed up the widget and nobody notic---"

"Wait, what is the DevOps guy doing even touching that widget.... what is even DevOps to you?"

"Bro that widget IS DevOps."

-silence-

Same applies to the topic of "micro-services".

boppo1 · on Oct 20, 2022

>microservices

I never miss an opportunity to share my favorite piece of comedy this decade:

https://youtu.be/y8OnoxKotPQ

dijit · on Oct 20, 2022

it's too close to reality.

There is a good talk that is also comedic as it comes from the perspective of someone who wants to fail at doing microservices.

https://www.youtube.com/watch?v=GWgRw5jiYy0

smcleod · on Oct 20, 2022

Thank was pretty funny, thanks.

soperj · on Oct 20, 2022

SOA is just Spaghetti Architecture. I haven't seen an implementation that convinces me otherwise.

jedberg · on Oct 20, 2022

What is your definition of Spaghetti Architecture? Netflix had a good SOA that enabled rapid development and had strong cut lines between services, with no way to access the data of a service without going through the service's API.

I think that's where most people go wrong. They put a bunch of services in front of a shared database, which means that they don't have to go through a service's API to get to it's data, and that's what breaks everything.

eropple · on Oct 20, 2022

This is really important, and I've gotten a lot of quizzical looks when making this assertion over the years: data is owned by one and only one service. If two pieces of code assert ownership of it by mutating that data or looking past the public encapsulation of that data, then that code is the same service.

If you see a queue between two services, that is usually an indication that some ownership is being transferred (even briefly), and that is a critical point where you need introspection and observation.

wikibob · on Oct 20, 2022

> Data sovereignty per microservice

> An important rule for microservices architecture is that each microservice must own its domain data and logic . Just as a full application owns its logic and data, so must each microservice own its logic and data under an autonomous lifecycle, with independent deployment per microservice.

https://learn.microsoft.com/en-us/dotnet/architecture/micros...

JamesBarney · on Oct 20, 2022

The big issue I've seen is that takes a lot of work, so people cut corners. The problem is of course when you cut corners with microservices and rely on a shared database for instance, suddenly you're dealing with 40% of the costs of a microservices and 0% of the benefits.

P5fRxh5kUvp2th · on Oct 21, 2022

That's completely untrue and glosses over the very real costs of transaction management in such an environment.

Using a shared database allows you to punt a lot of that complexity to a system that's been specifically designed for it, and working well for probably 20+ years.

Too many people think microservices don't have their own, severe, downsides. The likes of netflix, google, et al, can afford to pay people whose entire job is to manage the complexities of these approaches that flat don't exist in other scenarios.

But it's a hell of a lot simpler to use a single database if you can get away with it.

GauntletWizard · on Oct 20, 2022

Shared database can be a reasonable microservice boundary, especially when using database-as-queue or database-as-mucroservice. In the former, services a, b and c can insert and query but only D can update, i.e. any service can creat a work-order but only D can mark it completed. I. The latter, nobody can read or write and all access to tables is done through stored procedures.

I don't recommend either, and it's still a code smell, but with clear definitions they can work.

JamesBarney · on Oct 20, 2022

The benefits of the microservice pattern are you can build separate teams responsible for different business logic, and they can have their own deployment schedule. You don't get either of those benefits when a b and c have to coordinate on their work-order creation business logic, and a, b, c and d all need to be deployed at the same time anytime the schema changes.

Btw some of the confusion might be what I mean by shared database. I didn't mean two services sharing a limited set of tables, using the database as a rabbitmq replacement. I meant sharing the backing database of the microservices. It sounds like we probably agree I just wasn't very clear about what I meant by sharing a database.

(Ironically I am literally using a database as a queue to share data between two services we are running in prod. I don't think of it as a microservice because it's not separate teams, it just because our monolith is hosted on a platform that doesn't support certain libraries so we had to role those libraries onto a separate platform that does support them.)

GauntletWizard · on Oct 20, 2022

I think we're in violent agreement - A database can be a queue and that doesn't link things as the same, but it's easy to break those promises and you need to be clear in that promise to begin with. If two "microservices" have to move in strict lockstep, they're not microservices, they're components of a larger service.

lamontcg · on Oct 20, 2022

Yeah, services should be formed around the needs of data, and you shouldn't run multiple services in the same image/container just to split the code up. It shouldn't ever be about the code.

Amazon's SOA architecture at least all started out about pulling bits of data spread over hundreds of servers into services that could take advantage of caching.

maxrev17 · on Oct 20, 2022

This is it exactly!

throwdbaaway · on Oct 20, 2022

I have seen the "data service" design pattern that tries to work around this, by having a CRUD microservice to front a database, providing hundreds of APIs to read/write the data for 50 other microservices. If one of these CRUD microservices goes down, everything breaks.

duxup · on Oct 20, 2022

I feel like it has its place in large organization. But being a large organization means the odds of spaghetti are approaching 100% no matter what you do.

Kinrany · on Oct 20, 2022

Any general purpose architecture allows spaghetti, there's no way around it.

usrusr · on Oct 20, 2022

Has there been an architecture movement that formally embraces the spaghetti, aiming for peaceful coexistence? Detractors will certainly point at SOA and shout "that one!", but I mean one that openly admits..

Spivak · on Oct 21, 2022

I’m not sure if this architecture has a name other than “Event Sourcing” but I think giant org wide shared message bus with all services freely pushing and pulling data from it is the closest to embracing spaghetti.

rileyphone · on Oct 20, 2022

Microservices should be resource-oriented, not service-oriented. NetKernel takes it to the extreme, it’s a shame it hasn’t seen broader adoption.

http://resources.1060research.com/docs/IntroductionToResourc...

sbf501 · on Oct 20, 2022

"Hey, did the front-end new-hire upgrade the Linux distro yet?"

duxup · on Oct 20, 2022

Side rant:

There was an angry Ask HN post a while back about how terrible the "new guys" are that are out there.

It then complained that the new hire wrote a horrible authentication service.

I thought it was an intentionally absurd post about the expectations put on new some rando new guy to write something important / they shouldn't be working alone on ... but they were serious.

dustymcp · on Oct 20, 2022

this devops is also a dumping ground for anything else not happening on a developers computer which they should control but wont because "That is devops job" mostly this happens alot with node projects for some reason..

chrisweekly · on Oct 20, 2022

"anything else not happening on a developers computer which they should control but wont"

Ok, I've seen this, but IME (24y in industry, the last 6 as a consultant) in the vast majority of cases, it's more like "things devs should control but CAN'T [bc CICD etc are silo'd and owned jealously by an overburdened ops team unable or unwilling to facilitate self-service]".

outworlder · on Oct 20, 2022

Some of the 'jealousy' may also come from bad experiences.

It only takes a few instances of people royally messing stuff up in production for their deployment rights to be stripped away – and other groups catch the fallout too, as new "procedures" get implemented.

kazen44 · on Oct 22, 2022

Also, some developers do not seem to know what kind of impact a production issue can have.

Having something not work during the development cycle is annoying for one person or a team. Having something break in production usually means people get paged and (depending on what you do) the resulting issues can have major societal aswell as economic impact.

In the ops world, this is especially visible if you go further down the stack. (from applications down the the network).

Network architecture moves at a toad's pace compared to webdevelopment, which is a good thing considering breaking the network will usually break every other system inside a IT department/landscape.

dijit · on Oct 20, 2022

"Safety rules are written in blood" applies to parts of the organisation too.

Sure, there are some overbearing procedure monkeys who really want a process on everything, but a lot of "protection" rules are there because something really bad (and expensive, financially or reputationally) has happened before.

duxup · on Oct 20, 2022

Humans are kinda terrible when they can just "not my problem" things.

NikolaNovak · on Oct 20, 2022

>>the only alternative is to start making CPUs by hand

Agreed. For some applications the cloud difference is significant; for many (most?) others though, "Cloud" is just rebrand of "Hosted". And even for more cloudy offerings, while I'm in a very specific and different part of IBM, some of the old timers/architects/powers-that-be keep trying to explain "We had that in 1969!!!" :-D

Agreed also at rewriting of history when it comes to development/support/operations models. My dad has been IT director and he chuckles when I talk to him about "new and exciting paradigms" which he of course sees as turning a circle to what they had in 70's and 80's :)

kennend3 · on Oct 20, 2022

> My dad has been IT director and he chuckles when I talk to him about "new and exciting paradigms" which he of course sees as turning a circle to what they had in 70's and 80's :)

As someone with 20+ years in IT, I agree - a lot of these "new and exciting paradigms" are not new at all.

My personal favourite is how many large multi-nationals are now building in-house clouds?

WTF is the difference between an "in-house cloud" and a shared-use datacenter from the 1990's?

willcipriano · on Oct 20, 2022

> WTF is the difference between an "in-house cloud" and a shared-use datacenter from the 1990's?

A couple million in salaries and bonuses.

adamsb6 · on Oct 20, 2022

I think the difference is the interface.

The interface to the shared-use datacenter, if you're lucky, is a spreadsheet that declares the static resources you own and a remote hands guy that can tackle things beyond the capabilities of your remote KVM. If you need more capacity you need to work with the datacenter folks to order physical machines that might show up in a few months.

The interface to the in-house cloud is an API. In most instances, developers are completely abstracted away from the physical infrastructure and don't need to take a lock on some human in the datacenter to get their work done.

NikolaNovak · on Oct 21, 2022

I still think that's a bit "rewriting the history". E. G. Vmware enabled fast self-provisioned VMs and nobody called them cloud. Heck in 1999ish or so, while at university and way before I was an IBMer, I could sign up to some ibm development program as a student, get an account, and provision Linux VMs on mainframe.

Not to say your scenario isn't valid and real, but I live that scenario every day today with in house cloud and virtualization too - it takes months of approvals and solutioning and security assessment and network engineering and procurement and costing and whatnot... To deploy a windows vm.

Spivak · on Oct 21, 2022

> Vmware enabled fast self-provisioned VMs and nobody called them cloud

Because it’s missing all the glue. I need to self-service the VMs, the database, the load balancer, the dns records, certs, and hook them all up so I can receive production traffic all via an API I could theoretically do in Terraform.

toast0 · on Oct 20, 2022

> WTF is the difference between an "in-house cloud" and a shared-use datacenter from the 1990's?

An in-house cloud sounds like the mainframe installed in the raised floor computer room at the school district office I worked at in the late 90s; of course, the 12 foot long Unisys mainframe was replaced with a Unisys 4U pentium pro box pretending to be a mainframe, and then there was a lot of extra floor space.

If you're running your own 'cloud' in a (shared) colo, I dunno that that's really in-house. I guess it's still 'private cloud' though.

pfarrell · on Oct 20, 2022

  “There aren’t any new problems. Just new engineers”
  - A sig I read a long time ago

ElectricalUnion · on Oct 20, 2022

> WTF is the difference between an "in-house cloud" and a shared-use datacenter from the 1990's?

Can't call a mainframe a bunch of buzzwords like "Hyper-converged, high availability, on-premise software as a service cloud platform".

NikolaNovak · on Oct 20, 2022

I'm pretty sure mainframe guys are calling it exactly that:

https://www.ibm.com/ca-en/products/z16

warning: buzz word alert!

hinkley · on Oct 20, 2022

Turn-key cloud technology.

hinkley · on Oct 20, 2022

The only possible value over the last iteration is if they cleaned house and the new team manages to be more permissive than the old one. I'd much rather have on-prem, but the sad fact is that for cloud I just need a signed check. For on-prem I also need buy-in from other divisions before I can even start experimenting with a new service.

I still have the exact same problem with respect to never having exactly the ratio of CPU to memory that would make my app happy.

doctor_eval · on Oct 20, 2022

I had a couple of telco customers that ran their own internal cloud, and I’d say the main difference is that it’s way more expensive than the public cloud, heaps less flexible, and when you need compute it has to go through an approval process.

I mean … sometimes I shake my head in wonder, and other times I just shake my head.

chevdev · on Oct 20, 2022

An in-house cloud will just be a bunch of commodity servers running a hypervisor that gives you an API that allows you to automate the provisioning of infrastructure.

I am guessing that in the 80s you weren't writing Infrastructure as Code to define exactly what resources you needed for your software, having it all set up automatically, and so on.

kennend3 · on Oct 21, 2022

Isn't that what JCL did?

https://en.wikipedia.org/wiki/Job_Control_Language

True, they didn't call it "Infrastructure as Code" but it was used to define mainframe resources for jobs to use when they ran?

softfalcon · on Oct 20, 2022

The two of you have validated my very existence as a coder. I keep jokingly telling everyone that we’re often going around in circles. No one ever believes me (lol!)

bgro · on Oct 20, 2022

I don't understand how seemingly 90% of developers, can't do anything outside of their narrow scope of experience. Writing a shell script? Throw a temper tantrum, that's devops job! Have to work with legacy code? I can't believe this!

Especially seniors and beyond. They force leetcode interviews they somehow pass or are grandfathered through and gatekeep "trash devs" by slamming gotchas about how the whiteboarded code discussion doesn't technically compile and how "returnAverage()" isn't a method ("did you know you have to write methods before they work? What does returnAverage() supposedly even do? Return a random character in the alphabet?")

I get excited about working in different tech areas. I'm exceptional at fixing other's bugs and maintaining code. Absolutely nobody seems to be hiring for this though. It's all about college exam trivia and leetcode.

When I do get into these companies, I have to work with people throwing around casual racism with HR joining in and f-slurs like it's corporate 4chan. I ask if anyone can help look at a critical bug I discovered, and nobody speaks up. Even when it turns out it was their last code change that caused it and they are the sole master expert in this area, and they were just working on organizing their desktop instead.

"I can try to look at the SQL problem, since nobody else spoke up." Then I'm explaining the basics of SQL to some guy sitting around blank faced and it turns out he was hired because of his 10+ years of expertise in SQL. His whole job is to tackle problems like that.

Meanwhile I'm out here looking for jobs when panic cuts happen and it takes forever because I'm drained from all the gotchas and gatekeeping in these interviews.

There's no way I could live with myself if I put this experience onto others. I lose sleep when I fail to call out someone talking over a quiet person on my team. I've never seen anybody else stand up for anybody, however.

hinkley · on Oct 20, 2022

I learned SQL because my coworker was stuck on a bug she couldn't figure out and 'knew' I'd be able to solve her problem.

I just kept asking questions until I found the one she missed.

I also ended up becoming a bespoke VxWorks admin because the guy who volunteered was never available, and someone asserted my code wasn't working well on VxWorks, so of course I had to know enough to do benchmarks. I fixed a few problems but the real issue was the hard drive wasn't doing DMA due to the kernel not recognizing the processor revision number.

Somewhere between those two events I realized that if my part of the project is great but the whole project is on fire, nobody cares about my stuff. I don't get points for being right and the team being wrong. I mean, I do for some people, but I still feel bad at the end of the project, and those are the 'points' I have to live with the most.

The way that played on on the latter project is that once we got our shit together, we started work stealing from our peer orgs. Volunteering to carve off little pieces of interface between us and 'take care' of this bit of data handling here and that one over there. I came to realize that the Org subconsciously knew this and any past success they had was due to self-organization and collective work-stealing.

Throw people at it, turn a blind eye to precise mission statements, and hope for the best.

int0x2e · on Oct 20, 2022

You hit the nail on the head - at many places, the hiring process seems to be great at blocking people who are great at solving real world problems that aren't as good at whiteboard leet-coding.

If you truly enjoy this part of the job, you can go for a freelance career - build a network and reputation as the person you call to solve the hard stuff, and do mostly that, no leetcode interviews required...

apalumbi · on Oct 20, 2022

I like to build up my teams with people that act and think like you. Sometimes I need a sniper and when I do I'll get one. But my teams need to be excited about building and running the systems that we make.

Cthulhu_ · on Oct 21, 2022

To be fair, shell scripts are pretty horrible to write and maintain from a programmer's point of view. They're good in that they're portable, but they're not a good programming language. I don't object to writing them, but I'd rather there be an alternative.

Anyway, it sounds like you've worked in some really shitty companies / shitty people.

llanowarelves · on Oct 20, 2022

Sounds like medium or bigger sized company office politics.

At smaller companies the title tends to be only a title and many people are basically full-stack and know a bit of everything. And you're trading hats to get jobs done.

indymike · on Oct 20, 2022

> I don’t hire devops, I hire infra engineers, build engineers, release engineers and: backend engineers.

This is a great way to do it. There seems to be a correlation between unnecessary product complexity and unnecessary corporate complexity. Being direct about roles goes a long way towards simplifying corporate complexity.

willcipriano · on Oct 20, 2022

> There seems to be a correlation between unnecessary product complexity and unnecessary corporate complexity.

That's Conway's law.

"Any organization that designs a system (defined broadly) will produce a design whose structure is a copy of the organization's communication structure."

https://en.m.wikipedia.org/wiki/Conway%27s_law

notabee · on Oct 20, 2022

Most of the useful insights in the Devops movement can be derived from knowing Conway's law and thinking about organizational patterns holistically. As usual the success stories get cargo culted by others without understanding the principles and thought processes that led there, and usually fail because every organization is different and has different needs. Change is hard. This will happen with every shiny new movement like Devops or Agile.

I'm sure we're due for a new one soon which will follow the same path, since people seem to be admitting that Devops has problems on here more and more. Will that cause deeper reflection? Probably not for many orgs, because soon some consultants will dream up a new brand of silver bullet to slay the immortal monster of organizational dysfunction and that's much more exciting.

zeruch · on Oct 20, 2022

Conway's Law is sorely overlooked. It really does have broad applicability in the industry, and sometimes you can use it to evaluate vendors better (read: this product is a bit haphazard...the vendor probably is also. Warning.)

pfarrell · on Oct 20, 2022

Totally agree. The best succinct summary of Conway’s Law I've heard is “you ship your org chart.”

writeinpencil · on Oct 20, 2022

I like "the org chart is the asymptote," i.e. it's the best you could possibly do, and your reality will actually be worse. (see Casey Muratori)

hinkley · on Oct 20, 2022

Which is a good way to word it because if your org chart is screwy enough you won't ship at all.

dimitrios1 · on Oct 20, 2022

Roles are muddy because life is muddy...and because people can do more than one thing at a time. Also because save for a few roles, businesses rarely need people 100% allocated to one role or task. Responsibilities are nice, and job titles mean people mostly do x or y.

My favorite example is that of a restaurant. You hire waitors, diswashers, bartenders, line cooks, prep cooks, hosts and hostesses, and managers. And your waiters might be 100% focused on taking orders during prime dining hours, but at the end of the night, they help clean up assuming other roles. Your bartender might be busy making drinks, but they take a table or two if they can. Your prep cook might finish the prep early, and come on the line to help out. Your manager focuses on expediting everything and keeping the ship running smooth, but also can fill holes at time.

moqmar · on Oct 21, 2022

Communication with the other roles you work with is key though in that case, otherwise you'll have chaos - e.g. because the waiters don't know that the bartender took the table, or because the cooks need things the next morning that aren't where they're supposed to be.

dijit · on Oct 21, 2022

Exactly.

You wouldn’t expect a bartender to table every person, prepare all their drinks, cook all their food and clean up their table.

But that’s kind of what we keep trying to do, because we have new bartending tools that make it so that you need fewer and because we claim communication is harder than doing everything in one role.

Then when that person invariably gets overloaded; we hire more bartenders and complain they’re not good enough at cooking your specific dish.

LudwigNagasena · on Oct 20, 2022

That depends on the complexity of the development lifecycle, doesn’t it? If a single person can maintain it, having three different roles would mean unnecessary corporate complexity.

tremon · on Oct 20, 2022

Not sure what you mean with "development lifecycle" exactly. Aren't maintenance and development both part of the product lifecycle?

tetha · on Oct 20, 2022

Yep, this has been the hardest fight in our "culture shift towards devops" - getting the idea and wording right.

After a lot of discussion, we mostly realized that we get the most value if we define more specific operational roles. We now have the idea of infra-ops, and product-ops. infra-ops is providing a deployment platform - a container runtime and persistences. product-ops on the other hand is responsible for deploying and running the different products from development on this deployment platform.

And this is giving people good ideas. Some products are very simple without harsh requirements. In these cases, one of the backend devs just takes over the product-ops role by setting up a deployment pipeline, a job and some migration handling. Other products are bigger - for example we are providing some of our systems as essentially managed systems to invididual tenants and customers. In such a case, parts of the product operations is with the product team - such as writing jobs, releasing containers and artifacts, and other parts of the product-ops role takes place in the managed services consultancy. These products are currently looking at wordings to split up the product-ops role according to their needs in their context. And that's totally fine - the infra-ops role is also divided depending on the system the engineer is working on. A postgres admin is an infra-operator, for example.

And all of this results in a devops oriented culture of cooperating across team boundaries through automated processes.

otabdeveloper4 · on Oct 20, 2022

"Sysadmin" got rebranded as "DevOps", because "techsupport" got rebranded as "sysadmin".

notakio · on Oct 20, 2022

Yerp. As a former *nix admin, around the time that "DevOps" became a term, I read it as "system administrator, who also now has to fix developer code." Which, honestly, was already part of the job at a lot of places I've worked, owed to the age old problem of people who "test" something on their weird personal desktop environment, then hand it off and shrug off any questions with "well, it works on my desktop". That said, the part that I found offensive was that I was doing 2 jobs, but only getting paid for one.

Then came DevSecOps. Around that time, I switched to just Security, and while I miss the ability to make and push changes to hundreds of thousands of machines to get things done, I don't miss any of the pressure or blame that automatically got lumped onto the sysadmin shoulders every time anything went wrong, and the complete lack of appreciation of all the times nothing went wrong, that were entirely the result of a tireless, efficient systems administration team.

kazen44 · on Oct 22, 2022

> I switched to just Security, and while I miss the ability to make and push changes to hundreds of thousands of machines to get things done, I don't miss any of the pressure or blame that automatically got lumped onto the sysadmin shoulders every time anything went wrong, and the complete lack of appreciation of all the times nothing went wrong, that were entirely the result of a tireless, efficient systems administration team.

This reminds me of a joke my networking collegues used to tell to the new hires.

Welcome to the networking team, where we are responsible for the network, aswell as anything that runs on the network because developers never bothered to pay attention in their networking class.

liketochill · on Oct 21, 2022

It’s a tough place to be when the bar is perfection

notakio · on Oct 21, 2022

It's tough, until you stop caring. And obviously, hitting the "don't care" point is pretty sub-optimal, so far as being able to muster any enthusiasm necessary to keep doing the job.

Karellen · on Oct 20, 2022

Yeah, I've often seen "DevOps" described as "System Administration, but automating everything you can with scripting or whatever", to which my first thought was "WTF do you think Sysadmins do, if not exactly that?!?"

It's like they'd never heard a BOFH tell someone, "Go away now, or I will replace you with a small and uncomplicated shell script."

deepstack · on Oct 20, 2022

glad someone said it. I mean all the CI/CD pipeline are just in the old day *nix/bsd days bash/perl/awk/sed/python/ruby (fill in the blank) scripts. One problem when IT/Software development become more main stream is that everything got another layer of obfuscation especially in corporate cultures and amongst sales and recruiters and managements. Development Operation or pipeline sounds sooo much better bash scripts.

ardfard · on Oct 20, 2022

Just bash script isn't enough if there's no automated process that run it every time developers make changes to your main/release repository trunk. If you still run it manually to verify your build, then it's not a CI. If you're still doing manual release deployment, then it's not a CD.

brightball · on Oct 20, 2022

The extent of devops for your developers should be:

1. Push code to an automated pipeline

2. I understand that the automated pipeline may need input from me in order to run successfully.

Beyond that, you need other people building the "platform" that they're deploying to (for the most part). Probably the ideal example here is Heroku.

Your devs shouldn't be required to do all of the actual ops stuff. They need access to an ops platform.

ghaff · on Oct 20, 2022

To grossly oversimplify, there are essentially two views of DevOps.

1.) What's probably the traditional view which is breaking down the walls between devs and ops, developers carrying pagers, etc. I.e. at least in an idealized world, there are no devs and ops--only DevOps.

2.) As you suggest (and which probably more closely matches how "DevOps" works especially in larger organizations), an internal (or external) operations team provides a platform that developers can use. Developers are still going to be exposed to some operational details, but a lot of them are abstracted away.

Izkata · on Oct 21, 2022

3) The way I first heard it described years ago, bringing developer practices into ops: version control, testing, code reviews, continuous integration, etc

kazen44 · on Oct 22, 2022

> bringing developer practices into ops: version control, testing, code reviews, continuous integration, etc

In my opinion, this last subset just doesn't work well for a lot of branches of operations.

Sure, it might work if you run stuff in AWS and have an operations team managing that, But what if your team is responsible for things like Storage Arrays, networking equipment?

Doing continious integration and code review on some of these is hard if not impossible

tetha · on Oct 20, 2022

> Your devs shouldn't be required to do all of the actual ops stuff. They need access to an ops platform.

This becomes a lot more apparent once you're dealing with B2B customers and their nonfunctional requirements.

We've had situations in-house when devs built some simple microservice to handle some connection to a customers BI/DWH system - it's just 2 days of spring boot chugging to wrangle APIs around without state, nothing bad. But then, the customer started blasting that team with SLAs, backup questions like RTO, RPO, retention, regulation adherence of retention, the whole stack of IT security down to the physical access control.. That poor PO was caught just like a deer in the headlights.

This is some operational responsibility we're taking over for our dev-teams. We're providing the persistence, and this includes a defined backup and recovery strategy, a security strategy and such. And this also includes experience in dealing with these insane questionaires, and correctly pricing absurd requirements for custom backup solutions. And in fact, "we will have to schedule a discussion with ops about this" has ended quite a few of these requirements with "oh.. it's not that important". Intimidation with long job titles and appeals to external authority do work.

dilyevsky · on Oct 20, 2022

Highly dependent on the product. Anything infrastructure related you’ll have a really bad time with that model

tremon · on Oct 20, 2022

I expect my developers to troubleshoot a failing pipeline too. The pipeline is part of the "ops platform" they're expected to work with.

wschoenberger · on Oct 20, 2022

The article has a solid solution ;)

vlunkr · on Oct 20, 2022

DevOps is part of my job description. I was fuzzy on what it meant before, after reading this article, I have no idea what it means. As a general term, it's far too vague, when in practice it seems to just mean that you'll be splitting your time between writing regular application code and writing infrastructure as code. Oh and you get to be on-call.

chasd00 · on Oct 20, 2022

i'm working on an RFP response that has the word "DevSecOps" sprinkled around here and there. That's an even more subjective and ambiguous term than devops :/

guhidalg · on Oct 20, 2022

It means you're gonna be doing dev, ops, security, compliance, UX, UI, cost optimization, build, test, release, and on and on... good luck!

jmull · on Oct 20, 2022

I'm convinced the use of the term "DevOps" continues only because of its incredible polymorphic vagueness...

Two intelligent, sincere, experienced tech people can discuss devops, in detail, for a good length of time, and still be talking right past each other without really noticing.

xen2xen1 · on Oct 20, 2022

It exists so people who dole out money can make one person do two jobs.

biggu · on Oct 20, 2022

You hire build engineers and release engineers seperately? Does your product demand that level of specialization for these 2 functions? Just curious

dijit · on Oct 20, 2022

I work on AAA video games, so unfortunately yes.

It's ok to have build-engineer doing release engineering if it's simple enough, but doing build engineering properly (Source control, artifact control, shared caches, dependency management of the compiler outside of the standard toolchain) is quite a large job.

Release engineering itself gets complicated when you take into consideration the different target platforms, most games get published on Playstation, Xbox and one of a handful of PC platforms (notably steam), plus the online systems and of course the development environments and internal release systems which are ubiquitous.

Build Engineering usually is the step between developers and QA, and release engineering is the bit after QA.

Both interact a lot with QA so could be rolled into the same role.

usrusr · on Oct 20, 2022

Do those different breeds of engineers work in one team or in separate organizational branches?

The problem the term 'devops' is addressing, the way I understand it, isn't that of insufficient individual jack-of-tradesness, but that of too much separation between (internal) organisations. It should be perfectly possible to build a "devops" team from deeply specialized experts and turning it into a job description seems quite a stretch to me.

But you might actually want some jack-of-trades types nonetheless, because those isolated organizations weren't completely without merit: they are good at solving that problem affectionately called the bus factor. A single ops guy in a devops team is a single point of failure and the mitigation preparation of keeping some of the not-so-ops peers sufficiently in the loop will be much more dependent on organic motivation than the counterpart in a specialist company branch. There, substitutability would be much easier to ensure with formal process.

dijit · on Oct 21, 2022

> Do those different breeds of engineers work in one team or in separate organizational branches?

It doesn’t matter as long as communication is open.

Proper ops is all about reducing the bus factor. Procedures don’t depend on people, they depend on being good at communicating.

If your ops person got hit by a car, I would expect that everything would continue running, I would also expect to find some clear documentation, this is mandatory.

calvinmorrison · on Oct 20, 2022

DevOps to me is all about bridging what used to be a huge gap between operations and development. Developers need to know top 10 OWASP. One cannot simply say "it works locally". Ops people need to understand how your tools and programs work at a high level at minimum.

dijit · on Oct 20, 2022

Developers and Operations have been working together for a long time.

Some teams historically worked fine together, others didn't.

I think the tools we have these days are much better (for ops) to ship what works on developers machines.

But if your organisation had a culture of not working together then devops didn't really do much.

and anyway, you've leaned into my point about it meaning different things to different people.

Izmaki · on Oct 20, 2022

These two talks will change your view on DevOps forever:

- https://www.youtube.com/watch?v=zwSNjVTF168

- https://www.youtube.com/watch?v=MnyvgFDh-kw

calvinmorrison · on Oct 20, 2022

not if I don't watch it.

_ktx2 · on Oct 20, 2022

I get called an "Infrastructure Engineer" sometimes, I've also had the title of SRE and SWE. I don't really feel like any of these actually fit me.

I do work on Cloud Infrastructure at times, but honestly it's the smallest part of what I do. It's usually addressed in the architectural designs and I have to touch it incrementally. What I do much more often is writing tools, daemons, and services for distributed systems. I end up calling myself a Distributed Systems Software Engineer to reflect the idea that what I work on is systems and software, and most of them are non-monoliths (from a systems perspective).

Do you have any thoughts on Systems Engineers or the title that I prefer to call myself?

dijit · on Oct 20, 2022

In the games industry we have a role called “tools programmer” which is a person who works on tooling to accelerate development, sometimes these can be extensions to the editor for a specialisation for a game (say, a tyre builder for a racing game) and sometimes they can be more broad (a distribution tool using libtorrent so that a new build of the game can be shipped worldwide very quickly).

There are people who might refer to you as a backend programmer (if the focus is services and daemons), or platform engineer (if the focus is developer velocity).

Largely it depends what your primary focus is.

lumost · on Oct 20, 2022

I don't know, every time I see a title split people start saying that's "X's job", you can't do that because it's "y's job", or I don't want to be involved in "Z".

Fewer titles can avoid these discussions.

rovr138 · on Oct 20, 2022

Looks like you already identified the people that should be in those roles.

If they're not doing it or was overlooked for some reason, then that's an issue but not really an argument to not do it.

lumost · on Oct 20, 2022

Since when are managers incentivized to put people where their interests align? particularly when there is a perceived role transition.

dijit · on Oct 20, 2022

Sounds like a bad culture.

That won't be fixed with a new job title.

Izmaki · on Oct 20, 2022

You hire those engineers and then you have them implement and support Continuous Delivery for your business, yes? DevOps is a silly term, but one that companies seem to use for "infrastructure engineers who help us continuously deliver value to our customers, efficiently and pain-free". Give your people whatever title you feel comfortable with - but essentially they are "doing the DevOps". :P

Izmaki · on Oct 20, 2022

Did I get downvoted because I'm right and the truth hurts or is there a disagreement somebody chose to not share? Hmm...

burnished · on Oct 20, 2022

You basically assert that the other person is categorically wrong without argument but by rereredefining words. It doesn't read as productive to me.

ftlio · on Oct 20, 2022

> I hire infra engineers, build engineers, release engineers and: backend engineers.

I always thought DevOps the “function” just meant this, and being a DevOps engineer at a small company meant you did these with decreasing emphasis, where by the time you’re in the backend it’s just helping enforce logging, tracing, other observable components.

Has worked for me in hiring and being hired and almost everyone I know understands this.

la64710 · on Oct 20, 2022

I think everyone can agree that for most complicated things there are many sides to a story that makes up the present state of the system. I think taking a very opinionated stand in any complicated subject by itself negates the possibility of a mature and complete analysis of any subject. So the title itself of this article is unfortunate.

kuramitropolis · on Oct 20, 2022

Shouldn't a mature and complete analysis result in a conclusion, i.e. an opinionated stand backed up by correct reasoning?

la64710 · on Oct 21, 2022

Yes it should result in a conclusion but the conclusion should not simplify it in a way that the true nature of the issue is lost.

xtracto · on Oct 20, 2022

Some time ago, i was talking to an MBA people /project manager guy was ho had a completely different definition of what "devops" meant. His meaning was some kind of project management approach.

We have a way to prostitute words in this field... agile, QA, devops.

doctor_eval · on Oct 20, 2022

I agree with this but I’d add that it’s also important that the roles aren’t overly siloed. A front end engineer, for example, should be able to and even encouraged to look at backend code, and even submit PRs for it.

And more importantly, the backend engineers should be encouraged to be grateful for the PRs (even if it’s not acceptable for some reason). The more eyes on the code, the better.

I think developers naturally tend to specialise (at least for a time) but ultimately we all need to understand and contribute to code regardless if it’s front end, backend, build, test, …

dogleash · on Oct 20, 2022

>DevOps has different meaning depending on who you’re talking to

1000% this.

I think it's an instance of how the conversation in software always talks about solutions as if they're end-all-be-all answers, rather than explain a problem in specifics and why that made the solution the right answer at the time.

Then that "solution" becomes a buzzword version of itself, and popular buzzwords are tools leaders use to overcome institutional inertia. Which is a good thing, to get over that inertia. But then what comes misses the insight and understanding that eventually turned into a buzzword.

chrismarlow9 · on Oct 20, 2022

This is the answer.

I am interviewing right now and my previous experience includes devops roles and SRE roles, so I get contacted for both by recruiters. After hearing about the responsibilities and examples for these roles I can only come to the conclusion that titles are a waste of time. Even the "level" of a title of Staff, Principal, Lead, etc are a waste of time. Just call the job what it is and if you can't decide on what it is at least to an 80% level, maybe you're asking too much for that job?

llbeansandrice · on Oct 20, 2022

Are you hiring? I was hired as an infrastructure engineer and I'm currently writing typescript for the frontend of an internal product.

dijit · on Oct 20, 2022

Yes, I am, the project is mostly in stealth mode but please feel free to drop me a CV:

jan [at] competition [dot] company

If you want more information on the project we're working on the site is https://rennsport.gg

We're building a hard-core racing simulator game with a backend which can persist car ownership in a way that feels authentic. (IE; not just tied to a game).

cr4nberry · on Oct 20, 2022

> DevOps has different meaning depending on who you’re talking to, even some definitions that appear similar are different in nuanced but important ways.

This is always a dead giveaway that something is a buzzword

Same for rest. Sometimes when people use it, it just means JSON + http requests. Other times it's supposed to be some kind of architectural style

guhidalg · on Oct 20, 2022

You can always point people to the original paper that defined REST: https://www.ics.uci.edu/~fielding/pubs/dissertation/rest_arc...

331c8c71 · on Oct 20, 2022

Should one also point people to the original vision of OOP (by Alan Kay) when they mention the term?

guhidalg · on Oct 20, 2022

Yes, at least when there's disagreement about what OOP means. You and I can disagree about it, but we can more easily answer "Does it match what Alan Kay wrote about?" with a yes or no.

Melatonic · on Oct 20, 2022

I agree - I think it it is much better to use titles like these and with even a short JD describing what technology is used (many of which could be "devops" tools for infra engineers) everything becomes much more clear.

mannykannot · on Oct 20, 2022

Agreed; I have always thought of it as a rationalization of bad practices.

edmcnulty101 · on Oct 20, 2022

I suspect most places just use the term DevOps to try to get two jobs out of one person.

Then they also tend to have to do their own security and spec out their own tickets and then do Project Management.

Most tech people should be called DevSecOpsBizAnalystProjectManager.

zerkten · on Oct 20, 2022

>> I became CTO so I can solve this mess properly, I don’t hire devops, I hire infra engineers, build engineers, release engineers and: backend engineers.

This makes a lot of sense, but you need to realize that this is only a tiny part of a successful organization. There are many setups like that which you describe across many industries that have failed reasons beyond the role definitions. It's not enough to establish the organization. You have to keep the behaviors in check over time when you hire a build engineer with different aspirations.

travisgriggs · on Oct 20, 2022

I have bewilderingly tried to discern why software development continues to grow more and more complex.

It wasn’t always like this. There was a time when we talked about languages and OSes and libraries as if they made a difference on how much you could get done with as little people and cognitive load as possible (the claims were very much overrated, but the point was we acted like it mattered).

And then it started ballooning. It seems to me that much of that coincides with a lot of new money being dumped into the economy, and software moving past the necessary-evil-so-how-do-I-drive-down-my-costs and on to the gold rush of you must have a web app for your service that rivals a video game in complexity and visual finesse. It seems to me that as long as their is so much free money sloshing around venture capital funding, that it was inevitable that the process of making software would complexify to soak up the extra cash. After all, you can only add so many levels and varieties of managers to add value. After that comes the variegation in roles of software development.

That’s my working theory anyway.

rglover · on Oct 20, 2022

I like the Uncle Bob explanation: the number of developers has a doubling rate of ~5 years due to the constant entry of new developers [1]. Over time, the number of inexperienced developers far outweighs the experienced ones. Inexperienced developers naturally gravitate toward complexity because they don't know any better.

Couple that with a social drift toward hyper-specialization. That sort of hierarchy naturally creates minds that don't think at the systems level. This is the general theme of Buckminster Fuller's Operating Manual For Spaceship Earth: the shift toward hyper-specialization (or narrow focus) has long-term disastrous consequences.

When you couple inexperience with narrow focus, you get messes.

Part of this is to blame on the industry-standard thinking that because someone works for Company X, they're a competent, logical engineer (and should be granted authority/responsibility over essential products/projects). This is the downfall of "code tests" and "whiteboard coding." They don't evaluate for systems level thinking and so a developer with poor creativity and logic skills slips through the cracks because they're great at eeking performance out of a function which impresses the lollipop guild.

[1] https://blog.cleancoder.com/uncle-bob/2014/06/20/MyLawn.html

jiggawatts · on Oct 21, 2022

It's even worse than the Uncle Bob explanation.

There are thresholds of effort for certain things at largely constant levels. E.g.: designing a new language, build tool, web application framework, or database all have some minimum effort that needs to be invested. In the past, there just weren't enough developers in the entire world to "overdo" these things, so there was a relatively small pool of languages, tools, and frameworks to choose from.

Now, individual corporations have armies of junior developers spitting out frameworks and query languages like a machine gun. There's so many now that you or I haven't even heard of 99% of them!

As the number of available developers grows exponentially, so does their capacity to "reinvent wheels". Their ignorance of existing wheels to they could be reusing grows exponentially also. The result is an exponentially exploding set of ad-hoc, incompatible systems.

In my recent semi-DevOps, semi-Cloud-Engineer role I've come across an absolutely bewildering array of tools even when I've generally restricted things to one cloud and one language's ecosystem. Heaven help you if your project has multiple languages!

abledon · on Oct 24, 2022

"The lollipop guild" , amazing lol

agumonkey · on Oct 20, 2022

society also changed, tech is trendy, ecobronerds have the wheeldrive

Jorchime · on Oct 20, 2022

I think it is because of what is described through Conway's law: https://en.wikipedia.org/wiki/Conway%27s_law

    Any organization that designs a system (defined broadly) will produce a design whose structure is a copy of the organization's communication structure.[2][3]
    — Melvin E. Conway

The problem isn't necessarily software itself, but how we organize people (more than 2 or 3), how we communicate, how we mirror operations, expectations, etc. in software.

Scaling sustainable software feels like an "unsolved" Problem, because the society hasn't figured out how to organize better.

ArjenM · on Oct 20, 2022

Small teams have honestly always been successful to the flaw that they outgrow themselves constantly in my experiences.

Cthulhu_ · on Oct 21, 2022

Small teams can be super productive, but unless the people in it are there for the full run of a product, you'll end up with a problem; taking over the work for one developer will take more than one developer.

I'm sure - but haven't witnessed this myself yet, so take it with a grain of salt - that if one productive developer builds an application in a year, it needs a team of 5-10 to continue development at a similar level, and even then it may not make it.

Companies need to focus on keeping software as simple as possible, well documented, and transferable. Unfortunately this also means curbing people's enthousiasm.

_fat_santa · on Oct 20, 2022

> I have bewilderingly tried to discern why software development continues to grow more and more complex.

It's because of growth, software development today is both simpler and way more complex than before and that's entirely due to growth in the sector. In many ways it's simpler than it ever was, I'm writing an API using Lambda/API Gateway in AWS and it blows me away how quickly I am able to get services stood up and configure my API. But in another way it's so much more complicated, for this same API I spent 2-3 weeks experimenting and researching IAM roles and how all my AWS resources would interact with one another.

I would say the floor of software development has become way simpler, deploying a site to Netlify is way easier than dealing with webservers of the past. But while the floor is lower than ever before, the ceiling is in the stratosphere, with extremely complex systems that you can string together in the "Public Cloud".

insane_dreamer · on Oct 20, 2022

> writing an API using Lambda/API Gateway in AWS and it blows me away how quickly I am able to get services stood up and configure my API

> for this same API I spent 2-3 weeks experimenting and researching IAM roles and how all my AWS resources would interact with one another.

same experience here, and trying to understand/test un/poorly documented AWS behavior with public/private APIs and EC2 resources has been a huge time sink :(

bsenftner · on Oct 20, 2022

My theory is we need to add "software history" to computer science education programs. How many developers whose careers began after 2000 have ever hand written or know the power of a Makefile? I've worked at research labs and major animation studios that back in the 90's the entire infrastructure was fully automated thru Makefiles, and it purred like a well fed cat. There are hundreds of thousands of forgotten perfectly fine software tools, ignored simply because our industry has absolutely no respect for it's own history or lessons painfully learned beyond pulling up disparaging stories for new marketing angles.

ogogmad · on Oct 20, 2022

Very interesting comment. What older software should devs know about?

My problem with Make and similar tools is I don't want to learn more ad-hoc syntax to accomplish something nearly trivial. But I don't know if there's any alternative.

bsenftner · on Oct 20, 2022

I guess as far as other, older software: I'd need to know your interests or field. How I write software has completely changed at least 5 times, and now I'm deep diving on my 6th (docker/k8s) to deliver the same things I was delivering back in the 80's with software development process one. Actually, add 3 more complete re-wiring of how I write code - I forgot working on game consoles: every new console is a complete start from zero how to write, debug, package and ship software, software that is not that much different than was delivered with the last platform and dev process.

bsenftner · on Oct 20, 2022

Original Make was butt simple: any lines that begin without tabs are a filename followed by list of other file dependencies, files whose modified date must be older than the filename at the start of the list. Than any lines of text immediately beneath that first line starting with a tab character are the shell commands to be executed in order that brings the first file up to date against it's dependencies. That was it, that is the entire original Makefile syntax. Then software vendors started adding extensions...

acedTrex · on Oct 20, 2022

While make is very powerful it's certainly not a tool that is nice to use. Theres a reason people built up layers and layers of complexity on top of it.

HenriTEL · on Oct 20, 2022

As projects grow the Makefile becomes pretty much a big bash script, but with a specific syntax that I always struggle with.

davidpfarrell · on Oct 20, 2022

Unrelated to this topic, I invite you take a look at my project which a tool purpose-built to be a better version of what your makefile became:

Run: Task runner that helps you easily manage and invoke small scripts and wrappers

https://github.com/TekWizely/run

Defining commands feels like make, but comes with a bunch of extras targeted at the needs of a task-runner.

I hope you'll check it out!

HenriTEL · on Oct 25, 2022

The way to define options is interesting. It's hard to see the benefits from the docs in the README tho, given that it's yet another syntax to learn.

I suggest to add a complete example of common use cases like running the unit tests, building targets while setting the correct options depending on the host (linux vs windows vs macos).

confidantlake · on Oct 20, 2022

It has blown me away too. There was a time where it seemed like the default "devops" strategy for startups was to use Heroku. Developers could spend all their time working on code. Worked at a startup a few years ago. We had 2 devs, and one devops guy building out Kubernetes, Docker, running on Aws. Had to constantly spend me time with him troubleshooting why this or that thing wasn't working rather than writing application code. We had like a 100 concurrent users.

Now I am working at a large org for an internal application. There is an ops team I push code to github and Jenkins runs the CI and deploys it to a dev environment. I push a button in Jenkins to deploy to production. In two years there has been one ops related issue where I had to bump up the memory limit from 2GB to 4GB. There is a single server and a database. Setup works fine for a few thousand concurrent users. My skills are in development and understanding business reqs, not mucking about in config files. There are people that are good at that and enjoy doing that, let them handle it.

jeremyjh · on Oct 20, 2022

> We had 2 devs, and one devops guy building out Kubernetes, Docker, running on Aws. Had to constantly spend me time with him troubleshooting why this or that thing wasn't working rather than writing application code. We had like a 100 concurrent users.

I'm glad you said this. 2-dev shop here and we've used Heroku the last five years. We spend all our time developing and supporting our application and almost never worry about infrastructure. I keep getting tempted to move to K8s; Heroku is expensive for what we get and seems to have suffered some serious brain-drain. I've dabbled with K8s in side-projects and I can geek out on all the terraform and yaml stuff for days. But its probably not a great idea to inflict it on a 2-dev shop.

debacle · on Oct 20, 2022

Because you have, in 90 (99?) percent of instances a smart person (probably IQ 120+) doing a stupid menial job that requires a level of competence that only someone smart can provide.

So to prevent themselves from ending it all, they invent things to do to keep themselves sane while they churn out CRUD apps all day long for decades. Sometimes you get truly amazing software, but most of the times it's just reinventing the wheel, but worse.

Even worse, some of them become infatuated with doing things "Like Google," and errant CTOs enable them, leading to a 6 person team supporting a piece of software that could be replaced with WordPress, to the betterment of the primary stakeholder.

I've been consulting for over a decade now. The number of times a potential client has told me they've spent into the six figures (or eight) for something that free software does out of the box but better is depressing. The worst part? I usually can't convince that person they've been absolutely swindled, and so I have to let them keep on their merry way.

hotpotamus · on Oct 20, 2022

> So to prevent themselves from ending it all, they invent things to do to keep themselves sane while they churn out CRUD apps all day long for decades

Or they spend the spare time bullshitting on web forums.

jahewson · on Oct 20, 2022

I semi-agree but my take is that money can buy you people, it can buy you lines of code, more moving parts, more stuff. But it can’t buy elegance, efficiency and good design - those have to be nurtured. Most engineers nowadays haven’t ever seen a codebase that isn’t a hot mess (no offence folks). The demand for talented and experienced engineers eclipses the available supply. The industry is growing too fast for that. The engineers that really are good are needed in leadership, or become founders.

boppo1 · on Oct 20, 2022

Any open source codebases that meet your approval?

jesuscript · on Oct 20, 2022

Your guess is as good as anyones at this point. There’s just many over corrections that resemble strong levels of grandstanding. React is complex? Okay, let’s ditch JavaScript and just do basic server rendered pages.

It’s a laughable over reaction (pun unavoidable).

These are manic states the dev community enters, and all pragmatism is lost in our discourse on the solution.

Cooler heads are not prevailing. There is certainly a solution in the middle.

Devops is bullshit is also an over correction. It’s worth saying that making infra more accessible is a democratization of that entire sub-field. How do we preserve the good part of that?

nathanaldensr · on Oct 20, 2022

I completely agree that 20+ years of cheap money have absolutely caused much of the complexity we see today. That's because people aren't working on problems that matter; instead, they're just trying to make VCs rich--effectively passing the buck on to the greater fool. I think this era is coming to an end, and we should see a drastic reduction in software development costs with the end of cheap debt.

bsenftner · on Oct 20, 2022

We may see a strange hang over from this period: over the last 20 years we've created tech billionaires and tech trillionaire corporations - both entities that are not used to being told "no" and will spend their own money to extend this party that is their reality. Case in point: FB/Meta and their doomed Metaverse.

ogogmad · on Oct 20, 2022

I hope I'm not being pedantic here, so please forgive me. Didn't interest rates go to zero after 2008? So that's 14 years. I guess 8 years before that there was also Dot Com.

Are you talking about interest rates? Thanks.

Even if you're wrong, it's an interesting theory.

hotpotamus · on Oct 20, 2022

https://www.readmargins.com/p/zirp-explains-the-world

Is probably one of the better explainers.

ldjkfkdsjnv · on Oct 20, 2022

The complexity of what we are building keeps expanding. Theres no large conspiracy. We have a lot of tooling to build things that were near impossible to build in the past.

kuramitropolis · on Oct 20, 2022

>Theres no large conspiracy

Yes, there is a network of small to medium sized ones.

> things that were near impossible to build in the past.

Were they necessary in the past?

ldjkfkdsjnv · on Oct 20, 2022

This is like asking whether innovation is necessary. Capitalism pushes us towards it. New forms of efficiency and value are unlocked by moving further out on the software complexity curve. Not everything thats complex works well, but some of it does solve the problem

kuramitropolis · on Oct 20, 2022

What's "capitalism" in this context, and what's its actual connection to software development practices? What are "forms of efficiency" and "forms of value", and how does complexity "unlock" them?

acedTrex · on Oct 20, 2022

Capitalism is a company that is attempting to solve X problem or do Y thing etc. Forms of efficiency are "how fast can they solve that problem". And complexity doesn't unlock anything. It's just inherent in the problem space a company takes on.

kuramitropolis · on Oct 21, 2022

>Capitalism is a company that is attempting to solve X problem or do Y thing etc

So basically any organized human activity is now "capitalism"?

>Forms of efficiency are "how fast can they solve that problem"

From which follows that "new forms of efficiency" means that previously it didn't matter how fast they can solve that problem, but now, due to "moving further out on the software complexity curve", it has begun to matter?

>And complexity doesn't unlock anything.

Then what is meant by "moving further out on the software complexity curve unlocks new forms of efficiency"?

dilyevsky · on Oct 20, 2022

SaaS model requires software to be always online (strict sla) and you need to be able to update it without downtime or very little downtime. That’s the reason. For example, telecom software that always had these requirements was always complex

user3939382 · on Oct 20, 2022

My theory is a variation on that, which is that the money has placed so many more people in the space you inevitably end up with more people working in different directions, the JS ecosystem especially comes to mind here.

woeirua · on Oct 20, 2022

There is this moronic obsession with "scaling" in this industry, and it just blows my mind. You can have a wildly successful and profitable company while serving just a few hundred customers that requires nothing more than a basic LAMP stack.

Meanwhile people are now spending millions of dollars and years of person-hours building MVPs in the cloud that won't ever go anywhere because the business model sucks.

Focus on delivering value to your customer first. How you get there is quite literally irrelevant to your customer.

trog · on Oct 21, 2022

> There is this moronic obsession with "scaling" in this industry, and it just blows my mind. You can have a wildly successful and profitable company while serving just a few hundred customers that requires nothing more than a basic LAMP stack.

This is us. We have a couple thousand customers in a B2B space with large revenue per customer. Growing rapidly and our node.js stack is running on a tiny EC2 instance and RDS server.

I spent a lot of time in the last couple years trying to fight back against attempts to overcomplicate this by using new services & new tooling. I just see it all as stuff that's going to slow down the single most important thing to us from a business perspective - which is writing new features to optimise our internal workflows - and give us more headaches from an operational perspective.

Even the stack we're using feels ridiculously over the top and complicated compared to a basic LAMP application. Everything in node.js feels like a huge pain in the ass. If there's not a node module you can install to immediately solve your problem (which of course just adds a different set of problems) even writing basic things feels exceptionally arcane and time consuming compared to a basic PHP implementation of the same thing. Maybe it's just a side effect of our dev team size (currently very small, only five full time devs) but the overhead of it all just feels like it's not worth it.

yamtaddle · on Oct 20, 2022

I sometimes marvel at how few people worked on various pieces of commercial software in the 90s. You'd have 4x the headcount and still take twice as long, today. And it'd probably be webshit instead of native because "productivity matters more than performance".

boppo1 · on Oct 20, 2022

> money being dumped into the economy

This is my theory too, and as someone learning to program (hopefully as a career), I'm very worried what a new paradigm of tightening will do to the field. I feel like LOTS of people who think their stack is secure are going to get dropped because at the end of the day, lots of the B2B SaaS doesn't actually deliver anything to the world of material needs.

imetatroll · on Oct 20, 2022

I tried out cloudflare for the first time yesterday using a personal website. Previously I was just using a domain registrar and setting DNS to digital ocean. cloudflare is a very different/complex beast, but I can imagine that the pretty interface alone brings in customers.

I don't like this hyper-expansion either, but when you have an army of monkeys on typewriters, well, a lot gets "produced".

j-krieger · on Oct 20, 2022

To be fair, cloudflare is among the services I will defend forever. Their ddos protection is by far the best on the market.

sensanaty · on Oct 20, 2022

This just reads like a "Damn kids, get off my lawn!" type comment to my admittedly young mind. Same with a lot of similar comments I see floating around HN, especially ones that have anything to do with web development.

shadowgovt · on Oct 20, 2022

This echoes the philosophy of Google Site Reliability Engineering, which (this is key) is an engineering discipline.

The job of DevOps is not to close tickets. That'd be like driving a car by shouting directions at someone lying on the floorboards holding a wrench to the steering pinion.

The job of DevOps is to build a steering wheel (and ideally, teach SWEs how to drive... at least enough that they understand what a "road" is and why it's a pleasant experience for everyone if you stay on it. If the road doesn't go where they need to be, then it's time to file a ticket, but that ticket had better be "Build a new road," not "Offroad this one car to the cabin in the woods and call it a job well done").

The raw hardware of an enterprise deployment is so flexible it solves nobody's problem. DevOps is in the business of writing the operating system for a mega-computer physically represented by hundreds to possibly millions of heterogeneous computers. It's a process of continuous growth to make that work.

tootie · on Oct 20, 2022

Well they explicitly use the term SRE and not DevOps and I think that's very intentional and related the gist of the article. DevOps was never meant to be a role played by a person or a team. It's meant to reflect aligned incentives of dev and ops. Whether you have an SRE team or a platform team or a bunch of kitchen sink teams.

Cthulhu_ · on Oct 21, 2022

I much prefer the SRE distinction, it gives more focus and especially combined with the book and other materials, a much more professional workspace.

You want us to run and manage your software? Sure, here's a checklist of what it has to conform to. Oh it's unstable? We will no longer run it for you, here's the pager back.

mjr00 · on Oct 20, 2022

> If the “DevOps” team ships a Postgres RDS instance it will run fine forever, that is until an application starts using it. All of a sudden a cascade of N+1s hit, the CPU spikes, and queries grind to a halt. Who is woken up? And why does this always happen at 2 AM? In this scenario, there is nothing for operations personnel to do, yet here they are.

This is definitely a symptom of a broken model and not what I would call devops. IMO the most important tenant of devops is "if you build it, you run it," meaning the appdev team that decided to use Postgres RDS is the one getting woken up at 2am.

It's also, in my experience, one of the best ways to reduce masturbatory engineering decisions and get people to focus on picking boring technology that works. Coding up a serverless application in Rust that's using a CockroachDB backend at a Python/MySQL shop would get a lot of engineers excited, but those people would be less excited knowing they're going to be the ones paged at 2am when this new and exciting architecture falls over in an unfamiliar way (as opposed to Python/MySQL, where a wealth of operational knowledge at the org has already been built up).

Similarly, it naturally reduces architecturally complexity. Younger senior engineers love drawing boxes of queues, multiple microservices, event buses, etc to show off their skill in creating the ultimate engineering fantasy, but once you throw enough late night operational incidents at a senior engineer, suddenly the preferred architecture becomes "an executable running on a box that I can SSH into when things go wrong."

Spivak · on Oct 20, 2022

> This is definitely a symptom of a broken model and not what I would call devops.

I'm I the crazy one here? This kind of work is my bread and butter as a "devops" person. PagerDuty fires, I find the bad query, match it up with the most recent PRs to find where the n+1 got introduced and either patch it right there at 2AM with the on-call manager's approval or roll it back. Then we have a postmorterm in the morning with the team.

I'm the person positioned the best to do this work because I'm god in my little ops domain, have the most visibility and the biggest toolbox of potential fixes.

mjr00 · on Oct 20, 2022

This sounds like what I mean with building it and running it being devops, yeah -- the fact that you have access to make pull requests or code patches, or even know where the code repository exists in the first place, shows that you're at least somewhat involved in building the application. Contrast this to what was traditionally ops and is now "SRE"; in those roles, applications are usually black boxes where an ops person doesn't know/care how they're developed, because they're responsible for the overall health of the system, which could be managing 100 applications made by 50 development teams.

counttheforks · on Oct 20, 2022

> The problem is most engineers don’t want to do operations work.

There's your problem. You have people who build stuff without caring where and how it runs. Recipe for disaster.

Test0129 · on Oct 20, 2022

I don't understand. I want to develop code. I don't want to become an AWS/S3/Github/Jenkins/Action/terraform/etc expert. I know enough of this to be dangerous but not at a level that passes as professional. Yet I am regularly tasked with maintaining the full deployment of code.

There's a reason to have a team of people doing this "DevOps" work. Just like we have a team of people who do SRE. It creates a standard and a single point which all work flows through. Then you don't wander onto a new project only to realize they use $BESPOKE_DEPLOYMENT_METHOD because "it's what we used 6 months ago". Or worse, you don't have a developer playing with a massive, nuclear powered, foot gun like Terraform and accidentally destroying infrastructure.

Making DevOps/DevSecOps/$BUZZWORD the responsibility of developers is a cost-cutting measure not a responsibility measure.

mikkergp · on Oct 20, 2022

It just doesn't work for the most part. Maybe as an Ops person, I just want to do Ops, I don't want to have to understand your code, but Lambda has specific limitations on how long code can run for. I can't allocate CPU and Memory resources in Kubernetes without a deep understanding of the application. S3 has limitations on how files can be distributed and accessed. Integrating with CI gets complicated quickly and requires understanding the code being integrated. A lot of the time building things in terraform I spend three times as much time getting the information out of a developer as they would doing it themselves. I mean yes, there's 60% or so of my job that involves working on lower level infrastructure that doesn't touch a dev, and we do need ops engineers for that, but the other 40%? It works both ways, a good "devops engineer" needs to understand code, but if we don't have a shared language we both just end up banging our heads against the wall.

If you're building a crud-app in a common framework with low volume, sure you can toss that over the wall.

Test0129 · on Oct 20, 2022

At companies I've worked for where they actually do this the DevOps people are generally assigned to a team. So you might have 10 developers on a project, and one guy managing just the devops exactly to solve this problem. In smaller companies/etc I can see where this is a problem. I agree, and there's nothing wrong with knowledge sharing. This could even be as simple as having PR descriptions including benchmarks/code/etc and making sure the devops people make their points clear during design/planning.

In theory, devops shouldn't be responsible for maintaining the performance of code. The specification should say what it should run on, the devops guys set up a pipeline and manage that thing, and the developers are the ones taking heat for not hitting that goal. If devops guys are taking the heat for that it sounds more like cost-cutting measures flowing the other direction.

mikkergp · on Oct 20, 2022

> DevOps people are generally assigned to a team

Yeah and this is a great solution (aside from the bus factor), maybe the person I responded to was really concerned less about knowledge and more about expertise (the word they did use in their was expert) At a company I worked with we called this T-shaped engineers. Deep in one thing, but broadly knowledgeable. Devs have to have knowledge of ops, but not "expertise" that is ultimately what ops is for, we may just ultimately be fighting over where the knowledge line is sufficient and what constitutes "expertise" :-). I for instance think terraform is not that much of a footgun and provides good rails to be used by developers.

Spivak · on Oct 20, 2022

I think you make a strong case for ops who can dev, but a fairly weak one for devs who can ops so I think you and the parent are actually agreeing. And this mirrors my experience pretty well, I need to know the code to be able to ops effectively but it's much rarer that devs need to know how to ops to dev effectively.

And in some ways this is by design, I want to have some distance between dev and ops because it gives me the freedom to rearrange infrastructure transparently. I can move workloads between Lambda, ECS, and EC2 based on the observed performance characteristics without anyone being the wiser.

mikkergp · on Oct 20, 2022

This is going to depend on a lot of things. Size of company, cloud native or not, org structure. I mean everyone would love to live in the google world where a team of SRE's run everything. But even in the world where a Devops engineer is embedded on a team there's bus factor to consider.

I think most modern AWS services are more the equivalent of an API or a microservice than they are a server, and you need to understand the limitations of the services you integrate with. If you're a cloud native company and don't have a mature platform engineering team, devs are going to have to know alot about AWS.

If the developer has all the information necessary to create an S3 bucket, lambda function, kinesis stream, etc does it make sense for them to offload 10 pieces of information to me, or for them to learn HCL and interact with it themselves, especially if it is something they do often. Especially if there's a central dev/ops team and they're the limiting factor. Devops taking over every infrastructure change for a broad team of devs is inefficient and expensive. It's also probably frustrating for devs that are aware of the above factors. Lots of devs I know would prefer to do it themselves.

And all of this is to mention that infrastructure and integrations with infrastructure are not static. Should I be reviewing all PR's to a system that touch the client code for an AWS service, because the dev team doesn't want to learn how that service works? Maybe, but in the end I don't know if this makes anyone happy.

Platform engineering is certainly the goal. Where ops creates a platform and dev just consumes that platform. But I don't know if it is realistic. Every system on rails is great until you try to take the rails off, and most devs I know hate rails :-) Fundamentally "devops" is all meant to solve human problems, not tech problems so it will have to be dynamic.

mattgreenrocks · on Oct 20, 2022

This makes me wonder if ops allocating compute resources is really a good use of time if you're needing precise details of an app (which can and do evolve). This isn't a slam against ops, either, it's a knock against the tech itself that it forces all this incidental complexity on you.

mikkergp · on Oct 20, 2022

Yeah, I mean, fundamentally it's so complex because you have to make tradeoffs and people hate tradeoffs

"I don't want to have to worry about what machine my app runs on" vs "kubernetes is to complex"

"Dependencies change to often" vs "I don't have time to maintain this thing I wrote myself"

"I just want the infrastructure to figure out what I need" vs "I want to be able to build whatever I want with a bespoke language/framework/database/architecture"

> it's a knock against the tech itself that it forces all this incidental complexity on you.

If I were to say that Kubernetes is the magic secret sauce that fixed all the incidental complexity I would get laughed out of the room. There is no magic secret sauce to the incidental complexity, the more we try to fix it the more we create (or the cheap-fast-good problem. This is probably easy if there are no cost limitations)

Sakos · on Oct 20, 2022

Our devops runs the infrastructure, but details like "what resources do we allocate where" are done primarily by the software devs. I don't really see the conflict here. I don't know and I don't care about how to put together the infrastructure required so I can change the CPU allocation on a Kubernetes pod, but I also don't expect devops to know jackshit about our code.

mritun · on Oct 20, 2022

> I don't understand. I want to develop code. I don't want to become an AWS/S3/Github/Jenkins/Action/terraform/etc expert.

They’re tools for the job, like your compiler and the language you use to program.

That’s like saying “I like to use Python and couldn’t care less for Java”. It’s fine to disagree with the team’s choice of the tools, but one needs to eventually commit to the choice, even it’s not your preferred one!

There’s an old adage that “if you’re writing clever code then you may not be clever enough to debug it”. It’s true! Operations requires deep understanding of the code running in production, the business rules and the customer. The person who will operate your code will eventually be smart enough to develop it entirely too, eventually cutting out the “dev”. I’ve personally seen this happen over and over.

0xbadcafebee · on Oct 20, 2022

DevOps came about because of developers who "just wanted to write code". They would write something, then throw the dead cat over the wall and say "you figure out how to run it". That... doesn't really work. Somebody needs to explain to the Ops people how to run the code. Hence: DevOps... a way to get Dev and Ops to avoid throwing dead cats over walls.

If you don't want to think about AWS S3, GitHub Actions/Jenkins, Terraform, etc, .... then we need to work together. All those tools and services exist because all the software developers are sitting in their sandbox, and don't want to come out and play. The systems and tools that we run your code with... suck. A lot. We need programmers to make the systems better. We (in Ops) are a little busy with trying to just figure out how to run your apps without them falling down. We don't have a lot of time to reinvent the state of the art of computer systems.

For example, we need a distributed operating system. Not some fucked-up kludge of a monolith of microservices overseen by a company that has more engineers than brains... but an honest to god, stable-ABI, simple, composeable, stable, general operating system. We need Linux to come out of the box, ready to run distributed applications, in a way that doesn't require a PhD. Once we have that, then you - yes, you, the developer! - will be able to make applications that automatically scale so easily that we will never need to utter the phrase "container" ever again. You will rarely ever need us again, because the system will just be so simple, so general, that anybody who can use the terminal can build and deploy applications without ever learning anything outside of your programming framework.

But we need you to make that distributed operating system. Until you do, we will just have more stupid kludges, more bizarre unnecessary complexity, in the futile attempt to constrain all the crazy shit we want to do with technology, while trying to run your apps for you. Please, I'm begging you - put me out of a job.

c0mptonFP · on Oct 20, 2022

A developer who doesn't have a holistic view on their systems is a liability.

Aperocky · on Oct 20, 2022

> Yet I am regularly tasked with maintaining the full deployment of code.

Same shoes, but I have different perspectives. I can figure out where it's not working and if it's my/our team's area of responsibility then we go fix it.

Since we handle infrastructure (as code) and deployments in team, along with all the development, most of the problems are handled by us. Unless it's clear that it isn't, e.g. some API that we consume that keeps throwing 500, can't fix that.

Our operations is helped by 100s of automated tests and 1000s of metrics. I always thought that this is DevOps, but it sounds different from what most of the people here are alluding to.

kennend3 · on Oct 20, 2022

> Making DevOps/DevSecOps/$BUZZWORD the responsibility of developers is a cost-cutting measure not a responsibility measure.

My background is large multinationals so my view here is a bit bias but i don't think cost cutting is the driver.

Large orgs get large change management processes and procedures. Over time, these change management teams become overwhelming behemoths with minds of their own.

I think "DevOps" was designed as a way to "bypass" the bureaucracy?

"We just use this CI/CD pipeline and no need to sit on a 3 hour change management review call..."

jayofdoom · on Oct 20, 2022

Almost everywhere I've worked that I helped run software in production for would have a step in CD which filed an automatically-approvable change req with automation; just like the automated deploys.

It becomes just robots pushing around paper for compliance.

counttheforks · on Oct 21, 2022

That's like saying that you want to develop code but you don't want to use computers. It's part of the job. Maybe try mobile app development instead?

scarface74 · on Oct 20, 2022

Alan Kay said that “Everyone who is serious about software should make their own hardware.” How can you be a good developer if you don’t understand the architectural limitations and choices?

When I design a backend system, I need to think about how the front end developers are going to interact with it. My data storage characteristics and scaling. I need to know am I designing anything that’s hard to deploy. How will logging work and be aggregated. I have to be able to think about the entire system.

It’s not just “cost cutting” at a certain point in your career you are expected to know more than just “how to code”. I’m not saying learn AWS. But I would expect any senior developer to know about what their code runs on top of .

Test0129 · on Oct 20, 2022

I don’t think you’re disagreeing with me. Developers should know what their code runs on. They shouldn’t have to add managing that to an already full schedule of work. That’s the difference.

scarface74 · on Oct 20, 2022

Back when I was in the real world [1] working for a startup, I would do your typical serviceless solution with Lambdas, S3,SQS, etc. I couldn’t just use ClickOps and create everything on the console and expect someone else to recreate everything with IAC. I had to know how to do it.

I think to push back is rightfully coming from the “ops” part. I consider “creating the CloudFormation/CDK/Terraform” code as part of “development” as part of coding.

If you use Docker, wouldn’t you consider creating the Dockerfile as part of development?

Yes I knew AWS pretty well by the time I left and I needed to know it to be a good developer in that context and designed most of the processes around it. But I refused to do “operations” - ie “infrastructure babysitting”

There is a huge distinction between “I don’t think I should have to know how everything works” and “don’t call me in the middle of the night when something goes down “.

[1] I’m the first to admit that I left the “real world” once I started working in the cloud consulting department at $BigTech

Test0129 · on Oct 21, 2022

> I think to push back is rightfully coming from the “ops” part. I consider “creating the CloudFormation/CDK/Terraform” code as part of “development” as part of coding.

> If you use Docker, wouldn’t you consider creating the Dockerfile as part of development?

Sure you could argue a developer could, or even should create these things in theory. The problem is then when it goes down I have made two problems out of one. Now I have to manage the infrastructure of a system and what is running on it. Realistically, and even in my current job, it's actually several systems. Now when something breaks I have to pray I can fix it. Instead of allowing a team of infrastructure professionals to at least insure the hardware is working my 8 hour day turns into 14 or 16 very quickly the second one thing goes wrong.

scarface74 · on Oct 21, 2022

So if I need to create a bunch of Lambdas, queues, sns topics a few dynamodb tables an S3 bucket, etc and tie it all together, are you proposing that the developer should just create everything in the console and then call over someone else to go behind me and write the infrastructure as code?

lucasyvas · on Oct 20, 2022

It is more nuanced. I care, but my skillset demands I spend more time on the Development side, so Operations must take the backseat in my mind because of this incentive structure. Asking one person to do two (actually three) job functions is a scam.

Most developers have to do Frontend, Backend, and Ops. These have wildly different mindsets and feedback loops and not enough time exists. Don't hate the player, hate the game. The orgs are fucked, not the workers.

makestuff · on Oct 20, 2022

It is like your doctor being the anesthesiologist, the recovery room nurse, and the surgeon all at once.

dilyevsky · on Oct 20, 2022

Race car driver vs bus driver. Race car driver knows how their car functions and works with engineers/mechanics on improvements. Bus driver don’t give a shit (no offense to bus driver). Decide which company you are - a race team or a bus line

Multicomp · on Oct 20, 2022

I will extend this analogy to say I am a race car mechanic but I'm having to pick up how the driver drives the car in order to make the car perform better, with the eventual goal of being able to drive the car myself (ops -> devops -> dev) so I can do any of the above roles.

Software devs who close their mind to the lower parts of the stack give me the opportunity to learn and do their jobs as well, becoming a more valuable employee as a result.

dilyevsky · on Oct 20, 2022

> becoming a more valuable employee as a result.

That’s iyho. The management of a bus line company doesn’t see it that way ;)

danielvaughn · on Oct 20, 2022

Thank you, yes this is the perfect analogy. I think every developer should have some kind of baseline understanding of devops stuff. But it’s a whole field in and of itself, and is also innovating fairly rapidly.

There are only so many hours in the day, it’s hard enough to stay on top of my core skill set.

hbarka · on Oct 20, 2022

Bad analogy. How about your mom asking you to help wash the dishes and take out the garbage?