From where I sit in the DevOps community, there is often more focus on dev than on ops. Damon Edwards (@damonedwards) of SimplifyOps sought to change that with his talk, Ops Happens: DevOps Beyond Deployment, at the All Day DevOps conference.
Damon dove into the primary, systemic force behind most DevOps problems - silos. The product development process goes like this: Planning-->Dev-->Release-->Operate. The problem is that many enterprises tend to place similar functions together. Everyone ends up in a silo. Then walls build between the silos. Eventually, people only know life in their silo, making handoffs even harder.
We often find that application knowledge and business intent are heavily emphasized on the development side, but light on the operations side. Likewise, operational knowledge is heavy on the ops side, but light on the development side. Furthermore, development has ownership, but limited accountability, while operations have accountability, but no ownership.

While many enterprises are striving towards building cross-functional teams, the reality is that transformations often stop short of truly integrating operations. The result? Siloes remain.
So, one has to ask, why is this so hard?

The reality is that enterprise operations are under tremendous pressure. One side tells them to go faster and open it up, and the other side tells them to be more secure and reliable. These are often seen as competing priorities.
To solve this, enterprises need to "shift left" in the product development cycle operations activity as much as possible. They need to do as much as possible during development. For the deploy function, enterprises should be:
-
Writing/running automated tests
-
Writing/exercising deploy automation
-
Running security scanning tools
For the operate function, enterprises should be:
-
Writing/exercising automated runbooks
-
Writing/exercising monitoring/metrics
-
Operational control (safely!)
However, shifting operations to the left is much harder. How do you do it? Embed ownership.
Let me repeat: embed ownership.
First, those who build something define the procedures to fix it, and those who build something fix it when it breaks.
That sounds simple, but raise questions:
-
How do you safely and securely give access?
-
How do you enable the experts to contribute remediations?
-
How do you give the experts visibility into operations?
-
How do you postmortems days/weeks/months later?
Damon recommended four steps.
Step 1 -- Establish a Secure Ops Portal

Step 2 - Establish a SDLC for Ops Procedures

Step 3 - Connect with Enterprise Management Systems

Step 4 - Make Compliance Really Happy

Ticketmaster is a real-life example of this working at a large, prominent scale. Ticketmaster calls their system, "Support at the Edge" and it involves:
-
Automated Ops procedures written/vetted by the delivery teams
-
Ops remained in full control of what can run and security policy
-
Empowered support teams with self-service ops tasks
-
Empowered developers with limited self-service operations
-
Combined with a new incident response model
Ticketmaster has seen transformative results. Before Support at the Edge, the average mean time to respond was 47 minutes. Now, support at the Edge has reduced that to just 3.8 minutes, in addition to decreasing escalations 50% and support costs 55%. Ticketmaster has seen real results.

Derek serves as vice president and DevOps advocate at Sonatype and is the co-founder of All Day DevOps -- an online community of 65,000 IT professionals.
Explore All Posts by Derek WeeksTags
Code 3x Faster with Less False Positives
Build, test, and launch secure applications without rework. Explore how the Sonatype platform can enhance productivity and security.